{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,6]],"date-time":"2025-12-06T16:58:44Z","timestamp":1765040324384},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>While there are many methods for predicting protein-protein interaction, very few can determine the specific site of interaction on each protein. Characterization of the specific sequence regions mediating interaction (binding sites) is crucial for an understanding of cellular pathways. Experimental methods often report false binding sites due to experimental limitations, while computational methods tend to require data which is not available at the proteome-scale. Here we present PIPE-Sites, a novel method of protein specific binding site prediction based on pairs of re-occurring polypeptide sequences, which have been previously shown to accurately predict protein-protein interactions. PIPE-Sites operates at high specificity and requires only the sequences of query proteins and a database of known binary interactions with no binding site data, making it applicable to binding site prediction at the proteome-scale.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>PIPE-Sites was evaluated using a dataset of 265 yeast and 423 human interacting proteins pairs with experimentally-determined binding sites. We found that PIPE-Sites predictions were closer to the confirmed binding site than those of two existing binding site prediction methods based on domain-domain interactions, when applied to the same dataset. Finally, we applied PIPE-Sites to two datasets of 2347 yeast and 14,438 human novel interacting protein pairs predicted to interact with high confidence. An analysis of the predicted interaction sites revealed a number of protein subsequences which are highly re-occurring in binding sites and which may represent novel binding motifs.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>PIPE-Sites is an accurate method for predicting protein binding sites and is applicable to the proteome-scale. Thus, PIPE-Sites could be useful for exhaustive analysis of protein binding patterns in whole proteomes as well as discovery of novel binding motifs. PIPE-Sites is available online at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/pipe-sites.cgmlab.org\/\" ext-link-type=\"uri\">http:\/\/pipe-sites.cgmlab.org\/<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-225","type":"journal-article","created":{"date-parts":[[2011,6,3]],"date-time":"2011-06-03T04:44:09Z","timestamp":1307076249000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences"],"prefix":"10.1186","volume":"12","author":[{"given":"Adam","family":"Amos-Binks","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Catalin","family":"Patulea","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sylvain","family":"Pitre","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrew","family":"Schoenrock","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuan","family":"Gui","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James R","family":"Green","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ashkan","family":"Golshani","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Frank","family":"Dehne","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2011,6,2]]},"reference":[{"key":"4573_CR1","doi-asserted-by":"publisher","first-page":"917","DOI":"10.1006\/jmbi.2000.4092","volume":"302","author":"X Gallet","year":"2000","unstructured":"Gallet X, Charloteaux B, Thomas A, Brasseur R: A fast method to predict protein interaction sites from sequences. Journal of molecular biology 2000, 302: 917\u201326. 10.1006\/jmbi.2000.4092","journal-title":"Journal of molecular biology"},{"key":"4573_CR2","doi-asserted-by":"publisher","first-page":"4569","DOI":"10.1073\/pnas.061034498","volume":"98","author":"T Ito","year":"2001","unstructured":"Ito T, Chiba T, Ozawa R, et al.: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proceedings of the National Academy of Sciences of the United States of America 2001, 98: 4569\u201374. 10.1073\/pnas.061034498","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"4573_CR3","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1038\/35001009","volume":"403","author":"P Uetz","year":"2000","unstructured":"Uetz P, Giot L, Cagney G, et al.: A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 2000, 403: 623\u20137. 10.1038\/35001009","journal-title":"Nature"},{"key":"4573_CR4","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1038\/nature04670","volume":"440","author":"NJ Krogan","year":"2006","unstructured":"Krogan NJ, Cagney G, Yu H, et al.: Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 2006, 440: 637\u201343. 10.1038\/nature04670","journal-title":"Nature"},{"key":"4573_CR5","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1038\/415141a","volume":"415","author":"A-C Gavin","year":"2002","unstructured":"Gavin A-C, B\u00f6sche M, Krause R, et al.: Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 2002, 415: 141\u20137. 10.1038\/415141a","journal-title":"Nature"},{"key":"4573_CR6","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1016\/S0968-0004(02)02197-7","volume":"27","author":"I Stagljar","year":"2002","unstructured":"Stagljar I: Analysis of membrane protein interactions using yeast-based technologies. Trends in Biochemical Sciences 2002, 27: 559\u2013563. 10.1016\/S0968-0004(02)02197-7","journal-title":"Trends in Biochemical Sciences"},{"key":"4573_CR7","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1016\/0022-2836(78)90408-4","volume":"125","author":"J Janin","year":"1978","unstructured":"Janin J, Wodak SJ, Levitt M, Maigret B: Conformation of amino acid side-chains in proteins. Journal of Molecular Biology 1978, 125: 357\u2013386. 10.1016\/0022-2836(78)90408-4","journal-title":"Journal of Molecular Biology"},{"key":"4573_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2174\/138920308783565741","volume":"9","author":"DW Ritchie","year":"2008","unstructured":"Ritchie DW: Recent Progress and Future Directions in Protein-Protein Docking. Current Protein and Peptide Science 2008, 9: 1\u201315. 10.2174\/138920308783565741","journal-title":"Current Protein and Peptide Science"},{"key":"4573_CR9","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1038\/nature01160","volume":"420","author":"CD Snow","year":"2002","unstructured":"Snow CD, Nguyen H, Pande VS, Gruebele M: Absolute comparison of simulated and experimental protein-folding dynamics. Nature 2002, 420: 102\u20136. 10.1038\/nature01160","journal-title":"Nature"},{"key":"4573_CR10","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1186\/1471-2105-7-269","volume":"7","author":"H Lee","year":"2006","unstructured":"Lee H, Deng M, Sun F, Chen T: An integrated approach to the prediction of domain-domain interactions. BMC bioinformatics 2006, 7: 269. 10.1186\/1471-2105-7-269","journal-title":"BMC bioinformatics"},{"key":"4573_CR11","doi-asserted-by":"publisher","first-page":"R104","DOI":"10.1186\/gb-2006-7-11-r104","volume":"7","author":"KS Guimar\u00e3es","year":"2006","unstructured":"Guimar\u00e3es KS, Jothi R, Zotenko E, Przytycka TM: Predicting domain-domain interactions using a parsimony approach. Genome biology 2006, 7: R104. 10.1186\/gb-2006-7-11-r104","journal-title":"Genome biology"},{"key":"4573_CR12","doi-asserted-by":"publisher","first-page":"R89","DOI":"10.1186\/gb-2005-6-10-r89","volume":"6","author":"R Riley","year":"2005","unstructured":"Riley R, Lee C, Sabatti C, Eisenberg D: Inferring protein domain interactions from databases of interacting proteins. Genome biology 2005, 6: R89. 10.1186\/gb-2005-6-10-r89","journal-title":"Genome biology"},{"key":"4573_CR13","doi-asserted-by":"publisher","first-page":"R192","DOI":"10.1186\/gb-2007-8-9-r192","volume":"8","author":"H Wang","year":"2007","unstructured":"Wang H, Segal E, Ben-Hur A, et al.: InSite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale. Genome biology 2007, 8: R192. 10.1186\/gb-2007-8-9-r192","journal-title":"Genome biology"},{"key":"4573_CR14","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1016\/S0014-5793(03)00456-3","volume":"544","author":"Y Ofran","year":"2003","unstructured":"Ofran Y, Rost B: Predicted protein-protein interaction sites from local sequence information. FEBS Letters 2003, 544: 236\u2013239. 10.1016\/S0014-5793(03)00456-3","journal-title":"FEBS Letters"},{"key":"4573_CR15","doi-asserted-by":"publisher","first-page":"2496","DOI":"10.1093\/bioinformatics\/bti340","volume":"21","author":"I Res","year":"2005","unstructured":"Res I, Mihalek I, Lichtarge O: An evolution based classifier for prediction of protein interfaces without using protein structures. Bioinformatics (Oxford, England) 2005, 21: 2496\u2013501. 10.1093\/bioinformatics\/bti340","journal-title":"Bioinformatics (Oxford, England)"},{"key":"4573_CR16","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1093\/bioinformatics\/btl660","volume":"23","author":"M-H Li","year":"2007","unstructured":"Li M-H, Lin L, Wang X-L, Liu T: Protein-protein interaction site prediction based on conditional random fields. Bioinformatics (Oxford, England) 2007, 23: 597\u2013604. 10.1093\/bioinformatics\/btl660","journal-title":"Bioinformatics (Oxford, England)"},{"key":"4573_CR17","doi-asserted-by":"publisher","first-page":"2002","DOI":"10.1093\/nar\/gkn016","volume":"36","author":"J Guo","year":"2008","unstructured":"Guo J, Wu X, Zhang D-Y, Lin K: Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset. Nucleic acids research 2008, 36: 2002\u201311. 10.1093\/nar\/gkn016","journal-title":"Nucleic acids research"},{"key":"4573_CR18","doi-asserted-by":"publisher","first-page":"D637","DOI":"10.1093\/nar\/gkm1001","volume":"36","author":"B-J Breitkreutz","year":"2008","unstructured":"Breitkreutz B-J, Stark C, Reguly T, et al.: The BioGRID Interaction Database: 2008 update. Nucleic acids research 2008, 36: D637\u201340.","journal-title":"Nucleic acids research"},{"key":"4573_CR19","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1186\/1471-2105-10-419","volume":"10","author":"Y Park","year":"2009","unstructured":"Park Y: Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences. BMC bioinformatics 2009, 10: 419. 10.1186\/1471-2105-10-419","journal-title":"BMC bioinformatics"},{"key":"4573_CR20","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1126\/science.1083653","volume":"300","author":"T Pawson","year":"2003","unstructured":"Pawson T, Nash P: Assembly of cell regulatory systems through protein interaction domains. Science (New York, N.Y.) 2003, 300: 445\u201352. 10.1126\/science.1083653","journal-title":"Science (New York, N.Y.)"},{"key":"4573_CR21","first-page":"218","volume-title":"Bioinformatics","author":"S Martin","year":"2005","unstructured":"Martin S, Roe D, Faulon J-L: Predicting protein-protein interactions using signature products. In Bioinformatics. Volume 21. Oxford, England; 2005:218\u201326. 10.1093\/bioinformatics\/bth483"},{"key":"4573_CR22","doi-asserted-by":"publisher","first-page":"4337","DOI":"10.1073\/pnas.0607879104","volume":"104","author":"J Shen","year":"2007","unstructured":"Shen J, Zhang J, Luo X, et al.: Predicting protein-protein interactions based only on sequences information. Proceedings of the National Academy of Sciences of the United States of America 2007, 104: 4337\u201341. 10.1073\/pnas.0607879104","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"4573_CR23","doi-asserted-by":"publisher","first-page":"3025","DOI":"10.1093\/nar\/gkn159","volume":"36","author":"Y Guo","year":"2008","unstructured":"Guo Y, Yu L, Wen Z, Li M: Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences. Nucleic acids research 2008, 36: 3025\u201330. 10.1093\/nar\/gkn159","journal-title":"Nucleic acids research"},{"key":"4573_CR24","doi-asserted-by":"publisher","first-page":"365","DOI":"10.1186\/1471-2105-7-365","volume":"7","author":"S Pitre","year":"2006","unstructured":"Pitre S, Dehne F, Chan A, et al.: PIPE: a protein-protein interaction prediction engine based on the re-occurring short polypeptide sequences between known interacting protein pairs. BMC bioinformatics 2006, 7: 365. 10.1186\/1471-2105-7-365","journal-title":"BMC bioinformatics"},{"key":"4573_CR25","doi-asserted-by":"publisher","first-page":"D656","DOI":"10.1093\/nar\/gkm761","volume":"36","author":"B Raghavachari","year":"2008","unstructured":"Raghavachari B, Tasneem A, Przytycka TM, Jothi R: DOMINE: a database of protein domain interactions. Nucleic acids research 2008, 36: D656\u201361.","journal-title":"Nucleic acids research"},{"key":"4573_CR26","doi-asserted-by":"publisher","first-page":"D557","DOI":"10.1093\/nar\/gkl961","volume":"35","author":"A Ceol","year":"2007","unstructured":"Ceol A, Chatr-aryamontri A, Santonico E, et al.: DOMINO: a database of domain-peptide interactions. Nucleic acids research 2007, 35: D557\u201360. 10.1093\/nar\/gkl961","journal-title":"Nucleic acids research"},{"key":"4573_CR27","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1093\/molbev\/msh263","volume":"22","author":"D Wang","year":"2005","unstructured":"Wang D, Hsieh M, Li W-H: A general tendency for conservation of protein length across eukaryotic kingdoms. Molecular biology and evolution 2005, 22: 142\u20137.","journal-title":"Molecular biology and evolution"},{"key":"4573_CR28","doi-asserted-by":"publisher","first-page":"2164","DOI":"10.1039\/c0mb00038h","volume":"6","author":"E Pang","year":"2010","unstructured":"Pang E, Lin K: Yeast protein-protein interaction binding sites: prediction from the motif-motif, motif-domain and domain-domain levels. Molecular bioSystems 2010, 6: 2164\u201373. 10.1039\/c0mb00038h","journal-title":"Molecular bioSystems"},{"key":"4573_CR29","doi-asserted-by":"publisher","first-page":"D211","DOI":"10.1093\/nar\/gkp985","volume":"38","author":"RD Finn","year":"2010","unstructured":"Finn RD, Mistry J, Tate J, et al.: The Pfam protein families database. Nucleic acids research 2010, 38: D211\u201322. 10.1093\/nar\/gkp985","journal-title":"Nucleic acids research"},{"key":"4573_CR30","doi-asserted-by":"publisher","first-page":"4286","DOI":"10.1093\/nar\/gkn390","volume":"36","author":"S Pitre","year":"2008","unstructured":"Pitre S, North C, Alamgir M, et al.: Global investigation of protein-protein interactions in yeast Saccharomyces cerevisiae using re-occurring short polypeptide sequences. Nucleic acids research 2008, 36: 4286\u201394. 10.1093\/nar\/gkn390","journal-title":"Nucleic acids research"},{"key":"4573_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/ASP\/2006\/35909","volume":"2006","author":"X Jiang","year":"2006","unstructured":"Jiang X, Marti C, Irniger C, Bunke H: Distance Measures for Image Segmentation Evaluation. EURASIP Journal on Advances in Signal Processing 2006, 2006: 1\u201311.","journal-title":"EURASIP Journal on Advances in Signal Processing"},{"key":"4573_CR32","doi-asserted-by":"publisher","first-page":"D211","DOI":"10.1093\/nar\/gkn785","volume":"37","author":"S Hunter","year":"2009","unstructured":"Hunter S, Apweiler R, Attwood TK, et al.: InterPro: the integrative protein signature database. Nucleic acids research 2009, 37: D211\u20135. 10.1093\/nar\/gkn785","journal-title":"Nucleic acids research"},{"key":"4573_CR33","doi-asserted-by":"publisher","first-page":"847","DOI":"10.1093\/bioinformatics\/17.9.847","volume":"17","author":"EM Zdobnov","year":"2001","unstructured":"Zdobnov EM: InterProScan - an integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001, 17: 847\u2013848. 10.1093\/bioinformatics\/17.9.847","journal-title":"Bioinformatics"},{"key":"4573_CR34","doi-asserted-by":"publisher","first-page":"4157","DOI":"10.1093\/nar\/gkg466","volume":"31","author":"A Grigoriev","year":"2003","unstructured":"Grigoriev A: On the number of protein-protein interactions in the yeast proteome. Nucleic Acids Research 2003, 31: 4157\u20134161. 10.1093\/nar\/gkg466","journal-title":"Nucleic Acids Research"},{"key":"4573_CR35","doi-asserted-by":"publisher","first-page":"5699","DOI":"10.1074\/jbc.R100065200","volume":"277","author":"AY Hung","year":"2002","unstructured":"Hung AY, Sheng M: PDZ domains: structural modules for protein complex assembly. The Journal of biological chemistry 2002, 277: 5699\u2013702. 10.1074\/jbc.R100065200","journal-title":"The Journal of biological chemistry"},{"key":"4573_CR36","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1016\/S0960-9822(00)00134-2","volume":"4","author":"CJ Morton","year":"1994","unstructured":"Morton CJ, Campbell ID: SH3 Domains: Molecular \"Velcro.\". Current Biology 1994, 4: 615\u2013617. 10.1016\/S0960-9822(00)00134-2","journal-title":"Current Biology"},{"key":"4573_CR37","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1038\/sj.emboj.7600982","volume":"25","author":"O Kristensen","year":"2006","unstructured":"Kristensen O, Guenat S, Dar I, et al.: A unique set of SH3-SH3 interactions controls IB1 homodimerization. The EMBO journal 2006, 25: 785\u201397. 10.1038\/sj.emboj.7600982","journal-title":"The EMBO journal"},{"key":"4573_CR38","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1126\/science.279.5350.509","volume":"279","author":"A Hall","year":"1998","unstructured":"Hall A: Rho GTPases and the Actin Cytoskeleton. Science 1998, 279: 509\u2013514. 10.1126\/science.279.5350.509","journal-title":"Science"},{"key":"4573_CR39","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1038\/43025","volume":"387","author":"JM Cherry","year":"1997","unstructured":"Cherry JM, Ball C, Weng S, et al.: Genetic and physical maps of Saccharomyces cerevisiae. Nature 1997, 387: 67\u201373. 10.1038\/387067a0","journal-title":"Nature"},{"key":"4573_CR40","doi-asserted-by":"publisher","first-page":"D229","DOI":"10.1093\/nar\/gkn808","volume":"37","author":"I Letunic","year":"2009","unstructured":"Letunic I, Doerks T, Bork P: SMART 6: recent updates and new developments. Nucleic acids research 2009, 37: D229\u201332. 10.1093\/nar\/gkn808","journal-title":"Nucleic acids research"},{"key":"4573_CR41","doi-asserted-by":"publisher","first-page":"2129","DOI":"10.1101\/gr.772403","volume":"13","author":"PD Thomas","year":"2003","unstructured":"Thomas PD, Campbell MJ, Kejariwal A, et al.: PANTHER: a library of protein families and subfamilies indexed by function. Genome research 2003, 13: 2129\u201341. 10.1101\/gr.772403","journal-title":"Genome research"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-225.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T14:23:54Z","timestamp":1630506234000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-225"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6,2]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4573"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-225","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,6,2]]},"assertion":[{"value":"1 March 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 June 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 June 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"225"}}