{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T12:44:32Z","timestamp":1767962672219,"version":"3.49.0"},"reference-count":21,"publisher":"Springer Science and Business Media LLC","issue":"S10","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the processes. There are many PPI databases publicly available, each with a specific focus. The challenge is how to effectively combine their contents to generate a robust and biologically relevant PPI network.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Methods<\/jats:title>\n            <jats:p>In this study, seven public PPI databases, BioGRID, DIP, HPRD, IntAct, MINT, REACTOME, and SPIKE, were used to explore a powerful approach to combine multiple PPI databases for an integrated PPI network. We developed a novel method called <jats:italic>k<\/jats:italic>-votes to create seven different integrated networks by using values of <jats:italic>k<\/jats:italic> ranging from 1-7. Functional modules were mined by using SCAN, a Structural Clustering Algorithm for Networks. Overall module qualities were evaluated for each integrated network using the following statistical and biological measures: (1) modularity, (2) similarity-based modularity, (3) clustering score, and (4) enrichment.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Each integrated human PPI network was constructed based on the number of votes (<jats:italic>k<\/jats:italic>) for a particular interaction from the committee of the original seven PPI databases. The performance of functional modules obtained by SCAN from each integrated network was evaluated. The optimal value for <jats:italic>k<\/jats:italic> was determined by the functional module analysis. Our results demonstrate that the <jats:italic>k<\/jats:italic>-votes method outperforms the traditional union approach in terms of both statistical significance and biological meaning. The best network is achieved at <jats:italic>k<\/jats:italic>=2, which is composed of interactions that are confirmed in at least two PPI databases. In contrast, the traditional union approach yields an integrated network that consists of all interactions of seven PPI databases, which might be subject to high false positives.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>We determined that the k-votes method for constructing a robust PPI network by integrating multiple public databases outperforms previously reported approaches and that a value of k=2 provides the best results. The developed strategies for combining databases show promise in the advancement of network construction and modeling.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-s10-s7","type":"journal-article","created":{"date-parts":[[2011,10,20]],"date-time":"2011-10-20T06:39:22Z","timestamp":1319092762000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["Constructing a robust protein-protein interaction network by integrating multiple public databases"],"prefix":"10.1186","volume":"12","author":[{"given":"Venkata-Swamy","family":"Martha","sequence":"first","affiliation":[]},{"given":"Zhichao","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Li","family":"Guo","sequence":"additional","affiliation":[]},{"given":"Zhenqiang","family":"Su","sequence":"additional","affiliation":[]},{"given":"Yanbin","family":"Ye","sequence":"additional","affiliation":[]},{"given":"Hong","family":"Fang","sequence":"additional","affiliation":[]},{"given":"Don","family":"Ding","sequence":"additional","affiliation":[]},{"given":"Weida","family":"Tong","sequence":"additional","affiliation":[]},{"given":"Xiaowei","family":"Xu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,10,18]]},"reference":[{"issue":"7325","key":"4847_CR1","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1038\/468851a","volume":"468","author":"L Bonetta","year":"2010","unstructured":"Bonetta L: Interactome under construction. Nature 2010, 468(7325):851\u2013854. 10.1038\/468851a","journal-title":"Nature"},{"issue":"7307","key":"4847_CR2","doi-asserted-by":"publisher","first-page":"761","DOI":"10.1038\/nature09182","volume":"466","author":"YY Ahn","year":"2010","unstructured":"Ahn YY, Bagrow JP, Lehmann S: Link communities reveal multiscale complexity in networks. Nature 2010, 466(7307):761-U711. 10.1038\/nature09182","journal-title":"Nature"},{"key":"4847_CR3","doi-asserted-by":"publisher","first-page":"D685","DOI":"10.1093\/nar\/gkq1039","volume":"39","author":"EG Cerami","year":"2011","unstructured":"Cerami EG, Gross BE, Demir E, Rodchenkov I, Babur O, Anwar N, Schultz N, Bader GD, Sander C: Pathway Commons, a web resource for biological pathway data. Nucleic Acids Research 2011, 39: D685-D690. 10.1093\/nar\/gkq1039","journal-title":"Nucleic Acids Research"},{"key":"4847_CR4","doi-asserted-by":"publisher","first-page":"D712","DOI":"10.1093\/nar\/gkq1156","volume":"39","author":"A Kamburov","year":"2011","unstructured":"Kamburov A, Pentchev K, Galicka H, Wierling C, Lehrach H, Herwig R: ConsensusPathDB: toward a more complete picture of cell biology. Nucleic Acids Research 2011, 39: D712-D717. 10.1093\/nar\/gkq1156","journal-title":"Nucleic Acids Research"},{"key":"4847_CR5","doi-asserted-by":"publisher","first-page":"824","DOI":"10.1145\/1281192.1281280","volume-title":"In Proceedings of the 13th ACM SIGKDD international conference on Knowledge Discovery and Data Mining","author":"X Xu","year":"2007","unstructured":"Xu X, Yuruk N, Feng Z, Schweiger T: SCAN: a structural clustering algorithm for networks. In In Proceedings of the 13th ACM SIGKDD international conference on Knowledge Discovery and Data Mining. San Jose, California, USA; 2007:824\u2013833."},{"key":"4847_CR6","doi-asserted-by":"publisher","first-page":"D535","DOI":"10.1093\/nar\/gkj109","volume":"34","author":"C Stark","year":"2006","unstructured":"Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Research 2006, 34: D535-D539. 10.1093\/nar\/gkj109","journal-title":"Nucleic Acids Research"},{"issue":"1","key":"4847_CR7","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1093\/nar\/28.1.289","volume":"28","author":"I Xenarios","year":"2000","unstructured":"Xenarios I, Rice DW, Salwinski L, Baron MK, Marcotte EM, Eisenberg D: DIP: the Database of Interacting Proteins. Nucleic Acids Research 2000, 28(1):289\u2013291. 10.1093\/nar\/28.1.289","journal-title":"Nucleic Acids Research"},{"key":"4847_CR8","doi-asserted-by":"publisher","first-page":"D767","DOI":"10.1093\/nar\/gkn892","volume":"37","author":"TSK Prasad","year":"2009","unstructured":"Prasad TSK, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, et al.: Human Protein Reference Database-2009 update. Nucleic Acids Research 2009, 37: D767-D772. 10.1093\/nar\/gkn892","journal-title":"Nucleic Acids Research"},{"key":"4847_CR9","doi-asserted-by":"publisher","first-page":"D525","DOI":"10.1093\/nar\/gkp878","volume":"38","author":"B Aranda","year":"2010","unstructured":"Aranda B, Achuthan P, Alam-Faruque Y, Armean I, Bridge A, Derow C, Feuermann M, Ghanbarian AT, Kerrien S, Khadake J, et al.: The IntAct molecular interaction database in 2010. Nucleic Acids Research 2010, 38: D525-D531. 10.1093\/nar\/gkp878","journal-title":"Nucleic Acids Research"},{"key":"4847_CR10","doi-asserted-by":"publisher","first-page":"D532","DOI":"10.1093\/nar\/gkp983","volume":"38","author":"A Ceol","year":"2010","unstructured":"Ceol A, Aryamontri AC, Licata L, Peluso D, Briganti L, Perfetto L, Castagnoli L, Cesareni G: MINT, the molecular interaction database: 2009 update. Nucleic Acids Research 2010, 38: D532-D539. 10.1093\/nar\/gkp983","journal-title":"Nucleic Acids Research"},{"key":"4847_CR11","doi-asserted-by":"publisher","first-page":"D619","DOI":"10.1093\/nar\/gkn863","volume":"37","author":"L Matthews","year":"2009","unstructured":"Matthews L, Gopinath G, Gillespie M, Caudy M, Croft D, de Bono B, Garapati P, Hemish J, Hermjakob H, Jassal B, et al.: Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Research 2009, 37: D619-D622. 10.1093\/nar\/gkn863","journal-title":"Nucleic Acids Research"},{"key":"4847_CR12","doi-asserted-by":"publisher","first-page":"D793","DOI":"10.1093\/nar\/gkq1167","volume":"39","author":"A Paz","year":"2011","unstructured":"Paz A, Brownstein Z, Ber Y, Bialik S, David E, Sagir D, Ulitsky I, Elkon R, Kimchi A, Avraham KB, et al.: SPIKE: a database of highly curated human signaling pathways. Nucleic Acids Research 2011, 39: D793-D799. 10.1093\/nar\/gkq1167","journal-title":"Nucleic Acids Research"},{"key":"4847_CR13","volume-title":"Set Theory: Third Millennium Edition","author":"T Jech","year":"2003","unstructured":"Jech T: Set Theory: Third Millennium Edition. Berlin, New York: Springer-Verlag: Springer Monographs in Mathematics; 2003."},{"key":"4847_CR14","first-page":"358","volume":"64","author":"Z Feng","year":"2007","unstructured":"Feng Z, Xu X, Yuruk N, Schweiger T: A novel similarity-based modularity function for graph partitioning. Lect Notes Comp Sci 2007, 64: 358\u2013396.","journal-title":"Lect Notes Comp Sci"},{"issue":"2","key":"4847_CR15","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1103\/PhysRevE.69.026113","volume":"69","author":"MEJ Newman","year":"2004","unstructured":"Newman MEJ, Girvan M: Finding and evaluating community structure in networks. Physical Review E 2004, 69(2):15.","journal-title":"Physical Review E"},{"issue":"1","key":"4847_CR16","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1073\/pnas.0605965104","volume":"104","author":"S Fortunato","year":"2007","unstructured":"Fortunato S, Barthelemy M: Resolution limit in community detection. Proceedings of the National Academy of Sciences of the United States of America 2007, 104(1):36\u201341. 10.1073\/pnas.0605965104","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"9","key":"4847_CR17","doi-asserted-by":"publisher","first-page":"2443","DOI":"10.1093\/nar\/gkg340","volume":"31","author":"DB Bu","year":"2003","unstructured":"Bu DB, Zhao Y, Cai L, Xue H, Zhu XP, Lu HC, Zhang JF, Sun SW, Ling LJ, Zhang N, et al.: Topological structure analysis of the protein-protein interaction network in budding yeast. Nucleic Acids Research 2003, 31(9):2443\u20132450. 10.1093\/nar\/gkg340","journal-title":"Nucleic Acids Research"},{"key":"4847_CR18","first-page":"i29","volume-title":"An ensemble framework for clustering protein-protein interaction networks","author":"S Asur","year":"2007","unstructured":"Asur S, Ucar D, Parthasarathy S: An ensemble framework for clustering protein-protein interaction networks. 2007, 23(13):i29-i40."},{"key":"4847_CR19","volume-title":"Bmc Bioinformatics","author":"M Mete","year":"2008","unstructured":"Mete M, Tang FS, Xu X, Yuruk N: A structural approach for finding functional modules from large biological networks. Bmc Bioinformatics 2008., 9:"},{"key":"4847_CR20","volume-title":"Molecular Systems Biology","author":"HY Chuang","year":"2007","unstructured":"Chuang HY, Lee E, Liu YT, Lee D, Ideker T: Network-based classification of breast cancer metastasis. Molecular Systems Biology 2007., 3:"},{"issue":"6","key":"4847_CR21","first-page":"958","volume":"3","author":"M Liu","year":"2007","unstructured":"Liu M, Liberzon A, Kong SW, Lai WR, Park PJ, Kohane IS, Kasif S: Network-based analysis of affected biological processes in type 2 diabetes models. Plos Genetics 2007, 3(6):958\u2013972.","journal-title":"Plos Genetics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-S10-S7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T16:39:04Z","timestamp":1630514344000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-S10-S7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,18]]},"references-count":21,"journal-issue":{"issue":"S10","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4847"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-s10-s7","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,10,18]]},"assertion":[{"value":"18 October 2011","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S7"}}