{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T08:43:55Z","timestamp":1778057035087,"version":"3.51.4"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3141,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: \u00a0Recent improvements in high-throughput Mass Spectrometry (MS) technology have expedited genome-wide discovery of protein\u2013protein interactions by providing a capability of detecting protein complexes in a physiological setting. Computational inference of protein interaction networks and protein complexes from MS data are challenging. Advances are required in developing robust and seamlessly integrated procedures for assessment of protein\u2013protein interaction affinities, mathematical representation of protein interaction networks, discovery of protein complexes and evaluation of their biological relevance.<\/jats:p>\n               <jats:p>Results: A multi-step but easy-to-follow framework for identifying protein complexes from MS pull-down data is introduced. It assesses interaction affinity between two proteins based on similarity of their co-purification patterns derived from MS data. It constructs a protein interaction network by adopting a knowledge-guided threshold selection method. Based on the network, it identifies protein complexes and infers their core components using a graph-theoretical approach. It deploys a statistical evaluation procedure to assess biological relevance of each found complex. On Saccharomyces cerevisiae pull-down data, the framework outperformed other more complicated schemes by at least 10% in F1-measure and identified 610 protein complexes with high-functional homogeneity based on the enrichment in Gene Ontology (GO) annotation. Manual examination of the complexes brought forward the hypotheses on cause of false identifications. Namely, co-purification of different protein complexes as mediated by a common non-protein molecule, such as DNA, might be a source of false positives. Protein identification bias in pull-down technology, such as the hydrophilic bias could result in false negatives.<\/jats:p>\n               <jats:p>Contact: \u00a0samatovan@ornl.gov<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn036","type":"journal-article","created":{"date-parts":[[2008,2,28]],"date-time":"2008-02-28T01:14:32Z","timestamp":1204161272000},"page":"979-986","source":"Crossref","is-referenced-by-count":120,"title":["From pull-down data to protein interaction networks and complexes with biological relevance"],"prefix":"10.1093","volume":"24","author":[{"given":"Bing","family":"Zhang","sequence":"first","affiliation":[{"name":"1 Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 and 2Computer Science Department, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Byung-Hoon","family":"Park","sequence":"additional","affiliation":[{"name":"1 Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 and 2Computer Science Department, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tatiana","family":"Karpinets","sequence":"additional","affiliation":[{"name":"1 Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 and 2Computer Science Department, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nagiza F.","family":"Samatova","sequence":"additional","affiliation":[{"name":"1 Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 and 2Computer Science Department, North Carolina State University, Raleigh, NC 27695, USA"},{"name":"1 Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 and 2Computer Science Department, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,2,26]]},"reference":[{"key":"2023020209530341100_B1","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1016\/S0092-8674(00)80922-8","article-title":"The cell as a collection of protein machines: preparing the next generation of molecular biologists","volume":"92","author":"Alberts","year":"1998","journal-title":"Cell"},{"key":"2023020209530341100_B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023020209530341100_B3","doi-asserted-by":"crossref","first-page":"991","DOI":"10.1038\/nbt1002-991","article-title":"Analyzing yeast protein-protein interaction data obtained from different sources","volume":"20","author":"Bader","year":"2002","journal-title":"Nat. Biotechnol"},{"key":"2023020209530341100_B4","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/1471-2105-4-2","article-title":"An automated method for finding molecular complexes in large protein interaction networks","volume":"4","author":"Bader","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023020209530341100_B5","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1155\/JBB.2005.172","article-title":"Computational, integrative and comparative methods for the elucidation of genetic co-expression networks","volume":"2","author":"Baldwin","year":"2005","journal-title":"J. Biomed. Biotechnol"},{"key":"2023020209530341100_B6","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1038\/nrg1272","article-title":"Network biology: understanding the cell's functional organization","volume":"5","author":"Barabasi","year":"2004","journal-title":"Nat. Rev. Genet"},{"key":"2023020209530341100_B7","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1038\/nature03239","article-title":"Interaction network containing conserved and essential protein complexes in Escherichia coli","volume":"433","author":"Butland","year":"2005","journal-title":"Nature"},{"key":"2023020209530341100_B8","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1074\/mcp.M600381-MCP200","article-title":"Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae","volume":"6","author":"Collins","year":"2007","journal-title":"Mol. Cell Proteomics"},{"key":"2023020209530341100_B9","doi-asserted-by":"crossref","first-page":"15312","DOI":"10.1074\/jbc.M600890200","article-title":"Vma9p (subunit e) is an integral membrane V0 subunit of the yeast V-ATPase","volume":"281","author":"Compton","year":"2006","journal-title":"J. Biol. Chem"},{"key":"2023020209530341100_B10","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1083\/jcb.139.1.75","article-title":"A septin-based hierarchy of proteins required for localized deposition of chitin in the Saccharomyces cerevisiae cell wall","volume":"139","author":"DeMarini","year":"1997","journal-title":"J. Cell Biol"},{"key":"2023020209530341100_B11","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1016\/S0955-0674(03)00005-X","article-title":"Global approaches to protein-protein interactions","volume":"15","author":"Drewes","year":"2003","journal-title":"Curr. Opin. Cell Biol"},{"key":"2023020209530341100_B12","doi-asserted-by":"crossref","first-page":"1575","DOI":"10.1093\/nar\/30.7.1575","article-title":"An efficient algorithm for large-scale detection of protein families","volume":"30","author":"Enright","year":"2002","journal-title":"Nucl. Acids Res"},{"key":"2023020209530341100_B13","first-page":"M406767200","article-title":"Topological characterization of the c, c\u2032, and c\u2033 subunits of the Vacuolar ATPase from the yeast Saccharomyces cerevisiae","author":"Flannery","year":"2004","journal-title":"J. Biol. Chem"},{"key":"2023020209530341100_B14","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1038\/nature04532","article-title":"Proteome survey reveals modularity of the yeast cell machinery","volume":"440","author":"Gavin","year":"2006","journal-title":"Nature"},{"key":"2023020209530341100_B15","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1038\/415141a","article-title":"Functional organization of the yeast proteome by systematic analysis of protein complexes","volume":"415","author":"Gavin","year":"2002","journal-title":"Nature"},{"key":"2023020209530341100_B16","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1093\/bioinformatics\/btg469","article-title":"A statistical framework for combining and interpreting proteomic datasets","volume":"20","author":"Gilchrist","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020209530341100_B17","doi-asserted-by":"crossref","first-page":"4372","DOI":"10.1073\/pnas.0735871100","article-title":"Assessing experimentally derived interactions in a small world","volume":"100","author":"Goldberg","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020209530341100_B18","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1023\/A:1025772730586","article-title":"Structure and assembly of the yeast V-ATPase","volume":"35","author":"Graham","year":"2003","journal-title":"J. Bioenerg. Biomembr"},{"key":"2023020209530341100_B19","doi-asserted-by":"crossref","first-page":"C47","DOI":"10.1038\/35011540","article-title":"From molecular to modular cell biology","volume":"402","author":"Hartwell","year":"1999","journal-title":"Nature"},{"key":"2023020209530341100_B20","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nature04670","article-title":"Global landscape of protein complexes in the yeast Saccharomyces cerevisiae","volume":"440","author":"Krogan","year":"2006","journal-title":"Nature"},{"key":"2023020209530341100_B21","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","article-title":"A simple method for displaying the hydropathic character of a protein","volume":"157","author":"Kyte","year":"1982","journal-title":"J. Mol. Biol"},{"key":"2023020209530341100_B22","doi-asserted-by":"crossref","first-page":"4193","DOI":"10.1021\/ac0498563","article-title":"A model for random sampling and estimation of relative protein abundance in shotgun proteomics","volume":"76","author":"Liu","year":"2004","journal-title":"Anal. Chem"},{"key":"2023020209530341100_B23","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1093\/bioinformatics\/btg153","article-title":"Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation","volume":"19","author":"Lord","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020209530341100_B24","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1261\/rna.7231505","article-title":"Decay of mRNAs targeted by RISC requires XRN1, the Ski complex, and the exosome","volume":"11","author":"Orban","year":"2005","journal-title":"RNA"},{"key":"2023020209530341100_B25","doi-asserted-by":"crossref","first-page":"814","DOI":"10.1038\/nature03607","article-title":"Uncovering the overlapping community structure of complex networks in nature and society","volume":"435","author":"Palla","year":"2005","journal-title":"Nature"},{"key":"2023020209530341100_B26","doi-asserted-by":"crossref","first-page":"012061","DOI":"10.1088\/1742-6596\/78\/1\/012061","article-title":"Data-driven, data-intensive computing for modelling and analysis of biological networks: application to bioethanol production","volume":"78","author":"Park","year":"2007","journal-title":"J. Phys. Conf. Ser"},{"key":"2023020209530341100_B27","doi-asserted-by":"crossref","first-page":"R93","DOI":"10.1186\/gb-2004-5-11-r93","article-title":"How biologically relevant are interaction-based modules in protein networks?","volume":"5","author":"Poyatos","year":"2004","journal-title":"Genome Biol"},{"key":"2023020209530341100_B28","doi-asserted-by":"crossref","first-page":"6826","DOI":"10.1128\/MCB.18.11.6826","article-title":"Nuclear mRNA export requires complex formation between Mex67p and Mtr2p at the nuclear pores","volume":"18","author":"Santos-Rosa","year":"1998","journal-title":"Mol. Cell Biol"},{"key":"2023020209530341100_B29","doi-asserted-by":"crossref","first-page":"3548","DOI":"10.1093\/bioinformatics\/bti567","article-title":"Local modeling of global interactome networks","volume":"21","author":"Scholtens","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020209530341100_B30","doi-asserted-by":"crossref","first-page":"12123","DOI":"10.1073\/pnas.2032324100","article-title":"Protein complexes and functional modules in molecular networks","volume":"100","author":"Spirin","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020209530341100_B31","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1093\/bioinformatics\/btl014","article-title":"Predicting interactions in protein networks by completing defective cliques","volume":"22","author":"Yu","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020209530341100_B32","doi-asserted-by":"crossref","first-page":"W741","DOI":"10.1093\/nar\/gki475","article-title":"WebGestalt: an integrated system for exploring gene sets in various biological contexts","volume":"33","author":"Zhang","year":"2005","journal-title":"Nucl. Acids Res"},{"key":"2023020209530341100_B33","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/1471-2105-5-16","article-title":"GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using gene ontology hierarchies","volume":"5","author":"Zhang","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023020209530341100_B34","doi-asserted-by":"crossref","first-page":"2909","DOI":"10.1021\/pr0600273","article-title":"Detecting differential and correlated protein expression in label-free shotgun proteomics","volume":"5","author":"Zhang","year":"2006","journal-title":"J. Proteome Res"},{"key":"2023020209530341100_B35","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1021\/pr050366g","article-title":"Fast and accurate method for identifying high-quality protein-interaction modules by clique merging and its application to yeast","volume":"5","author":"Zhang","year":"2006","journal-title":"J. Proteome Res"},{"key":"2023020209530341100_B36","article-title":"Genome-scale computational approaches to memory-intensive applications in systems biology","author":"Zhang","year":"2005"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/7\/979\/49046240\/bioinformatics_24_7_979.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/7\/979\/49046240\/bioinformatics_24_7_979.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T10:50:06Z","timestamp":1675335006000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/7\/979\/296061"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,2,26]]},"references-count":36,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2008,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn036","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,4,1]]},"published":{"date-parts":[[2008,2,26]]}}}