{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T00:23:47Z","timestamp":1774052627902,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2016,10,1]],"date-time":"2016-10-01T00:00:00Z","timestamp":1475280000000},"content-version":"vor","delay-in-days":2319,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Many algorithms that integrate multiple functional association networks for predicting gene function construct a composite network as a weighted sum of the individual networks and then use the composite network to predict gene function. The weight assigned to an individual network represents the usefulness of that network in predicting a given gene function. However, because many categories of gene function have a small number of annotations, the process of assigning these network weights is prone to overfitting.<\/jats:p><jats:p>Results: Here, we address this problem by proposing a novel approach to combining multiple functional association networks. In particular, we present a method where network weights are simultaneously optimized on sets of related function categories. The method is simpler and faster than existing approaches. Further, we show that it produces composite networks with improved function prediction accuracy using five example species (yeast, mouse, fly, Esherichia coli and human).<\/jats:p><jats:p>Availability: Networks and code are available from: http:\/\/morrislab.med.utoronto.ca\/\u02dcsara\/SW<\/jats:p><jats:p>Contact: \u00a0smostafavi@cs.toronto.edu; quaid.morris@utoronto.ca<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq262","type":"journal-article","created":{"date-parts":[[2010,5,28]],"date-time":"2010-05-28T00:46:57Z","timestamp":1275007617000},"page":"1759-1765","source":"Crossref","is-referenced-by-count":115,"title":["Fast integration of heterogeneous data sources for predicting gene function with limited annotation"],"prefix":"10.1093","volume":"26","author":[{"given":"Sara","family":"Mostafavi","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and 2 Center for Cellular and Biomolecular Research, University of Toronto, Canada"},{"name":"1 Department of Computer Science and 2 Center for Cellular and Biomolecular Research, University of Toronto, Canada"}]},{"given":"Quaid","family":"Morris","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and 2 Center for Cellular and Biomolecular Research, University of Toronto, Canada"},{"name":"1 Department of Computer Science and 2 Center for Cellular and Biomolecular Research, University of Toronto, Canada"}]}],"member":"286","published-online":{"date-parts":[[2010,5,27]]},"reference":[{"key":"2023012507575580500_B1","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023012507575580500_B2","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/nar\/28.1.304","article-title":"The enzyme database in 2000","volume":"28","author":"Bairoch","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012507575580500_B3","first-page":"367","article-title":"On kernel target alignment","volume-title":"Proceedings of the Fourteen Conference on Advances in Neural Information Processing Systems","author":"Cristianini","year":"2002"},{"key":"2023012507575580500_B4","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene expression omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023012507575580500_B5","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1214\/009053604000000067","article-title":"Least angle regression","volume":"32","author":"Efron","year":"2004","journal-title":"Ann. Stat."},{"key":"2023012507575580500_B6","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21606-5","volume-title":"The Elements of Statistical Learning.","author":"Hastie","year":"2001"},{"key":"2023012507575580500_B7","doi-asserted-by":"crossref","first-page":"e96","DOI":"10.1371\/journal.pbio.1000096","article-title":"Global functional atlas of escherichia coli encompassing previously uncharacterized proteins","volume":"7","author":"Hu","year":"2009","journal-title":"PLoS Biol."},{"key":"2023012507575580500_B8","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: Kyoto encyclopedia of genes and genome","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012507575580500_B9","doi-asserted-by":"crossref","first-page":"2888","DOI":"10.1073\/pnas.0307326101","article-title":"Whole-genome annotation by using evidence integration in functional-linkage networks","volume":"101","author":"Karaoz","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507575580500_B10","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1101\/gr.1645104","article-title":"Ensmart: a generic system for fast and flexible access to biological data","volume":"14","author":"Kasprzyk","year":"2004","journal-title":"Genome Res."},{"key":"2023012507575580500_B11","first-page":"463","article-title":"Diffusion kernels on graphs and other discrete structures","volume":"11","author":"Kondor","year":"2002","journal-title":"Int. Conf. Mach. Learn. (ICML)"},{"key":"2023012507575580500_B12","doi-asserted-by":"crossref","first-page":"2626","DOI":"10.1093\/bioinformatics\/bth294","article-title":"A statistical framework for genomic data fusion","volume":"20","author":"Lanckriet","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012507575580500_B13","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1038\/47048","article-title":"A combined algorithm for genome-wide prediction of protein function","volume":"42","author":"Marcotte","year":"1999","journal-title":"Nature"},{"key":"2023012507575580500_B14","article-title":"Using the gene ontology hierarchy when predicting gene function","volume-title":"Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence.","author":"Mostafavi","year":"2009"},{"issue":"Suppl. 1","key":"2023012507575580500_B15","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/gb-2008-9-s1-s4","article-title":"Genemania: a real-time multiple association network integration algorithm for predicting gene function","volume":"9","author":"Mostafavi","year":"2008","journal-title":"Genome Biol."},{"key":"2023012507575580500_B16","doi-asserted-by":"crossref","first-page":"2322","DOI":"10.1093\/bioinformatics\/btm332","article-title":"Context-sensitive data integration and prediction of biological networks","volume":"23","author":"Myers","year":"2007","journal-title":"Bioinformatics"},{"issue":"Suppl. 1","key":"2023012507575580500_B17","article-title":"Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps","volume":"2","author":"Nabieva","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012507575580500_B18","doi-asserted-by":"crossref","DOI":"10.1002\/9783527619368.ch35","article-title":"Integrating Information for Protein Function Prediction","volume-title":"Bioinformatics-From Genomes to Therapies.","author":"Noble","year":"2007"},{"key":"2023012507575580500_B19","volume-title":"Numerical Optimization.","author":"Nocedal","year":"2006"},{"key":"2023012507575580500_B20","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1089\/10665270252935539","article-title":"Learning gene functional classification from multiple data types","volume":"9","author":"Pavlidis","year":"2002","journal-title":"J. Comput. Biol."},{"issue":"Suppl. 1","key":"2023012507575580500_B21","doi-asserted-by":"crossref","first-page":"S2","DOI":"10.1186\/gb-2008-9-s1-s2","article-title":"A critical assessment of Mus musculus gene function prediction using integrated genomic evidence","volume":"9","author":"Pena-Castillo","year":"2008","journal-title":"Genome Biol."},{"key":"2023012507575580500_B22","doi-asserted-by":"crossref","first-page":"D411","DOI":"10.1093\/nar\/gkj141","article-title":"Human protein reference database \u2013 2006 update","volume":"34","author":"Prasad","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507575580500_B23","doi-asserted-by":"crossref","first-page":"1991","DOI":"10.1101\/gr.077693.108","article-title":"Finding friends and enemies in an enemies-only network: a graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic intearctions","volume":"18","author":"Qi","year":"2008","journal-title":"Genome Res."},{"key":"2023012507575580500_B24","first-page":"D539","article-title":"BioGRID: a general repository for interaction datasets","volume":"1","author":"Stark","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507575580500_B25","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. B."},{"issue":"Suppl. 2","key":"2023012507575580500_B26","doi-asserted-by":"crossref","first-page":"ii59","DOI":"10.1093\/bioinformatics\/bti1110","article-title":"Fast protein classification with multiple networks","volume":"21","author":"Tsuda","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012507575580500_B27","doi-asserted-by":"crossref","first-page":"697","DOI":"10.1038\/nbt825","article-title":"Global protein function prediction from protein-protein interaction networks","volume":"21","author":"Vazquez","year":"2003","journal-title":"Nat. Biotechnol."},{"key":"2023012507575580500_B28","first-page":"321","article-title":"Learning with local and global consistency","volume":"16","author":"Zhou","year":"2004","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"2023012507575580500_B29","article-title":"Semi-supervised learning using Gaussian fields and harmonic functions","volume-title":"Proceedings of the Twentieth International Conference on Machine Learning","author":"Zhu","year":"2003"},{"key":"2023012507575580500_B30","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. R. Stat. Soc. B."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/14\/1759\/48852190\/bioinformatics_26_14_1759.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/14\/1759\/48852190\/bioinformatics_26_14_1759.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T08:46:19Z","timestamp":1740127579000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/14\/1759\/177586"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5,27]]},"references-count":30,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2010,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq262","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,7,15]]},"published":{"date-parts":[[2010,5,27]]}}}