{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T00:24:07Z","timestamp":1773275047007,"version":"3.50.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"11","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Compared with sequence and structure similarity, functional similarity is more informative for understanding the biological roles and functions of genes. Many important applications in computational molecular biology require functional similarity, such as gene clustering, protein function prediction, protein interaction evaluation and disease gene prioritization. Gene Ontology (GO) is now widely used as the basis for measuring gene functional similarity. Some existing methods combined semantic similarity scores of single term pairs to estimate gene functional similarity, whereas others compared terms in groups to measure it. However, these methods may make error-prone judgments about gene functional similarity. It remains a challenge that measuring gene functional similarity reliably.<\/jats:p>\n               <jats:p>Result: We propose a novel method called SORA to measure gene functional similarity in GO context. First of all, SORA computes the information content (IC) of a term making use of semantic specificity and coverage. Second, SORA measures the IC of a term set by means of combining inherited and extended IC of the terms based on the structure of GO. Finally, SORA estimates gene functional similarity using the IC overlap ratio of term sets. SORA is evaluated against five state-of-the-art methods in the file on the public platform for collaborative evaluation of GO-based semantic similarity measure. The carefully comparisons show SORA is superior to other methods in general. Further analysis suggests that it primarily benefits from the structure of GO, which implies expressive information about gene function. SORA offers an effective and reliable way to compare gene function.<\/jats:p>\n               <jats:p>Availability: The web service of SORA is freely available at http:\/\/nclab.hit.edu.cn\/SORA\/.<\/jats:p>\n               <jats:p>Contact: \u00a0maozuguo@hit.edu.cn<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt160","type":"journal-article","created":{"date-parts":[[2013,4,10]],"date-time":"2013-04-10T02:22:46Z","timestamp":1365560566000},"page":"1424-1432","source":"Crossref","is-referenced-by-count":80,"title":["Measuring gene functional similarity based on group-wise comparison of GO terms"],"prefix":"10.1093","volume":"29","author":[{"given":"Zhixia","family":"Teng","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"},{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]},{"given":"Maozu","family":"Guo","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]},{"given":"Xiaoyan","family":"Liu","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]},{"given":"Qiguo","family":"Dai","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]},{"given":"Chunyu","family":"Wang","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]},{"given":"Ping","family":"Xuan","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"},{"name":"1 Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, P.R. China, 2Department of Information Management and Information System, Northeast Forestry University, Harbin 150040, P.R. China and 3Department of Computer Science and Technology, Heilongjiang University, Harbin 150080, P.R. China"}]}],"member":"286","published-online":{"date-parts":[[2013,4,9]]},"reference":[{"key":"2023062610153390000_btt160-B1","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1142\/S0219720011005641","article-title":"A graph-based semantic similarity measure for the gene ontology","volume":"9","author":"Alvarez","year":"2011","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023062610153390000_btt160-B2","first-page":"9","article-title":"Ontology-driven similarity approaches to supporting gene functional assessment","volume-title":"Proceeding of the ISMB 2005 SIG Meeting on Bio-ontologies","author":"Azuaje","year":"2005"},{"key":"2023062610153390000_btt160-B3","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.jbi.2010.09.002","article-title":"An ontology-based measure to compute semantic similarity in biomedicine","volume":"44","author":"Batet","year":"2011","journal-title":"J. Biomed. Inform."},{"key":"2023062610153390000_btt160-B4","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1016\/j.jbi.2006.05.001","article-title":"Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps","volume":"40","author":"Brameier","year":"2007","journal-title":"J. Biomed. Inform."},{"key":"2023062610153390000_btt160-B5","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1186\/1471-2105-8-235","article-title":"A transversal approach to predict gene product networks from ontology-based similarity","volume":"8","author":"Chabalier","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B6","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1186\/1471-2105-10-73","article-title":"Disease candidate gene identification and prioritization using protein interaction networks","volume":"10","author":"Chen","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B51","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/j.gene.2012.07.078","article-title":"A sensitive method for computing GO-based functional similarities among genes with \u2018shallow annotation'","volume":"509","author":"Chen","year":"2012","journal-title":"Gene"},{"key":"2023062610153390000_btt160-B7","doi-asserted-by":"crossref","first-page":"6414","DOI":"10.1093\/nar\/gkh978","article-title":"Genome-scale protein function prediction in yeast Saccha-romyces cerevisiae through integrating multiple sources of high throughput data","volume":"32","author":"Chen","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023062610153390000_btt160-B8","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1186\/1471-2105-8-265","article-title":"Semantic integration to identify overlapping functional modules in protein interaction networks","volume":"8","author":"Cho","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B9","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1504\/IJDMB.2009.026705","article-title":"Semantic similarity based feature extraction from microarray expression data","volume":"3","author":"Cho","year":"2009","journal-title":"Int. J. Data Min. Bioinform."},{"key":"2023062610153390000_btt160-B10","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1145\/1099554.1099658","article-title":"Semantic similarity over the gene ontology: Family correla-tion and selecting disjunctive ancestors","volume-title":"Proceedings of the 14th ACM International Conference on Information and knowledge Management","author":"Couto","year":"2005"},{"key":"2023062610153390000_btt160-B11","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1186\/2041-1480-2-5","article-title":"Disjunctive shared information between ontology concepts: application to Gene Ontology","volume":"2","author":"Couto","year":"2011","journal-title":"J. Biomed. Semantics"},{"key":"2023062610153390000_btt160-B12","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1007\/0-387-29362-0","article-title":"Visualizing distances","volume-title":"Bioinformatics and computational biology solutions using R and Bioconductor","author":"Gentleman","year":"2005"},{"key":"2023062610153390000_btt160-B13","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"GO-Consortium","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023062610153390000_btt160-B14","first-page":"569","article-title":"Semantic similarity analysis of protein data: assessment with biological features and issues","volume":"5","author":"Guzzi","year":"2011","journal-title":"Brief. Bioinform."},{"key":"2023062610153390000_btt160-B15","doi-asserted-by":"crossref","first-page":"R183","DOI":"10.1186\/gb-2007-8-9-r183","article-title":"David gene functional classification tool: A novel biological module centric algorithm to functionally analyze large gene list","volume":"8","author":"Huang","year":"2007","journal-title":"Genome Biol."},{"key":"2023062610153390000_btt160-B17","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1186\/1471-2105-11-562","article-title":"An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology","volume":"11","author":"Jain","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B18","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1093\/bioinformatics\/btg036","article-title":"Prediction of human protein function according to Gene Ontology categories","volume":"19","author":"Jensen","year":"2003","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B19","first-page":"19","article-title":"Semantic similarity based on corpus statistics and lexical taxonomy","volume-title":"Proceedings of International Conference on Research in Computational Linguistics","author":"Jiang","year":"1998"},{"key":"2023062610153390000_btt160-B21","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1101\/gr.1910904","article-title":"Coexpression analysis of human genes across many microarray data sets","volume":"14","author":"Lee","year":"2004","journal-title":"Genome Res."},{"key":"2023062610153390000_btt160-B23","doi-asserted-by":"crossref","first-page":"1043","DOI":"10.1074\/mcp.M700287-MCP200","article-title":"PRINCESS, a protein interaction confidence evaluation system with multiple data sources","volume":"7","author":"Li","year":"2008","journal-title":"Mol. Cell. Proteomics"},{"key":"2023062610153390000_btt160-B24","first-page":"296","article-title":"An information-theoretic definition of similarity","volume-title":"Proceeding of the 15th International Conference on Machine learning","author":"Lin","year":"1998"},{"key":"2023062610153390000_btt160-B25","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1186\/1471-2105-5-154","article-title":"Information assessment on predicting protein-protein interactions","volume":"5","author":"Lin","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B26","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1093\/bioinformatics\/btg153","article-title":"Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation","volume":"19","author":"Lord","year":"2003","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B27","doi-asserted-by":"crossref","first-page":"e337","DOI":"10.1371\/journal.pone.0000337","article-title":"Probabilistic protein function prediction from heterogeneous genome-wide data","volume":"2","author":"Nariai","year":"2007","journal-title":"PLoS One"},{"key":"2023062610153390000_btt160-B28","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1016\/j.jbi.2011.11.017","article-title":"Finding disease similarity based on implicit semantic similarity","volume":"45","author":"Mathur","year":"2011","journal-title":"J. Biomed. Inform."},{"key":"2023062610153390000_btt160-B52","doi-asserted-by":"crossref","first-page":"R101","DOI":"10.1186\/gb-2004-5-12-r101","article-title":"GOToolBox: functional analysis of gene datasets based on Gene Ontology","volume":"5","author":"Martin","year":"2004","journal-title":"Genome Biol."},{"key":"2023062610153390000_btt160-B29","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1186\/1471-2105-9-327","article-title":"Gene ontology term overlap as a measure of gene functional similarity","volume":"9","author":"Mistry","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B30","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1093\/nar\/gkn982","article-title":"Identification of candidate disease genes by inte-grating Gene Ontologies and protein interaction networks: case study of primary immune deficiencies","volume":"37","author":"Ortutay","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023062610153390000_btt160-B31","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2105-9-S5-S4","article-title":"Metrics for go based protein semantic similarity: a systematic evaluation","volume":"9","author":"Pesquita","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B32","doi-asserted-by":"crossref","first-page":"e1000443","DOI":"10.1371\/journal.pcbi.1000443","article-title":"Semantic similarity in biomedical ontologies","volume":"5","author":"Pesquita","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023062610153390000_btt160-B33","article-title":"CESSM: collaborative evaluation of semantic similarity measures","volume-title":"Proceeding of JB 2009: Challenges in Bioinformatics","author":"Pesquita","year":"2009"},{"key":"2023062610153390000_btt160-B34","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1186\/1471-2105-9-50","article-title":"Defining functional distances over gene ontology","volume":"9","author":"Pozo","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B35","doi-asserted-by":"crossref","first-page":"1905","DOI":"10.1093\/bioinformatics\/bth177","article-title":"Supervised cluster analysis for microarray data based on multivariate Gaussian mixture","volume":"20","author":"Qu","year":"2004","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B36","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1613\/jair.514","article-title":"Semantic similarity in a taxonomy: an information based measure and its application to problems of ambiguity in natural language","volume":"11","author":"Resnik","year":"1999","journal-title":"J. Artif. Intell. Res."},{"key":"2023062610153390000_btt160-B37","first-page":"99","article-title":"XOA: Web-enabled cross-ontological analytics","volume-title":"Proceeding of IEEE Congress on Services","author":"Riensche","year":"2007"},{"key":"2023062610153390000_btt160-B38","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1093\/bioinformatics\/btm012","article-title":"Functional evaluation of domain-domain interactions and human protein interaction networks","volume":"23","author":"Schlicker","year":"2007","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B39","doi-asserted-by":"crossref","first-page":"i561","DOI":"10.1093\/bioinformatics\/btq384","article-title":"Improving disease gene prioritization using the semantic similarity of Gene Ontology terms","volume":"26","author":"Schlicker","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B40","first-page":"1089","article-title":"An intrinsic information content metric for semantic similarity in WordNet","volume-title":"Proceedings of 16th European Conference on Artificial Intelligence","author":"Seco","year":"2004"},{"key":"2023062610153390000_btt160-B41","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1109\/TCBB.2005.50","article-title":"Correlation between gene expression and go semantic similarity","volume":"2","author":"Sevilla","year":"2005","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023062610153390000_btt160-B42","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1186\/1471-2105-9-468","article-title":"A relation based measure of semantic similarity for gene ontology annotations","volume":"9","author":"Sheehan","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B43","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1037\/0033-295X.84.4.327","article-title":"Features of similarity","volume":"84","author":"Tversky","year":"1977","journal-title":"Psychol. Rev."},{"key":"2023062610153390000_btt160-B45","doi-asserted-by":"crossref","first-page":"1274","DOI":"10.1093\/bioinformatics\/btm087","article-title":"A new method to measure the semantic similarity of go terms","volume":"23","author":"Wang","year":"2007","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B46","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1186\/1471-2105-11-290","article-title":"Revealing and avoiding bias in semantic similarity scores for protein pairs","volume":"11","author":"Wang","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023062610153390000_btt160-B47","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1093\/bioinformatics\/btm558","article-title":"Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories","volume":"24","author":"Yang","year":"2008","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B48","doi-asserted-by":"crossref","first-page":"1383","DOI":"10.1093\/bioinformatics\/bts129","article-title":"Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty","volume":"28","author":"Yang","year":"2012","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B49","doi-asserted-by":"crossref","first-page":"0026","DOI":"10.1038\/msb4100034","article-title":"Gene function prediction from congruent synthetic lethal interactions in yeast","volume":"1","author":"Ye","year":"2005","journal-title":"Mol. Syst. Biol."},{"key":"2023062610153390000_btt160-B50","doi-asserted-by":"crossref","first-page":"230","DOI":"10.1093\/bioinformatics\/btn612","article-title":"Gene-disease relationship discovery based on model-driven data integration and database view definition","volume":"25","author":"Yilmaz","year":"2009","journal-title":"Bioinformatics"},{"key":"2023062610153390000_btt160-B53","doi-asserted-by":"crossref","first-page":"2163","DOI":"10.1093\/bioinformatics\/btm291","article-title":"Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications","volume":"23","author":"Yu","year":"2007","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/11\/1424\/50701012\/bioinformatics_29_11_1424.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/11\/1424\/50701012\/bioinformatics_29_11_1424.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,26]],"date-time":"2023-06-26T10:18:13Z","timestamp":1687774693000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/11\/1424\/220386"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,4,9]]},"references-count":49,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2013,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt160","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,6,1]]},"published":{"date-parts":[[2013,4,9]]}}}