{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,7,11]],"date-time":"2023-07-11T18:51:53Z","timestamp":1689101513696},"reference-count":17,"publisher":"Springer Science and Business Media LLC","issue":"S1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The Gene Ontology (GO) provides a controlled vocabulary for describing genes and gene products. In spite of the undoubted importance of GO, several drawbacks associated with GO and GO-based annotations have been introduced. We identified three types of semantic inconsistencies in GO-based annotations; semantically redundant, biological-domain inconsistent and taxonomy inconsistent annotations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Methods<\/jats:title>\n            <jats:p>To determine the semantic inconsistencies in GO annotation, we used the hierarchical structure of GO graph and tree structure of NCBI taxonomy. Twenty seven biological databases were collected for finding semantic inconsistent annotation.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The distributions and possible causes of the semantic inconsistencies were investigated using twenty seven biological databases with GO-based annotations. We found that some evidence codes of annotation were associated with the inconsistencies. The numbers of gene products and species in a database that are related to the complexity of database management are also in correlation with the inconsistencies. Consequently, numerous annotation errors arise and are propagated throughout biological databases and GO-based high-level analyses. GOChase-II is developed to detect and correct both syntactic and semantic errors in GO-based annotations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>We identified some inconsistencies in GO-based annotation and provided software, GOChase-II, for correcting these semantic inconsistencies in addition to the previous corrections for the syntactic errors by GOChase-I.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-s1-s40","type":"journal-article","created":{"date-parts":[[2011,2,18]],"date-time":"2011-02-18T20:10:58Z","timestamp":1298059858000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products"],"prefix":"10.1186","volume":"12","author":[{"given":"Yu Rang","family":"Park","sequence":"first","affiliation":[]},{"given":"Jihun","family":"Kim","sequence":"additional","affiliation":[]},{"given":"Hye Won","family":"Lee","sequence":"additional","affiliation":[]},{"given":"Young Jo","family":"Yoon","sequence":"additional","affiliation":[]},{"given":"Ju Han","family":"Kim","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,2,15]]},"reference":[{"issue":"1","key":"4398_CR1","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25(1):25\u201329. 10.1038\/75556","journal-title":"Nat Genet"},{"key":"4398_CR2","first-page":"219","volume-title":"Data Structures and Algorithms","author":"HJ Aho AV","year":"1983","unstructured":"Aho AV HJ, Ullman JD: Directed graphs. In Data Structures and Algorithms. Massachusetts: Addison-Wesley; 1983:219\u2013221."},{"issue":"1","key":"4398_CR3","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1186\/gb-2004-6-1-103","volume":"6","author":"SE Lewis","year":"2005","unstructured":"Lewis SE: Gene Ontology: looking backwards and forwards. Genome Biol 2005, 6(1):103. 10.1186\/gb-2004-6-1-103","journal-title":"Genome Biol"},{"issue":"Database issue","key":"4398_CR4","doi-asserted-by":"publisher","first-page":"D262","DOI":"10.1093\/nar\/gkh021","volume":"32","author":"E Camon","year":"2004","unstructured":"Camon E, Magrane M, Barrell D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R: The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res 2004, 32(Database issue):D262\u2013266. 10.1093\/nar\/gkh021","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"4398_CR5","doi-asserted-by":"publisher","first-page":"D500","DOI":"10.1093\/nar\/gkj054","volume":"34","author":"NA Stover","year":"2006","unstructured":"Stover NA, Krieger CJ, Binkley G, Dong Q, Fisk DG, Nash R, Sethuraman A, Weng S, Cherry JM: Tetrahymena Genome Database (TGD): a new genomic resource for Tetrahymena thermophila research. Nucleic Acids Res 2006, 34(Database issue):D500\u2013503. 10.1093\/nar\/gkj054","journal-title":"Nucleic Acids Res"},{"key":"4398_CR6","volume-title":"J Struct Biol","author":"MI Sadowski","year":"2010","unstructured":"Sadowski MI, Taylor WR: On the evolutionary origins of \"Fold Space Continuity\": A study of topological convergence and divergence in mixed alpha-beta domains. J Struct Biol 2010. [Epub ahead of print] [Epub ahead of print]"},{"issue":"1","key":"4398_CR7","doi-asserted-by":"publisher","first-page":"S62","DOI":"10.1186\/1471-2105-11-S1-S62","volume":"11","author":"MR Mehan","year":"2010","unstructured":"Mehan MR, Nunez-Iglesias J, Dai C, Waterman MS, Zhou XJ: An integrative modular approach to systematically predict gene-phenotype associations. BMC Bioinformatics 2010, 11(1):S62. 10.1186\/1471-2105-11-S1-S62","journal-title":"BMC Bioinformatics"},{"key":"4398_CR8","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1186\/1471-2105-11-91","volume":"11","author":"A Martin","year":"2010","unstructured":"Martin A, Ochagavia ME, Rabasa LC, Miranda J, Fernandez-de-Cossio J, Bringas R: BisoGanet: a new tool for gene network building, visualizatiion and analysis. BMC Bioinformatics 2010, 11: 91. 10.1186\/1471-2105-11-91","journal-title":"BMC Bioinformatics"},{"issue":"Database issue","key":"4398_CR9","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","volume":"32","author":"O Bodenreider","year":"2004","unstructured":"Bodenreider O: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004, 32(Database issue):D267\u2013270. 10.1093\/nar\/gkh061","journal-title":"Nucleic Acids Res"},{"issue":"7-8","key":"4398_CR10","doi-asserted-by":"publisher","first-page":"731","DOI":"10.1016\/j.compbiomed.2005.04.008","volume":"36","author":"M Masseroli","year":"2006","unstructured":"Masseroli M, Pinciroli F: Using Gene Ontology and genomic controlled vocabularies to analyze high-throughput gene lists: three tool comparison. Comput Biol Med 2006, 36(7\u20138):731\u2013747. 10.1016\/j.compbiomed.2005.04.008","journal-title":"Comput Biol Med"},{"issue":"1","key":"4398_CR11","doi-asserted-by":"publisher","first-page":"i136","DOI":"10.1093\/bioinformatics\/bti1019","volume":"21 Suppl 1","author":"ME Dolan","year":"2005","unstructured":"Dolan ME, Ni L, Camon E, Blake JA: A procedure for assessing GO annotation consistency. Bioinformatics 2005, 21 Suppl 1(1):i136\u2013143. 10.1093\/bioinformatics\/bti1019","journal-title":"Bioinformatics"},{"issue":"6","key":"4398_CR12","doi-asserted-by":"publisher","first-page":"829","DOI":"10.1093\/bioinformatics\/bti106","volume":"21","author":"YR Park","year":"2005","unstructured":"Park YR, Park CH, Kim JH: GOChase: correcting errors from Gene Ontology-based annotations for gene products. Bioinformatics 2005, 21(6):829\u2013831. 10.1093\/bioinformatics\/bti106","journal-title":"Bioinformatics"},{"issue":"8","key":"4398_CR13","doi-asserted-by":"publisher","first-page":"1425","DOI":"10.1101\/gr.180801","volume":"11","author":"The Gene Ontology Consortium","year":"2001","unstructured":"The Gene Ontology Consortium: Creating the gene ontology resource: design and implementation. Genome Res 2001, 11(8):1425\u20131433. 10.1101\/gr.180801","journal-title":"Genome Res"},{"issue":"18","key":"4398_CR14","doi-asserted-by":"publisher","first-page":"3587","DOI":"10.1093\/bioinformatics\/bti565","volume":"21","author":"P Khatri","year":"2005","unstructured":"Khatri P, Draghici S: Ontological analysis of gene expression data: current tools, limitations, and open problems. Bioinformatics 2005, 21(18):3587\u20133595. 10.1093\/bioinformatics\/bti565","journal-title":"Bioinformatics"},{"issue":"16","key":"4398_CR15","doi-asserted-by":"publisher","first-page":"2198","DOI":"10.1093\/bioinformatics\/btm112","volume":"23","author":"J Day-Richter","year":"2007","unstructured":"Day-Richter J, Harris MA, Haendel M, Gene Ontology OBO-Edit Working Groups, Lewis S: OBO-Edit-- an ontology editor for biologists. Bioinformatics 2007, 23(16):2198\u20132200. 10.1093\/bioinformatics\/btm112","journal-title":"Bioinformatics"},{"issue":"Database issue","key":"4398_CR16","first-page":"D173","volume":"38","author":"EW Sayers","year":"2010","unstructured":"Sayers EW, Barrett T, Benson DA, Bolton E, Bryant SH, Canese K, Chetvernin V, Church DM, Dicuccio M, Federhen S, et al.: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2010, 38(Database issue):D173\u2013180.","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"4398_CR17","doi-asserted-by":"publisher","first-page":"D331","DOI":"10.1093\/nar\/gkp1018","volume":"38","author":"The Gene Ontology Consortium","year":"2010","unstructured":"The Gene Ontology Consortium: The Gene Ontology in 2010: extensions and refinemens. Nucleic Acids Research 2010, 38(Database issue):D331\u2013335. 10.1093\/nar\/gkp1018","journal-title":"Nucleic Acids Research"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-S1-S40.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T14:04:03Z","timestamp":1630505043000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-S1-S40"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,2,15]]},"references-count":17,"journal-issue":{"issue":"S1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4398"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-s1-s40","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,2,15]]},"assertion":[{"value":"15 February 2011","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S40"}}