{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,2]],"date-time":"2025-11-02T16:28:36Z","timestamp":1762100916041},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"S2","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2009,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The evolving complexity of genome-scale experiments has increasingly centralized the role of a highly computable, accurate, and comprehensive resource spanning multiple biological scales and viewpoints. To provide a resource to meet this need, we have significantly extended the PhenoGO database with gene-disease specific annotations and included an additional ten species. This a computationally-derived resource is primarily intended to provide phenotypic context (cell type, tissue, organ, and disease) for mining existing associations between gene products and GO terms specified in the Gene Ontology Databases Automated natural language processing (BioMedLEE) and computational ontology (PhenOS) methods were used to derive these relationships from the literature, expanding the database with information from ten additional species to include over 600,000 phenotypic contexts spanning eleven species from five GO annotation databases. A comprehensive evaluation evaluating the mappings (<jats:italic>n<\/jats:italic> = 300) found precision (positive predictive value) at 85%, and recall (sensitivity) at 76%. Phenotypes are encoded in general purpose ontologies such as Cell Ontology, the Unified Medical Language System, and in specialized ontologies such as the Mouse Anatomy and the Mammalian Phenotype Ontology. A web portal has also been developed, allowing for advanced filtering and querying of the database as well as download of the entire dataset <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/www.phenogo.org\" ext-link-type=\"uri\">http:\/\/www.phenogo.org<\/jats:ext-link>.<\/jats:p>","DOI":"10.1186\/1471-2105-10-s2-s8","type":"journal-article","created":{"date-parts":[[2009,2,5]],"date-time":"2009-02-05T16:29:10Z","timestamp":1233851350000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["PhenoGO: an integrated resource for the multiscale mining of clinical and biological data"],"prefix":"10.1186","volume":"10","author":[{"given":"Lee T","family":"Sam","sequence":"first","affiliation":[]},{"given":"Eneida A","family":"Mendon\u00e7a","sequence":"additional","affiliation":[]},{"given":"Jianrong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Judith","family":"Blake","sequence":"additional","affiliation":[]},{"given":"Carol","family":"Friedman","sequence":"additional","affiliation":[]},{"given":"Yves A","family":"Lussier","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2009,2,5]]},"reference":[{"key":"3267_CR1","first-page":"76","volume-title":"Pac Symp Biocomput","author":"L Sam","year":"2007","unstructured":"Sam L, Liu Y, Li J, Friedman C, Lussier YA: Discovery of protein interaction networks shared by diseases. Pac Symp Biocomput 2007, 76\u201387."},{"issue":"3","key":"3267_CR2","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1038\/nbt1295","volume":"25","author":"K Lage","year":"2007","unstructured":"Lage K, Karlberg EO, Storling ZM, Olason PI, Pedersen AG, Rigina O, Hinsby AM, Tumer Z, Pociot F, Tommerup N, et al.: A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 2007, 25(3):309\u2013316.","journal-title":"Nat Biotechnol"},{"issue":"6","key":"3267_CR3","doi-asserted-by":"publisher","first-page":"1011","DOI":"10.1086\/504300","volume":"78","author":"L Franke","year":"2006","unstructured":"Franke L, Bakel H, Fokkens L, de Jong ED, Egmont-Petersen M, Wijmenga C: Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. Am J Hum Genet 2006, 78(6):1011\u20131025.","journal-title":"Am J Hum Genet"},{"issue":"5","key":"3267_CR4","doi-asserted-by":"publisher","first-page":"535","DOI":"10.1038\/sj.ejhg.5201585","volume":"14","author":"MA van Driel","year":"2006","unstructured":"van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JA: A text-mining analysis of the human phenome. Eur J Hum Genet 2006, 14(5):535\u2013542.","journal-title":"Eur J Hum Genet"},{"issue":"19","key":"3267_CR5","doi-asserted-by":"publisher","first-page":"e130","DOI":"10.1093\/nar\/gkl707","volume":"34","author":"RA George","year":"2006","unstructured":"George RA, Liu JY, Feng LL, Bryson-Richardson RJ, Fatkin D, Wouters MA: Analysis of protein sequence and interaction data for candidate disease gene prediction. Nucleic Acids Res 2006, 34(19):e130.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"3267_CR6","first-page":"5","volume":"4","author":"E Camon","year":"2004","unstructured":"Camon E, Barrell D, Lee V, Dimmer E, Apweiler R: The Gene Ontology Annotation (GOA) Database \u2013 an integrated resource of GO annotations to the UniProt Knowledgebase. In Silico Biol 2004, 4(1):5\u20136.","journal-title":"In Silico Biol"},{"issue":"2","key":"3267_CR7","doi-asserted-by":"publisher","first-page":"R21","DOI":"10.1186\/gb-2005-6-2-r21","volume":"6","author":"J Bard","year":"2005","unstructured":"Bard J, Rhee SY, Ashburner M: An ontology for cell types. Genome Biol 2005, 6(2):R21.","journal-title":"Genome Biol"},{"issue":"5","key":"3267_CR8","first-page":"40","volume":"61","author":"C Lindberg","year":"1990","unstructured":"Lindberg C: The Unified Medical Language System (UMLS) of the National Library of Medicine. J Am Med Rec Assoc 1990, 61(5):40\u201342.","journal-title":"J Am Med Rec Assoc"},{"key":"3267_CR9","first-page":"114","volume":"51","author":"FB Rogers","year":"1963","unstructured":"Rogers FB: Medical subject headings. Bull Med Libr Assoc 1963, 51: 114\u2013116.","journal-title":"Bull Med Libr Assoc"},{"issue":"1","key":"3267_CR10","doi-asserted-by":"publisher","first-page":"R7","DOI":"10.1186\/gb-2004-6-1-r7","volume":"6","author":"CL Smith","year":"2005","unstructured":"Smith CL, Goldsmith CA, Eppig JT: The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biol 2005, 6(1):R7.","journal-title":"Genome Biol"},{"issue":"3","key":"3267_CR11","doi-asserted-by":"publisher","first-page":"R29","DOI":"10.1186\/gb-2005-6-3-r29","volume":"6","author":"TF Hayamizu","year":"2005","unstructured":"Hayamizu TF, Mangan M, Corradi JP, Kadin JA, Ringwald M: The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data. Genome Biol 2005, 6(3):R29.","journal-title":"Genome Biol"},{"issue":"1","key":"3267_CR12","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1093\/nar\/28.1.10","volume":"28","author":"DL Wheeler","year":"2000","unstructured":"Wheeler DL, Chappey C, Lash AE, Leipe DD, Madden TL, Schuler GD, Tatusova TA, Rapp BA: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2000, 28(1):10\u201314.","journal-title":"Nucleic Acids Res"},{"key":"3267_CR13","first-page":"D577","volume-title":"Nucleic Acids Res","author":"EL Hong","year":"2008","unstructured":"Hong EL, Balakrishnan R, Dong Q, Christie KR, Park J, Binkley G, Costanzo MC, Dwight SS, Engel SR, Fisk DG, et al.: Gene Ontology annotations at SGD: new data sources and annotation methods. Nucleic Acids Res 2008, (36 Database):D577\u2013581."},{"key":"3267_CR14","first-page":"D411","volume-title":"Nucleic Acids Res","author":"TW Harris","year":"2004","unstructured":"Harris TW, Chen N, Cunningham F, Tello-Ruiz M, Antoshechkin I, Bastiani C, Bieri T, Blasiar D, Bradnam K, Chan J, et al.: WormBase: a multi-species resource for nematode biology and genomics. Nucleic Acids Res 2004, (32 Database):D411\u2013417."},{"key":"3267_CR15","first-page":"D588","volume-title":"Nucleic Acids Res","author":"RJ Wilson","year":"2008","unstructured":"Wilson RJ, Goodman JL, Strelets VB: FlyBase: integration and improvements to query tools. Nucleic Acids Res 2008, (36 Database):D588\u2013593."},{"key":"3267_CR16","first-page":"D581","volume-title":"Nucleic Acids Res","author":"J Sprague","year":"2006","unstructured":"Sprague J, Bayraktaroglu L, Clements D, Conlin T, Fashena D, Frazer K, Haendel M, Howe DG, Mani P, Ramachandran S, et al.: The Zebrafish Information Network: the zebrafish model organism database. Nucleic Acids Res 2006, (34 Database):D581\u2013585."},{"key":"3267_CR17","first-page":"D471","volume-title":"Nucleic Acids Res","author":"JT Eppig","year":"2005","unstructured":"Eppig JT, Bult CJ, Kadin JA, Richardson JE, Blake JA, Anagnostopoulos A, Baldarelli RM, Baya M, Beal JS, Bello SM, et al.: The Mouse Genome Database (MGD): from genes to mice \u2013 a community resource for mouse biology. Nucleic Acids Res 2005, (33 Database):D471\u2013475."},{"key":"3267_CR18","first-page":"D658","volume-title":"Nucleic Acids Res","author":"SN Twigger","year":"2007","unstructured":"Twigger SN, Shimoyama M, Bromberg S, Kwitek AE, Jacob HJ: The Rat Genome Database, update 2007 \u2013 easing the path from disease to data and back again. Nucleic Acids Res 2007, (35 Database):D658\u2013662."},{"issue":"13","key":"3267_CR19","doi-asserted-by":"publisher","first-page":"i529","DOI":"10.1093\/bioinformatics\/btm195","volume":"23","author":"Y Tao","year":"2007","unstructured":"Tao Y, Sam L, Li J, Friedman C, Lussier YA: Information theory applied to the sparse gene ontology annotation network to predict novel gene function. Bioinformatics 2007, 23(13):i529\u2013538.","journal-title":"Bioinformatics"},{"issue":"5","key":"3267_CR20","doi-asserted-by":"publisher","first-page":"896","DOI":"10.1101\/gr.440803","volume":"13","author":"OD King","year":"2003","unstructured":"King OD, Foulger RE, Dwight SS, White JV, Roth FP: Predicting gene function from patterns of annotation. Genome Res 2003, 13(5):896\u2013904.","journal-title":"Genome Res"},{"key":"3267_CR21","doi-asserted-by":"publisher","first-page":"116","DOI":"10.1186\/1471-2105-5-116","volume":"5","author":"A Vinayagam","year":"2004","unstructured":"Vinayagam A, Konig R, Moormann J, Schubert F, Eils R, Glatting KH, Suhai S: Applying Support Vector Machines for Gene Ontology based gene function prediction. BMC Bioinformatics 2004, 5: 116.","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"3267_CR22","doi-asserted-by":"publisher","first-page":"707","DOI":"10.1006\/jmbi.1998.2144","volume":"283","author":"P Bork","year":"1998","unstructured":"Bork P, Dandekar T, Diaz-Lazcoz Y, Eisenhaber F, Huynen M, Yuan Y: Predicting function: from genes to genomes and back. J Mol Biol 1998, 283(4):707\u2013725.","journal-title":"J Mol Biol"},{"key":"3267_CR23","unstructured":"Mouse Genome Database (MGD) MGIWS, The Jackson Laboratory, Bar Harbor, Maine[http:\/\/www.informatics.jax.org] [August 15, 2005]."},{"key":"3267_CR24","volume-title":"ISMB","author":"Y Lussier","year":"2007","unstructured":"Lussier Y, Friedman C: BiomedLEE: a natural-language processor for extracting and representing phenotypes, underlying molecular mechanisms and their relationships. ISMB 2007. [http:\/\/www.iscb.org\/uploaded\/css\/O02Lussier.pdf]"},{"issue":"Pt 2","key":"3267_CR25","first-page":"758","volume":"107","author":"L Chen","year":"2004","unstructured":"Chen L, Friedman C: Extracting phenotypic information from the literature via natural language processing. Stud Health Technol Inform 2004, 107(Pt 2):758\u2013762.","journal-title":"Stud Health Technol Inform"},{"key":"3267_CR26","first-page":"202","volume-title":"Pac Symp Biocomput","author":"YA Lussier","year":"2004","unstructured":"Lussier YA, Li J: Terminological mapping for high throughput comparative biology of phenotypes. Pac Symp Biocomput 2004, 202\u2013213."},{"key":"3267_CR27","first-page":"439","volume-title":"Pacific Symposium on Biocomputing","author":"IN Sarkar","year":"2003","unstructured":"Sarkar IN, Cantor MN, Gelman R, Hartel F, Lussier YA: Linking biomedical language information and knowledge resources: GO and UMLS. Pacific Symposium on Biocomputing 2003, 439\u2013450."},{"key":"3267_CR28","first-page":"103","volume-title":"Pac Symp Biocomput","author":"MN Cantor","year":"2005","unstructured":"Cantor MN, Sarkar IN, Bodenreider O, Lussier YA: Genestrace: phenomic knowledge discovery via structured terminology. Pac Symp Biocomput 2005, 103\u2013114."},{"key":"3267_CR29","first-page":"64","volume-title":"Pac Symp Biocomput","author":"Y Lussier","year":"2006","unstructured":"Lussier Y, Borlawsky T, Rappaport D, Liu Y, Friedman C: PhenoGO: assigning phenotypic context to gene ontology annotations with natural language processing. Pac Symp Biocomput 2006, 64\u201375."},{"issue":"21","key":"3267_CR30","doi-asserted-by":"publisher","first-page":"8685","DOI":"10.1073\/pnas.0701361104","volume":"104","author":"KI Goh","year":"2007","unstructured":"Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabasi AL: The human disease network. Proc Natl Acad Sci USA 2007, 104(21):8685\u20138690.","journal-title":"Proc Natl Acad Sci USA"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-10-S2-S8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:36:29Z","timestamp":1630445789000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-10-S2-S8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,2]]},"references-count":30,"journal-issue":{"issue":"S2","published-print":{"date-parts":[[2009,2]]}},"alternative-id":["3267"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-10-s2-s8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,2]]},"assertion":[{"value":"5 February 2009","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S8"}}