{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T07:01:46Z","timestamp":1773730906318,"version":"3.50.1"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"S5","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Although the UniProt KnowledgeBase is not a medical-oriented database, it contains information on more than 2,000 human proteins involved in pathologies. However, these annotations are not standardized, which impairs the interoperability between biological and clinical resources. In order to make these data easily accessible to clinical researchers, we have developed a procedure to link diseases described in the UniProtKB\/Swiss-Prot entries to the MeSH disease terminology.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We mapped disease names extracted either from the UniProtKB\/Swiss-Prot entry comment lines or from the corresponding OMIM entry to the MeSH. Different methods were assessed on a benchmark set of 200 disease names manually mapped to MeSH terms. The performance of the retained procedure in term of precision and recall was 86% and 64% respectively. Using the same procedure, more than 3,000 disease names in Swiss-Prot were mapped to MeSH with comparable efficiency.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>This study is a first attempt to link proteins in UniProtKB to the medical resources. The indexing we provided will help clinicians and researchers navigate from diseases to genes and from genes to diseases in an efficient way. The mapping is available at: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/research.isb-sib.ch\/unimed\" ext-link-type=\"uri\">http:\/\/research.isb-sib.ch\/unimed<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-s5-s3","type":"journal-article","created":{"date-parts":[[2008,4,29]],"date-time":"2008-04-29T18:14:29Z","timestamp":1209492869000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":45,"title":["Mapping proteins to disease terminologies: from UniProt to MeSH"],"prefix":"10.1186","volume":"9","author":[{"given":"Ana\u00efs","family":"Mottaz","sequence":"first","affiliation":[]},{"given":"Yum L","family":"Yip","sequence":"additional","affiliation":[]},{"given":"Patrick","family":"Ruch","sequence":"additional","affiliation":[]},{"given":"Anne-Lise","family":"Veuthey","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,4,29]]},"reference":[{"key":"2609_CR1","doi-asserted-by":"publisher","first-page":"D193","DOI":"10.1093\/nar\/gkl929","volume":"35","author":"The UniProt Consortium","year":"2007","unstructured":"The UniProt Consortium: The Universal Protein Resource (UniProt) Nucleic Acids Res 2007, 35: D193-D197.","journal-title":"Nucleic Acids Res"},{"issue":"Pt 1","key":"2609_CR2","first-page":"67","volume":"11","author":"SJ Nelson","year":"2004","unstructured":"Nelson SJ, Schopen M, Savage AG, Schulman JL, Arluk N: The MeSH Translation Maintenance System: Structure, Interface Design, and Implementation. Medinfo 2004, 11(Pt 1):67\u201369.","journal-title":"Medinfo"},{"key":"2609_CR3","unstructured":"International Statistical Classification of Diseases and Health Related Problems In (The) ICD-10. Second Edition edition. WHO Press, Geneva;"},{"key":"2609_CR4","first-page":"79","volume":"121","author":"K Donnelly","year":"2006","unstructured":"Donnelly K, SNOMED-CT: The advanced terminology and coding system for eHealth. Stud Health Techno Inform 2006, 121: 79\u201390.","journal-title":"Stud Health Techno Inform"},{"key":"2609_CR5","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","volume":"32","author":"O Bodenreider","year":"2004","unstructured":"Bodenreider O: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004, 32: D267-D270.","journal-title":"Nucleic Acids Res"},{"key":"2609_CR6","doi-asserted-by":"publisher","first-page":"D322","DOI":"10.1093\/nar\/gkj021","volume":"34","author":"Gene Ontology Consortium","year":"2006","unstructured":"Gene Ontology Consortium: The Gene Ontology (GO) project in 2006 Nucleic Acids Res 2006, 34: D322-D326.","journal-title":"Nucleic Acids Res"},{"key":"2609_CR7","first-page":"227","volume-title":"Cold Spring Harbor Symp Quant Biol","author":"M Ashburner","year":"2003","unstructured":"Ashburner M, Mungall CJ, Lewis SE: Ontologies for biologists: a community model for the annotation of genomic data. Cold Spring Harbor Symp Quant Biol 2003, 227\u2013236."},{"key":"2609_CR8","unstructured":"National Library of Medicine: UMLS Lexical Tools . [http:\/\/www.nlm.nih.gov\/research\/umls\/tools.html]"},{"key":"2609_CR9","first-page":"439","volume-title":"Pac Symp Biocomput","author":"IN Sarkar","year":"2003","unstructured":"Sarkar IN, Cantor MN, Gelman R, Hartel F, Lussier YA: Linking biomedical language information and knowledge resources: GO and UMLS. Pac Symp Biocomput 2003, 439\u2013450."},{"key":"2609_CR10","first-page":"62","volume":"95","author":"MN Cantor","year":"2003","unstructured":"Cantor MN, Sarkar IN, Gelman R, Hartel F, Bodenreider O, Lussier YA: An evaluation of hybrid methods for matching biomedical terminologies: Mapping the Gene Ontology to the UMLS. Stud Health Technol Inform 2003, 95: 62\u201367.","journal-title":"Stud Health Technol Inform"},{"key":"2609_CR11","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1016\/j.artmed.2006.12.002","volume":"39","author":"S Zhang","year":"2007","unstructured":"Zhang S, Mork P, Bodenreider O, Bernstein PA: Comparing two approaches for aligning representations of anatomy. Artif Intell Med 2007, 39: 227\u2013236.","journal-title":"Artif Intell Med"},{"key":"2609_CR12","first-page":"202","volume-title":"Pac Symp Biocomput","author":"YA Lussier","year":"2004","unstructured":"Lussier YA, Li J: Terminological mapping for high throughput comparative biology of phenotypes. Pac Symp Biocomput 2004, 202\u2013213."},{"key":"2609_CR13","first-page":"103","volume-title":"Pac Symp Biocomput","author":"MN Cantor","year":"2005","unstructured":"Cantor MN, Sarkar IN, Bodenreider O, Lussier YA: GenesTrace: Phenomic knowledge discovery via structured terminology. Pac Symp Biocomput 2005, 103\u2013114."},{"key":"2609_CR14","first-page":"28","volume-title":"Pac Symp Biocomput","author":"HL Johnson","year":"2006","unstructured":"Johnson HL, Cohen KB, Baumgartner WA, Lu Z, Bada M, Kester T, Kim H, Hunter L: Evaluation of lexical methods for detecting relationships between concepts from multiple ontologies. Pac Symp Biocomput 2006, 28\u201339."},{"key":"2609_CR15","doi-asserted-by":"publisher","first-page":"D514","DOI":"10.1093\/nar\/gki033","volume":"33","author":"A Hamosh","year":"2005","unstructured":"Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 2005, 33: D514\u2013517.","journal-title":"Nucleic Acids Res"},{"key":"2609_CR16","unstructured":"The Specialist Lexical Tools [http:\/\/lexsrv3.nlm.nih.gov\/SPECIALIST\/index.html]"},{"key":"2609_CR17","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1093\/bib\/6.3.222","volume":"6","author":"H Shatkay","year":"2005","unstructured":"Shatkay H: Hairpins in a bookstacks: Information retrieval from biomedical text. Brief Bioinform 2005, 6: 222\u201338.","journal-title":"Brief Bioinform"},{"key":"2609_CR18","volume-title":"BioLINK SIG 2007, ISMB\/ECCB","author":"V Ha-Thuc","year":"2007","unstructured":"Ha-Thuc V, Srinivasan P: Exploiting synonym relationships in biomedical named entity matching. In BioLINK SIG 2007, ISMB\/ECCB. Vienna; 2007. July"},{"key":"2609_CR19","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1109\/MIS.2003.1234765","volume":"18","author":"M Bilenko","year":"2003","unstructured":"Bilenko M, Mooney R, Cohen W, Ravikumar P, Fienberg S: Adaptive name matching in information integration. IEEE Intellig Sys. 2003, 18: 16\u201323.","journal-title":"IEEE Intellig Sys"},{"key":"2609_CR20","first-page":"73","volume-title":"Proc JCCAI Conf","author":"W Cohen","year":"2003","unstructured":"Cohen W, Ravikumar P, Fienberg S: A comparison of string distance metrics. for name-matching tasks. Proc JCCAI Conf 2003, 73\u201378."},{"key":"2609_CR21","doi-asserted-by":"publisher","first-page":"658","DOI":"10.1093\/bioinformatics\/bti783","volume":"22","author":"P Ruch","year":"2006","unstructured":"Ruch P: Automatic assignment of biomedical categories: toward a generic approach. Bioinformatics 2006, 22: 658\u2013664.","journal-title":"Bioinformatics"},{"key":"2609_CR22","first-page":"17","volume-title":"AMIA Annu SympProc","author":"AR Aronson","year":"2001","unstructured":"Aronson AR: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. AMIA Annu SympProc 2001, 17\u201321."},{"key":"2609_CR23","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1038\/nbt1150","volume":"24","author":"AJ Butte","year":"2006","unstructured":"Butte AJ, Kohane IS: Creation and implications of a phenome-genome network. Nat Biotechnol 2006, 24: 55\u201362.","journal-title":"Nat Biotechnol"},{"key":"2609_CR24","first-page":"106","volume-title":"AMIA Annu SympProc","author":"AJ Butte","year":"2006","unstructured":"Butte AJ, Chen R: Finding disease-related genomic experiments within an international repository: first steps in translational bioinformatics. AMIA Annu SympProc 2006, 106\u2013110."},{"key":"2609_CR25","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1186\/1471-2105-8-296","volume":"8","author":"NH Shah","year":"2007","unstructured":"Shah NH, Rubin DL, Espinosa I, Montgomery K, Musen MA: Annotation and query of tissue microarray data using the NCI Thesaurus. BMC Bioinformatics 2007, 8: 296.","journal-title":"BMC Bioinformatics"},{"key":"2609_CR26","doi-asserted-by":"publisher","first-page":"535","DOI":"10.1038\/sj.ejhg.5201585","volume":"14","author":"MA van Driel","year":"2006","unstructured":"van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JA: A text-mining analysis of the human phenome. Eur J Hum Genet 2006, 14: 535\u2013542.","journal-title":"Eur J Hum Genet"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-S5-S3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:24:51Z","timestamp":1630445091000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-S5-S3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,4]]},"references-count":26,"journal-issue":{"issue":"S5","published-print":{"date-parts":[[2008,4]]}},"alternative-id":["2609"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-s5-s3","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,4]]},"assertion":[{"value":"29 April 2008","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S3"}}