{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T04:26:12Z","timestamp":1780374372027,"version":"3.54.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2020,9,12]],"date-time":"2020-09-12T00:00:00Z","timestamp":1599868800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1657306"],"award-info":[{"award-number":["1657306"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1931134"],"award-info":[{"award-number":["1931134"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R21CA231904"],"award-info":[{"award-number":["R21CA231904"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01NS116287"],"award-info":[{"award-number":["R01NS116287"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["1UL1TR003167"],"award-info":[{"award-number":["1UL1TR003167"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01AG066749"],"award-info":[{"award-number":["R01AG066749"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004917","name":"Cancer Prevention and Research Institute of Texas","doi-asserted-by":"publisher","award":["RP170668"],"award-info":[{"award-number":["RP170668"]}],"id":[{"id":"10.13039\/100004917","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Science Foundation, National Institutes of Health, or Cancer Prevention and Research Institute of Texas"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objective<\/jats:title>\n                  <jats:p>The Unified Medical Language System (UMLS) integrates various source terminologies to support interoperability between biomedical information systems. In this article, we introduce a novel transformation-based auditing method that leverages the UMLS knowledge to systematically identify missing hierarchical IS-A relations in the source terminologies.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>Given a concept name in the UMLS, we first identify its base and secondary noun chunks. For each identified noun chunk, we generate replacement candidates that are more general than the noun chunk. Then, we replace the noun chunks with their replacement candidates to generate new potential concept names that may serve as supertypes of the original concept. If a newly generated name is an existing concept name in the same source terminology with the original concept, then a potentially missing IS-A relation between the original and the new concept is identified.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Applying our transformation-based method to English-language concept names in the UMLS (2019AB release), a total of 39\u00a0359 potentially missing IS-A relations were detected in 13 source terminologies. Domain experts evaluated a random sample of 200 potentially missing IS-A relations identified in the SNOMED CT (U.S. edition) and 100 in Gene Ontology. A total of 173 of 200 and 63 of 100 potentially missing IS-A relations were confirmed by domain experts, indicating that our method achieved a precision of 86.5% and 63% for the SNOMED CT and Gene Ontology, respectively.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusions<\/jats:title>\n                  <jats:p>Our results showed that our transformation-based method is effective in identifying missing IS-A relations in the UMLS source terminologies.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocaa123","type":"journal-article","created":{"date-parts":[[2020,5,30]],"date-time":"2020-05-30T11:11:56Z","timestamp":1590837116000},"page":"1568-1575","source":"Crossref","is-referenced-by-count":11,"title":["A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System"],"prefix":"10.1093","volume":"27","author":[{"given":"Fengbo","family":"Zheng","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jay","family":"Shi","sequence":"additional","affiliation":[{"name":"Department of Internal Medicine, University of Kentucky, Lexington, Kentucky, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuntao","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"W Jim","family":"Zheng","sequence":"additional","affiliation":[{"name":"School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Licong","family":"Cui","sequence":"additional","affiliation":[{"name":"School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"2020110613102046300_ocaa123-B1","author":"Humphreys","year":"1989"},{"issue":"1","key":"2020110613102046300_ocaa123-B2","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1055\/s-0038-1637976","article-title":"The Unified Medical Language System","volume":"2","author":"Lindberg","year":"1993","journal-title":"Yearb Med Inform"},{"issue":"1","key":"2020110613102046300_ocaa123-B3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1136\/jamia.1998.0050001","article-title":"The Unified Medical Language System: an informatics research collaboration","volume":"5","author":"Humphreys","year":"1998","journal-title":"J Am Med Inform Assoc"},{"issue":"90001","key":"2020110613102046300_ocaa123-B4","doi-asserted-by":"crossref","first-page":"267D","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2020110613102046300_ocaa123-B5","year":"2020"},{"key":"2020110613102046300_ocaa123-B6","first-page":"185","author":"Chute","year":"1991"},{"issue":"1","key":"2020110613102046300_ocaa123-B7","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1136\/jamia.2001.0080080","article-title":"UMLS concept indexing for production databases: a feasibility study","volume":"8","author":"Nadkarni","year":"2001","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613102046300_ocaa123-B8","first-page":"34","author":"Hersh","year":"2000"},{"issue":"1","key":"2020110613102046300_ocaa123-B9","first-page":"1","article-title":"Query expansion using UMLS tools for health information retrieval","volume":"46","author":"Lu","year":"2009","journal-title":"Proc Am Soc Info Sci Technol"},{"key":"2020110613102046300_ocaa123-B10","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.jbi.2014.04.013","article-title":"Improving search over electronic health records using UMLS-based query expansion through random walks","volume":"51","author":"Martinez","year":"2014","journal-title":"J Biomed Inform"},{"issue":"2","key":"2020110613102046300_ocaa123-B11","first-page":"184","article-title":"UMLS knowledge for biomedical language processing","volume":"81","author":"McCray","year":"1993","journal-title":"Bull Med Library Assoc"},{"key":"2020110613102046300_ocaa123-B12","first-page":"17","author":"Aronson","year":"2001"},{"issue":"11","key":"2020110613102046300_ocaa123-B13","doi-asserted-by":"crossref","first-page":"1218","DOI":"10.1093\/jamia\/ocz109","article-title":"Clinical trial cohort selection based on multi-level rule-based natural language processing system","volume":"26","author":"Chen","year":"2019","journal-title":"J Am Med Inform Assoc"},{"issue":"S3","key":"2020110613102046300_ocaa123-B14","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1186\/s12911-019-0781-4","article-title":"Clinical text classification with rule-based features and knowledge-guided convolutional neural networks","volume":"19","author":"Yao","year":"2019","journal-title":"BMC Med Inform Decis Mak"},{"key":"2020110613102046300_ocaa123-B15","first-page":"543","article-title":"Adversarial learning of knowledge embeddings for the Unified Medical Language System","author":"Maldonado","year":"2019","journal-title":"AMIA Jt Summits Transl Sci Proc"},{"issue":"1","key":"2020110613102046300_ocaa123-B16","doi-asserted-by":"crossref","first-page":"e5","DOI":"10.2196\/medinform.3172","article-title":"Next generation phenotyping using the Unified Medical Language System","volume":"2","author":"Adamusiak","year":"2014","journal-title":"JMIR Med Inform"},{"issue":"4","key":"2020110613102046300_ocaa123-B17","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1136\/jamia.2001.0080351","article-title":"A UMLS-based knowledge acquisition tool for rule-based clinical decision support system development","volume":"8","author":"Achour","year":"2001","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613102046300_ocaa123-B18","author":"Lee","year":"2019"},{"key":"2020110613102046300_ocaa123-B19","year":"2020"},{"issue":"1","key":"2020110613102046300_ocaa123-B20","doi-asserted-by":"crossref","first-page":"79","DOI":"10.3233\/SW-190378","article-title":"Ontologies as nested facet systems for human\u2013data interaction","volume":"11","author":"Zhang","year":"2020","journal-title":"Seman Web"},{"issue":"1","key":"2020110613102046300_ocaa123-B21","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1136\/jamia.1998.0050041","article-title":"Auditing the Unified Medical Language system with semantic methods","volume":"5","author":"Cimino","year":"1998","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613102046300_ocaa123-B22","first-page":"57","author":"Bodenreider","year":"2001"},{"issue":"3","key":"2020110613102046300_ocaa123-B23","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1016\/j.jbi.2008.08.006","article-title":"Structural group-based auditing of missing hierarchical relationships in UMLS","volume":"42","author":"Chen","year":"2009","journal-title":"J Biomed Inform"},{"issue":"1","key":"2020110613102046300_ocaa123-B24","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1016\/j.artmed.2015.03.002","article-title":"A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization","volume":"64","author":"He","year":"2015","journal-title":"Artif Intell Med"},{"key":"2020110613102046300_ocaa123-B25","first-page":"618","article-title":"Topological-pattern-based recommendation of UMLS concepts for National Cancer Institute Thesaurus","volume":"2016","author":"He","year":"2016","journal-title":"AMIA Annu Symp Proc"},{"key":"2020110613102046300_ocaa123-B26","first-page":"456","article-title":"COHeRE: Cross-ontology hierarchical relation examination for ontology quality assurance","volume":"2015","author":"Cui","year":"2015","journal-title":"AMIA Annu Symp Proc"},{"key":"2020110613102046300_ocaa123-B27","first-page":"1982","author":"Abeysinghe","year":"2018"},{"key":"2020110613102046300_ocaa123-B28","author":"SpaCy: Industrial-Strength Natural Language Processing","year":"2020"},{"key":"2020110613102046300_ocaa123-B29","year":"2020"},{"key":"2020110613102046300_ocaa123-B30","first-page":"11","author":"Hagberg","year":"2008"},{"key":"2020110613102046300_ocaa123-B31","author":"Liu","year":"2018"},{"key":"2020110613102046300_ocaa123-B32","first-page":"378","article-title":"Validating auto-suggested changes for SNOMED CT in non-lattice subgraphs using relational machine learning","volume":"264","author":"Sun","year":"2019","journal-title":"Stud Health Technol Inform"},{"key":"2020110613102046300_ocaa123-B33","first-page":"364","author":"Abeysinghe","year":"2017"},{"issue":"4","key":"2020110613102046300_ocaa123-B34","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1093\/jamia\/ocw175","article-title":"Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT","volume":"24","author":"Cui","year":"2017","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613102046300_ocaa123-B35","author":"Bodenreider"},{"key":"2020110613102046300_ocaa123-B36","first-page":"1242","author":"Abeysinghe","year":"2017"},{"key":"2020110613102046300_ocaa123-B37","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1016\/j.jbi.2017.12.010","article-title":"Auditing SNOMED CT hierarchical relations based on lexical features of concepts in non-lattice subgraphs","volume":"78","author":"Cui","year":"2018","journal-title":"J Biomed Inform"},{"key":"2020110613102046300_ocaa123-B38","first-page":"982","article-title":"Leveraging non-lattice subgraphs to audit hierarchical relations in NCI Thesaurus","volume":"2019","author":"Abeysinghe","year":"2019","journal-title":"AMIA Annu Symp Proc"},{"key":"2020110613102046300_ocaa123-B39","author":"LuiNorm"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/27\/10\/1568\/34153215\/ocaa123.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/27\/10\/1568\/34153215\/ocaa123.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T19:33:05Z","timestamp":1604691185000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/27\/10\/1568\/5904423"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,1]]},"references-count":39,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2020,10,12]]},"published-print":{"date-parts":[[2020,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocaa123","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,10]]},"published":{"date-parts":[[2020,10,1]]}}}