{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T05:14:51Z","timestamp":1772946891906,"version":"3.50.1"},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T00:00:00Z","timestamp":1714521600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T00:00:00Z","timestamp":1714521600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01NS116287"],"award-info":[{"award-number":["R01NS116287"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01LM013335"],"award-info":[{"award-number":["R01LM013335"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01NS116287"],"award-info":[{"award-number":["R01NS116287"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01LM013335"],"award-info":[{"award-number":["R01LM013335"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"name":"National Science Foundation","award":["2047001"],"award-info":[{"award-number":["2047001"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Biomed Semant"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Biomedical terminologies play a vital role in managing biomedical data. Missing IS-A relations in a biomedical terminology could be detrimental to its downstream usages. In this paper, we investigate an approach combining logical definitions and lexical features to discover missing IS-A relations in two biomedical terminologies: SNOMED CT and the National Cancer Institute (NCI) thesaurus. The method is applied to unrelated concept-pairs within non-lattice subgraphs: graph fragments within a terminology likely to contain various inconsistencies. Our approach first compares whether the logical definition of a concept is more general than \u00a0that of\u00a0the other concept. Then, we check whether the lexical features of the concept are contained in those of\u00a0the other concept. If both constraints are satisfied, we suggest a potentially missing IS-A relation between the two concepts. The method identified 982 potential missing IS-A relations for SNOMED CT and 100 for NCI thesaurus. In order to assess the efficacy of our approach, a random sample of results belonging to the \u201cClinical Findings\u201d and \u201cProcedure\u201d subhierarchies of SNOMED CT and results belonging to the \u201cDrug, Food, Chemical or Biomedical Material\u201d subhierarchy of the NCI thesaurus were evaluated by domain experts. The evaluation results revealed that 118 out of 150 suggestions are valid for SNOMED CT and 17 out of 20 are valid for NCI thesaurus.<\/jats:p>","DOI":"10.1186\/s13326-024-00309-y","type":"journal-article","created":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T15:09:55Z","timestamp":1714576195000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Leveraging logical definitions and lexical features to detect missing IS-A relations in biomedical terminologies"],"prefix":"10.1186","volume":"15","author":[{"given":"Rashmie","family":"Abeysinghe","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fengbo","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jay","family":"Shi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Samden D.","family":"Lhatoo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Licong","family":"Cui","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,5,1]]},"reference":[{"key":"309_CR1","unstructured":"National Library of Medicine. SNOMED CT. 2023. https:\/\/www.nlm.nih.gov\/healthit\/snomedct\/index.html. Accessed 14 July 2023."},{"issue":"8","key":"309_CR2","doi-asserted-by":"publisher","first-page":"648","DOI":"10.1002\/cfg.445","volume":"5","author":"G Fragoso","year":"2004","unstructured":"Fragoso G, de Coronado S, Haber M, Hartel F, Wright L. Overview and utilization of the NCI thesaurus. Comp Funct Genom. 2004;5(8):648\u201354.","journal-title":"Comp Funct Genom."},{"key":"309_CR3","unstructured":"Kazakov Y, Kr\u00f6tzsch M, Simancik F. ELK Reasoner: Architecture and Evaluation. In: Proceedings of the OWL Reasoner Evaluation Workshop (ORE 2012).\u00a0Aachen:\u00a0CEUR-WS.org; 2012."},{"key":"309_CR4","unstructured":"Lawley MJ, Bousquet C. Fast classification in Prot\u00e9g\u00e9: Snorocket as an OWL 2 EL reasoner. In: Proc. 6th Australasian Ontology Workshop (IAOA\u201910). Conferences in Research and Practice in Information Technology. vol. 122.\u00a0Sydney:\u00a0Australian Computer Society Inc.; 2010. p. 45\u20139."},{"key":"309_CR5","unstructured":"SNOMED International. SNOMED CT Logical Model. https:\/\/confluence.ihtsdotools.org\/display\/DOCSTART\/5.+SNOMED+CT+Logical+Model. Accessed 12 July 2023."},{"key":"309_CR6","unstructured":"National Cancer Institute. NCI Thesaurus Semantics. https:\/\/evs.nci.nih.gov\/ftp1\/ThesaurusSemantics\/NCI%20Thesaurus%20Semantics.pdf. Accessed 12 July 2023."},{"key":"309_CR7","unstructured":"SNOMED International. High Level Logical Model of SNOMED CT. https:\/\/confluence.ihtsdotools.org\/display\/DOCRELFMT\/2.1+High+Level+Logical+Model+of+SNOMED+CT. Accessed 14 Dec 2023."},{"key":"309_CR8","unstructured":"SNOMED International. attribute group. https:\/\/confluence.ihtsdotools.org\/display\/DOCGLOSS\/attribute+group. Accessed 12 July 2023."},{"key":"309_CR9","unstructured":"SNOMED International. SNOMED CT Starter Guide. https:\/\/confluence.ihtsdotools.org\/display\/DOCSTART\/SNOMED+CT+Starter+Guide. Accessed 1 June 2021."},{"key":"309_CR10","unstructured":"SNOMED International. SNOMED International Release Management Home. https:\/\/confluence.ihtsdotools.org\/display\/RMT\/. Accessed 1 June 2021."},{"issue":"3","key":"309_CR11","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1016\/j.jbi.2009.03.003","volume":"42","author":"X Zhu","year":"2009","unstructured":"Zhu X, Fan JW, Baorto DM, Weng C, Cimino JJ. A review of auditing methods applied to the content of controlled biomedical terminologies. J Biomed Inform. 2009;42(3):413\u201325.","journal-title":"J Biomed Inform."},{"key":"309_CR12","unstructured":"Bodenreider O. Identifying missing hierarchical relations in SNOMED CT from logical definitions based on the lexical features of concept names. In: CEUR workshop proceedings. vol. 1747.\u00a0Aachen: NIH Public Access; 2016."},{"issue":"3","key":"309_CR13","doi-asserted-by":"publisher","first-page":"628","DOI":"10.1136\/amiajnl-2014-003173","volume":"22","author":"C Ochs","year":"2015","unstructured":"Ochs C, Geller J, Perl Y, Chen Y, Agrawal A, Case JT, et al. A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships. J Am Med Inform Assoc. 2015;22(3):628\u201339.","journal-title":"J Am Med Inform Assoc."},{"key":"309_CR14","unstructured":"Wei D, Bodenreider O. Using the abstraction network in complement to description logics for quality assurance in biomedical terminologies-a case study in SNOMED CT. Stud Health Technol Inform. 2010;160(0 2):1070."},{"issue":"3","key":"309_CR15","doi-asserted-by":"publisher","first-page":"507","DOI":"10.1136\/amiajnl-2014-003151","volume":"22","author":"C Ochs","year":"2015","unstructured":"Ochs C, Geller J, Perl Y, Chen Y, Xu J, Min H, et al. Scalable quality assurance for large SNOMED CT hierarchies using subject-based subtaxonomies. J Am Med Inform Assoc. 2015;22(3):507\u201318.","journal-title":"J Am Med Inform Assoc."},{"key":"309_CR16","unstructured":"Ochs C, Perl Y, Geller J, Halper M, Gu H, Chen Y, et\u00a0al. Scalability of abstraction-network-based quality assurance to large SNOMED hierarchies. In: AMIA Annual Symposium Proceedings. vol. 2013.\u00a0Washington, DC: American Medical Informatics Association; 2013. p. 1071."},{"key":"309_CR17","doi-asserted-by":"crossref","unstructured":"Agrawal A, Perl Y, Ochs C, Elhanan G. Algorithmic detection of inconsistent modeling among SNOMED CT concepts by combining lexical and structural indicators. In: 2015 IEEE international conference on bioinformatics and biomedicine (BIBM).\u00a0New York: IEEE; 2015. p. 476\u201383.","DOI":"10.1109\/BIBM.2015.7359731"},{"key":"309_CR18","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1016\/j.ymeth.2020.05.019","volume":"179","author":"A Agrawal","year":"2020","unstructured":"Agrawal A, Qazi K. Detecting modeling inconsistencies in SNOMED CT using a machine learning technique. Methods. 2020;179:111\u20138.","journal-title":"Methods."},{"key":"309_CR19","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1186\/s12911-018-0673-z","volume":"18","author":"A Agrawal","year":"2018","unstructured":"Agrawal A. Evaluating lexical similarity and modeling discrepancies in the procedure hierarchy of SNOMED CT. BMC Med Inform Decis Making. 2018;18:27\u201333.","journal-title":"BMC Med Inform Decis Making."},{"key":"309_CR20","unstructured":"Liu H, Zheng L, Perl Y, Geller J, Elhanan G. Can a convolutional neural network support auditing of NCI thesaurus neoplasm concepts? In: 2018 International Conference on Biomedical Ontology (ICBO-2018).\u00a0Aachen:\u00a0CEUR-WS.org; 2018."},{"key":"309_CR21","doi-asserted-by":"publisher","first-page":"103607","DOI":"10.1016\/j.jbi.2020.103607","volume":"112","author":"H Liu","year":"2020","unstructured":"Liu H, Perl Y, Geller J. Concept placement using BERT trained by transforming and summarizing biomedical ontology structure. J Biomed Inf. 2020;112:103607.","journal-title":"J Biomed Inf."},{"issue":"3","key":"309_CR22","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1093\/jamia\/ocac248","volume":"30","author":"R Abeysinghe","year":"2023","unstructured":"Abeysinghe R, Zheng F, Bernstam EV, Shi J, Bodenreider O, Cui L. A deep learning approach to identify missing is-a relations in SNOMED CT. J Am Med Inform Assoc. 2023;30(3):475\u201384.","journal-title":"J Am Med Inform Assoc."},{"key":"309_CR23","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1016\/j.jbi.2017.12.010","volume":"78","author":"L Cui","year":"2018","unstructured":"Cui L, Bodenreider O, Shi J, Zhang GQ. Auditing SNOMED CT hierarchical relations based on lexical features of concepts in non-lattice subgraphs. J Biomed Inform. 2018;78:177\u201384.","journal-title":"J Biomed Inform."},{"key":"309_CR24","unstructured":"Abeysinghe R, Brooks MA, Cui L. Leveraging non-lattice subgraphs to audit hierarchical relations in NCI Thesaurus. In: AMIA annual symposium proceedings. vol. 2019.\u00a0Washington, DC: American Medical Informatics Association; 2019. p. 982."},{"key":"309_CR25","unstructured":"Zheng F, Shi J, Cui L. A lexical-based approach for exhaustive detection of missing hierarchical IS-A relations in SNOMED CT. In: AMIA Annual Symposium Proceedings. vol. 2020.\u00a0Washington, DC: American Medical Informatics Association; 2020. p. 1392."},{"issue":"1","key":"309_CR26","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1186\/s13326-022-00276-2","volume":"13","author":"W Manuel","year":"2022","unstructured":"Manuel W, Abeysinghe R, He Y, Tao C, Cui L. Identification of missing hierarchical relations in the Vaccine Ontology using acquired term pairs. J Biomed Semant. 2022;13(1):22.","journal-title":"J Biomed Semant."},{"key":"309_CR27","doi-asserted-by":"crossref","unstructured":"Hao X, Abeysinghe R, Shi J, Cui L. A substring replacement approach for identifying missing IS-A relations in SNOMED CT. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).\u00a0New York: IEEE; 2022. p. 2611\u20138.","DOI":"10.1109\/BIBM55620.2022.9995595"},{"key":"309_CR28","doi-asserted-by":"crossref","unstructured":"Mohtashamian M, Abeysinghe R, Hao X, Cui L. Identifying Missing IS-A Relations in Orphanet Rare Disease Ontology. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).\u00a0New York: IEEE; 2022. p. 3274\u20139.","DOI":"10.1109\/BIBM55620.2022.9995614"},{"issue":"3","key":"309_CR29","doi-asserted-by":"publisher","first-page":"bbac122","DOI":"10.1093\/bib\/bbac122","volume":"23","author":"R Abeysinghe","year":"2022","unstructured":"Abeysinghe R, Yang Y, Bartels M, Zheng WJ, Cui L. An evidence-based lexical pattern approach for quality assurance of Gene Ontology relations. Brief Bioinforma. 2022;23(3):bbac122.","journal-title":"Brief Bioinforma."},{"issue":"4","key":"309_CR30","doi-asserted-by":"publisher","first-page":"788","DOI":"10.1093\/jamia\/ocw175","volume":"24","author":"L Cui","year":"2017","unstructured":"Cui L, Zhu W, Tao S, Case JT, Bodenreider O, Zhang GQ. Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT. J Am Med Inform Assoc. 2017;24(4):788\u201398.","journal-title":"J Am Med Inform Assoc."},{"key":"309_CR31","unstructured":"Abeysinghe R, Brooks MA, Talbert J, Licong C. Quality assurance of NCI Thesaurus by mining structural-lexical patterns. In: AMIA Annual Symposium Proceedings. vol. 2017. American Medical Informatics Association; 2017. p. 364."},{"issue":"10","key":"309_CR32","first-page":"1","volume":"20","author":"F Zheng","year":"2020","unstructured":"Zheng F, Abeysinghe R, Sioutos N, Whiteman L, Remennik L, Cui L. Detecting missing IS-A relations in the NCI Thesaurus using an enhanced hybrid approach. BMC Med Inform Decis Making. 2020;20(10):1\u201311.","journal-title":"BMC Med Inform Decis Making."},{"key":"309_CR33","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1016\/j.jbi.2018.03.004","volume":"80","author":"GQ Zhang","year":"2018","unstructured":"Zhang GQ, Xing G, Cui L. An efficient, large-scale, non-lattice-detection algorithm for exhaustive structural auditing of biomedical ontologies. J Biomed Inform. 2018;80:106\u201319.","journal-title":"J Biomed Inform."},{"key":"309_CR34","unstructured":"Explosion. NCI Thesaurus Semantics. https:\/\/evs.nci.nih.gov\/ftp1\/ThesaurusSemantics\/NCI%20Thesaurus%20Semantics.pdf. Accessed 12 July 2023."},{"key":"309_CR35","first-page":"384","volume":"228","author":"M Quesada-Mart\u00ednez","year":"2016","unstructured":"Quesada-Mart\u00ednez M, Fern\u00e1ndez-Breis JT, Karlsson D. Suggesting missing relations in biomedical ontologies based on lexical regularities. Stud Health Technol Inform. 2016;228:384\u20138.","journal-title":"Stud Health Technol Inform"},{"issue":"5","key":"309_CR36","doi-asserted-by":"publisher","first-page":"2569","DOI":"10.1007\/s11280-023-01169-9","volume":"26","author":"J Chen","year":"2023","unstructured":"Chen J, He Y, Geng Y, Jim\u00e9nez-Ruiz E, Dong H, Horrocks I. Contextual semantic embeddings for ontology subsumption prediction. World Wide Web. 2023;26(5):2569\u201391.","journal-title":"World Wide Web."}],"container-title":["Journal of Biomedical Semantics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13326-024-00309-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13326-024-00309-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13326-024-00309-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T15:12:26Z","timestamp":1714576346000},"score":1,"resource":{"primary":{"URL":"https:\/\/jbiomedsem.biomedcentral.com\/articles\/10.1186\/s13326-024-00309-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,1]]},"references-count":36,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["309"],"URL":"https:\/\/doi.org\/10.1186\/s13326-024-00309-y","relation":{},"ISSN":["2041-1480"],"issn-type":[{"value":"2041-1480","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,1]]},"assertion":[{"value":"17 August 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 April 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"6"}}