{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T04:26:26Z","timestamp":1781324786850,"version":"3.54.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T00:00:00Z","timestamp":1750118400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"SATURN project"},{"name":"Federal Ministry of Health in Germany","award":["2520DAT02B"],"award-info":[{"award-number":["2520DAT02B"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Objective<\/jats:title>\n                    <jats:p>This study aims to enhance the diagnostic process for rare diseases using case-based reasoning (CBR). CBR compares new cases with historical data, utilizing both structured and unstructured clinical data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Materials and Methods<\/jats:title>\n                    <jats:p>The study uses a dataset of 4295 patient cases from the University Hospital Frankfurt. Data were standardized using the OMOP Common Data Model. Three methods\u2014TF, TF-IDF, and TF-IDF with semantic vector embeddings\u2014were employed to represent patient records. Similarity search effectiveness was evaluated using cross-validation to assess diagnostic precision. High-weighted concepts were rated by medical experts for relevance. Additionally, the impact of different levels of ICD-10 code granularity on prediction outcomes was analyzed.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>The TF-IDF method showed a high degree of precision, with an average positive predictive value of 91% in the 10 most similar cases. The differences between the methods were not statistically significant. The expert evaluation rated the medical relevance of high-weighted concepts as moderate. The granularity of ICD-10 coding significantly influences the precision of predictions, with more granular codes showing decreased precision.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Discussion<\/jats:title>\n                    <jats:p>The methods effectively handle data from multiple medical specialties, suggesting broad applicability. The use of broader ICD-10 codes with high precision in prediction could improve initial diagnostic guidance. The use of Explainable AI could enhance diagnostic transparency, leading to better patient outcomes. Limitations include standardization issues and the need for more comprehensive lab value integration.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>While CBR shows promise for rare disease diagnostics, its utility depends on the specific needs of the decision support system and its intended clinical application.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/jamia\/ocaf092","type":"journal-article","created":{"date-parts":[[2025,5,28]],"date-time":"2025-05-28T07:47:55Z","timestamp":1748418475000},"page":"98-111","source":"Crossref","is-referenced-by-count":4,"title":["Enhancing diagnostic precision for rare diseases using case-based reasoning"],"prefix":"10.1093","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5240-8484","authenticated-orcid":false,"given":"Richard","family":"Noll","sequence":"first","affiliation":[{"name":"Institute of Medical Informatics, Goethe University Frankfurt, University Medicine , 60590 Frankfurt am Main,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alexandra","family":"Berger","sequence":"additional","affiliation":[{"name":"Frankfurt Reference Centre for Rare Diseases, University Medicine, Goethe University Frankfurt , 60596 Frankfurt am Main,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Carlo","family":"Facchinello","sequence":"additional","affiliation":[{"name":"Academic Teaching Practice of the Heinrich-Heine-University D\u00fcsseldorf , 40885 Ratingen,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Katharina","family":"Stratmann","sequence":"additional","affiliation":[{"name":"Medical Clinic 1, Goethe University Frankfurt, University Medicine , 60596 Frankfurt am Main,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jannik","family":"Schaaf","sequence":"additional","affiliation":[{"name":"Institute of Medical Informatics, Goethe University Frankfurt, University Medicine , 60590 Frankfurt am Main,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Holger","family":"Storf","sequence":"additional","affiliation":[{"name":"Institute of Medical Informatics, Goethe University Frankfurt, University Medicine , 60590 Frankfurt am Main,","place":["Germany"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2025,6,17]]},"reference":[{"key":"2026010211513881800_ocaf092-B1","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1038\/d41573-019-00180-y","article-title":"How many rare diseases are there?","volume":"19","author":"Haendel","year":"2020","journal-title":"Nat Rev Drug Discov"},{"key":"2026010211513881800_ocaf092-B2","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1186\/s13073-022-01026-w","article-title":"A guide for the diagnosis of rare and undiagnosed disease: beyond the exome","volume":"14","author":"Marwaha","year":"2022","journal-title":"Genome Med"},{"key":"2026010211513881800_ocaf092-B3","doi-asserted-by":"publisher","first-page":"4732","DOI":"10.3390\/ijerph20064732","article-title":"Enhancing equitable access to rare disease diagnosis and treatment around the world: a review of evidence, policies, and challenges","volume":"20","author":"Adachi","year":"2023","journal-title":"Int J Environ Res Public Health"},{"key":"2026010211513881800_ocaf092-B4","first-page":"340","article-title":"Learning to identify rare disease patients from electronic health records","volume":"2018","author":"Colbaugh","year":"2018","journal-title":"AMIA Annu Symp Proc"},{"key":"2026010211513881800_ocaf092-B5","doi-asserted-by":"publisher","first-page":"e0238277","DOI":"10.1371\/journal.pone.0235574","article-title":"Detecting rare diseases in electronic health records using machine learning and knowledge engineering: case study of acute hepatic porphyria","volume":"15","author":"Cohen","year":"2020","journal-title":"PLoS One"},{"key":"2026010211513881800_ocaf092-B6","doi-asserted-by":"publisher","first-page":"104073","DOI":"10.1016\/j.ijmedinf.2019.104073","article-title":"A patient-similarity-based model for diagnostic prediction","volume":"135","author":"Jia","year":"2020","journal-title":"Int J Med Inform"},{"key":"2026010211513881800_ocaf092-B7","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1016\/j.jbi.2017.07.016","article-title":"Finding patients using similarity measures in a rare diseases-oriented clinical data warehouse: Dr Warehouse and the needle in the needle stack","volume":"73","author":"Garcelon","year":"2017","journal-title":"J Biomed Inform"},{"key":"2026010211513881800_ocaf092-B8","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1007\/978-3-031-14923-8_26","author":"Noll","year":"2022"},{"key":"2026010211513881800_ocaf092-B9","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1136\/jamia.2010.007237","article-title":"Data from clinical notes: a perspective on the tension between structure and flexible documentation","volume":"18","author":"Rosenbloom","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"2026010211513881800_ocaf092-B10","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1197\/jamia.M2882","article-title":"LOINC codes for hospital information systems documents: a case study","volume":"16","author":"Dugas","year":"2009","journal-title":"J Am Med Inform Assoc"},{"key":"2026010211513881800_ocaf092-B11","doi-asserted-by":"publisher","first-page":"e569","DOI":"10.1016\/S2589-7500(20)30244-2","article-title":"Use of LOINC for interoperability between organisations poses a risk to safety","volume":"2","author":"Carter","year":"2020","journal-title":"Lancet Digit Health"},{"key":"2026010211513881800_ocaf092-B12","doi-asserted-by":"publisher","first-page":"229","DOI":"10.5858\/arpa.2018-0477-RA","article-title":"Logical Observation Identifiers Names and Codes for laboratorians","volume":"144","author":"Stram","year":"2020","journal-title":"Arch Pathol Lab Med"},{"key":"2026010211513881800_ocaf092-B13","doi-asserted-by":"publisher","first-page":"853","DOI":"10.3233\/sw-160238","article-title":"A systematic analysis of term reuse and term overlap across biomedical ontologies","volume":"8","author":"Kamdar","year":"2017","journal-title":"Semant Web"},{"key":"2026010211513881800_ocaf092-B14","doi-asserted-by":"publisher","author":"Yuan","DOI":"10.48550\/arXiv.2011.02947"},{"key":"2026010211513881800_ocaf092-B15","doi-asserted-by":"publisher","first-page":"1538","DOI":"10.1093\/jamia\/ocaa136","article-title":"Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts","volume":"27","author":"Mao","year":"2020","journal-title":"J Am Med Inform Assoc"},{"key":"2026010211513881800_ocaf092-B16","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1016\/j.jbi.2018.09.008","article-title":"A comparison of word embeddings for the biomedical natural language processing","volume":"87","author":"Wang","year":"2018","journal-title":"J Biomed Inform"},{"key":"2026010211513881800_ocaf092-B17","doi-asserted-by":"publisher","first-page":"O10","DOI":"10.1186\/1750-1172-9-S1-O10","article-title":"German approach of coding rare diseases with ICD-10-GM and Orpha numbers in routine settings","volume":"9","author":"Weber","year":"2014","journal-title":"Orphanet J Rare Dis"},{"key":"2026010211513881800_ocaf092-B18","doi-asserted-by":"publisher","first-page":"2152","DOI":"10.1164\/ajrccm.162.6.2003112","article-title":"Gender-related differences in severe, early-onset chronic obstructive pulmonary disease","volume":"162","author":"Silverman","year":"2000","journal-title":"Am J Respir Crit Care Med"},{"key":"2026010211513881800_ocaf092-B19","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1016\/j.phrs.2017.03.014","article-title":"Gender differences in liver disease and the drug-dose gender gap","volume":"120","author":"Buzzetti","year":"2017","journal-title":"Pharmacol Res"},{"key":"2026010211513881800_ocaf092-B20","doi-asserted-by":"publisher","first-page":"298","DOI":"10.1186\/s13023-024-03312-9","article-title":"How to customize common data models for rare diseases: an OMOP-based implementation and lessons learned","volume":"19","author":"Ahmadi","year":"2024","journal-title":"Orphanet J Rare Dis"},{"key":"2026010211513881800_ocaf092-B21","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-91563-6","volume-title":"Health Informatics on FHIR: How HL7\u2019s API is Transforming Healthcare","author":"Braunstein","year":"2022","edition":"2nd ed"},{"key":"2026010211513881800_ocaf092-B22","doi-asserted-by":"publisher","first-page":"e50","DOI":"10.3414\/ME18-03-0003","article-title":"German Medical Informatics initiative","volume":"57","author":"Semler","year":"2018","journal-title":"Methods Inf Med"},{"key":"2026010211513881800_ocaf092-B23","doi-asserted-by":"publisher","first-page":"95","DOI":"10.3233\/SHTI210546","article-title":"The usage of OHDSI OMOP\u2014a scoping review","volume":"283","author":"Reinecke","year":"2021","journal-title":"Stud Health Technol Inform"},{"key":"2026010211513881800_ocaf092-B24","doi-asserted-by":"publisher","first-page":"138","DOI":"10.3233\/SHTI210136","article-title":"Adaption of the OMOP CDM for rare diseases","volume":"281","author":"Zoch","year":"2021","journal-title":"Stud Health Technol Inform"},{"key":"2026010211513881800_ocaf092-B25","doi-asserted-by":"publisher","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci Data"},{"key":"2026010211513881800_ocaf092-B26","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1007\/s00347-020-01177-4","article-title":"Merkmalsextraktion aus klinischen Routinedaten mittels Text-Mining","volume":"118","author":"Grundel","year":"2021","journal-title":"Ophthalmologe"},{"key":"2026010211513881800_ocaf092-B27","doi-asserted-by":"publisher","first-page":"D1333","DOI":"10.1093\/nar\/gkad1005","article-title":"The Human Phenotype Ontology in 2024: phenotypes around the world","volume":"52","author":"Gargano","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2026010211513881800_ocaf092-B28","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1016\/j.nbt.2023.08.004","article-title":"Machine translation of standardised medical terminology using natural language processing: a scoping review","volume":"77","author":"Noll","year":"2023","journal-title":"N Biotechnol"},{"key":"2026010211513881800_ocaf092-B29","doi-asserted-by":"publisher","first-page":"89","DOI":"10.3233\/SHTI230933","article-title":"Translation of ontological concepts from English into German using commercial translation software and expert evaluation","volume":"310","author":"Noll","year":"2024","journal-title":"Stud Health Technol Inform"},{"key":"2026010211513881800_ocaf092-B30","first-page":"279","article-title":"SNOMED-CT: the advanced terminology and coding system for eHealth","volume":"121","author":"Donnelly","year":"2006","journal-title":"Stud Health Technol Inform"},{"key":"2026010211513881800_ocaf092-B31","volume-title":"Automatic Information Organization and Retrieval","author":"Salton","year":"1968"},{"key":"2026010211513881800_ocaf092-B32","author":"scikit-learn"},{"key":"2026010211513881800_ocaf092-B33","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1108\/eb026526","article-title":"A statistical interpretation of term specificity and its application in retrieval","volume":"28","author":"Sparck Jones","year":"1972","journal-title":"J Doc"},{"key":"2026010211513881800_ocaf092-B34","doi-asserted-by":"publisher","first-page":"103867","DOI":"10.1016\/j.jbi.2021.103867","article-title":"Improved biomedical word embeddings in the transformer era","volume":"120","author":"Noh","year":"2021","journal-title":"J Biomed Inform"},{"key":"2026010211513881800_ocaf092-B35","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2026010211513881800_ocaf092-B36","doi-asserted-by":"publisher","first-page":"e0127428","DOI":"10.1371\/journal.pone.0127428","article-title":"Personalized mortality prediction driven by electronic medical data and a patient similarity metric","volume":"10","author":"Lee","year":"2015","journal-title":"PLoS One"},{"key":"2026010211513881800_ocaf092-B37","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1136\/amiajnl-2011-000480","article-title":"A vector space model approach to identify genetically related diseases","volume":"19","author":"Sarkar","year":"2012","journal-title":"J Am Med Inform Assoc"},{"key":"2026010211513881800_ocaf092-B38","doi-asserted-by":"publisher","first-page":"561","DOI":"10.3389\/fphys.2016.00561","article-title":"Patient similarity: emerging concepts in systems and precision medicine","volume":"7","author":"Brown","year":"2016","journal-title":"Front Physiol"},{"key":"2026010211513881800_ocaf092-B39","doi-asserted-by":"publisher","first-page":"2839","DOI":"10.1016\/j.patcog.2015.03.009","article-title":"Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation","volume":"48","author":"Wong","year":"2015","journal-title":"Pattern Recognit"},{"key":"2026010211513881800_ocaf092-B40","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/978-3-030-05318-5_1","volume-title":"Automated Machine Learning: Methods, Systems, Challenges","author":"Feurer","year":"2019"},{"key":"2026010211513881800_ocaf092-B41","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"van der Maaten","year":"2008","journal-title":"J Mach Learn Res"},{"key":"2026010211513881800_ocaf092-B42","author":"scikit-learn: machine learning in Python\u2013scikit-learn 1.4.2 documentation"},{"key":"2026010211513881800_ocaf092-B43","author":"GitHub"},{"key":"2026010211513881800_ocaf092-B44","doi-asserted-by":"publisher","first-page":"588","DOI":"10.1089\/wound.2013.0478","article-title":"ICD-9-CM to ICD-10-CM codes: what? Why? How?","volume":"2","author":"Cartwright","year":"2013","journal-title":"Adv Wound Care (New Rochelle)"},{"key":"2026010211513881800_ocaf092-B45","doi-asserted-by":"publisher","first-page":"473","DOI":"10.1016\/S2213-8587(18)30371-1","article-title":"Reference intervals in the diagnosis of thyroid dysfunction: treating patients not numbers","volume":"7","author":"Jonklaas","year":"2019","journal-title":"Lancet Diabetes Endocrinol"},{"key":"2026010211513881800_ocaf092-B46","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.jbi.2018.01.004","article-title":"Estimating summary statistics for electronic health record laboratory data for use in high-throughput phenotyping algorithms","volume":"78","author":"Albers","year":"2018","journal-title":"J Biomed Inform"},{"key":"2026010211513881800_ocaf092-B47","doi-asserted-by":"publisher","first-page":"223","DOI":"10.5582\/irdr.2021.01134","article-title":"Current diagnosis and management of rare pediatric diseases in China","volume":"10","author":"Yang","year":"2021","journal-title":"Intractable Rare Dis Res"},{"key":"2026010211513881800_ocaf092-B48","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1186\/s12911-019-0766-3","article-title":"Time-sensitive clinical concept embeddings learned from large electronic health records","volume":"19","author":"Xiang","year":"2019","journal-title":"BMC Med Inform Decis Mak"},{"key":"2026010211513881800_ocaf092-B49","author":"SATURN Projekt"},{"key":"2026010211513881800_ocaf092-B50","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1186\/s12911-023-02245-w","article-title":"Requirements analysis for an AI-based clinical decision support system for general practitioners: a user-centered design process","volume":"23","author":"Sch\u00fctze","year":"2023","journal-title":"BMC Med Inform Decis Mak"},{"key":"2026010211513881800_ocaf092-B51","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.3233\/SHTI231125","article-title":"Initial user-centred design of an AI-based clinical decision support system for primary care","volume":"310","author":"Neff","year":"2024","journal-title":"Stud Health Technol Inform"},{"key":"2026010211513881800_ocaf092-B52","doi-asserted-by":"publisher","author":"Keane","year":"2021","DOI":"10.48550\/arXiv.2104.14461"},{"key":"2026010211513881800_ocaf092-B53","first-page":"543","article-title":"Adversarial learning of knowledge embeddings for the Unified Medical Language System","volume":"2019","author":"Maldonado","year":"2019","journal-title":"AMIA Jt Summits Transl Sci Proc"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/1\/98\/63508886\/ocaf092.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/1\/98\/63508886\/ocaf092.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,2]],"date-time":"2026-01-02T16:51:47Z","timestamp":1767372707000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/33\/1\/98\/8165644"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,17]]},"references-count":53,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,6,17]]},"published-print":{"date-parts":[[2026,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocaf092","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,1]]},"published":{"date-parts":[[2025,6,17]]}}}