{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T08:09:36Z","timestamp":1772179776349,"version":"3.50.1"},"reference-count":61,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2023,12,22]],"date-time":"2023-12-22T00:00:00Z","timestamp":1703203200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"PTC Therapeutics, South Plainfield","award":["SRA-21-164 PO#109719"],"award-info":[{"award-number":["SRA-21-164 PO#109719"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,2,16]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objectives<\/jats:title>\n                  <jats:p>Electronic health record (EHR) data may facilitate the identification of rare diseases in patients, such as aromatic l-amino acid decarboxylase deficiency (AADCd), an autosomal recessive disease caused by pathogenic variants in the dopa decarboxylase gene. Deficiency of the AADC enzyme results in combined severe reductions in monoamine neurotransmitters: dopamine, serotonin, epinephrine, and norepinephrine. This leads to widespread neurological complications affecting motor, behavioral, and autonomic function. The goal of this study was to use EHR data to identify previously undiagnosed patients who may have AADCd without available training cases for the disease.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>A multiple symptom and related disease annotated dataset was created and used to train individual concept classifiers on annotated sentence data. A multistep algorithm was then used to combine concept predictions into a single patient rank value.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Using an 8000-patient dataset that the algorithms had not seen before ranking, the top and bottom 200 ranked patients were manually reviewed for clinical indications of performing an AADCd diagnostic screening test. The top-ranked patients were 22.5% positively assessed for diagnostic screening, with 0% for the bottom-ranked patients. This result is statistically significant at P &amp;lt; .0001.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusion<\/jats:title>\n                  <jats:p>This work validates the approach that large-scale rare-disease screening can be accomplished by combining predictions for relevant individual symptoms and related conditions which are much more common and for which training data is easier to create.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocad244","type":"journal-article","created":{"date-parts":[[2023,12,23]],"date-time":"2023-12-23T00:24:22Z","timestamp":1703291062000},"page":"692-704","source":"Crossref","is-referenced-by-count":2,"title":["Automatically pre-screening patients for the rare disease aromatic <scp>l<\/scp>-amino acid decarboxylase deficiency using knowledge engineering, natural language processing, and machine learning on a large EHR population"],"prefix":"10.1093","volume":"31","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4610-9912","authenticated-orcid":false,"given":"Aaron M","family":"Cohen","sequence":"first","affiliation":[{"name":"Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University , Portland, OR 97239, United States"}]},{"given":"Jolie","family":"Kaner","sequence":"additional","affiliation":[{"name":"Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University , Portland, OR 97239, United States"}]},{"given":"Ryan","family":"Miller","sequence":"additional","affiliation":[{"name":"PTC Therapeutics , South Plainfield, NJ 07080, United States"}]},{"given":"Jeffrey W","family":"Kopesky","sequence":"additional","affiliation":[{"name":"PTC Therapeutics , South Plainfield, NJ 07080, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4114-5148","authenticated-orcid":false,"given":"William","family":"Hersh","sequence":"additional","affiliation":[{"name":"Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University , Portland, OR 97239, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,12,22]]},"reference":[{"issue":"5","key":"2024021710214604700_ocad244-B1","doi-asserted-by":"crossref","first-page":"1121","DOI":"10.1002\/jimd.12247","article-title":"AADC deficiency from infancy to adulthood: symptoms and developmental outcome in an international cohort of 63 patients","volume":"43","author":"Pearson","year":"2020","journal-title":"J Inherit Metab Dis"},{"issue":"4","key":"2024021710214604700_ocad244-B2","doi-asserted-by":"crossref","first-page":"107647","DOI":"10.1016\/j.ymgme.2023.107647","article-title":"Prevalence of DDC genotypes in patients with aromatic L-amino acid decarboxylase (AADC) deficiency and in silico prediction of structural protein changes","volume":"139","author":"Himmelreich","year":"2023","journal-title":"Mol Genet Metab"},{"key":"2024021710214604700_ocad244-B3","first-page":"2210555","article-title":"Clinical features in aromatic L-Amino acid decarboxylase (AADC) deficiency: a systematic review","volume":"2022","author":"Rizzi","year":"2022"},{"issue":"2","key":"2024021710214604700_ocad244-B4","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1055\/s-0040-1714690","article-title":"Clinical profile and outcome of Indian children with aromatic L-amino acid decarboxylase deficiency: a primary CSF neurotransmitter disorder mimicking as dyskinetic cerebral palsy","volume":"10","author":"Gowda","year":"2021","journal-title":"J Pediatr Genet"},{"key":"2024021710214604700_ocad244-B5","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/8904_2014_327","article-title":"Widening phenotypic spectrum of AADC deficiency, a disorder of dopamine and serotonin synthesis","volume":"17","author":"Helman","year":"2014","journal-title":"JIMD Rep"},{"issue":"5","key":"2024021710214604700_ocad244-B6","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1001\/jama.2020.26148","article-title":"Molecular diagnostic yield of exome sequencing in patients with cerebral palsy","volume":"325","author":"Moreno-De-Luca","year":"2021","journal-title":"JAMA"},{"issue":"3","key":"2024021710214604700_ocad244-B7","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1016\/j.ejpn.2019.02.001","article-title":"The genetic etiology in cerebral palsy mimics: the results from a Greek tertiary care center","volume":"23","author":"Zouvelou","year":"2019","journal-title":"Eur J Paediatr Neurol"},{"key":"2024021710214604700_ocad244-B8","first-page":"625428","article-title":"Insights from genetic studies of cerebral palsy","volume":"11","author":"Lewis","year":"2020"},{"issue":"3","key":"2024021710214604700_ocad244-B9","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1007\/s10545-009-1076-1","article-title":"Aromatic L-amino acid decarboxylase deficiency: clinical features, drug therapy and follow-up","volume":"32","author":"Manegold","year":"2009","journal-title":"J Inherit Metab Dis"},{"issue":"1","key":"2024021710214604700_ocad244-B10","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s13023-016-0522-z","article-title":"Consensus guideline for the diagnosis and treatment of aromatic L-amino acid decarboxylase (AADC) deficiency","volume":"12","author":"Wassenberg","year":"2017","journal-title":"Orphanet J Rare Dis"},{"issue":"2","key":"2024021710214604700_ocad244-B11","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1038\/d41573-019-00180-y","article-title":"How many rare diseases are there?","volume":"19","author":"Haendel","year":"2020","journal-title":"Nat Rev Drug Discov"},{"issue":"1","key":"2024021710214604700_ocad244-B12","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1186\/s13023-020-01424-6","article-title":"The use of machine learning in rare diseases: a scoping review","volume":"15","author":"Schaefer","year":"2020","journal-title":"Orphanet J Rare Dis"},{"issue":"3","key":"2024021710214604700_ocad244-B13","doi-asserted-by":"crossref","first-page":"887","DOI":"10.3390\/biomedicines11030887","article-title":"The impact of artificial intelligence in the odyssey of rare diseases","volume":"11","author":"Visibelli","year":"2023","journal-title":"Biomedicines"},{"issue":"5","key":"2024021710214604700_ocad244-B14","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1007\/s10545-012-9544-4","article-title":"The incidence of inherited porphyrias in Europe","volume":"36","author":"Elder","year":"2013","journal-title":"J Inherit Metab Dis"},{"issue":"12","key":"2024021710214604700_ocad244-B15","doi-asserted-by":"crossref","first-page":"1233","DOI":"10.1016\/j.amjmed.2014.06.036","article-title":"Acute porphyrias in the USA: features of 108 subjects from porphyrias consortium","volume":"127","author":"Bonkovsky","year":"2014","journal-title":"Am J Med"},{"issue":"7","key":"2024021710214604700_ocad244-B16","doi-asserted-by":"crossref","first-page":"e0235574","DOI":"10.1371\/journal.pone.0235574","article-title":"Detecting rare diseases in electronic health records using machine learning and knowledge engineering: case study of acute hepatic porphyria","volume":"15","author":"Cohen","year":"2020","journal-title":"PLoS One"},{"issue":"2","key":"2024021710214604700_ocad244-B17","doi-asserted-by":"crossref","first-page":"ooac053","DOI":"10.1093\/jamiaopen\/ooac053","article-title":"Clinical study applying machine learning to detect a rare disease: results and lessons learned","volume":"5","author":"Hersh","year":"2022","journal-title":"JAMIA Open"},{"key":"2024021710214604700_ocad244-B18","author":"Garg","year":"2016"},{"issue":"1","key":"2024021710214604700_ocad244-B19","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1186\/s13075-019-2092-7","article-title":"Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record","volume":"21","author":"Jamian","year":"2019","journal-title":"Arthritis Res Ther"},{"key":"2024021710214604700_ocad244-B20","author":"Colbaugh","year":"2020"},{"issue":"5","key":"2024021710214604700_ocad244-B21","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1038\/s41436-020-01039-z","article-title":"Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes","volume":"23","author":"Hully","year":"2021","journal-title":"Genet Med"},{"issue":"1","key":"2024021710214604700_ocad244-B22","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1186\/s12875-022-01804-w","article-title":"Detection of primary Sj\u00f6gren\u2019s syndrome in primary care: developing a classification model with the use of routine healthcare data and machine learning","volume":"23","author":"Dros","year":"2022","journal-title":"BMC Prim Care"},{"issue":"1","key":"2024021710214604700_ocad244-B23","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1186\/s13023-021-01936-9","article-title":"Improving early diagnosis of rare diseases using natural language processing in unstructured medical records: an illustration from Dravet syndrome","volume":"16","author":"Lo Barco","year":"2021","journal-title":"Orphanet J Rare Dis"},{"key":"2024021710214604700_ocad244-B24","first-page":"844","article-title":"Enriching UMLS-based phenotyping of rare diseases using deep-learning: evaluation on Jeune syndrome","volume":"294","author":"Faviez","year":"2022","journal-title":"Stud Health Technol Inform"},{"issue":"5","key":"2024021710214604700_ocad244-B25","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1002\/acr.24522","article-title":"Developing and validating methods to assemble systemic lupus erythematosus births in the electronic health record","volume":"74","author":"Barnado","year":"2022","journal-title":"Arthritis Care Res (Hoboken)"},{"key":"2024021710214604700_ocad244-B26","doi-asserted-by":"crossref","first-page":"786710","DOI":"10.3389\/fphar.2022.786710","article-title":"Patient-patient similarity-based screening of a clinical data warehouse to support ciliopathy diagnosis","volume":"13","author":"Chen","year":"2022","journal-title":"Front Pharmacol"},{"key":"2024021710214604700_ocad244-B27","doi-asserted-by":"crossref","first-page":"1108222","DOI":"10.3389\/fneur.2023.1108222","article-title":"An artificial intelligence-based approach for identifying rare disease patients using retrospective electronic health records applied for Pompe disease","volume":"14","author":"Lin","year":"2023","journal-title":"Front Neurol"},{"issue":"10","key":"2024021710214604700_ocad244-B28","doi-asserted-by":"crossref","first-page":"3599","DOI":"10.3390\/jcm12103599","article-title":"Supporting the diagnosis of Fabry disease using a natural language processing-based approach","volume":"12","author":"Michalski","year":"2023","journal-title":"J Clin Med"},{"key":"2024021710214604700_ocad244-B29","first-page":"1","article-title":"Natural history of aromatic L-amino acid decarboxylase deficiency in Taiwan","volume":"40","author":"Hwu","year":"2018","journal-title":"JIMD Rep"},{"issue":"7","key":"2024021710214604700_ocad244-B30","doi-asserted-by":"crossref","first-page":"1058","DOI":"10.1212\/WNL.62.7.1058","article-title":"Aromatic L-amino acid decarboxylase deficiency: clinical features, treatment, and prognosis","volume":"62","author":"Pons","year":"2004","journal-title":"Neurology"},{"key":"2024021710214604700_ocad244-B31","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1007\/978-3-662-44578-5_344","article-title":"Erratum to: widening phenotypic spectrum of AADC deficiency, a disorder of dopamine and serotonin synthesis","volume":"17","author":"Helman","year":"2014","journal-title":"JIMD Rep"},{"issue":"3","key":"2024021710214604700_ocad244-B32","doi-asserted-by":"crossref","first-page":"e1143","DOI":"10.1002\/mgg3.1143","article-title":"Aromatic L-amino acid decarboxylase deficiency in 17 mainland China patients: clinical phenotype, molecular spectrum, and therapy overview","volume":"8","author":"Dai","year":"2020","journal-title":"Mol Genet Genomic Med"},{"key":"2024021710214604700_ocad244-B33","volume":"283","author":"Reinecke"},{"issue":"10","key":"2024021710214604700_ocad244-B34","doi-asserted-by":"crossref","first-page":"1331","DOI":"10.1093\/jamia\/ocy093","article-title":"Web services for data warehouses: OMOP and PCORnet on i2b2","volume":"25","author":"Klann","year":"2018","journal-title":"J Am Med Inform Assoc"},{"issue":"2","key":"2024021710214604700_ocad244-B35","doi-asserted-by":"crossref","first-page":"e0212463","DOI":"10.1371\/journal.pone.0212463","article-title":"Data model harmonization for the all of us research program: transforming i2b2 data into the OMOP common data model","volume":"14","author":"Klann","year":"2019","journal-title":"PLoS One"},{"key":"2024021710214604700_ocad244-B36","author":"Stenetorp","year":"2012:102-107."},{"key":"2024021710214604700_ocad244-B37","author":"Alsentzer"},{"issue":"12","key":"2024021710214604700_ocad244-B38","article-title":"Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion","volume":"11","author":"Vincent","year":"2010","journal-title":"J Mach Learn Res"},{"key":"2024021710214604700_ocad244-B39","first-page":"100030","article-title":"Using a neural network-based feature extraction method to facilitate citation screening for systematic reviews","volume":"6","author":"Kontonatsios","year":"2020","journal-title":"Exp Syst Applicat: X"},{"key":"2024021710214604700_ocad244-B40","first-page":"In:","author":"Cozman"},{"issue":"11","key":"2024021710214604700_ocad244-B41","doi-asserted-by":"crossref","first-page":"876","DOI":"10.1111\/j.1469-8749.2008.03094.x","article-title":"Aromatic L-amino acid decarboxylase deficiency associated with epilepsy mimicking non-epileptic involuntary movements","volume":"50","author":"Ito","year":"2008","journal-title":"Dev Med Child Neurol"},{"issue":"4","key":"2024021710214604700_ocad244-B42","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","article-title":"A tutorial on spectral clustering","volume":"17","author":"Von Luxburg","year":"2007","journal-title":"Stat Comput"},{"issue":"3","key":"2024021710214604700_ocad244-B43","first-page":"241","article-title":"Atypical presentations of celiac disease","volume":"53","author":"Celilo\u011flu","year":"2011","journal-title":"Turk J Pediatr"},{"key":"2024021710214604700_ocad244-B44","doi-asserted-by":"publisher","first-page":"637187","DOI":"10.1155\/2012\/637187","article-title":"Atypical celiac disease: from recognizing to managing","volume":"2012","author":"Admou","year":"2012","journal-title":"Gastroenterol Res Pract"},{"issue":"10377","key":"2024021710214604700_ocad244-B45","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1016\/S0140-6736(23)00216-7","article-title":"The promise of large language models in health care","volume":"401","author":"Arora","year":"2023","journal-title":"Lancet"},{"key":"2024021710214604700_ocad244-B46","author":"Touvron","year":"2023"},{"issue":"6","key":"2024021710214604700_ocad244-B47","first-page":"7","article-title":"Alpaca: a strong, replicable instruction-following model","volume":"3","author":"Taori","year":"2023","journal-title":"Stanf Center Res Found Mod"},{"key":"2024021710214604700_ocad244-B48","author":"Kaplan","year":"2020"},{"key":"2024021710214604700_ocad244-B49","author":"Li","year":"2023"},{"key":"2024021710214604700_ocad244-B50","author":"Xiong","year":"2023"},{"key":"2024021710214604700_ocad244-B51","author":"Hu","year":"2021"},{"key":"2024021710214604700_ocad244-B52","first-page":"12697","author":"Zhao","year":"2021"},{"issue":"9","key":"2024021710214604700_ocad244-B53","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3560815","article-title":"Pre-train, prompt, and predict: a systematic survey of prompting methods in natural language processing","volume":"55","author":"Liu","year":"2023","journal-title":"ACM Comput Surv"},{"key":"2024021710214604700_ocad244-B54","first-page":"140","author":"Bommasani"},{"issue":"1","key":"2024021710214604700_ocad244-B55","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1038\/s41746-022-00742-2","article-title":"A large language model for electronic health records","volume":"5","author":"Yang","year":"2022","journal-title":"NPJ Digit Med"},{"key":"2024021710214604700_ocad244-B56","first-page":"5185","author":"Bender","year":"2020"},{"key":"2024021710214604700_ocad244-B57","author":"Lu","year":"2021"},{"issue":"5","key":"2024021710214604700_ocad244-B58","first-page":"1090","article-title":"Performance of ChatGPT, GPT-4, and Google bard on a neurosurgery oral boards preparation question bank","volume":"93","author":"Ali"},{"issue":"4","key":"2024021710214604700_ocad244-B59","doi-asserted-by":"crossref","first-page":"219","DOI":"10.4103\/singaporemedj.SMJ-2023-055","article-title":"The rise of artificial intelligence: addressing the impact of large language models such as ChatGPT on scientific publications","volume":"64","author":"Ang","year":"2023","journal-title":"Singapore Med J"},{"issue":"1","key":"2024021710214604700_ocad244-B60","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/s41597-023-01945-2","article-title":"MIMIC-IV, a freely accessible electronic health record dataset","volume":"10","author":"Johnson","year":"2023","journal-title":"Sci Data"},{"issue":"23","key":"2024021710214604700_ocad244-B61","doi-asserted-by":"crossref","first-page":"e215","DOI":"10.1161\/01.CIR.101.23.e215","article-title":"PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals","volume":"101","author":"Goldberger","year":"2000","journal-title":"Circulation"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/31\/3\/692\/56691555\/ocad244.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/31\/3\/692\/56691555\/ocad244.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,17]],"date-time":"2024-02-17T10:22:43Z","timestamp":1708165363000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/31\/3\/692\/7491951"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,22]]},"references-count":61,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,12,22]]},"published-print":{"date-parts":[[2024,2,16]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocad244","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,3,1]]},"published":{"date-parts":[[2023,12,22]]}}}