{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T23:08:31Z","timestamp":1774048111072,"version":"3.50.1"},"reference-count":96,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T00:00:00Z","timestamp":1760572800000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Australian Medical Research Futures Fund"},{"name":"Research Data Infrastructure","award":["MRFFRD000154"],"award-info":[{"award-number":["MRFFRD000154"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Objective<\/jats:title>\n                    <jats:p>Clinical registries advance healthcare by tracking patient outcomes and intervention safety. Manually extracting information from clinical text for registries is labor- and resource-intensive and often inaccurate. Therefore, this systematic review aims to evaluate the use and effectiveness of natural language processing (NLP) methods in extracting information from clinical text for populating clinical registries.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Materials and Methods<\/jats:title>\n                    <jats:p>PubMed, Embase, Scopus, Web of Science, and ACM Digital Library were systematically searched. Studies were included if they used NLP techniques to populate clinical registries. The extracted data included details of the registry, the clinical text, the registry data elements extracted, the NLP methods used, and how their performance was evaluated.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Fifteen articles were included in the review. Since 2020, the use of NLP methods for extracting information to populate clinical registries has been increasing steadily. Initially, rule-based NLP methods dominated the field, but machine learning-based approaches have gradually gained popularity. However, only one of the included studies employed generative large language models (LLMs). The diversity of clinical text and extracted data elements posed challenges to the generalizability of the NLP methods.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>To date, the application of NLP methods to clinical text for populating clinical registries has been limited in both the number of published studies and the scope of implementation. The NLP methods used thus far face significant challenges in effectively managing the complexity and diversity of clinical text and data elements. Moreover, the performance of the NLP methods varied significantly. This review underscores the need for a robust and adaptable NLP framework. Generative LLMs may provide direction for future research, but their use must account for challenges such as accuracy, cost, privacy, and limited supporting evidence.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/jamia\/ocaf176","type":"journal-article","created":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T11:58:30Z","timestamp":1758801510000},"page":"484-499","source":"Crossref","is-referenced-by-count":5,"title":["Using natural language processing to extract information from clinical text in electronic medical records for populating clinical registries: a systematic review"],"prefix":"10.1093","volume":"33","author":[{"given":"Leibo","family":"Liu","sequence":"first","affiliation":[{"name":"Centre for Big Data Research in Health, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"given":"Victoria","family":"Blake","sequence":"additional","affiliation":[{"name":"Centre for Big Data Research in Health, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"given":"Matthew","family":"Barman","sequence":"additional","affiliation":[{"name":"Centre for Big Data Research in Health, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3704-7975","authenticated-orcid":false,"given":"Blanca","family":"Gallego","sequence":"additional","affiliation":[{"name":"Centre for Big Data Research in Health, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"given":"Timothy","family":"Churches","sequence":"additional","affiliation":[{"name":"Ingham Institute for Applied Medical Research , Liverpool, NSW 2170,","place":["Australia"]}]},{"given":"Georgina","family":"Kennedy","sequence":"additional","affiliation":[{"name":"Ingham Institute for Applied Medical Research , Liverpool, NSW 2170,","place":["Australia"]},{"name":"South Western Sydney Clinical School, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"given":"Sze-Yuan","family":"Ooi","sequence":"additional","affiliation":[{"name":"School of Clinical Medicine, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]},{"name":"Prince of Wales Hospital , Randwick, NSW 2031,","place":["Australia"]}]},{"given":"Geoffrey P","family":"Delaney","sequence":"additional","affiliation":[{"name":"Ingham Institute for Applied Medical Research , Liverpool, NSW 2170,","place":["Australia"]},{"name":"South Western Sydney Clinical School, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0390-661X","authenticated-orcid":false,"given":"Louisa","family":"Jorm","sequence":"additional","affiliation":[{"name":"Centre for Big Data Research in Health, University of New South Wales , Sydney, NSW 2052,","place":["Australia"]}]}],"member":"286","published-online":{"date-parts":[[2025,10,15]]},"reference":[{"key":"2026012716174370700_ocaf176-B1","doi-asserted-by":"publisher","author":"Gliklich","year":"2020","DOI":"10.23970\/AHRQEPCREGISTRIES4"},{"key":"2026012716174370700_ocaf176-B2","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1093\/ejcts\/ezt018","article-title":"Clinical registries: governance, management, analysis and applications","volume":"44","author":"Hickey","year":"2013","journal-title":"Eur J Cardiothorac Surg"},{"key":"2026012716174370700_ocaf176-B3","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1177\/2050640615618042","article-title":"Creating an effective clinical registry for rare diseases","volume":"4","author":"D\u2019Agnolo","year":"2016","journal-title":"United European Gastroenterol J"},{"key":"2026012716174370700_ocaf176-B4","doi-asserted-by":"publisher","first-page":"655","DOI":"10.1016\/j.ophtha.2018.12.030","article-title":"Clinical registries in ophthalmology","volume":"126","author":"Tan","year":"2019","journal-title":"Ophthalmology"},{"key":"2026012716174370700_ocaf176-B5","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1377\/hlthaff.2011.0762","article-title":"Use of 13 disease registries in 5 countries demonstrates the potential to use outcome data to improve health care\u2019s value","volume":"31","author":"Larsson","year":"2012","journal-title":"Health Aff (Millwood)"},{"key":"2026012716174370700_ocaf176-B6","doi-asserted-by":"publisher","first-page":"106843","DOI":"10.1016\/j.cct.2022.106843","article-title":"Clinical registries data quality attributes to support registry-based randomised controlled trials: a scoping review","volume":"119","author":"Prang","year":"2022","journal-title":"Contemp Clin Trials"},{"key":"2026012716174370700_ocaf176-B7","doi-asserted-by":"crossref","first-page":"1614","DOI":"10.1038\/ajg.2016.464","article-title":"Oral contraceptive use and risk of ulcerative colitis progression: a nationwide study","volume":"111","author":"Khalili","year":"2016","journal-title":"Am J Gastroenterol"},{"key":"2026012716174370700_ocaf176-B8","doi-asserted-by":"crossref","first-page":"2311","DOI":"10.1007\/s00701-016-2980-4","article-title":"Multimodal analysis to predict shunt surgery outcome of 284 patients with suspected idiopathic normal pressure hydrocephalus","volume":"158","author":"Luikku","year":"2016","journal-title":"Acta Neurochir (Wien)"},{"key":"2026012716174370700_ocaf176-B9","doi-asserted-by":"crossref","first-page":"2375","DOI":"10.1016\/S0140-6736(16)31803-7","article-title":"Comparison of stapled haemorrhoidopexy with traditional excisional surgery for haemorrhoidal disease (eTHoS): a pragmatic, multicentre, randomised controlled trial. The","volume":"388","author":"Watson","year":"2016","journal-title":"Lancet"},{"key":"2026012716174370700_ocaf176-B10","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/B978-0-12-812898-5.00005-9","volume-title":"Quality and Safety in Neurosurgery","author":"Kerezoudis","year":"2018"},{"key":"2026012716174370700_ocaf176-B11","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1002\/cncr.30315","article-title":"Quality of care received and patient-reported regret in prostate cancer: analysis of a population-based prospective cohort","volume":"123","author":"Holmes","year":"2017","journal-title":"Cancer"},{"key":"2026012716174370700_ocaf176-B12","doi-asserted-by":"publisher","first-page":"46226","DOI":"10.1038\/srep46226","article-title":"Analysis of free text in electronic health records for identification of cancer patient trajectories","volume":"7","author":"Jensen","year":"2017","journal-title":"Sci Rep"},{"key":"2026012716174370700_ocaf176-B13","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1136\/jamia.2010.007237","article-title":"Data from clinical notes: a perspective on the tension between structure and flexible documentation","volume":"18","author":"Rosenbloom","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716174370700_ocaf176-B14","doi-asserted-by":"publisher","first-page":"e66910","DOI":"10.2196\/66910","article-title":"Using structured codes and free-text notes to measure information complementarity in electronic health records: feasibility and validation study","volume":"27","author":"Seinen","year":"2025","journal-title":"J Med Internet Res."},{"key":"2026012716174370700_ocaf176-B15","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1016\/j.jbi.2016.03.008","article-title":"Predicting colorectal surgical complications using heterogeneous clinical data and kernel methods","volume":"61","author":"Soguero-Ruiz","year":"2016","journal-title":"J Biomed Inform"},{"key":"2026012716174370700_ocaf176-B16","doi-asserted-by":"publisher","first-page":"e12239","DOI":"10.2196\/12239","article-title":"Natural language processing of clinical notes on chronic diseases: systematic review","volume":"7","author":"Sheikhalishahi","year":"2019","journal-title":"JMIR Med Inform"},{"key":"2026012716174370700_ocaf176-B17","doi-asserted-by":"publisher","first-page":"104840","DOI":"10.1016\/j.cmpb.2019.01.012","article-title":"An automated data verification approach for improving data quality in a clinical registry","volume":"181","author":"Tian","year":"2019","journal-title":"Comput Methods Programs Biomed"},{"key":"2026012716174370700_ocaf176-B18","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1093\/jamia\/ocab243","article-title":"Natural language inference for curation of structured clinical registries from unstructured text","volume":"29","author":"Percha","year":"2021","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716174370700_ocaf176-B19","doi-asserted-by":"publisher","first-page":"E334","DOI":"10.1136\/amiajnl-2013-001999","article-title":"Automated extraction of clinical traits of multiple sclerosis in electronic medical records","volume":"20","author":"Davis","year":"2013","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716174370700_ocaf176-B20","doi-asserted-by":"publisher","first-page":"1228","DOI":"10.1227\/neu.0000000000002568","article-title":"Developing an Automated Registry (Autoregistry) of spine surgery using natural language processing and health system scale databases","volume":"93","author":"Cheung","year":"2023","journal-title":"Neurosurgery"},{"key":"2026012716174370700_ocaf176-B21","doi-asserted-by":"crossref","first-page":"1490","DOI":"10.1007\/s11605-022-05282-4","article-title":"Capturing surgical data: comparing a quality improvement registry to natural language processing and manual chart review","volume":"26","author":"Miller","year":"2022","journal-title":"J Gastrointest Surg"},{"key":"2026012716174370700_ocaf176-B22","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1007\/s10278-014-9762-4","article-title":"Evaluation of an Automated Information Extraction Tool for Imaging Data Elements to Populate a Breast Cancer Screening Registry","volume":"28","author":"Lacson","year":"2015","journal-title":"J Digit Imaging"},{"key":"2026012716174370700_ocaf176-B23","doi-asserted-by":"crossref","first-page":"e0279842","DOI":"10.1371\/journal.pone.0279842","article-title":"Adverse drug event detection using natural language processing: a scoping review of supervised learning methods","volume":"18","author":"Murphy","year":"2023","journal-title":"PLoS One"},{"key":"2026012716174370700_ocaf176-B24","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1002\/9781394278695.ch3","article-title":"Natural Language Processing (NLP) in disease detection\u2013a discussion of how NLP techniques can be used to analyze and classify medical text data for disease diagnosis","volume":"3","author":"Kumar","year":"2025","journal-title":"AI in Disease Detection"},{"key":"2026012716174370700_ocaf176-B25","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1111\/j.1477-2574.2010.00235.x","article-title":"Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms","volume":"12","author":"Al-Haddad","year":"2010","journal-title":"HPB (Oxford)"},{"key":"2026012716174370700_ocaf176-B26","doi-asserted-by":"crossref","first-page":"1219","DOI":"10.1109\/TETC.2020.2983404","article-title":"Privacy-preserving deep learning NLP models for cancer registries","volume":"9","author":"Alawad","year":"2021","journal-title":"IEEE Trans Emerg Top Comput"},{"key":"2026012716174370700_ocaf176-B27","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1093\/jamiaopen\/ooz040","article-title":"Using natural language processing to construct a metastatic breast cancer cohort from linked cancer registry and electronic medical records data","volume":"2","author":"Ling","year":"2019","journal-title":"JAMIA Open"},{"key":"2026012716174370700_ocaf176-B28","doi-asserted-by":"publisher","first-page":"316:685","DOI":"10.3233\/SHTI240507","volume-title":"Stud Health Technol Inform","author":"Mou","year":"2024"},{"key":"2026012716174370700_ocaf176-B29","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1093\/jamia\/ocv180","article-title":"Extracting information from the text of electronic medical records to improve case detection: a systematic review","volume":"23","author":"Ford","year":"2016","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716174370700_ocaf176-B30","doi-asserted-by":"publisher","first-page":"102934","DOI":"10.1016\/j.artmed.2024.102934","article-title":"Deep learning algorithms for melanoma detection using dermoscopic images: a systematic review and meta-analysis","volume":"155","author":"Ye","year":"2024","journal-title":"Artif Intell Med"},{"key":"2026012716174370700_ocaf176-B31","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1093\/jamia\/ocab236","article-title":"Sepsis prediction, early detection, and identification using clinical text for machine learning: a systematic review","volume":"29","author":"Yan","year":"2022","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716174370700_ocaf176-B32","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1186\/s12882-015-0028-2","article-title":"A global overview of renal registries: a systematic review","volume":"16","author":"Liu","year":"2015","journal-title":"BMC Nephrol"},{"key":"2026012716174370700_ocaf176-B33","doi-asserted-by":"publisher","first-page":"1031","DOI":"10.1016\/j.jalz.2017.04.005","article-title":"Dementia registries around the globe and their applications: a systematic review","volume":"13","author":"Krysinska","year":"2017","journal-title":"Alzheimers Dement"},{"key":"2026012716174370700_ocaf176-B34","doi-asserted-by":"publisher","first-page":"101049","DOI":"10.1016\/j.cpcardiol.2021.101049","article-title":"A global overview of acute coronary syndrome registries: a systematic review","volume":"48","author":"Nabovati","year":"2023","journal-title":"Curr Probl Cardiol"},{"key":"2026012716174370700_ocaf176-B35","doi-asserted-by":"publisher","first-page":"e0183667","DOI":"10.1371\/journal.pone.0183667","article-title":"Impact of clinical registries on quality of patient care and clinical outcomes: a systematic review","volume":"12","author":"Hoque","year":"2017","journal-title":"PLoS One"},{"key":"2026012716174370700_ocaf176-B36","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1016\/j.surg.2014.08.097","article-title":"Clinical registries and quality measurement in surgery: a systematic review","volume":"157","author":"Stey","year":"2015","journal-title":"Surgery"},{"key":"2026012716174370700_ocaf176-B37","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1001\/jamasurg.2018.1635","article-title":"The value of clinical colorectal cancer registries in colorectal cancer research: a systematic review","volume":"153","author":"MacCallum","year":"2018","journal-title":"JAMA Surg"},{"key":"2026012716174370700_ocaf176-B38","doi-asserted-by":"publisher","first-page":"e007963","DOI":"10.1161\/CIRCOUTCOMES.121.007963","article-title":"Characteristics and quality of national cardiac registries: a systematic review","volume":"14","author":"Dawson","year":"2021","journal-title":"Circ Cardiovasc Qual Outcomes"},{"key":"2026012716174370700_ocaf176-B39","doi-asserted-by":"publisher","first-page":"e017373","DOI":"10.1136\/bmjopen-2017-017373","article-title":"What are the essential features of a successful surgical registry? a systematic review","volume":"7","author":"Mandavia","year":"2017","journal-title":"BMJ Open"},{"key":"2026012716174370700_ocaf176-B40","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1186\/s13643-024-02533-0","article-title":"The usage of population and disease registries as pre-screening tools for clinical trials, a systematic review","volume":"13","author":"Foucher","year":"2024","journal-title":"Syst Rev"},{"key":"2026012716174370700_ocaf176-B41","doi-asserted-by":"publisher","first-page":"n71","DOI":"10.1136\/bmj.n71","article-title":"The PRISMA 2020 statement: an updated guideline for reporting systematic reviews","volume":"372","author":"Page","year":"2021","journal-title":"BMJ"},{"key":"2026012716174370700_ocaf176-B42","author":"Covidence","year":"2025"},{"key":"2026012716174370700_ocaf176-B43","doi-asserted-by":"publisher","first-page":"e1001744","DOI":"10.1371\/journal.pmed.1001744","article-title":"Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist","volume":"11","author":"Moons","year":"2014","journal-title":"PLoS Med"},{"key":"2026012716174370700_ocaf176-B44","doi-asserted-by":"crossref","first-page":"51","DOI":"10.7326\/M18-1376","article-title":"PROBAST: a tool to assess the risk of bias and applicability of prediction model studies","volume":"170","author":"Wolff","year":"2019","journal-title":"Ann Intern Med"},{"key":"2026012716174370700_ocaf176-B45","doi-asserted-by":"publisher","first-page":"945","DOI":"10.1111\/ceo.14156","article-title":"Automated disease registry using low-code natural language processing","volume":"50","author":"Macri","year":"2022","journal-title":"Clinical and Experimental Ophthalmology"},{"key":"2026012716174370700_ocaf176-B46","first-page":"899","article-title":"Automated extraction of free-text from pathology reports","author":"Currie","year":"2006","journal-title":"AMIA Annual Symposium proceedings\/AMIA Symposium AMIA Symposium"},{"key":"2026012716174370700_ocaf176-B47","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1002\/pds.3324","article-title":"Automated identification of asthma patients within an electronical medical record database using machine learning","volume":"21","author":"Engelkes","year":"2012","journal-title":"Pharmacoepidemiol Drug Saf"},{"key":"2026012716174370700_ocaf176-B48","doi-asserted-by":"publisher","first-page":"S300","DOI":"10.1097\/MPG.0000000000002164","article-title":"The Boston children\u2019s hospital appendicitis database: development of a large, multimodal dataset optimized for deep learning","volume":"67","author":"Crowley","year":"2018","journal-title":"J Pediatr Gastroenterol Nutr"},{"key":"2026012716174370700_ocaf176-B49","doi-asserted-by":"publisher","DOI":"10.1161\/circ.146.suppl_1","article-title":"Design and development of Automatic Extraction Transformation and Load (ETL) process and machine learning for echocardiography data extraction and validation: Houston methodist CVD learning health system registry","volume":"146","author":"Bose","year":"2022","journal-title":"Circulation"},{"key":"2026012716174370700_ocaf176-B50","doi-asserted-by":"crossref","first-page":"A80","DOI":"10.1016\/j.jval.2016.03.1765","article-title":"Identification of persons with congenital hemophilia in a large electronic health record database","volume":"19","author":"Wang","year":"2016","journal-title":"Value in Health"},{"key":"2026012716174370700_ocaf176-B51","doi-asserted-by":"publisher","first-page":"426","DOI":"10.1002\/ppul.23840","article-title":"Leveraging a novel documentation paradigm to populate the CF foundation patient registry","volume":"52","author":"Leander","year":"2017","journal-title":"Pediatr Pulmonol"},{"key":"2026012716174370700_ocaf176-B52","doi-asserted-by":"publisher","first-page":"3955","DOI":"10.1002\/art.41966","article-title":"Natural language processing tool for extraction of patient-reported outcomes from a national multi-electronic health records registry","volume":"73","author":"Humbert-Droz","year":"2021","journal-title":"Arthritis Rheumatol"},{"key":"2026012716174370700_ocaf176-B53","doi-asserted-by":"publisher","first-page":"e121","DOI":"10.1097\/01.JU.0000555154.96959.37","article-title":"Patient-level validation of prostate cancer data collected via automated extraction from structured and unstructured electronic health record (EHR) records","volume":"201","author":"Cooperberg","year":"2019","journal-title":"J Urol"},{"key":"2026012716174370700_ocaf176-B54","doi-asserted-by":"crossref","DOI":"10.1093\/eurheartj\/ehad655.2952","article-title":"Using natural language processing to generate a large-scale database of aortic stenosis with long-term follow-up: the CASPER (cogstack aortic stenosis patient electronic registry) database","volume":"44","author":"Wu","year":"2023","journal-title":"European Heart Journal"},{"key":"2026012716174370700_ocaf176-B55","author":"Instituto de Investigacion Sanitaria La Fe","year":"2021"},{"key":"2026012716174370700_ocaf176-B56","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1016\/j.urolonc.2024.12.144","article-title":"Automating renal cancer chart review using large language models","volume":"43","author":"Heller","year":"2025","journal-title":"Urologic Oncology: Seminars and Original Investigations"},{"key":"2026012716174370700_ocaf176-B57","doi-asserted-by":"publisher","first-page":"i2230","DOI":"10.1093\/ecco-jcc\/jjae190.1406","article-title":"A novel inflammatory bowel disease registry powered by Artificial Intelligence and Natural Language Processing","volume":"19","author":"Liu","year":"2025","journal-title":"J Crohn\u2019s Colitis"},{"key":"2026012716174370700_ocaf176-B58","doi-asserted-by":"publisher","first-page":"18","DOI":"10.3760\/cma.j.cn113565-113565-20220512-00082","article-title":"Application and design of lymphoma dataset based on real-world research","volume":"36","author":"Mi","year":"2023","journal-title":"Chin J Med Sci Res Manag"},{"key":"2026012716174370700_ocaf176-B59","doi-asserted-by":"publisher","first-page":"1145","DOI":"10.3969\/j.issn.1674-8115.2023.09.008","article-title":"Construction of Shanghai Diabetes Clinical Database and real-world study","volume":"43","author":"Xue","year":"2023","journal-title":"J Shanghai Jiaotong Univ Med Sci"},{"key":"2026012716174370700_ocaf176-B60","doi-asserted-by":"publisher","first-page":"996","DOI":"10.3969\/j.issn.1674-8115.2020.07.022","article-title":"Discussion on value of medical records-structured specialized disease database based on artificial intelligence in clinical research","volume":"40","author":"Rong","year":"2020","journal-title":"J Shanghai Jiaotong Univ Med Sci"},{"key":"2026012716174370700_ocaf176-B61","doi-asserted-by":"publisher","first-page":"712827","DOI":"10.3389\/fpubh.2021.712827","article-title":"Research on the construction and application of breast cancer-specific database system based on full data lifecycle","volume":"9","author":"Jin","year":"2021","journal-title":"Front Public Health"},{"key":"2026012716174370700_ocaf176-B62","doi-asserted-by":"publisher","first-page":"e088166","DOI":"10.1136\/bmjopen-2024-088166","article-title":"Akrivia Health Database-deep patient characterisation using a secondary mental healthcare dataset in England and Wales: cohort profile","volume":"14","author":"Todorovic","year":"2024","journal-title":"BMJ Open"},{"key":"2026012716174370700_ocaf176-B63","doi-asserted-by":"publisher","first-page":"588","DOI":"10.1007\/s12070-022-03173-3","article-title":"Feasibility of establishing an artificial intelligence based head and neck cancer registry: experience from a tertiary care hospital","volume":"74","author":"Gautamjit","year":"2022","journal-title":"Indian J Otolaryngol Head Neck Surg"},{"key":"2026012716174370700_ocaf176-B64","doi-asserted-by":"publisher","first-page":"e140","DOI":"10.1111\/ijlh.13781","article-title":"Construction of a bone marrow report registry using a clinical data warehouse","volume":"44","author":"Lee","year":"2022","journal-title":"Int J Lab Hematol"},{"key":"2026012716174370700_ocaf176-B65","doi-asserted-by":"publisher","first-page":"1375","DOI":"10.1109\/EMBC48229.2022.9871340","author":"Ieee","year":"2022"},{"key":"2026012716174370700_ocaf176-B66","doi-asserted-by":"publisher","first-page":"267","DOI":"10.2174\/157489312802460730","article-title":"SemanticDB: a semantic Web infrastructure for clinical research and quality reporting","volume":"7","author":"Pierce","year":"2012","journal-title":"CBIO"},{"key":"2026012716174370700_ocaf176-B67","doi-asserted-by":"publisher","first-page":"103849","DOI":"10.1016\/j.jbi.2021.103849","article-title":"A two-stage workflow to extract and harmonize drug mentions from clinical notes into observational databases","volume":"120","author":"Almeida","year":"2021","journal-title":"J Biomed Inform"},{"key":"2026012716174370700_ocaf176-B68","doi-asserted-by":"publisher","first-page":"373","DOI":"10.3414\/ME15-02-0019","article-title":"Automated classification of selected data elements from free-text diagnostic reports for clinical research","volume":"55","author":"L\u00f6pprich","year":"2016","journal-title":"Methods Inf Med"},{"key":"2026012716174370700_ocaf176-B69","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/s12911-016-0255-x","article-title":"Modelling and extraction of variability in free-text medication prescriptions from an anonymised primary care electronic medical record research database","volume":"16","author":"Karystianis","year":"2016","journal-title":"BMC Med Inform Decis Mak"},{"key":"2026012716174370700_ocaf176-B70","doi-asserted-by":"publisher","first-page":"119811","DOI":"10.1016\/j.cca.2024.119811","article-title":"Computer-assisted patient identification tool in inborn errors of metabolism\u2014potential for rare disease patient registry and big data analysis","volume":"561","author":"Mak","year":"2024","journal-title":"Clin Chim Acta"},{"key":"2026012716174370700_ocaf176-B71","doi-asserted-by":"publisher","DOI":"10.20452\/pamw.16704","article-title":"Practical use case of natural language processing for observational clinical research data retrieval from electronic health records: AssistMED project","volume":"134","author":"Maciejewski","year":"2024","journal-title":"Poli Arch of Inter Medi"},{"key":"2026012716174370700_ocaf176-B72","doi-asserted-by":"publisher","first-page":"e14127","DOI":"10.1002\/acm2.14127","article-title":"Infrastructure tools to support an effective Radiation Oncology Learning Health System","volume":"24","author":"Kapoor","year":"2023","journal-title":"J Appl Clin Med Phys"},{"key":"2026012716174370700_ocaf176-B73","doi-asserted-by":"publisher","author":"Taseh","year":"2024","DOI":"10.1101\/2024.09.26.24314444"},{"key":"2026012716174370700_ocaf176-B74","doi-asserted-by":"publisher","author":"Dencker","year":"2025","DOI":"10.1101\/2025.04.07.25325369"},{"key":"2026012716174370700_ocaf176-B75","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1016\/j.ijmedinf.2017.10.004","article-title":"Need of informatics in designing interoperable clinical registries","volume":"108","author":"Rastegar-Mojarad","year":"2017","journal-title":"Int J Med Inform"},{"key":"2026012716174370700_ocaf176-B76","doi-asserted-by":"publisher","first-page":"3335","DOI":"10.1007\/s00417-023-06190-2","article-title":"A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry","volume":"261","author":"Macri","year":"2023","journal-title":"Graefes Arch Clin Exp Ophthalmol"},{"key":"2026012716174370700_ocaf176-B77","doi-asserted-by":"publisher","first-page":"20543581231178963","DOI":"10.1177\/20543581231178963","article-title":"The development of a comprehensive clinicopathologic registry for glomerular diseases using natural language processing","volume":"10","author":"Barr","year":"2023","journal-title":"Can J Kidney Health Dis"},{"key":"2026012716174370700_ocaf176-B78","doi-asserted-by":"publisher","first-page":"8591","DOI":"10.1038\/s41598-023-35482-0","article-title":"Constructing a disease database and using natural language processing to capture and standardize free text clinical information","volume":"13","author":"Raza","year":"2023","journal-title":"Sci Rep"},{"key":"2026012716174370700_ocaf176-B79","doi-asserted-by":"publisher","DOI":"10.3390\/cancers15153808","article-title":"Reliability and efficiency of the CAPRI-3 metastatic prostate cancer registry driven by Artificial Intelligence","volume":"15","author":"Bosch","year":"2023","journal-title":"Cancers (Basel)"},{"key":"2026012716174370700_ocaf176-B80","doi-asserted-by":"publisher","first-page":"102847","DOI":"10.1016\/j.artmed.2024.102847","article-title":"Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline","volume":"151","author":"Tavabi","year":"2024","journal-title":"Artif Intell Med"},{"key":"2026012716174370700_ocaf176-B81","doi-asserted-by":"publisher","first-page":"322","DOI":"10.1016\/j.csbj.2024.04.007","article-title":"Integrating predictive coding and a user-centric interface for enhanced auditing and quality in cancer registry data","volume":"24","author":"Dai","year":"2024","journal-title":"Comput Struct Biotechnol J"},{"key":"2026012716174370700_ocaf176-B82","doi-asserted-by":"publisher","first-page":"ooae054","DOI":"10.1093\/jamiaopen\/ooae054","article-title":"Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models","volume":"7","author":"Lee","year":"2024","journal-title":"JAMIA Open"},{"key":"2026012716174370700_ocaf176-B83","doi-asserted-by":"publisher","first-page":"685","DOI":"10.3233\/SHTI240507","article-title":"Improving the quality of unstructured cancer data using large language models: a German oncological case study","volume":"316","author":"Mou","year":"2024","journal-title":"Stud Health Technol Inform"},{"key":"2026012716174370700_ocaf176-B84","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1002\/jrsm.1411","article-title":"Risk-of-bias VISualization (robvis): an R package and Shiny web app for visualizing risk-of-bias assessments","volume":"12","author":"McGuinness","year":"2021","journal-title":"Res Synth Methods"},{"key":"2026012716174370700_ocaf176-B85","doi-asserted-by":"publisher","first-page":"644","DOI":"10.1002\/cpt.1966","article-title":"An electronic health record text mining tool to collect real-world drug treatment outcomes: a validation study in patients with metastatic renal cell carcinoma","volume":"108","author":"van Laar","year":"2020","journal-title":"Clin Pharmacol Ther"},{"key":"2026012716174370700_ocaf176-B86","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1186\/s12913-020-06015-6","article-title":"Electronic medical record implementation in tertiary care: factors influencing adoption of an electronic medical record in a cancer centre","volume":"21","author":"Janssen","year":"2021","journal-title":"BMC Health Serv Res"},{"key":"2026012716174370700_ocaf176-B87","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1007\/s11023-024-09692-y","article-title":"A justifiable investment in AI for healthcare: aligning ambition with reality","volume":"34","author":"Karpathakis","year":"2024","journal-title":"Minds & Machines"},{"key":"2026012716174370700_ocaf176-B88","first-page":"4171"},{"key":"2026012716174370700_ocaf176-B89","author":"Wei","year":"2024"},{"key":"2026012716174370700_ocaf176-B90","first-page":"3","article-title":"Quality of labeled data in machine learning: common sense and the controversial effect for user behavior models","volume":"33","author":"Bakaev","year":"2023","journal-title":"Eng Proc"},{"key":"2026012716174370700_ocaf176-B91","doi-asserted-by":"publisher","author":"Bowman","DOI":"10.18653\/v1\/D15-1075"},{"key":"2026012716174370700_ocaf176-B92","doi-asserted-by":"publisher","author":"Williams","DOI":"10.18653\/v1\/N18-1101"},{"key":"2026012716174370700_ocaf176-B93","doi-asserted-by":"publisher","author":"Thorne","DOI":"10.18653\/v1\/N18-1074"},{"key":"2026012716174370700_ocaf176-B94","doi-asserted-by":"publisher","author":"Nie","DOI":"10.18653\/v1\/2020.acl-main.441"},{"key":"2026012716174370700_ocaf176-B95","doi-asserted-by":"publisher","author":"Dong","DOI":"10.18653\/v1\/2024.emnlp-main.64"},{"key":"2026012716174370700_ocaf176-B96","doi-asserted-by":"publisher","first-page":"186357","DOI":"10.1007\/s11704-024-40555-y","article-title":"Large language models for generative information extraction: a survey","volume":"18","author":"Xu","year":"2024","journal-title":"Front Comput Sci"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/2\/484\/64713900\/ocaf176.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/2\/484\/64713900\/ocaf176.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T21:17:51Z","timestamp":1769548671000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/33\/2\/484\/8287208"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,15]]},"references-count":96,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2025,10,15]]},"published-print":{"date-parts":[[2026,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocaf176","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,2]]},"published":{"date-parts":[[2025,10,15]]}}}