{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T00:15:20Z","timestamp":1775002520590,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,1,16]],"date-time":"2020-01-16T00:00:00Z","timestamp":1579132800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["R01-GM123109"],"award-info":[{"award-number":["R01-GM123109"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Objective<\/jats:title>\n                    <jats:p>We developed medExtractR, a natural language processing system to extract medication information from clinical notes. Using a targeted approach, medExtractR focuses on individual drugs to facilitate creation of medication-specific research datasets from electronic health records.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Materials and Methods<\/jats:title>\n                    <jats:p>Written using the R programming language, medExtractR combines lexicon dictionaries and regular expressions to identify relevant medication entities (eg, drug name, strength, frequency). MedExtractR was developed on notes from Vanderbilt University Medical Center, using medications prescribed with varying complexity. We evaluated medExtractR and compared it with 3 existing systems: MedEx, MedXN, and CLAMP (Clinical Language Annotation, Modeling, and Processing). We also demonstrated how medExtractR can be easily tuned for better performance on an outside dataset using the MIMIC-III (Medical Information Mart for Intensive Care III) database.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>On 50 test notes per development drug and 110 test notes for an additional drug, medExtractR achieved high overall performance (F-measures &amp;gt;0.95), exceeding performance of the 3 existing systems across all drugs. MedExtractR achieved the highest F-measure for each individual entity, except drug name and dose amount for allopurinol. With tuning and customization, medExtractR achieved F-measures &amp;gt;0.90 in the MIMIC-III dataset.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Discussion<\/jats:title>\n                    <jats:p>The medExtractR system successfully extracted entities for medications of interest. High performance in entity-level extraction provides a strong foundation for developing robust research datasets for pharmacological research. When working with new datasets, medExtractR should be tuned on a small sample of notes before being broadly applied.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>The medExtractR system achieved high performance extracting specific medications from clinical text, leading to higher-quality research datasets for drug-related studies than some existing general-purpose medication extraction tools.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/jamia\/ocz207","type":"journal-article","created":{"date-parts":[[2019,11,23]],"date-time":"2019-11-23T07:11:30Z","timestamp":1574493090000},"page":"407-418","source":"Crossref","is-referenced-by-count":28,"title":["medExtractR: A targeted, customizable approach to medication extraction from electronic health records"],"prefix":"10.1093","volume":"27","author":[{"given":"Hannah L","family":"Weeks","sequence":"first","affiliation":[{"name":"Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Cole","family":"Beck","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Elizabeth","family":"McNeer","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Michael L","family":"Williams","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Cosmin A","family":"Bejan","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Joshua C","family":"Denny","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]},{"given":"Leena","family":"Choi","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA"}]}],"member":"286","published-online":{"date-parts":[[2020,1,16]]},"reference":[{"issue":"6","key":"2020110613085822600_ocz207-B1","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1093\/jamia\/ocv202","article-title":"PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability","volume":"23","author":"Kirby","year":"2016","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2020110613085822600_ocz207-B2","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1097\/FPC.0b013e32834e1641","article-title":"Use of a DNA biobank linked to electronic medical records to characterize pharmacogenomic predictors of tacrolimus dose requirement in kidney transplant recipients","volume":"22","author":"Birdwell","year":"2012","journal-title":"Pharmacogenet Genomics"},{"issue":"1","key":"2020110613085822600_ocz207-B3","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1002\/cpt.1416","article-title":"Real-world data for pediatric pharmacometrics: can we upcycle clinical data for research use?","volume":"106","author":"Van","year":"2019","journal-title":"Clin Pharmacol Ther"},{"issue":"1","key":"2020110613085822600_ocz207-B4","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1197\/jamia.M3378","article-title":"MedEx: a medication information extraction system for clinical narratives","volume":"17","author":"Xu","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"2020110613085822600_ocz207-B5","doi-asserted-by":"crossref","first-page":"858","DOI":"10.1136\/amiajnl-2013-002190","article-title":"MedXN: an open source medication extraction and normalization tool for clinical text","volume":"21","author":"Sohn","year":"2014","journal-title":"J Am Med Inform Assoc"},{"issue":"3","key":"2020110613085822600_ocz207-B6","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1093\/jamia\/ocx132","article-title":"CLAMP\u2013a toolkit for efficiently building customized clinical natural language processing pipelines","volume":"25","author":"Soysal","year":"2018","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2020110613085822600_ocz207-B7","doi-asserted-by":"crossref","first-page":"160035","DOI":"10.1038\/sdata.2016.35","article-title":"MIMIC-III, a freely accessible critical care database","volume":"3","author":"Johnson","year":"2016","journal-title":"Sci Data"},{"key":"2020110613085822600_ocz207-B8","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1016\/j.jbi.2017.11.011","article-title":"Clinical information extraction applications: a literature review","volume":"77","author":"Wang","year":"2018","journal-title":"J Biomed Inform"},{"key":"2020110613085822600_ocz207-B9","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1016\/j.jbi.2017.07.012","article-title":"Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review","volume":"73","author":"Kreimeyer","year":"2017","journal-title":"J Biomed Inform"},{"key":"2020110613085822600_ocz207-B10","first-page":"1880","article-title":"Ensembles of nlp tools for data element extraction from clinical notes","volume":"2016","author":"Kuo","year":"2016","journal-title":"AMIA Annu Symp Proc"},{"issue":"3","key":"2020110613085822600_ocz207-B11","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1136\/jamia.2009.002733","article-title":"An overview of MetaMap: historical perspective and recent advances","volume":"17","author":"Aronson","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"2020110613085822600_ocz207-B12","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1136\/jamia.2009.001560","article-title":"Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications","volume":"17","author":"Savova","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"2","key":"2020110613085822600_ocz207-B13","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1136\/jamia.1994.95236146","article-title":"A general natural-language text processor for clinical radiology","volume":"1","author":"Friedman","year":"1994","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"2020110613085822600_ocz207-B14","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1136\/amiajnl-2011-000155","article-title":"Using machine learning for concept extraction on clinical documents from multiple data sources","volume":"18","author":"Torii","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613085822600_ocz207-B15","first-page":"195","article-title":"The KnowledgeMap project: development of a concept-based medical school curriculum database","volume":"2003","author":"Denny","year":"2003","journal-title":"AMIA Annu Symp Proc"},{"key":"2020110613085822600_ocz207-B16","first-page":"931","volume-title":"AMIA Annu Symp Proc","author":"Goryachev","year":"2006"},{"key":"2020110613085822600_ocz207-B17","first-page":"188","author":"Nguyen","year":"2009"},{"key":"2020110613085822600_ocz207-B18","first-page":"237","article-title":"Extracting structured medication event information from discharge summaries","volume":"2008","author":"Gold","year":"2008","journal-title":"AMIA Annu Symp Proc"},{"issue":"2","key":"2020110613085822600_ocz207-B19","doi-asserted-by":"crossref","first-page":"e9","DOI":"10.1097\/MLR.0b013e3182a30373","article-title":"Validation of case finding algorithms for hepatocellular cancer from administrative data and electronic health records using natural language processing","volume":"54","author":"Sada","year":"2016","journal-title":"Med Care"},{"key":"2020110613085822600_ocz207-B20","doi-asserted-by":"crossref","first-page":"89","DOI":"10.2147\/JBM.S136060","article-title":"Identification of people with acquired hemophilia in a large electronic health record database","volume":"8","author":"Wang","year":"2017","journal-title":"J Blood Med"},{"issue":"1","key":"2020110613085822600_ocz207-B21","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1186\/s13012-014-0197-6","article-title":"Accuracy of using automated methods for detecting adverse events from electronic health record data: a research protocol","volume":"10","author":"Rochefort","year":"2015","journal-title":"Implement Sci"},{"issue":"3","key":"2020110613085822600_ocz207-B22","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1093\/intqhc\/mzq012","article-title":"Automated detection of follow-up appointments using text mining of discharge records","volume":"22","author":"Ruud","year":"2010","journal-title":"Int J Qual Health Care"},{"key":"2020110613085822600_ocz207-B23","volume-title":"Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition","author":"Martin","year":"2009"},{"issue":"5","key":"2020110613085822600_ocz207-B24","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1136\/jamia.2010.003939","article-title":"High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge","volume":"17","author":"Patrick","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"2020110613085822600_ocz207-B25","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1136\/jamia.2010.004077","article-title":"Lancet: a high precision medication event extraction system for clinical text","volume":"17","author":"Li","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"4","key":"2020110613085822600_ocz207-B26","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1016\/j.ijmedinf.2008.08.006","article-title":"Assessment of commercial NLP engines for medication information extraction from dictated clinical notes","volume":"78","author":"Jagannathan","year":"2009","journal-title":"Int J Med Inform"},{"issue":"5","key":"2020110613085822600_ocz207-B27","doi-asserted-by":"crossref","first-page":"514","DOI":"10.1136\/jamia.2010.003947","article-title":"Extracting medication information from clinical text","volume":"17","author":"Uzuner","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"3","key":"2020110613085822600_ocz207-B28","doi-asserted-by":"crossref","first-page":"362","DOI":"10.1038\/clpt.2008.89","article-title":"Development of a large-scale de-identified DNA biobank to enable personalized medicine","volume":"84","author":"Roden","year":"2008","journal-title":"Clin Pharmacol Ther"},{"issue":"4","key":"2020110613085822600_ocz207-B29","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1136\/amiajnl-2011-000116","article-title":"Normalized names for clinical drugs: RxNorm at 6 years","volume":"18","author":"Nelson","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"2020110613085822600_ocz207-B30","first-page":"102","author":"Stenetorp","year":"2012"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/27\/3\/407\/34153278\/ocz207.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/27\/3\/407\/34153278\/ocz207.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T14:23:55Z","timestamp":1604672635000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/27\/3\/407\/5707371"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,16]]},"references-count":30,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2020,1,16]]},"published-print":{"date-parts":[[2020,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocz207","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/19007286","asserted-by":"object"}]},"ISSN":["1527-974X"],"issn-type":[{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,3]]},"published":{"date-parts":[[2020,1,16]]}}}