{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T20:42:48Z","timestamp":1772311368445,"version":"3.50.1"},"reference-count":21,"publisher":"Springer Science and Business Media LLC","issue":"S5","license":[{"start":{"date-parts":[[2019,12,1]],"date-time":"2019-12-01T00:00:00Z","timestamp":1575158400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2019,12,5]],"date-time":"2019-12-05T00:00:00Z","timestamp":1575504000000},"content-version":"vor","delay-in-days":4,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Lung cancer is the second most common cancer for men and women; the wide adoption of electronic health records (EHRs) offers a potential to accelerate cohort-related epidemiological studies using informatics approaches. Since manual extraction from large volumes of text materials is time consuming and labor intensive, some efforts have emerged to automatically extract information from text for lung cancer patients using natural language processing (NLP), an artificial intelligence technique.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Methods<\/jats:title>\n                <jats:p>In this study, using an existing cohort of 2311 lung cancer patients with information about stage, histology, tumor grade, and therapies (chemotherapy, radiotherapy and surgery) manually ascertained, we developed and evaluated an NLP system to extract information on these variables automatically for the same patients from clinical narratives including clinical notes, pathology reports and surgery reports.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>Evaluation showed promising results with the recalls for stage, histology, tumor grade, and therapies achieving 89, 98, 78, and 100% respectively and the precisions were 70, 88, 90, and 100% respectively.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>This study demonstrated the feasibility and accuracy of automatically extracting pre-defined information from clinical narratives for lung cancer research.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12911-019-0931-8","type":"journal-article","created":{"date-parts":[[2019,12,5]],"date-time":"2019-12-05T01:02:23Z","timestamp":1575507743000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["Natural language processing for populating lung cancer clinical research data"],"prefix":"10.1186","volume":"19","author":[{"given":"Liwei","family":"Wang","sequence":"first","affiliation":[]},{"given":"Lei","family":"Luo","sequence":"additional","affiliation":[]},{"given":"Yanshan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jason","family":"Wampfler","sequence":"additional","affiliation":[]},{"given":"Ping","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Hongfang","family":"Liu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,12,5]]},"reference":[{"key":"931_CR1","unstructured":"American Cancer Society (ACS).. Cancer Facts & Figures 2017 [https:\/\/www.cancer.org\/research\/cancer-facts-statistics\/all-cancer-facts-figures\/cancer-facts-figures-2017.html] Access date: 25-Apr-2019."},{"key":"931_CR2","first-page":"469","volume-title":"Methods in Molecular Biology","author":"Ping Yang","year":"2009","unstructured":"Yang P. Epidemiology of lung cancer prognosis: quantity and quality of life. In: Cancer Epidemiology: Humana Press; 2009. p. 469\u201386."},{"issue":"3","key":"931_CR3","doi-asserted-by":"publisher","first-page":"659","DOI":"10.1002\/cncr.24831","volume":"116","author":"JA Barletta","year":"2010","unstructured":"Barletta JA, Yeap BY, Chirieac LR. Prognostic significance of grading in lung adenocarcinoma. Cancer. 2010;116(3):659\u201369.","journal-title":"Cancer"},{"issue":"suppl_9","key":"931_CR4","doi-asserted-by":"crossref","first-page":"ix135","DOI":"10.1093\/annonc\/mdm308","volume":"18","author":"B Besse","year":"2007","unstructured":"Besse B, Ropert S, Soria J. Targeted therapies in lung cancer. Ann Oncol. 2007;18(suppl_9):ix135\u201342.","journal-title":"Ann Oncol"},{"issue":"1","key":"931_CR5","doi-asserted-by":"publisher","first-page":"13050","DOI":"10.1038\/s41598-017-13495-w","volume":"7","author":"F Bie","year":"2017","unstructured":"Bie F, Qu X, Yang X, Pang Z, Yang Y, Liu S, Dong W, Du J. Appropriate surgical modalities for stages T2a and T2b in the eighth TNM classification of lung cancer. Sci Rep. 2017;7(1):13050.","journal-title":"Sci Rep"},{"key":"931_CR6","unstructured":"National Cancer Institute (NCI). Tumor Grade [https:\/\/www.cancer.gov\/about-cancer\/diagnosis-staging\/prognosis\/tumor-grade-fact-sheet] Access date: 25-Apr-2019."},{"key":"931_CR7","doi-asserted-by":"publisher","first-page":"139","DOI":"10.2147\/CLEP.S17191","volume":"3","author":"K Cetin","year":"2011","unstructured":"Cetin K, Ettinger DS, Y-j H, D O Malley C. Survival by histologic subtype in stage IV nonsmall cell lung cancer based on data from the surveillance, Epidemiology and End Results Program. Clin Epidemiol. 2011;3:139.","journal-title":"Clin Epidemiol"},{"key":"931_CR8","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1146\/annurev-publhealth-032315-021353","volume":"37","author":"JA Casey","year":"2016","unstructured":"Casey JA, Schwartz BS, Stewart WF, Adler NE. Using electronic health records for population health research: a review of methods and applications. Annu Rev Public Health. 2016;37:61\u201381.","journal-title":"Annu Rev Public Health"},{"key":"931_CR9","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1016\/j.jbi.2017.11.011","volume":"77","author":"Y Wang","year":"2018","unstructured":"Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, Liu S, Zeng Y, Mehrabi S, Sohn S. Clinical information extraction applications: a literature review. J Biomed Inform. 2018;77:34\u201349.","journal-title":"J Biomed Inform"},{"issue":"4","key":"931_CR10","doi-asserted-by":"publisher","first-page":"440","DOI":"10.1136\/jamia.2010.003707","volume":"17","author":"AN Nguyen","year":"2010","unstructured":"Nguyen AN, Lawley MJ, Hansen DP, Bowman RV, Clarke BE, Duhig EE, Colquist S. Symbolic rule-based classification of lung cancer stages from free-text pathology reports. J Am Med Inform Assoc. 2010;17(4):440\u20135.","journal-title":"J Am Med Inform Assoc"},{"issue":"2","key":"931_CR11","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1200\/JOP.2015.004622","volume":"12","author":"JL Warner","year":"2015","unstructured":"Warner JL, Levy MA, Neuss MN, Warner JL, Levy MA, Neuss MN. ReCAP: feasibility and accuracy of extracting cancer stage information from narrative electronic health record data. J Oncol Pract. 2015;12(2):157\u20138.","journal-title":"J Oncol Pract"},{"issue":"1","key":"931_CR12","doi-asserted-by":"publisher","first-page":"e8","DOI":"10.2196\/medinform.8662","volume":"6","author":"S Zheng","year":"2018","unstructured":"Zheng S, Jabbour SK, O'Reilly SE, Lu JJ, Dong L, Ding L, Xiao Y, Yue N, Wang F, Zou W. Automated information extraction on treatment and prognosis for non\u2013small cell lung Cancer radiotherapy patients: clinical study. JMIR Med Inform. 2018;6(1):e8.","journal-title":"JMIR Med Inform"},{"key":"931_CR13","first-page":"268","volume":"2017","author":"E Soysal","year":"2017","unstructured":"Soysal E, Warner JL, Denny JC, Xu H. Identifying metastases-related information from pathology reports of lung Cancer patients. AMIA Summits Transl Sci Proc. 2017;2017:268.","journal-title":"AMIA Summits Transl Sci Proc"},{"issue":"21","key":"931_CR14","doi-asserted-by":"publisher","first-page":"e115","DOI":"10.1158\/0008-5472.CAN-17-0615","volume":"77","author":"GK Savova","year":"2017","unstructured":"Savova GK, Tseytlin E, Finan S, Castine M, Miller T, Medvedeva O, Harris D, Hochheiser H, Lin C, Chavan G. DeepPhe: a natural language processing system for extracting Cancer phenotypes from clinical records. Cancer Res. 2017;77(21):e115\u20138.","journal-title":"Cancer Res"},{"issue":"6","key":"931_CR15","doi-asserted-by":"publisher","first-page":"749","DOI":"10.1093\/aje\/kwt441","volume":"179","author":"DS Carrell","year":"2014","unstructured":"Carrell DS, Halgrim S, Tran D-T, Buist DS, Chubak J, Chapman WW, Savova G. Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence. Am J Epidemiol. 2014;179(6):749\u201358.","journal-title":"Am J Epidemiol"},{"key":"931_CR16","first-page":"149","volume":"2013","author":"H Liu","year":"2013","unstructured":"Liu H, Bielinski SJ, Sohn S, Murphy S, Wagholikar KB, Jonnalagadda SR, Ravikumar K, Wu ST, Kullo IJ, Chute CG. An information extraction framework for cohort identification using electronic health records. AMIA Summits Transl Sci Proc. 2013;2013:149.","journal-title":"AMIA Summits Transl Sci Proc"},{"issue":"9","key":"931_CR17","doi-asserted-by":"publisher","first-page":"1243","DOI":"10.1097\/JTO.0000000000000630","volume":"10","author":"WD Travis","year":"2015","unstructured":"Travis WD, Brambilla E, Nicholson AG, Yatabe Y, Austin JH, Beasley MB, Chirieac LR, Dacic S, Duhig E, Flieder DB. The 2015 World Health Organization classification of lung tumors: impact of genetic, clinical and radiologic advances since the 2004 classification. J Thorac Oncol. 2015;10(9):1243\u201360.","journal-title":"J Thorac Oncol"},{"key":"931_CR18","first-page":"1524","volume-title":"AMIA Annual Symposium Proceedings: 2018: American Medical Informatics Association","author":"Y Si","year":"2018","unstructured":"Si Y, Roberts K. A frame-based NLP system for cancer-related information extraction. In: AMIA Annual Symposium Proceedings: 2018: American Medical Informatics Association; 2018. p. 1524."},{"issue":"11","key":"931_CR19","doi-asserted-by":"publisher","first-page":"2278","DOI":"10.1109\/5.726791","volume":"86","author":"Y LeCun","year":"1998","unstructured":"LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278\u2013324.","journal-title":"Proc IEEE"},{"key":"931_CR20","first-page":"746","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: 2013","author":"T Mikolov","year":"2013","unstructured":"Mikolov T, W-t Y, Zweig G. Linguistic regularities in continuous space word representations. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: 2013; 2013. p. 746\u201351."},{"key":"931_CR21","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1016\/j.jbi.2018.09.008","volume":"87","author":"Y Wang","year":"2018","unstructured":"Wang Y, Liu S, Afzal N, Rastegar-Mojarad M, Wang L, Shen F, Liu H. A Comparison of Word Embeddings for the Biomedical Natural Language Processing. J Biomed Inform. 2018;87:12.","journal-title":"J Biomed Inform"}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-019-0931-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12911-019-0931-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-019-0931-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,12,4]],"date-time":"2020-12-04T00:19:53Z","timestamp":1607041193000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-019-0931-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12]]},"references-count":21,"journal-issue":{"issue":"S5","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["931"],"URL":"https:\/\/doi.org\/10.1186\/s12911-019-0931-8","relation":{},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12]]},"assertion":[{"value":"5 December 2019","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"N\/A","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"N\/A","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"239"}}