{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,18]],"date-time":"2026-01-18T12:11:40Z","timestamp":1768738300758,"version":"3.49.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T00:00:00Z","timestamp":1700092800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T00:00:00Z","timestamp":1700092800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000050","name":"NHLBI","doi-asserted-by":"crossref","award":["5T32HL007745"],"award-info":[{"award-number":["5T32HL007745"]}],"id":[{"id":"10.13039\/100000050","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Introduction<\/jats:title>\n                <jats:p>Accurate identification of venous thromboembolism (VTE) is critical to develop replicable epidemiological studies and rigorous predictions models. Traditionally, VTE studies have relied on international classification of diseases (ICD) codes which are inaccurate \u2013 leading to misclassification bias. Here, we developed ClotCatcher, a novel deep learning model that uses natural language processing to detect VTE from radiology reports.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Methods<\/jats:title>\n                <jats:p>Radiology reports to detect VTE were obtained from patients admitted to Emory University Hospital (EUH) and Grady Memorial Hospital (GMH). Data augmentation was performed using the Google PEGASUS paraphraser. This data was then used to fine-tune ClotCatcher, a novel deep learning model. ClotCatcher was validated on both the EUH dataset alone and GMH dataset alone.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>The dataset contained 1358 studies from EUH and 915 studies from GMH (<jats:italic>n<\/jats:italic>\u2009=\u20092273). The dataset contained 1506 ultrasound studies with 528 (35.1%) studies positive for VTE, and 767 CT studies with 91 (11.9%) positive for VTE. When validated on the EUH dataset, ClotCatcher performed best (AUC\u2009=\u20090.980) when trained on both EUH and GMH dataset without paraphrasing. When validated on the GMH dataset, ClotCatcher performed best (AUC\u2009=\u20090.995) when trained on both EUH and GMH dataset with paraphrasing.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>ClotCatcher, a novel deep learning model with data augmentation rapidly and accurately adjudicated the presence of VTE from radiology reports. Applying ClotCatcher to large databases would allow for rapid and accurate adjudication of incident VTE. This would reduce misclassification bias and form the foundation for future studies to estimate individual risk for patient to develop incident VTE.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12911-023-02369-z","type":"journal-article","created":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T10:02:41Z","timestamp":1700128961000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["ClotCatcher: a novel natural language model to accurately adjudicate venous thromboembolism from radiology reports"],"prefix":"10.1186","volume":"23","author":[{"given":"Jeffrey","family":"Wang","sequence":"first","affiliation":[]},{"given":"Joao Souza","family":"de Vale","sequence":"additional","affiliation":[]},{"given":"Saransh","family":"Gupta","sequence":"additional","affiliation":[]},{"given":"Pulakesh","family":"Upadhyaya","sequence":"additional","affiliation":[]},{"given":"Felipe A.","family":"Lisboa","sequence":"additional","affiliation":[]},{"given":"Seth A.","family":"Schobel","sequence":"additional","affiliation":[]},{"given":"Eric A.","family":"Elster","sequence":"additional","affiliation":[]},{"given":"Christopher J.","family":"Dente","sequence":"additional","affiliation":[]},{"given":"Timothy G.","family":"Buchman","sequence":"additional","affiliation":[]},{"given":"Rishikesan","family":"Kamaleswaran","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,11,16]]},"reference":[{"issue":"4","key":"2369_CR1","doi-asserted-by":"publisher","first-page":"312S","DOI":"10.1378\/chest.108.4_Supplement.312S","volume":"108","author":"GP Clagett","year":"1995","unstructured":"Clagett GP, Anderson FA Jr, Heit J, Levine MN, Wheeler HB. Prevention of Venous Thromboembolism. Chest. 1995;108(4):312S-334S. https:\/\/doi.org\/10.1378\/chest.108.4_Supplement.312S.","journal-title":"Chest"},{"issue":"1","key":"2369_CR2","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/S0749-0690(05)70107-5","volume":"17","author":"JA Heit","year":"2001","unstructured":"Heit JA. Prevention of venous thromboembolism. Clin Geriatr Med. 2001;17(1):71\u201392. https:\/\/doi.org\/10.1016\/S0749-0690(05)70107-5.","journal-title":"Clin Geriatr Med"},{"issue":"3","key":"2369_CR3","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1136\/bmjqs-2012-001782","volume":"23","author":"BD Lau","year":"2014","unstructured":"Lau BD, Haut ER. Practices to prevent venous thromboembolism: a brief review. BMJ Qual Saf. 2014;23(3):187\u201395. https:\/\/doi.org\/10.1136\/bmjqs-2012-001782.","journal-title":"BMJ Qual Saf"},{"issue":"4 Suppl","key":"2369_CR4","doi-asserted-by":"publisher","first-page":"S495","DOI":"10.1016\/j.amepre.2009.12.017","volume":"38","author":"MG Beckman","year":"2010","unstructured":"Beckman MG, Hooper WC, Critchley SE, Ortel TL. Venous thromboembolism: a public health concern. Am J Prev Med. 2010;38(4 Suppl):S495-501. https:\/\/doi.org\/10.1016\/j.amepre.2009.12.017.","journal-title":"Am J Prev Med"},{"issue":"3","key":"2369_CR5","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1016\/j.surg.2014.10.005","volume":"157","author":"KP Cohoon","year":"2015","unstructured":"Cohoon KP, Leibson CL, Ransom JE, et al. Direct medical costs attributable to venous thromboembolism among persons hospitalized for major operation: a population-based longitudinal study. Surgery. 2015;157(3):423\u201331. https:\/\/doi.org\/10.1016\/j.surg.2014.10.005.","journal-title":"Surgery"},{"key":"2369_CR6","doi-asserted-by":"publisher","unstructured":"Correction to: Call to Action to Prevent Venous Thromboembolism in Hospitalized Patients: A Policy Statement From the American Heart Association. Circulation. 2021 143(7);e249-e249. https:\/\/doi.org\/10.1161\/CIR.0000000000000956.","DOI":"10.1161\/CIR.0000000000000956"},{"issue":"8","key":"2369_CR7","doi-asserted-by":"publisher","first-page":"1611","DOI":"10.1111\/j.1538-7836.2005.01415.x","volume":"3","author":"HEIT Ja","year":"2005","unstructured":"Ja HEIT. Venous thromboembolism: disease burden, outcomes and risk factors. J Thromb Haemost. 2005;3(8):1611\u20137. https:\/\/doi.org\/10.1111\/j.1538-7836.2005.01415.x.","journal-title":"J Thromb Haemost"},{"issue":"10","key":"2369_CR8","doi-asserted-by":"publisher","first-page":"829","DOI":"10.1161\/circulationaha.114.009107","volume":"130","author":"KK S\u00f8gaard","year":"2014","unstructured":"S\u00f8gaard KK, Schmidt M, Pedersen L, Horv\u00e1th-Puh\u00f3 E, S\u00f8rensen HT. 30-year mortality after venous thromboembolism: a population-based cohort study. Circulation. 2014;130(10):829\u201336. https:\/\/doi.org\/10.1161\/circulationaha.114.009107.","journal-title":"Circulation"},{"issue":"9","key":"2369_CR9","first-page":"190","volume":"63","author":"MB Streiff","year":"2014","unstructured":"Streiff MB, Brady JP, Grant AM, Grosse SD, Wong B, Popovic T. CDC Grand Rounds: preventing hospital-associated venous thromboembolism. MMWR Morb Mortal Wkly Rep. 2014;63(9):190\u20133.","journal-title":"MMWR Morb Mortal Wkly Rep"},{"key":"2369_CR10","unstructured":"(US) OotSG. The Surgeon General's Call to Action to Prevent Deep Vein Thrombosis and Pulmonary Embolism. Office of the Surgeon General (US); 2008.\u00a0https:\/\/www.ncbi.nlm.nih.gov\/books\/NBK44178\/."},{"issue":"11","key":"2369_CR11","doi-asserted-by":"publisher","first-page":"e2240373","DOI":"10.1001\/jamanetworkopen.2022.40373","volume":"5","author":"E Neeman","year":"2022","unstructured":"Neeman E, Liu V, Mishra P, et al. Trends and risk factors for venous thromboembolism among hospitalized medical patients. JAMA Netw Open. 2022;5(11):e2240373\u2013e2240373. https:\/\/doi.org\/10.1001\/jamanetworkopen.2022.40373.","journal-title":"JAMA Netw Open"},{"issue":"4","key":"2369_CR12","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1016\/j.thromres.2015.01.026","volume":"135","author":"RE Nelson","year":"2015","unstructured":"Nelson RE, Grosse SD, Waitzman NJ, et al. Using multiple sources of data for surveillance of postoperative venous thromboembolism among surgical patients treated in Department of Veterans Affairs hospitals, 2005\u20132010. Thromb Res. 2015;135(4):636\u201342. https:\/\/doi.org\/10.1016\/j.thromres.2015.01.026.","journal-title":"Thromb Res"},{"issue":"19","key":"2369_CR13","doi-asserted-by":"publisher","first-page":"1774","DOI":"10.1001\/archinternmed.2010.336","volume":"170","author":"SL Boulet","year":"2010","unstructured":"Boulet SL, Grosse SD, Hooper WC, Beckman MG, Atrash HK. Prevalence of venous thromboembolism among privately insured US adults. Arch Intern Med. 2010;170(19):1774\u20135. https:\/\/doi.org\/10.1001\/archinternmed.2010.336.","journal-title":"Arch Intern Med"},{"key":"2369_CR14","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1016\/j.thromres.2020.02.023","volume":"189","author":"C Baumgartner","year":"2020","unstructured":"Baumgartner C, Go AS, Fan D, et al. Administrative codes inaccurately identify recurrent venous thromboembolism: The CVRN VTE study. Thromb Res. 2020;189:112\u20138. https:\/\/doi.org\/10.1016\/j.thromres.2020.02.023.","journal-title":"Thromb Res"},{"issue":"2","key":"2369_CR15","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1007\/s10877-021-00664-6","volume":"36","author":"T Pellathy","year":"2022","unstructured":"Pellathy T, Saul M, Clermont G, Dubrawski AW, Pinsky MR, Hravnak M. Accuracy of identifying hospital acquired venous thromboembolism by administrative coding: implications for big data and machine learning research. J Clin Monit Comput. 2022;36(2):397\u2013405. https:\/\/doi.org\/10.1007\/s10877-021-00664-6.","journal-title":"J Clin Monit Comput"},{"key":"2369_CR16","doi-asserted-by":"publisher","first-page":"107602962110131","DOI":"10.1177\/10760296211013108","volume":"27","author":"B Woller","year":"2021","unstructured":"Woller B, Daw A, Aston V, et al. Natural language processing performance for the identification of venous thromboembolism in an integrated healthcare system. Clin Appl Thromb Hemost. 2021;27:10760296211013108. https:\/\/doi.org\/10.1177\/10760296211013108.","journal-title":"Clin Appl Thromb Hemost"},{"issue":"3","key":"2369_CR17","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1007\/s11239-017-1532-y","volume":"44","author":"JA G\u00e1lvez","year":"2017","unstructured":"G\u00e1lvez JA, Pappas JM, Ahumada L, et al. The use of natural language processing on pediatric diagnostic radiology reports in the electronic health record to identify deep venous thrombosis in children. J Thromb Thrombolysis. 2017;44(3):281\u201390. https:\/\/doi.org\/10.1007\/s11239-017-1532-y.","journal-title":"J Thromb Thrombolysis"},{"issue":"4","key":"2369_CR18","doi-asserted-by":"publisher","first-page":"1175","DOI":"10.1016\/j.surg.2021.04.027","volume":"170","author":"J Shi","year":"2021","unstructured":"Shi J, Hurdle JF, Johnson SA, et al. Natural language processing for the surveillance of postoperative venous thromboembolism. Surgery. 2021;170(4):1175\u201382. https:\/\/doi.org\/10.1016\/j.surg.2021.04.027.","journal-title":"Surgery"},{"key":"2369_CR19","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1016\/j.thromres.2021.11.020","volume":"209","author":"AA Verma","year":"2022","unstructured":"Verma AA, Masoom H, Pou-Prom C, et al. Developing and validating natural language processing algorithms for radiology reports compared to ICD-10 codes for identifying venous thromboembolism in hospitalized medical patients. Thromb Res. 2022;209:51\u20138. https:\/\/doi.org\/10.1016\/j.thromres.2021.11.020.","journal-title":"Thromb Res"},{"key":"2369_CR20","unstructured":"Huang K, Altosaar J, Ranganath R. Clinicalbert: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:190405342. 2019;\u00a0https:\/\/arxiv.org\/abs\/1904.05342."},{"key":"2369_CR21","unstructured":"Lee J-S, Hsiang J. Patentbert: Patent classification with fine-tuning a pre-trained bert model. arXiv preprint arXiv:190602124. 2019;\u00a0https:\/\/arxiv.org\/abs\/1906.02124."},{"issue":"4","key":"2369_CR22","doi-asserted-by":"publisher","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","volume":"36","author":"J Lee","year":"2020","unstructured":"Lee J, Yoon W, Kim S, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234\u201340.","journal-title":"Bioinformatics"},{"key":"2369_CR23","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:181004805. 2018;\u00a0https:\/\/arxiv.org\/abs\/1810.04805."},{"key":"2369_CR24","doi-asserted-by":"crossref","unstructured":"Feng SY, Gangal V, Wei J, et al. A survey of data augmentation approaches for NLP. arXiv preprint arXiv:210503075. 2021;\u00a0https:\/\/arxiv.org\/abs\/2105.03075.","DOI":"10.18653\/v1\/2021.findings-acl.84"},{"issue":"2","key":"2369_CR25","doi-asserted-by":"publisher","first-page":"e054669","DOI":"10.1136\/bmjopen-2021-054669","volume":"12","author":"SC Weller","year":"2022","unstructured":"Weller SC, Porterfield L, Davis J, Wilkinson GS, Chen L, Baillargeon J. Incidence of venous thrombotic events and events of special interest in a retrospective cohort of commercially insured US patients. BMJ Open. 2022;12(2):e054669. https:\/\/doi.org\/10.1136\/bmjopen-2021-054669.","journal-title":"BMJ Open"},{"issue":"1","key":"2369_CR26","doi-asserted-by":"publisher","first-page":"58","DOI":"10.7812\/tpp\/21.019","volume":"26","author":"K Higashiya","year":"2022","unstructured":"Higashiya K, Ford J, Yoon HC. Variation in positivity rates of computed tomography pulmonary angiograms for the evaluation of acute pulmonary embolism among emergency department physicians. Perm J. 2022;26(1):58\u201363. https:\/\/doi.org\/10.7812\/tpp\/21.019.","journal-title":"Perm J"},{"issue":"1","key":"2369_CR27","doi-asserted-by":"publisher","first-page":"1022","DOI":"10.1038\/s41598-022-26467-6","volume":"13","author":"RM Wichmann","year":"2023","unstructured":"Wichmann RM, Fernandes FT, Chiavegatto Filho ADP, et al. Improving the performance of machine learning algorithms for health outcomes predictions in multicentric cohorts. Sci Rep. 2023;13(1):1022. https:\/\/doi.org\/10.1038\/s41598-022-26467-6.","journal-title":"Sci Rep"},{"issue":"1","key":"2369_CR28","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1186\/s12916-019-1426-2","volume":"17","author":"CJ Kelly","year":"2019","unstructured":"Kelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. https:\/\/doi.org\/10.1186\/s12916-019-1426-2.","journal-title":"BMC Med"}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-023-02369-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12911-023-02369-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-023-02369-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T10:03:43Z","timestamp":1700129023000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-023-02369-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,16]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["2369"],"URL":"https:\/\/doi.org\/10.1186\/s12911-023-02369-z","relation":{},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,16]]},"assertion":[{"value":"14 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 November 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 November 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"This study was approved by the Emory University Institutional Review Board (STUDY00000302). This study was approved by the Emory University Institutional Review Board under waiver of consent due to the retrospective nature of the study.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"262"}}