{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T15:46:08Z","timestamp":1774367168870,"version":"3.50.1"},"reference-count":48,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,2,28]],"date-time":"2024-02-28T00:00:00Z","timestamp":1709078400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Digit. Health"],"abstract":"<jats:p>Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories.<\/jats:p>","DOI":"10.3389\/fdgth.2024.1282043","type":"journal-article","created":{"date-parts":[[2024,2,28]],"date-time":"2024-02-28T04:40:16Z","timestamp":1709095216000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Word sense disambiguation of acronyms in clinical narratives"],"prefix":"10.3389","volume":"6","author":[{"given":"Daphn\u00e9","family":"Chopard","sequence":"first","affiliation":[]},{"given":"Padraig","family":"Corcoran","sequence":"additional","affiliation":[]},{"given":"Irena","family":"Spasi\u0107","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,2,28]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","DOI":"10.4000\/lexis.713","article-title":"Submorphemic elements in the formation of acronyms, blends, clippings","author":"Fandrych","year":"2008","journal-title":"Lexis J Engl Lexicol"},{"key":"B2","doi-asserted-by":"publisher","first-page":"1158","DOI":"10.3758\/BF03193106","article-title":"The acronym superiority effect","volume":"14","author":"Laszlo","year":"2007","journal-title":"Psychon Bull Rev"},{"key":"B3","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1136\/amiajnl-2012-001506","article-title":"A sense inventory for clinical abbreviations, acronyms created using clinical notes, medical dictionary resources","volume":"21","author":"Moon","year":"2014","journal-title":"J Am Med Inform Assoc"},{"key":"B4","doi-asserted-by":"publisher","first-page":"e15980","DOI":"10.2196\/15980","article-title":"Cohort selection for clinical trials from longitudinal patient records: text mining approach","volume":"7","author":"Spasi\u0107","year":"2019","journal-title":"JMIR Med Inform"},{"key":"B5","doi-asserted-by":"publisher","first-page":"1073","DOI":"10.1111\/imj.14442","article-title":"Ambiguous medical abbreviation study: challenges and opportunities","volume":"50","author":"Holper","year":"2020","journal-title":"Intern Med J"},{"key":"B6","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"B7","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1007\/978-1-4020-4809-8_8","article-title":"Knowledge sources for WSD","volume-title":"Word Sense Disambiguation","author":"Agirre","year":"2007"},{"key":"B8","doi-asserted-by":"publisher","first-page":"e17984","DOI":"10.2196\/17984","article-title":"Clinical text data in machine learning: systematic review","volume":"8","author":"Spasi\u0107","year":"2020","journal-title":"JMIR Med Inform"},{"key":"B9","doi-asserted-by":"publisher","first-page":"732381","DOI":"10.3389\/frai.2021.732381","article-title":"Acronyms, opportunities for improving deep nets","volume":"4","author":"Church","year":"2021","journal-title":"Front Artif Intell"},{"key":"B10","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1142\/9789812776303_0042","article-title":"A simple algorithm for identifying abbreviation definitions in biomedical text","volume-title":"Biocomputing 2003","author":"Schwartz","year":"2002"},{"key":"B11","doi-asserted-by":"publisher","first-page":"402","DOI":"10.1186\/1471-2105-9-402","article-title":"Abbreviation definition identification based on automatic precision estimates","volume":"9","author":"Sohn","year":"2008","journal-title":"BMC Bioinf"},{"key":"B12","article-title":"Acronym identification, disambiguation shared tasks for scientific document understanding","author":"Veyseh","year":"2021"},{"key":"B13","article-title":"BERT-based acronym disambiguation with multiple training strategies","author":"Pan","year":"2021"},{"key":"B14","article-title":"Leveraging domain agnostic, specific knowledge for acronym disambiguation","author":"Zhong","year":"2021"},{"key":"B15","article-title":"Primer AI\u2019s systems for acronym identification, disambiguation","author":"Egan","year":"2021"},{"key":"B16","article-title":"SCIDR at SDU-2020: ideas - identifying and disambiguating everyday acronyms for scientific domain","author":"Singh","year":"2021"},{"key":"B17","article-title":"AI-NLM exploration of the acronym identification shared task at SDU@AAAI-21","author":"Rogers","year":"2021"},{"key":"B18","article-title":"Participation of UC3M in SDU@AAAI-21: a hybrid approach to disambiguate scientific acronyms","author":"Jaber","year":"2021"},{"key":"B19","article-title":"Acronym expander at SDU@AAAI-21: an acronym disambiguation module","author":"Pereira","year":"2021"},{"key":"B20","doi-asserted-by":"publisher","first-page":"1073","DOI":"10.1111\/imj.14442","article-title":"Ambiguous medical abbreviation study: challenges and opportunities","volume":"50","author":"Holper","year":"2020","journal-title":"Intern Med J"},{"key":"B21","year":""},{"key":"B22","doi-asserted-by":"publisher","first-page":"2813","DOI":"10.1093\/bioinformatics\/btl480","article-title":"ADAM: Another database of abbreviations in MEDLINE","volume":"22","author":"Zhou","year":"2006","journal-title":"Bioinformatics"},{"key":"B23","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1016\/j.jbi.2018.11.004","article-title":"A method for harmonization of clinical abbreviation and acronym sense inventories","volume":"88","author":"Grossman","year":"2018","journal-title":"J Biomed Inform"},{"key":"B24","author":"Xia","year":""},{"key":"B25","author":"Jagannathan","year":""},{"key":"B26","author":"Patrick","year":""},{"key":"B27","author":"Wu","year":""},{"key":"B28","author":"Zweigenbaum","year":""},{"key":"B29","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/s13326-016-0084-y","article-title":"Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe\/CLEF eHealth challenge 2013, task 2","volume":"7","author":"Mowery","year":"2016","journal-title":"J Biomed Semant"},{"key":"B30","author":"Devlin","year":""},{"key":"B31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/2041-1480-4-27","article-title":"FlexiTerm: a flexible term recognition method","volume":"4","author":"Spasi\u0107","year":"2013","journal-title":"J Biomed Semant"},{"key":"B32","doi-asserted-by":"publisher","first-page":"3136","DOI":"10.1093\/bioinformatics\/btac298","article-title":"Simulation, annotation of global acronyms","volume":"38","author":"Filimonov","year":"2022","journal-title":"Bioinformatics"},{"key":"B33","first-page":"3509","article-title":"GlossBERT: BERT for word sense disambiguation with gloss knowledge","author":"Huang","year":"2019"},{"key":"B34","volume-title":"WordNet: an electronic lexical database","author":"Miller","year":"1998"},{"key":"B35","doi-asserted-by":"publisher","first-page":"e28632","DOI":"10.2196\/28632","article-title":"Text mining of adverse events in clinical trials: deep learning approach","volume":"9","author":"Chopard","year":"2021","journal-title":"JMIR Med Inform"},{"key":"B36","doi-asserted-by":"publisher","first-page":"e215","DOI":"10.1161\/01.cir.101.23.e215","article-title":"PhysioBank, PhysioToolkit, PhysioNet: components of a new research resource for complex physiologic signals","volume":"101","author":"Goldberger","year":"2000","journal-title":"Circulation"},{"key":"B37","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1016\/j.jbi.2013.04.009","article-title":"Trak ontology: defining standard care for the rehabilitation of knee conditions","volume":"46","author":"Button","year":"2013","journal-title":"J Biomed Inform"},{"key":"B38","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13326-015-0033-1","article-title":"KneeTex: an ontology\u2013driven system for information extraction from MRI reports","volume":"6","author":"Spasi\u0107","year":"2015","journal-title":"J Biomed Semant"},{"key":"B39","doi-asserted-by":"publisher","DOI":"10.1038\/s41597-021-00929-4","article-title":"A deep database of medical abbreviations, acronyms for natural language processing","volume":"8","author":"Grossman Liu","year":"2021","journal-title":"Sci Data"},{"key":"B40","doi-asserted-by":"publisher","first-page":"8351","DOI":"10.1109\/ACCESS.2018.2807122","article-title":"Acronyms as an integral part of multi-word term recognition \u2013 a token of appreciation","volume":"6","author":"Spasi\u0107","year":"2018","journal-title":"IEEE Access"},{"key":"B41","author":"Spasi\u0107","year":""},{"key":"B42","first-page":"233","article-title":"One sense per discourse","author":"Gale","year":"1992"},{"key":"B43","doi-asserted-by":"publisher","first-page":"121","DOI":"10.5715\/jnlp.1.21","article-title":"A comparative study of automatic extraction of collocations from corpora: mutual information vs. cost criteria","volume":"1","author":"Kita","year":"1994","journal-title":"J Nat Lang Process"},{"key":"B44","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1142\/9789812776303_0039","article-title":"Mining terminological knowledge in large biomedical corpora","volume-title":"Biocomputing 2003","author":"Liu","year":"2002"},{"key":"B45","doi-asserted-by":"publisher","first-page":"104101","DOI":"10.1016\/j.ijmedinf.2020.104101","article-title":"The CLASSE GATOR (clinical acronym sense disambiguator): a method for predicting acronym sense from neonatal clinical notes","volume":"137","author":"Kashyap","year":"2020","journal-title":"Int J Med Inform"},{"key":"B46","first-page":"4699","article-title":"Exploring transformer text generation for medical dataset augmentation","author":"Amin-Nejad","year":"2020"},{"key":"B47","first-page":"610","article-title":"On the dangers of stochastic parrots: can language models be too big?","author":"Bender","year":"2021"},{"key":"B48","doi-asserted-by":"publisher","first-page":"ooad052","DOI":"10.1093\/jamiaopen\/ooad052","article-title":"A novel method to create realistic synthetic medication data","volume":"6","author":"Hodges","year":"2023","journal-title":"JAMIA Open"}],"container-title":["Frontiers in Digital Health"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdgth.2024.1282043\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,28]],"date-time":"2024-02-28T04:40:20Z","timestamp":1709095220000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdgth.2024.1282043\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,28]]},"references-count":48,"alternative-id":["10.3389\/fdgth.2024.1282043"],"URL":"https:\/\/doi.org\/10.3389\/fdgth.2024.1282043","relation":{},"ISSN":["2673-253X"],"issn-type":[{"value":"2673-253X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,28]]},"article-number":"1282043"}}