{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:24Z","timestamp":1740185124552,"version":"3.37.3"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2022,4,28]],"date-time":"2022-04-28T00:00:00Z","timestamp":1651104000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,26]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Global acronyms are used in written text without their formal definitions. This makes it difficult to automatically interpret their sense as acronyms tend to be ambiguous. Supervised machine learning approaches to sense disambiguation require large training datasets. In clinical applications, large datasets are difficult to obtain due to patient privacy. Manual data annotation creates an additional bottleneck.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We proposed an approach to automatically modifying scientific abstracts to (i) simulate global acronym usage and (ii) annotate their senses without the need for external sources or manual intervention. We implemented it as a web-based application, which can create large datasets that in turn can be used to train supervised approaches to word sense disambiguation of biomedical acronyms.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The datasets will be generated on demand based on a user query and will be downloadable from https:\/\/datainnovation.cardiff.ac.uk\/acronyms\/.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac298","type":"journal-article","created":{"date-parts":[[2022,4,22]],"date-time":"2022-04-22T19:10:49Z","timestamp":1650654649000},"page":"3136-3138","source":"Crossref","is-referenced-by-count":1,"title":["Simulation and annotation of global acronyms"],"prefix":"10.1093","volume":"38","author":[{"given":"Maxim","family":"Filimonov","sequence":"first","affiliation":[{"name":"School of Computer Science and Informatics, Cardiff University , Cardiff CF24 4AG, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daphn\u00e9","family":"Chopard","sequence":"additional","affiliation":[{"name":"School of Computer Science and Informatics, Cardiff University , Cardiff CF24 4AG, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8132-3885","authenticated-orcid":false,"given":"Irena","family":"Spasi\u0107","sequence":"additional","affiliation":[{"name":"School of Computer Science and Informatics, Cardiff University , Cardiff CF24 4AG, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,4,28]]},"reference":[{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1007\/978-1-4020-4809-8_8","volume-title":"Word Sense Disambiguation","author":"Agirre","year":"2006"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1197\/jamia.M1757","article-title":"ALICE: an algorithm to extract abbreviations from MEDLINE","volume":"12","author":"Ao","year":"2005","journal-title":"J. Am. Med. Inform. Assoc"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The unified medical language system (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1197\/jamia.M1139","article-title":"Creating an online dictionary of abbreviations from MEDLINE","volume":"9","author":"Chang","year":"2002","journal-title":"J. Am. Med. Inform. Assoc"},{"first-page":"4171","year":"2018","author":"Devlin","key":"2023041403083699500_"},{"key":"2023041403083699500_","first-page":"713","article-title":"Submorphemic elements in the formation of acronyms, blends and clippings","volume":"2","author":"Fandrych","year":"2008","journal-title":"Lexis"},{"first-page":"233","year":"1992","author":"Gale","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"3658","DOI":"10.1093\/bioinformatics\/bti586","article-title":"Resolving abbreviations to their senses in MEDLINE","volume":"21","author":"Gaudan","year":"2005","journal-title":"Bioinformatics"},{"first-page":"3509","year":"2019","author":"Huang","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"1158","DOI":"10.3758\/BF03193106","article-title":"The acronym superiority effect","volume":"14","author":"Laszlo","year":"2007","journal-title":"Psychon. Bull. Rev"},{"first-page":"415","year":"2003","author":"Liu","key":"2023041403083699500_"},{"first-page":"464","year":"2002","author":"Liu","key":"2023041403083699500_"},{"year":"2021","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1136\/amiajnl-2012-001506","article-title":"A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources","volume":"21","author":"Moon","year":"2014","journal-title":"J. Am. Med. Inform. Assoc"},{"first-page":"643","year":"2006","author":"Okazaki","key":"2023041403083699500_"},{"year":"2021","key":"2023041403083699500_"},{"key":"2023041403083699500_","first-page":"371","article-title":"Automatic extraction of acronym-meaning pairs from MEDLINE databases","volume":"84","author":"Pustejovsky","year":"2001","journal-title":"Stud. Health Technol. Inform"},{"first-page":"451","year":"2003","author":"Schwartz","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1186\/1471-2105-9-402","article-title":"Abbreviation definition identification based on automatic precision estimates","volume":"9","author":"Sohn","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"8351","DOI":"10.1109\/ACCESS.2018.2807122","article-title":"Acronyms as an integral part of multi-word term recognition\u2014a token of appreciation","volume":"6","author":"Spasi\u0107","year":"2018","journal-title":"IEEE Access"},{"year":"2021","author":"Spasi\u0107","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"e17984","DOI":"10.2196\/17984","article-title":"Clinical text data in machine learning: systematic review","volume":"8","author":"Spasi\u0107","year":"2020","journal-title":"JMIR Med. Inform"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/2041-1480-4-27","article-title":"FlexiTerm: a flexible term recognition method","volume":"4","author":"Spasi\u0107","year":"2013","journal-title":"J. Biomed. Semantics"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"e15980","DOI":"10.2196\/15980","article-title":"Cohort selection for clinical trials from longitudinal patient records: text mining approach","volume":"7","author":"Spasi\u0107","year":"2019","journal-title":"JMIR Med. Inform"},{"year":"2021","key":"2023041403083699500_"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"426","DOI":"10.1055\/s-0038-1634373","article-title":"Heuristics for identification of acronym-definition patterns within text: towards an automated construction of comprehensive acronym-definition dictionaries","volume":"41","author":"Wren","year":"2002","journal-title":"Methods Inf. Med"},{"key":"2023041403083699500_","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.jbi.2006.06.001","article-title":"Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles","volume":"40","author":"Yu","year":"2007","journal-title":"J. Biomed. Inform"},{"first-page":"57","year":"2003","author":"Yu","key":"2023041403083699500_"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac298\/43615709\/btac298.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/11\/3136\/49878765\/btac298.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/11\/3136\/49878765\/btac298.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T12:15:23Z","timestamp":1700482523000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/11\/3136\/6575439"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2022,4,28]]},"references-count":28,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2022,5,26]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac298","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2022,6,1]]},"published":{"date-parts":[[2022,4,28]]}}}