{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T19:06:10Z","timestamp":1775934370540,"version":"3.50.1"},"reference-count":83,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2022,12,8]],"date-time":"2022-12-08T00:00:00Z","timestamp":1670457600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Annual Funding track by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia","award":["GRANT 2114"],"award-info":[{"award-number":["GRANT 2114"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This study aims to develop and evaluate an automated system for extracting information related to patient substance use (smoking, alcohol, and drugs) from unstructured clinical text (medical discharge records). The authors propose a four-stage system for the extraction of the substance-use status and related attributes (type, frequency, amount, quit-time, and period). The first stage uses a keyword search technique to detect sentences related to substance use and to exclude unrelated records. In the second stage, an extension of the NegEx negation detection algorithm is developed and employed for detecting the negated records. The third stage involves identifying the temporal status of the substance use by applying windowing and chunking methodologies. Finally, in the fourth stage, regular expressions, syntactic patterns, and keyword search techniques are used in order to extract the substance-use attributes. The proposed system achieves an F1-score of up to 0.99 for identifying substance-use-related records, 0.98 for detecting the negation status, and 0.94 for identifying temporal status. Moreover, F1-scores of up to 0.98, 0.98, 1.00, 0.92, and 0.98 are achieved for the extraction of the amount, frequency, type, quit-time, and period attributes, respectively. Natural Language Processing (NLP) and rule-based techniques are employed efficiently for extracting substance-use status and attributes, with the proposed system being able to detect substance-use status and attributes over both sentence-level and document-level data. Results show that the proposed system outperforms the compared state-of-the-art substance-use identification system on an unseen dataset, demonstrating its generalisability.<\/jats:p>","DOI":"10.3390\/s22249609","type":"journal-article","created":{"date-parts":[[2022,12,8]],"date-time":"2022-12-08T03:35:53Z","timestamp":1670470553000},"page":"9609","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Automated Detection of Substance-Use Status and Related Information from Clinical Text"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6651-8342","authenticated-orcid":false,"given":"Raid","family":"Alzubi","sequence":"first","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, King Faisal University, Al-Ahsa 31982, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0846-4075","authenticated-orcid":false,"given":"Hadeel","family":"Alzoubi","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, King Faisal University, Al-Ahsa 31982, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9190-0941","authenticated-orcid":false,"given":"Stamos","family":"Katsigiannis","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Durham University, Upper Mountjoy Campus, Stockton Road, Durham DH1 3LE, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7245-8825","authenticated-orcid":false,"given":"Daune","family":"West","sequence":"additional","affiliation":[{"name":"School of Computing, Engineering and Physical Sciences, University of the West of Scotland, High St., Paisley PA1 2BE, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5088-1462","authenticated-orcid":false,"given":"Naeem","family":"Ramzan","sequence":"additional","affiliation":[{"name":"School of Computing, Engineering and Physical Sciences, University of the West of Scotland, High St., Paisley PA1 2BE, UK"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1093\/ajcp\/107.1.64","article-title":"Smoking, alcohol consumption, and leukocyte counts","volume":"107","author":"Parry","year":"1997","journal-title":"Am. J. Clin. Pathol."},{"key":"ref_2","unstructured":"Centers for Disease Control and Prevention (CDC) (2021, October 20). Unintentional Drug Poisoning in the United States, Available online: https:\/\/www.cdc.gov\/medicationsafety\/pdfs\/cdc_5538_ds1.pdf."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2093","DOI":"10.1016\/S0140-6736(11)60512-6","article-title":"Global burden of disease in young people aged 10\u201324 years: A systematic analysis","volume":"377","author":"Gore","year":"2011","journal-title":"Lancet"},{"key":"ref_4","unstructured":"World Health Organization and Research for International Tobacco Control (2008). WHO Report on the Global Tobacco Epidemic, 2008: The MPOWER Package, World Health Organization."},{"key":"ref_5","first-page":"129","article-title":"Neurophysiologic findings in chronic alcohol abuse","volume":"37","author":"Koch","year":"1985","journal-title":"Psychiatr. Neurol. Und Med. Psychol."},{"key":"ref_6","first-page":"371","article-title":"Alcoholic diseases in hepato-gastroenterology: A point of view","volume":"55","author":"Testino","year":"2008","journal-title":"Hepato-Gastroenterol."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Caan, W., and De Belleroche, J. (2002). Drink, Drugs and Dependence: From Science to Clinical Practice, Routledge.","DOI":"10.4324\/9780203219812"},{"key":"ref_8","unstructured":"(2021, October 20). Health Consequences of Drug Misuse, by National Institute On Drug Abuse, Available online: https:\/\/www.drugabuse.gov\/related-topics\/health-consequences-drug-misuse."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"American Psychiatric Association and Others (2013). Diagnostic and Statistical Manual of Mental Disorders (DSM-5\u00ae), American Psychiatric Association Publishing.","DOI":"10.1176\/appi.books.9780890425596"},{"key":"ref_10","unstructured":"NHS Digital (2021, October 20). Statistics on Smoking, England 2020. Available online: https:\/\/digital.nhs.uk\/data-and-information\/publications\/statistical\/statistics-on-smoking\/statistics-on-smoking-england-2020."},{"key":"ref_11","unstructured":"Office for National Statistics (2021, October 20). Adult Smoking Habits in the UK: 2019, Available online: https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/healthandsocialcare\/healthandlifeexpectancies\/bulletins\/adultsmokinghabitsingreatbritain\/2019."},{"key":"ref_12","unstructured":"Alcohol Change, UK (2021, October 20). The Alcohol Change Report. Available online: https:\/\/alcoholchange.org.uk\/get-involved\/campaigns\/the-alcohol-change-report."},{"key":"ref_13","unstructured":"Burton, R., Henn, C., Lavoie, D., O\u2019Connor, R., Perkins, C., Sweeney, K., Greaves, F., Ferguson, B., Beynon, C., and Belloni, A. (2021, October 20). The Public Health Burden of Alcohol and the Effectiveness and Cost-Effectiveness of Alcohol Control Policies: An Evidence Review, Available online: https:\/\/www.gov.uk\/government\/publications\/the-public-health-burden-of-alcohol-evidence-review."},{"key":"ref_14","unstructured":"Office for National Statistics (2021, October 20). Drug Misuse in England and Wales: Year Ending March 2020, Available online: https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/crimeandjustice\/articles\/drugmisuseinenglandandwales\/yearendingmarch2020."},{"key":"ref_15","unstructured":"Office for National Statistics (2021, October 20). Deaths Related to Drug Poisoning in England and Wales: 2019 Registrations, Available online: https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/birthsdeathsandmarriages\/deaths\/bulletins\/deathsrelatedtodrugpoisoninginenglandandwales\/2019registrations."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"102083","DOI":"10.1016\/j.artmed.2021.102083","article-title":"Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit","volume":"117","author":"Kraljevic","year":"2021","journal-title":"Artif. Intell. Med."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1007\/s10916-018-1075-6","article-title":"The use of Electronic Health Records to Support Population Health: A Systematic Review of the Literature","volume":"42","author":"Kruse","year":"2018","journal-title":"J. Med. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1136\/amiajnl-2013-001935","article-title":"A review of approaches to identifying patient phenotype cohorts using electronic health records","volume":"21","author":"Shivade","year":"2014","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Alzoubi, H., Alzubi, R., Ramzan, N., West, D., Al-Hadhrami, T., and Alazab, M. (2019). A review of automatic phenotyping approaches using electronic health records. Electronics, 8.","DOI":"10.3390\/electronics8111235"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Al-Qahtani, M., Katsigiannis, S., and Ramzan, N. (2021). Information Retrieval from Electronic Health Records. Engineering and Technology for Healthcare, Wiley-IEEE. Chapter 6.","DOI":"10.1002\/9781119644316.ch6"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1055\/s-0038-1638592","article-title":"Extracting information from textual documents in the electronic health record: A review of recent research","volume":"17","author":"Meystre","year":"2008","journal-title":"Yearb. Med. Inform."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1007\/s10916-018-1018-2","article-title":"Data mining algorithms and techniques in mental health: A systematic review","volume":"42","author":"Alonso","year":"2018","journal-title":"J. Med. Syst."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"102086","DOI":"10.1016\/j.artmed.2021.102086","article-title":"Med7: A transferable clinical natural language processing model for electronic health records","volume":"118","author":"Kormilitzin","year":"2021","journal-title":"Artif. Intell. Med."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1007\/s10916-018-1066-7","article-title":"Extraction of Ejection Fraction from Echocardiography Notes for Constructing a Cohort of Patients having Heart Failure with reduced Ejection Fraction (HFrEF)","volume":"42","author":"Wagholikar","year":"2018","journal-title":"J. Med. Syst."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"100511","DOI":"10.1016\/j.cosrev.2022.100511","article-title":"Neural Natural Language Processing for Unstructured Data in Electronic Health Records: A Review","volume":"46","author":"Li","year":"2022","journal-title":"Comput. Sci. Rev."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"880","DOI":"10.1001\/jama.2011.1219","article-title":"The promise of electronic records: Around the corner or down the road?","volume":"306","author":"Jha","year":"2011","journal-title":"Jama"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1016\/j.anai.2013.07.022","article-title":"Automated chart review for asthma cohort identification using natural language processing: An exploratory study","volume":"111","author":"Wu","year":"2013","journal-title":"Ann. Allergy Asthma Immunol."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Kullo, I.J., Ding, K., Jouni, H., Smith, C.Y., and Chute, C.G. (2010). A genome-wide association study of red blood cell traits using the electronic medical record. PLoS ONE, 5.","DOI":"10.1371\/journal.pone.0013011"},{"key":"ref_29","first-page":"43","article-title":"A hybrid approach to sentiment sentence classification in suicide notes","volume":"5","author":"Sohn","year":"2012","journal-title":"Biomed. Inform. Insights"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"i144","DOI":"10.1136\/amiajnl-2011-000351","article-title":"Drug side effect extraction from clinical narratives of psychiatry and psychology patients","volume":"18","author":"Sohn","year":"2011","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1136\/amiajnl-2014-002945","article-title":"HARVEST, a longitudinal patient record summarizer","volume":"22","author":"Hirsch","year":"2014","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"392","DOI":"10.1197\/jamia.M1552","article-title":"Automated encoding of clinical documents based on natural language processing","volume":"11","author":"Friedman","year":"2004","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1136\/jamia.2009.001560","article-title":"Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): Architecture, component evaluation and applications","volume":"17","author":"Savova","year":"2010","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_34","unstructured":"Aronson, A.R. (2001, January 3\u20137). Effective mapping of biomedical text to the UMLS Metathesaurus: The MetaMap program. Proceedings of the AMIA Symposium. American Medical Informatics Association, Washington, DC, USA."},{"key":"ref_35","first-page":"349","article-title":"Exploiting semantic relations for literature-based discovery","volume":"Volume 2006","author":"Hristovski","year":"2006","journal-title":"Proceedings of the AMIA Annual Symposium Proceedings"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1197\/jamia.M2444","article-title":"Evaluating the state-of-the-art in automatic de-identification","volume":"14","author":"Uzuner","year":"2007","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1197\/jamia.M3115","article-title":"Recognizing obesity and comorbidities in sparse data","volume":"16","author":"Uzuner","year":"2009","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"514","DOI":"10.1136\/jamia.2010.003947","article-title":"Extracting medication information from clinical text","volume":"17","author":"Uzuner","year":"2010","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1136\/amiajnl-2011-000203","article-title":"2010 i2b2\/VA challenge on concepts, assertions, and relations in clinical text","volume":"18","author":"Uzuner","year":"2011","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1136\/amiajnl-2011-000784","article-title":"Evaluating the state of the art in coreference resolution for electronic medical records","volume":"19","author":"Uzuner","year":"2012","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"S11","DOI":"10.1016\/j.jbi.2015.06.007","article-title":"Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2\/UTHealth shared task Track 1","volume":"58","author":"Stubbs","year":"2015","journal-title":"J. Biomed. Inform."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1006\/jbin.2001.1029","article-title":"A simple algorithm for identifying negated findings and diseases in discharge summaries","volume":"34","author":"Chapman","year":"2001","journal-title":"J. Biomed. Inform."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00392-016-1025-6","article-title":"Electronic health records to facilitate clinical research","volume":"106","author":"Cowie","year":"2017","journal-title":"Clin. Res. Cardiol."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.1093\/jamia\/ocy068","article-title":"Opportunities and challenges in developing deep learning models using electronic health records data: A systematic review","volume":"25","author":"Xiao","year":"2018","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1136\/amiajnl-2014-002694","article-title":"N-gram support vector machines for scalable procedure and diagnosis classification, with applications to clinical free text data from the intensive care unit","volume":"21","author":"Marafino","year":"2014","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"8313454","DOI":"10.1155\/2016\/8313454","article-title":"Natural Language Processing Based Instrument for Classification of Free Text Medical Records","volume":"2016","author":"Khachidze","year":"2016","journal-title":"BioMed Res. Int."},{"key":"ref_47","first-page":"246","article-title":"Medical Text Classification Using Convolutional Neural Networks","volume":"235","author":"Hughes","year":"2017","journal-title":"Stud. Health Technol. Inform."},{"key":"ref_48","unstructured":"Mikolov, T., Sutskever, I., Chen, K., Corrado, G., and Dean, J. Distributed Representations of Words and Phrases and Their Compositionality. Proceedings of the 26th International Conference on Neural Information Processing Systems\u2014Volume 2, NIPS\u201913."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C. (2014). GloVe: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_50","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Alsentzer, E., Murphy, J., Boag, W., Weng, W.H., Jindi, D., Naumann, T., and McDermott, M. (2019). Publicly Available Clinical BERT Embeddings. Proceedings of the 2nd Clinical Natural Language Processing Workshop, Association for Computational Linguistics.","DOI":"10.18653\/v1\/W19-1909"},{"key":"ref_52","unstructured":"Huang, K., Altosaar, J., and Ranganath, R. (2019). ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission. arXiv."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"e14830","DOI":"10.2196\/14830","article-title":"Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)\u2013Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study","volume":"7","author":"Li","year":"2019","journal-title":"JMIR Med. Inform."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: A pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee","year":"2019","journal-title":"Bioinformatics"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Mascio, A., Kraljevic, Z., Bean, D., Dobson, R., Stewart, R., Bendayan, R., and Roberts, A. (2020). Comparative Analysis of Text Classification Approaches in Electronic Health Records. Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.bionlp-1.9"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"S158","DOI":"10.1016\/j.jbi.2015.09.002","article-title":"An automatic system to identify heart disease risk factors in clinical texts over time","volume":"58","author":"Chen","year":"2015","journal-title":"J. Biomed. Inform."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"S171","DOI":"10.1016\/j.jbi.2015.09.006","article-title":"A hybrid model for automatic identification of risk factors for heart disease","volume":"58","author":"Yang","year":"2015","journal-title":"J. Biomed. Inform."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Jonnagaddala, J., Liaw, S.T., Ray, P., Kumar, M., Dai, H.J., and Hsu, C.Y. (2015). Identification and progression of heart disease risk factors in diabetic patients from longitudinal electronic health records. BioMed. Res. Int., 2015.","DOI":"10.1155\/2015\/636371"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"S203","DOI":"10.1016\/j.jbi.2015.08.003","article-title":"Coronary artery disease risk assessment from unstructured electronic health records using text mining","volume":"58","author":"Jonnagaddala","year":"2015","journal-title":"J. Biomed. Inform."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.jbi.2015.12.006","article-title":"Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study","volume":"60","author":"Anderson","year":"2016","journal-title":"J. Biomed. Inform."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.drugalcdep.2015.09.003","article-title":"Substance use and mental diagnoses among adults with and without type 2 diabetes: Results from electronic health records data","volume":"156","author":"Wu","year":"2015","journal-title":"Drug Alcohol Depend."},{"key":"ref_62","first-page":"250","article-title":"Identifying Family History and Substance Use Associations for Adult Epilepsy from the Electronic Health Record","volume":"2016","author":"Wang","year":"2016","journal-title":"AMIA Summits Transl. Sci. Proc."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1055\/s-0040-1702214","article-title":"Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data","volume":"11","author":"Feller","year":"2020","journal-title":"Appl. Clin. Inform."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1197\/jamia.M2408","article-title":"Identifying patient smoking status from medical discharge records","volume":"15","author":"Uzuner","year":"2008","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1197\/jamia.M2434","article-title":"Five-way smoking status classification using text hot-spot identification and error-correcting output codes","volume":"15","author":"Cohen","year":"2008","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1197\/jamia.M2440","article-title":"Using implicit information to identify smoking status in smoke-blind medical discharge summaries","volume":"15","author":"Wicentowski","year":"2008","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1197\/jamia.M2438","article-title":"Medical i2b2 NLP smoking challenge: The A-Life system architecture and methodology","volume":"15","author":"Heinze","year":"2008","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_68","unstructured":"McCormick, P.J., Elhadad, N., and Stetson, P.D. (2008, January 8\u201312). Use of semantic features to classify patient smoking status. Proceedings of the AMIA Annual Symposium, Washington, DC, USA."},{"key":"ref_69","unstructured":"Sohn, S., and Savova, G.K. (2009, January 14\u201318). Mayo clinic smoking status classification system: Extensions and improvements. Proceedings of the AMIA Annual Symposium, San Francisco, CA, USA."},{"key":"ref_70","first-page":"577","article-title":"A study of transportability of an existing smoking status detection module across institutions","volume":"Volume 2012","author":"Liu","year":"2012","journal-title":"Proceedings of the AMIA Annual Symposium Proceedings"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1136\/amiajnl-2013-002090","article-title":"Practical implementation of an existing smoking detection pipeline and reduced support vector machine training corpus requirements","volume":"21","author":"Khor","year":"2013","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_72","first-page":"1209","article-title":"Investigating Longitudinal Tobacco Use Information from Social History and Clinical Notes in the Electronic Health Record","volume":"Volume 2016","author":"Wang","year":"2016","journal-title":"Proceedings of the AMIA Annual Symposium Proceedings"},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"237","DOI":"10.4137\/CIN.S40604","article-title":"Comparison of Three Information Sources for Smoking Information in Electronic Health Records","volume":"15","author":"Wang","year":"2016","journal-title":"Cancer Inform."},{"key":"ref_74","doi-asserted-by":"crossref","first-page":"e069","DOI":"10.5210\/ojphi.v9i1.7648","article-title":"Automated Classification of Alcohol Use by Text Mining of Electronic Medical Records","volume":"9","author":"Lix","year":"2017","journal-title":"Online J. Public Health Inform."},{"key":"ref_75","unstructured":"Wang, Y., Chen, E.S., Pakhomov, S., Arsoniadis, E., Carter, E.W., Lindemann, E., Sarkar, I.N., and Melton, G.B. (2015, January 14\u201318). Automated extraction of substance use information from clinical texts. Proceedings of the AMIA Annual Symposium Proceedings, San Francisco, CA, USA."},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Yetisgen, M., and Vanderwende, L. (2017, January 21\u201324). Automatic Identification of Substance Abuse from Social History in Clinical Text. Proceedings of the Conference on Artificial Intelligence in Medicine in Europe, Vienna, Austria.","DOI":"10.1007\/978-3-319-59758-4_18"},{"key":"ref_77","unstructured":"(2021, October 20). Brat, by Brat Rapid Annotation Tool. Available online: http:\/\/brat.nlplab.org\/."},{"key":"ref_78","unstructured":"(2021, October 20). MTSamples Collection of Transcribed Medical Transcription Sample Reports and Examples. Available online: https:\/\/www.mtsamples.com\/."},{"key":"ref_79","unstructured":"Melton, G.B., Manaktala, S., Sarkar, I.N., and Chen, E.S. (2012, January 3\u20137). Social and behavioral history information in public health datasets. Proceedings of the AMIA Annual Symposium Proceedings, Chicago, IL, USA."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Elsafoury, F., Katsigiannis, S., Wilson, S.R., and Ramzan, N. (2021, January 11\u201315). Does BERT Pay Attention to Cyberbullying?. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual.","DOI":"10.1145\/3404835.3463029"},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"103541","DOI":"10.1109\/ACCESS.2021.3098979","article-title":"When the Timeline Meets the Pipeline: A Survey on Automated Cyberbullying Detection","volume":"9","author":"Elsafoury","year":"2021","journal-title":"IEEE Access"},{"key":"ref_82","unstructured":"Pasi, G., Piwowarski, B., Azzopardi, L., and Hanbury, A. (2018). Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms. Proceedings of the Advances in Information Retrieval, Springer International Publishing."},{"key":"ref_83","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long Short-Term Memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/24\/9609\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:36:11Z","timestamp":1760146571000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/24\/9609"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,8]]},"references-count":83,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["s22249609"],"URL":"https:\/\/doi.org\/10.3390\/s22249609","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,8]]}}}