{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T07:17:14Z","timestamp":1774163834749,"version":"3.50.1"},"reference-count":59,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T00:00:00Z","timestamp":1741824000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Informatics"],"abstract":"<jats:p>Mental health illness is the single biggest cause of inability within the UK, contributing up to 22.8% of the whole burden compared to 15.9% for cancer and 16.2% for cardiovascular disease. The more extensive financial costs of mental ailments in Britain have been evaluated at British Pound Sterling (GBP) 105.2 billion each year. This burden could be decreased with productive forms and utilization of computerized innovations. Electronical health records (EHRs), for instance, could offer an extraordinary opportunity for research and provide improved and optimized care. Consequently, this technological advance would unburden the mental health system and help provide optimized and efficient care to the patients. Using natural language processing methods to explore unstructured EHR text data from mental health services in the National Health Service (NHS) UK brings opportunities and technical challenges in the use of such data and possible solutions. This descriptive study compared technical methods and approaches to leverage large-scale text data in EHRs of mental health service providers in the NHS. We conclude that the method used is suitable for mental health services. However, broader studies including other hospital sites are still needed to validate the method.<\/jats:p>","DOI":"10.3390\/informatics12010028","type":"journal-article","created":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T10:16:45Z","timestamp":1741861005000},"page":"28","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["A Pilot Study Using Natural Language Processing to Explore Textual Electronic Mental Healthcare Data"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9628-9245","authenticated-orcid":false,"given":"Gayathri","family":"Delanerolle","sequence":"first","affiliation":[{"name":"Research & Innovation Department, Hampshire & Isle of Wight Healthcare NHS Foundation Trust, Southampton SO30 3JB, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8819-9736","authenticated-orcid":false,"given":"Yassine","family":"Bouchareb","sequence":"additional","affiliation":[{"name":"College of Medicine and Health Sciences, Sultan Qaboos University, Muscat 123, Oman"}]},{"given":"Suchith","family":"Shetty","sequence":"additional","affiliation":[{"name":"Research & Innovation Department, Hampshire & Isle of Wight Healthcare NHS Foundation Trust, Southampton SO30 3JB, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2814-6477","authenticated-orcid":false,"given":"Heitor","family":"Cavalini","sequence":"additional","affiliation":[{"name":"Research & Innovation Department, Hampshire & Isle of Wight Healthcare NHS Foundation Trust, Southampton SO30 3JB, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9950-3254","authenticated-orcid":false,"given":"Peter","family":"Phiri","sequence":"additional","affiliation":[{"name":"Research & Innovation Department, Hampshire & Isle of Wight Healthcare NHS Foundation Trust, Southampton SO30 3JB, UK"},{"name":"Psychology Department, University of Southampton, Southampton SO17 1PS, UK"}]}],"member":"1968","published-online":{"date-parts":[[2025,3,13]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/S2215-0366(15)00505-2","article-title":"Estimating the true global burden of mental illness","volume":"3","author":"Vigo","year":"2016","journal-title":"Lancet Psychiatry"},{"key":"ref_2","unstructured":"NHS Digital (2024, October 14). Mental Health Bulletin 2020\u20132021 Annual Report. Available online: https:\/\/digital.nhs.uk\/data-and-information\/publications\/statistical\/mental-health-bulletin\/2020-21-annual-report."},{"key":"ref_3","unstructured":"Dorning, H., Davies, A., and Blunt, I. (2015). Focus on: People with Mental Ill Health and Hospital Use. Exploring Disparities in Hospital Use for Physical Healthcare, The Nuffield Trust."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1586\/erp.13.7","article-title":"Application of electronic medical record data for health outcomes research: A review of recent literature","volume":"13","author":"Lin","year":"2013","journal-title":"Expert Rev. Pharmacoecon. Outcomes Res."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Jensen, K., Soguero-Ruiz, C., Oyvind Mikalsen, K., Lindsetmo, R.O., Kouskoumvekaki, I., Girolami, M., Olav Skrovseth, S., and Augestad, K.M. (2017). Analysis of free text in electronic health records for identification of cancer patient trajectories. Sci. Rep., 7.","DOI":"10.1038\/srep46226"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Cohen, K.B., and Hunter, L. (2004). Natural language processing and systems biology. Artificial Intelligence Methods and Tools for Systems Biology, Springer.","DOI":"10.1007\/978-1-4020-5811-0_9"},{"key":"ref_7","unstructured":"Locke, W.N., and Booth, A.D. (1955). Machine Translation of Languages, John Wiley. Fourteen essays."},{"key":"ref_8","unstructured":"Tenney, I., Xia, P., Chen, B., Wang, A., Poliak, A., McCoy, R.T., Kim, N., Van Durme, B., Bowman, S.R., and Das, D. (2019). What do you learn from context? Probing for sentence structure in contextualized word representations. arXiv."},{"key":"ref_9","first-page":"5998","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_10","unstructured":"Carlini, N., Tramer, F., Wallace, E., Jagielski, M., Herbert-Voss, A., Lee, K., Roberts, A., Brown, T., Song, D., and Erlingsson, U. (2021, January 11\u201313). Extracting training data from large language models. Proceedings of the 30th USENIX Security Symposium, Vancouver, BC, Canada."},{"key":"ref_11","unstructured":"Lawson, N., Eustice, K., Perkowitz, M., and Yetisgen-Yildiz, M. (2010, January 6). Annotating large email datasets for named entity recognition with mechanical turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon\u2019s Mechanical Turk, Los Angeles, CA, USA."},{"key":"ref_12","first-page":"e12047","article-title":"Crowdsourcing for food purchase receipt annotation via amazon mechanical Turk: A feasibility study","volume":"21","author":"Lu","year":"2019","journal-title":"J. Med. Int. Res."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.ijmedinf.2017.12.004","article-title":"Public and physician\u2019s expectations and ethical concerns about electronic health record: Benefits outweigh risks except for information security","volume":"110","author":"Entzeridou","year":"2018","journal-title":"Int. J. Med. Inform."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"102086","DOI":"10.1016\/j.artmed.2021.102086","article-title":"Med7: A transferable clinical natural language processing model for electronic health records","volume":"118","author":"Kormilitzin","year":"2021","journal-title":"Artif. Intell. Med."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"e61","DOI":"10.1016\/S2215-0366(20)30375-8","article-title":"Multisite data linkage projects in mental health research","volume":"7","author":"Carson","year":"2020","journal-title":"Lancet Psychiatry"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"e052911","DOI":"10.1136\/bmjopen-2021-052911","article-title":"Can natural language processing models extract and classify instances of interpersonal violence in mental healthcare electronic records: An applied evaluative study","volume":"12","author":"Botelle","year":"2022","journal-title":"BMJ Open"},{"key":"ref_17","first-page":"503","article-title":"An intelligent virtual human system for providing healthcare information and support","volume":"163","author":"Rizzo","year":"2011","journal-title":"Stud. Health Technol. Inform."},{"key":"ref_18","unstructured":"Bhakta, R., Savin-Baden, M., and Tombs, G. (2024, October 14). Sharing Secrets with Robots?. Available online: https:\/\/www.learntechlib.org\/primary\/p\/147797\/."},{"key":"ref_19","first-page":"13","article-title":"SimCoach evaluation: A virtual human intervention to encourage service-member help-seeking for posttraumatic stress disorder and depression","volume":"5","author":"Meeker","year":"2016","journal-title":"Rand Health Q."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"McCoy, T.H., Castro, V.M., Cagan, A., Roberson, A.M., Kohane, I.S., and Perlis, R.H. (2015). Sentiment measured in hospital discharge notes is associated with readmission and mortality risk: An electronic health record study. PLoS ONE, 10.","DOI":"10.1371\/journal.pone.0136341"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/j.artmed.2015.03.006","article-title":"Sentiment analysis in medical settings: New opportunities and challenges","volume":"64","author":"Denecke","year":"2015","journal-title":"Artif. Intell. Med."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2053","DOI":"10.1016\/S0140-6736(16)00338-X","article-title":"The map is not the territory: Medical records and 21st century practice","volume":"388","author":"Martin","year":"2016","journal-title":"Lancet"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1108\/AJIM-02-2019-0048","article-title":"Long story short: Finding health advice with informative summaries on health social media","volume":"71","author":"Liu","year":"2019","journal-title":"Aslib J. Inf. Manag."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Yin, Y., Zhang, Y., Liu, X., Zhang, Y., Xing, C., and Chen, H. (2014). HealthQA: A Chinese QA summary system for smart health. Lecture Notes in Computer Science, Springer.","DOI":"10.1007\/978-3-319-08416-9_6"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2287","DOI":"10.1093\/jamia\/ocab143","article-title":"A systematic review of automatic text summarization for biomedical literature and EHRs","volume":"28","author":"Wang","year":"2021","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1007\/s13755-020-00108-6","article-title":"Keyword extraction and structuralization of medical reports","volume":"8","author":"Wu","year":"2020","journal-title":"Health Inf. Sci. Syst."},{"key":"ref_27","unstructured":"Tang, M., Gandhi, P., Kabir, M.A., Zou, C., Blakey, J., and Luo, X. (2019). Progress notes classification and keyword extraction using attention-based deep learning models with BERT. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1109\/TCBB.2018.2849968","article-title":"Natural Language Processing for EHR-Based Computational Phenotyping","volume":"16","author":"Zeng","year":"2019","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Wang, H., Ding, Y., Tang, J., Dong, X., He, B., Qiu, J., and Wild, D.J. (2011). Finding complex biological relationships in recent PubMed articles using Bio-LDA. PLoS ONE, 6.","DOI":"10.1371\/journal.pone.0017243"},{"key":"ref_30","unstructured":"Gu, Y., and Leroy, G. (2020). Large-scale analysis of free-text data for mental health surveillance with topic modelling. 26th Americas Conference on Information Systems, AMCIS 2020, Association for Information Systems."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Fernandes, A.C., Cloete, D., Broadbent, M., Hayes, R.D., Chang, C.K., Jackson, R.G., Roberts, A., Tsang, J., Soncul, M., and Liebscher, J. (2013). Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records. BMC Med. Inform. Decis. Mak., 13.","DOI":"10.1186\/1472-6947-13-71"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Goodday, S.M., Kormilitzin, A., Vaci, N., Liu, Q., Cipriani, A., Smith, T., and Nevado-Holgado, A. (2020). Maximizing the use of social and behavioural information from secondary care mental health electronic health records. J. Biomed. Inform., 107.","DOI":"10.1016\/j.jbi.2020.103429"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"e005654","DOI":"10.1136\/bmjopen-2014-005654","article-title":"Developing a new model for patient recruitment in mental health services: A cohort study using Electronic Health Records","volume":"4","author":"Callard","year":"2014","journal-title":"BMJ Open"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1136\/ebmental-2019-300116","article-title":"Consent to discuss participation in research: A pilot study","volume":"23","author":"Walker","year":"2020","journal-title":"Evid.-Based Ment. Health"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1192\/bjp.2020.136","article-title":"Real-world effectiveness, its predictors and onset of action of cholinesterase inhibitors and memantine in dementia: Retrospective health record study","volume":"218","author":"Vaci","year":"2021","journal-title":"Br. J. Psychiatry"},{"key":"ref_36","first-page":"729","article-title":"TextHunter\u2014A User Friendly Tool for Extracting Generic Concepts from Free Text in Clinical Research","volume":"2014","author":"Jackson","year":"2014","journal-title":"AMIA Annu. Symp. Proc."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Iqbal, E., Mallah, R., Jackson, R.G., Ball, M., Ibrahim, Z.M., Broadbent, M., Dzahini, O., Stewart, R., Johnston, C., and Dobson, R.J. (2015). Identification of Adverse Drug Events from Free Text Electronic Patient Records and Information in a Large Mental Health Case Register. PLoS ONE, 10.","DOI":"10.1371\/journal.pone.0134208"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Kadra, G., Stewart, R., Shetty, H., Jackson, R.G., Greenwood, M.A., Roberts, A., Chang, C.K., MacCabe, J.H., and Hayes, R.D. (2015). Extracting antipsychotic polypharmacy data from electronic health records: Developing and evaluating a novel process. BMC Psychiatry, 15.","DOI":"10.1186\/s12888-015-0557-z"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"e007619","DOI":"10.1136\/bmjopen-2015-007619","article-title":"Negative symptoms in schizophrenia: A study in a large clinical sample of patients using a novel automated method","volume":"5","author":"Patel","year":"2015","journal-title":"BMJ Open"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"S79","DOI":"10.1016\/S0140-6736(15)60394-4","article-title":"Cannabis use and treatment resistance in first episode psychosis: A natural language processing study","volume":"385","author":"Patel","year":"2015","journal-title":"Lancet"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Taylor, C.L., Stewart, R., Ogden, J., Broadbent, M., Pasupathy, D., and Howard, L.M. (2015). The characteristics and health needs of pregnant women with schizophrenia compared with bipolar disorder and affective psychoses. BMC Psychiatry, 15.","DOI":"10.1186\/s12888-015-0451-8"},{"key":"ref_42","first-page":"641","article-title":"Detection of Suicidality in Adolescents with Autism Spectrum Disorders: Developing a Natural Language Processing Approach for Use in Electronic Health Records","volume":"2017","author":"Downs","year":"2018","journal-title":"AMIA Annu. Symp. Proc."},{"key":"ref_43","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_44","unstructured":"Devlin, J. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: A pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee","year":"2020","journal-title":"Bioinformatics"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Gururangan, S., Marasovi\u0107, A., Swayamdipta, S., Lo, K., Beltagy, I., Downey, D., and Smith, N.A. (2020). Don\u2019t stop pretraining: Adapt language models to domains and tasks. arXiv.","DOI":"10.18653\/v1\/2020.acl-main.740"},{"key":"ref_47","first-page":"1432","article-title":"An empirical study of pre-trained transformer models for biomedical text mining","volume":"18","author":"Wang","year":"2021","journal-title":"ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_48","unstructured":"Doshi-Velez, F., and Kim, B. (2017). Towards a rigorous science of interpretable machine learning. arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Bender, E.M., Gebru, T., McMillan-Major, A., and Shmitchell, S. (2021, January 3\u201310). On the dangers of stochastic parrots: Can language models be too big?. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (FAccT\u201921), Athens, Greece.","DOI":"10.1145\/3442188.3445922"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1038\/s42256-019-0048-x","article-title":"Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead","volume":"1","author":"Rudin","year":"2019","journal-title":"Nat. Mach. Intell."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/s41591-018-0316-z","article-title":"A guide to deep learning in healthcare","volume":"25","author":"Esteva","year":"2021","journal-title":"Nat. Med."},{"key":"ref_52","first-page":"858","article-title":"Deep learning for healthcare: Review, opportunities, and challenges","volume":"22","author":"Goh","year":"2021","journal-title":"Brief. Bioinform."},{"key":"ref_53","first-page":"e16279","article-title":"Analyzing the potential of ChatGPT-like models in healthcare: Opportunities and challenges","volume":"21","author":"He","year":"2019","journal-title":"J. Med. Internet Res."},{"key":"ref_54","unstructured":"Shen, Y., Zhang, R., Jiang, X., Wang, J., and Liu, Y. (2021). Advances in Natural Language Processing for Clinical Text: Applications and Challenges. J. Biomed. Inform., 118."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Qin, C., Zhang, Z., Chen, J., Yasunaga, M., and Yang, D. (2023). Is chatgpt a general-purpose natural language processing task solver?. arXiv.","DOI":"10.18653\/v1\/2023.emnlp-main.85"},{"key":"ref_56","unstructured":"Franciscu, S. (2024, October 14). ChatGPT: A Natural Language Generation Model for Chatbots. Available online: https:\/\/doi.org\/10.13140\/RG.2.2.24777.83044."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"160035","DOI":"10.1038\/sdata.2016.35","article-title":"MIMIC-III, a freely accessible critical care database","volume":"3","author":"Johnson","year":"2016","journal-title":"Sci. Data"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1136\/amiajnl-2011-000203","article-title":"2010 i2b2\/VA challenge on concepts, assertions, and relations in clinical text","volume":"18","author":"Uzuner","year":"2011","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Stewart, R., Soremekun, M., Perera, G., Broadbent, M., Callard, F., Denis, M., Hotopf, M., and Lovestone, S. (2009). The South London and Maudsley NHS Foundation Trust Biomedical Research Centre (SLAM BRC) case register: Development and descriptive data. BMC Psychiatry, 9.","DOI":"10.1186\/1471-244X-9-51"}],"container-title":["Informatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2227-9709\/12\/1\/28\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T16:53:07Z","timestamp":1760028787000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2227-9709\/12\/1\/28"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,13]]},"references-count":59,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,3]]}},"alternative-id":["informatics12010028"],"URL":"https:\/\/doi.org\/10.3390\/informatics12010028","relation":{},"ISSN":["2227-9709"],"issn-type":[{"value":"2227-9709","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,13]]}}}