{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T06:12:52Z","timestamp":1773295972664,"version":"3.50.1"},"reference-count":27,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Artif. Intell."],"abstract":"<jats:sec><jats:title>Context<\/jats:title><jats:p>Traditional methods such as rule-based systems, word embeddings (e.g. Word2Vec, GloVe) and sequence tagging models such as CRFs and HMMs have difficulty capturing the complex and nuanced context of medical texts, leading to low precision and inflexibility. These methods also struggle with the inherent variability of medical language and often require large and difficult-to-obtain labeled datasets.<\/jats:p><\/jats:sec><jats:sec><jats:title>Objective<\/jats:title><jats:p>We examine the growing importance of Named Entity Recognition (NER) in the analysis of healthcare texts. NER, a fundamental technique in Natural Language Processing (NLP), automatically identifies and categorizes named entities in the text, such as names of people and organizations, in medical texts, medical conditions and drug names. This facilitates better information retrieval, personalized medicine approaches and clinical decision support systems.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>A systematic mapping was carried out that focused on advanced language models, specifically transformation-based models such as BERT. These models are known for capturing complex semantic dependencies and linguistic nuances, which are crucial for accurate processing of medical texts. Transformation architectures, unlike traditional techniques such as CNNs and RNNs, are better suited to dealing with the contextual and semantic nature of medical texts due to their ability to manage long sequences and the need for high precision.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>The results indicate that transformation-based models, in particular BERT and its specialized variants (e.g. ClinicalBERT), consistently demonstrate high performance on NER tasks, with F1 scores often exceeding 97%, outperforming traditional and hybrid methods. When examining the geographical distribution of contributions, the research identifies a significant contribution from China, followed by the United States. These findings have crucial implications for the integration of NER technologies into the Brazilian National Health System (SUS).<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>This systematic review contributes to the advancement of NER in health texts by evaluating methods, showing results and highlighting the wider implications for the field. The article is systematically structured into the following sections: Methodology, Bibliometric analysis, Results and discussion, Threats to validity, Future work and Conclusion. This systematic organization provides a comprehensive review of the research, its impact and future directions, highlighting the importance of keeping up to date with advances in the field to increase the relevance of NER applications in healthcare.<\/jats:p><\/jats:sec>","DOI":"10.3389\/frai.2025.1584203","type":"journal-article","created":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T05:27:17Z","timestamp":1751866037000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Artificial intelligence in healthcare text processing: a review applied to named entity recognition"],"prefix":"10.3389","volume":"8","author":[{"given":"Samuel Santana","family":"de Almeida","sequence":"first","affiliation":[]},{"given":"Raphael","family":"Silva Fontes","sequence":"additional","affiliation":[]},{"given":"Luca","family":"Pareja Credidio Freire Alves","sequence":"additional","affiliation":[]},{"given":"Methanias Cola\u00e7o","family":"J\u00fanior","sequence":"additional","affiliation":[]},{"given":"Gleyson","family":"Jos\u00e9 Pinheiro Caldeira Silva","sequence":"additional","affiliation":[]},{"given":"Lyane","family":"Ramalho Cortez","sequence":"additional","affiliation":[]},{"given":"Antonio Higor Freire","family":"de Morais","sequence":"additional","affiliation":[]},{"given":"Guilherme","family":"Medeiros Machado","sequence":"additional","affiliation":[]},{"given":"Hugo","family":"Gon\u00e7alo Oliveira","sequence":"additional","affiliation":[]},{"given":"Aliete","family":"Cunha-Oliveira","sequence":"additional","affiliation":[]},{"given":"Jo\u00e3o Paulo Queiroz","family":"dos Santos","sequence":"additional","affiliation":[]},{"given":"Ricardo Alexsandro","family":"de Medeiros Valentim","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,7,7]]},"reference":[{"key":"ref1","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1590\/S1414-32832012005000014","article-title":"Direito a sa\u00fade e integralidade: uma discuss\u00e3o sobre os desafios e caminhos para sua efetiva\u00e7\u00e3o","volume":"16","author":"Brito-Silva","year":"2012","journal-title":"Interface"},{"key":"ref2","volume-title":"Portal de Peri\u00f3dicos","year":"2021"},{"key":"ref3","doi-asserted-by":"publisher","first-page":"372","DOI":"10.1186\/s12911-021-01717-1","article-title":"Multi-task learning for Chinese clinical named entity recognition with external knowledge","volume":"21","author":"Cheng","year":"2021","journal-title":"BMC Med. Inform. Decis. Mak."},{"key":"ref4","first-page":"278","article-title":"Exploring the use of conditional random field models and HMMS for historical handwritten document recognition","author":"Feng","year":"2006"},{"key":"ref5","first-page":"211","article-title":"Sussurro \u2013 detec\u00e7\u00e3o na web de eventos audit\u00e1veis que representam riscos \u00e0 sa\u00fade p\u00fablica","author":"Fontes","year":"2023"},{"key":"ref6","doi-asserted-by":"publisher","first-page":"144","DOI":"10.14135\/j.cnki.1006-3080.20210909003","article-title":"Chinese medical named entity recognition based on Roberta and adversarial training","volume":"49","author":"Guo","year":"2023","journal-title":"J. East China Univ. Sci. Technol."},{"key":"ref7","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1007\/s41666-024-00162-9","article-title":"Prompt tuning in biomedical relation extraction","volume":"8","author":"He","year":"2024","journal-title":"J. Healthcare Inform. Res."},{"key":"ref8","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1177\/0022242918814254","article-title":"Uncle Sam rising: performance implications of business-to-government relationships","volume":"83","author":"Josephson","year":"2019","journal-title":"J. Mark."},{"key":"ref9","author":"Kitchenham","year":"2007"},{"key":"ref10","doi-asserted-by":"publisher","first-page":"1555","DOI":"10.5281\/zenodo.10443962","article-title":"Performance evaluation of word embedding algorithms","volume":"8","author":"Kulshretha","year":"2023","journal-title":"Int. J. Innovative Sci. Res. Technol."},{"key":"ref11","author":"Lee","year":"2024"},{"key":"ref12","author":"Li","year":"2020"},{"key":"ref13","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1609\/aimag.v27i4.1904","article-title":"A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955","volume":"27","author":"McCarthy","year":"2006","journal-title":"AI Mag."},{"key":"ref14","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1590\/S1413-81232009000300019","article-title":"O financiamento do sus sob os \u201cventos\u201d da financeiriza\u00e7ao","volume":"14","author":"Mendes","year":"2009","journal-title":"Ci\u00eancia Sa\u00fade Coletiva"},{"key":"ref15","doi-asserted-by":"publisher","first-page":"e20346","DOI":"10.2196\/20346","article-title":"The effectiveness of artificial intelligence conversational agents in health care: systematic review","volume":"22","author":"Milne-Ives","year":"2020","journal-title":"J. Med. Internet Res."},{"key":"ref16","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1186\/s12859-022-04688-w","article-title":"Benchmarking for biomedical natural language processing tasks with a domain specific ALBERT","volume":"23","author":"Naseem","year":"2022","journal-title":"BMC Bioinf."},{"key":"ref17","year":""},{"key":"ref18","year":""},{"key":"ref19","volume-title":"Before reading: Narrative conventions and the politics of interpretation","author":"Rabinowitz","year":"1987"},{"key":"ref20","doi-asserted-by":"publisher","first-page":"A12","DOI":"10.7326\/ACPJC-1995-123-3-A12","article-title":"The well-built clinical question: a key to evidence-based decisions","volume":"123","author":"Richardson","year":"1995","journal-title":"ACP J. Club"},{"key":"ref21","author":"Rodrigues","year":"2018"},{"key":"ref22","doi-asserted-by":"publisher","first-page":"508","DOI":"10.1590\/S0104-11692007000300023","article-title":"The pico strategy for the research question construction and evidence search","volume":"15","author":"Santos","year":"2007","journal-title":"Rev. Latino-Am. Enfermagem."},{"key":"ref23","doi-asserted-by":"publisher","first-page":"438","DOI":"10.1007\/s41666-023-00155-0","article-title":"Identifying and extracting rare diseases and their phenotypes with large language models","volume":"8","author":"Shyr","year":"2024","journal-title":"J. Healthc. Inform. Res."},{"key":"ref24","doi-asserted-by":"publisher","first-page":"909","DOI":"10.1136\/heartjnl-2021-319769","article-title":"Systematic review of current natural language processing methods and applications in cardiology","volume":"108","author":"Turchioe","year":"2022","journal-title":"Heart"},{"key":"ref25","doi-asserted-by":"publisher","first-page":"7","DOI":"10.3390\/ijerph18157776","article-title":"A weakly-supervised named entity recognition machine learning approach for emergency medical services clinical audit","volume":"18","author":"Wang","year":"2021","journal-title":"Int. J. Environ. Res. Public Health"},{"key":"ref26","author":"Xing","year":"2014"},{"key":"ref27","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1016\/j.procs.2021.03.010","article-title":"Named entity recognition method in health preserving field based on BERT","volume":"183","author":"Zhang","year":"2021","journal-title":"Procedia Comput. Sci."}],"container-title":["Frontiers in Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frai.2025.1584203\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T05:27:19Z","timestamp":1751866039000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frai.2025.1584203\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,7]]},"references-count":27,"alternative-id":["10.3389\/frai.2025.1584203"],"URL":"https:\/\/doi.org\/10.3389\/frai.2025.1584203","relation":{},"ISSN":["2624-8212"],"issn-type":[{"value":"2624-8212","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,7]]},"article-number":"1584203"}}