{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,11]],"date-time":"2026-05-11T18:02:05Z","timestamp":1778522525609,"version":"3.51.4"},"reference-count":53,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2025,1,6]],"date-time":"2025-01-06T00:00:00Z","timestamp":1736121600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>This study introduces a novel AI-driven approach to support elderly patients in Thailand with medication management, focusing on accurate drug label interpretation. Two model architectures were explored: a Two-Stage Optical Character Recognition (OCR) and Large Language Model (LLM) pipeline combining EasyOCR with Qwen2-72b-instruct and a Uni-Stage Visual Question Answering (VQA) model using Qwen2-72b-VL. Both models operated in a zero-shot capacity, utilizing Retrieval-Augmented Generation (RAG) with DrugBank references to ensure contextual relevance and accuracy. Performance was evaluated on a dataset of 100 diverse prescription labels from Thai healthcare facilities, using RAG Assessment (RAGAs) metrics to assess Context Recall, Factual Correctness, Faithfulness, and Semantic Similarity. The Two-Stage model achieved high accuracy (94%) and strong RAGAs scores, particularly in Context Recall (0.88) and Semantic Similarity (0.91), making it well-suited for complex medication instructions. In contrast, the Uni-Stage model delivered faster response times, making it practical for high-volume environments such as pharmacies. This study demonstrates the potential of zero-shot AI models in addressing medication management challenges for the elderly by providing clear, accurate, and contextually relevant label interpretations. The findings underscore the adaptability of AI in healthcare, balancing accuracy and efficiency to meet various real-world needs.<\/jats:p>","DOI":"10.3390\/jimaging11010011","type":"journal-article","created":{"date-parts":[[2025,1,6]],"date-time":"2025-01-06T06:43:04Z","timestamp":1736145784000},"page":"11","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Application of Generative Artificial Intelligence Models for Accurate Prescription Label Identification and Information Retrieval for the Elderly in Northern East of Thailand"],"prefix":"10.3390","volume":"11","author":[{"given":"Parinya","family":"Thetbanthad","sequence":"first","affiliation":[{"name":"School of Information Technology, Sripatum University, Bangkok 10900, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Benjaporn","family":"Sathanarugsawait","sequence":"additional","affiliation":[{"name":"School of Information Technology, Sripatum University, Bangkok 10900, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Prasong","family":"Praneetpolgrang","sequence":"additional","affiliation":[{"name":"School of Information Technology, Sripatum University, Bangkok 10900, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,1,6]]},"reference":[{"key":"ref_1","unstructured":"Department of Older Persons, Ministry of Social Development and Human Security (2024, July 04). Situation of the Thai Older Persons 2021, Available online: https:\/\/www.dop.go.th\/download\/knowledge\/th1663828576-1747_1.pdf."},{"key":"ref_2","unstructured":"Department of Older Persons (2024, July 04). Statistics of Older Persons June 2023 by Looker Studio, Available online: https:\/\/www.dop.go.th\/th\/know\/side\/1\/1\/2449."},{"key":"ref_3","unstructured":"Economic Research Institute for ASEAN and East Asia (ERIA) (2024, July 04). Population Ageing in Thailand. Available online: https:\/\/www.eria.org\/publications\/population-ageing-in-thailand."},{"key":"ref_4","unstructured":"World Health Organization (2024, July 04). Thailand\u2019s Leadership and Innovations Towards Healthy Ageing. Available online: https:\/\/www.who.int\/southeastasia\/news\/feature-stories\/detail\/thailands-leadership-and-innovation-towards-healthy-ageing."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1007\/s11096-005-4213-4","article-title":"Identification of Potential Drug-Related Problems in the Elderly: The Role of the Community Pharmacist","volume":"28","author":"Vinks","year":"2006","journal-title":"Pharm. World Sci."},{"key":"ref_6","first-page":"129","article-title":"Drug Prescribing Pattern and Prescription Error in Elderly: A Retrospective Study of Inpatient Record","volume":"4","author":"Sapkota","year":"2011","journal-title":"Asian J. Pharm. Clin. Res."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"104154","DOI":"10.1016\/j.ijnurstu.2021.104154","article-title":"Interventions for Improving Medication Adherence in Community-Dwelling Older People with Multimorbidity: A Systematic Review and Meta-Analysis","volume":"126","author":"Yang","year":"2022","journal-title":"Int. J. Nurs. Stud."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"604","DOI":"10.1007\/s00508-014-0589-2","article-title":"Risk Factors for the Prescription of Potentially Inappropriate Medication (PIM) in the Elderly","volume":"126","author":"Stock","year":"2014","journal-title":"Wien. Klin. Wochenschr."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Roux-Marson, C., Baranski, J.B., Fafin, C., Exterman, G., Vigneau, C., Couchoud, C., and Moranne, O. (2020). Medication Burden and Inappropriate Prescription Risk Among Elderly with Advanced Chronic Kidney Disease. BMC Geriatr., 20.","DOI":"10.1186\/s12877-020-1485-4"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"e020927","DOI":"10.1136\/bmjopen-2017-020927","article-title":"Who Cares? The Experiences of Caregivers of Adults Living with Heart Failure, Chronic Obstructive Pulmonary Disease and Coronary Artery Disease: A Mixed Methods Systematic Review","volume":"8","author":"Noonan","year":"2018","journal-title":"BMJ Open"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1016\/j.jds.2020.06.019","article-title":"Developments, Application, and Performance of Artificial Intelligence in Dentistry\u2014A Systematic Review","volume":"16","author":"Khanagar","year":"2021","journal-title":"J. Dent. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"107161","DOI":"10.1016\/j.cmpb.2022.107161","article-title":"Application of Explainable Artificial Intelligence for Healthcare: A Systematic Review of the Last Decade (2011\u20132022)","volume":"226","author":"Loh","year":"2022","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"e25759","DOI":"10.2196\/25759","article-title":"Role of Artificial Intelligence Applications in Real-Life Clinical Practice: Systematic Review","volume":"23","author":"Yin","year":"2021","journal-title":"J. Med. Internet Res."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1381","DOI":"10.1016\/j.jiph.2020.06.028","article-title":"Systematic Review of Artificial Intelligence Techniques in the Detection and Classification of COVID-19 Medical Images in Terms of Evaluation and Benchmarking: Taxonomy Analysis, Challenges, Future Solutions and Methodological Aspects","volume":"13","author":"Albahri","year":"2020","journal-title":"J. Infect. Public Health"},{"key":"ref_15","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","unstructured":"(2024, July 04). Jaided AI, EasyOCR. Available online: https:\/\/github.com\/JaidedAI\/EasyOCR."},{"key":"ref_18","unstructured":"Yang, A., Yang, B., Hui, B., Zheng, B., Yu, B., Zhou, C., and Fan, Z. (2024). Qwen2 Technical Report. arXiv."},{"key":"ref_19","unstructured":"Lewis, P., Oguz, B., Rinott, R., Riedel, S., and Stoyanov, V. (2020, January 6\u201312). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Proceedings of the 34th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"D1074","DOI":"10.1093\/nar\/gkx1037","article-title":"DrugBank 5.0: A Major Update DrugBank Database 2018","volume":"46","author":"Wishart","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"ref_21","unstructured":"Es, S., James, J., Espinosa-Anke, L., and Schockaert, S. (2023). RAGAs: Automated Evaluation of Retrieval Augmented Generation. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Liu, X., Meehan, J., Tong, W., Wu, L., Xu, X., and Xu, J. (2020). DLI-IT: A Deep Learning Approach to Drug Label Identification through Image and Text Embedding. BMC Med. Inform. Decis. Mak., 20.","DOI":"10.1186\/s12911-020-1078-3"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Tian, Z., Huang, W., He, T., He, P., and Qiao, Y. (2016, January 11\u201314). Detecting Text in Natural Image with Connectionist Text Proposal Network. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46484-8_4"},{"key":"ref_24","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long Short-Term Memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Smith, R. (2007, January 23\u201326). An Overview of the Tesseract OCR Engine. Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil.","DOI":"10.1109\/ICDAR.2007.4376991"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2016). YOLO9000: Better, Faster, Stronger. arXiv.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Ting, H.W., Chung, S.L., Chen, C.F., Chiu, H.Y., and Hsieh, Y.W. (2020). A Drug Identification Model Developed Using Deep Learning Technologies: Experience of a Medical Center in Taiwan. BMC Health Serv. Res., 20.","DOI":"10.1186\/s12913-020-05166-w"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"852","DOI":"10.3390\/make4040043","article-title":"Automatic Extraction of Medication Information from Cylindrically Distorted Pill Bottle Labels","volume":"4","author":"Gromova","year":"2022","journal-title":"Mach. Learn. Knowl. Extract."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Szeliski, R. (2022). Computer Vision: Algorithms and Applications, Springer Nature.","DOI":"10.1007\/978-3-030-34372-9"},{"key":"ref_31","unstructured":"Tan, M., and Le, Q.V. (2019, January 9\u201315). EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_32","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"You, Y.S., and Lin, Y.S. (2023). A Novel Two-Stage Induced Deep Learning System for Classifying Similar Drugs with Diverse Packaging. Sensors, 23.","DOI":"10.3390\/s23167275"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1038\/s41746-020-00341-z","article-title":"Fusion of Medical Imaging and Electronic Health Records Using Deep Learning: A Systematic Review and Implementation Guidelines","volume":"3","author":"Huang","year":"2020","journal-title":"NPJ Digit. Med."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"8459","DOI":"10.1007\/s12652-021-03612-z","article-title":"Artificial Intelligence in Disease Diagnosis: A Systematic Literature Review, Synthesizing Framework and Future Research Agenda","volume":"14","author":"Kumar","year":"2023","journal-title":"J. Ambient Intell. Humaniz. Comput."},{"key":"ref_36","unstructured":"Hu, S., Yu, S., Cheng, C., Shen, X., Zhang, H., Wei, F., Zhou, X., Yu, H., Zhuang, L., and Liu, J. (2024, July 04). MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies. Available online: https:\/\/arxiv.org\/abs\/2404.06395."},{"key":"ref_37","unstructured":"Zhao, W.X., Zhou, K., Li, J., Tang, T., Wang, X., Hou, Y., Min, Y., Zhang, B., Zhang, J., and Dong, Z. (2023). A Survey of Large Language Models. arXiv."},{"key":"ref_38","unstructured":"Harris, J., Laurence, T., Loman, L., Grayson, F., Nonnenmacher, T., Long, H., WalsGriffith, L., Douglas, A., Fountain, H., and Georgiou, S. (2024). Evaluating Large Language Models for Public Health Classification and Extraction Tasks. arXiv."},{"key":"ref_39","unstructured":"Awais, M., Naseer, M., Khan, S., Anwer, R.M., Cholakkal, H., Shah, M., and Khan, F.S. (2023). Foundational Models Defining a New Era in Vision: A Survey and Outlook. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Hartsock, I., and Rasool, G. (2024). Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review. arXiv.","DOI":"10.3389\/frai.2024.1430984"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzman, F., and Stoyanov, V. (2020, January 5\u201310). Unsupervised Cross-Lingual Representation Learning at Scale. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Seattle, WA, USA.","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Xue, L., Constant, N., Roberts, A., Kale, M., Al-Rfou, R., Siddhant, A., Barua, A., and Raffel, C. (2021). mT5: A Massively Multilingual Pre-Trained Text-to-Text Transformer. arXiv.","DOI":"10.18653\/v1\/2021.naacl-main.41"},{"key":"ref_43","unstructured":"Anil, R., Dai, A.M., Firat, O., Johnson, M., Lepikhin, D., Passos, A., and Wu, Y. (2023). PaLM 2 Technical Report. arXiv."},{"key":"ref_44","unstructured":"Jiang, A.Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D.S., Casas, D.D.L., Bressand, F., Lengyel, G., Lample, G., and Saulnier, L. (2023). Mistral 7B. arXiv."},{"key":"ref_45","first-page":"31809","article-title":"The BigScience Roots Corpus: A 1.6 TB Composite Multilingual Dataset","volume":"35","author":"Saulnier","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_46","unstructured":"OpenAI (2024, July 04). ChatGPT (July 5 Version) [Large Language Model]. Available online: https:\/\/chat.openai.com\/."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3458754","article-title":"Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing","volume":"3","author":"Gu","year":"2021","journal-title":"ACM Trans. Comput. Healthc."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1038\/s41586-023-06291-2","article-title":"Large Language Models Encode Clinical Knowledge","volume":"620","author":"Singhal","year":"2023","journal-title":"Nature"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Johnson, J., Hariharan, B., Van Der Maaten, L., Fei-Fei, L., Lawrence Zitnick, C., and Girshick, R. (2017, January 21\u201326). CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.215"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Hudson, D.A., and Manning, C.D. (2019, January 16\u201320). GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00686"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"180251","DOI":"10.1038\/sdata.2018.251","article-title":"A Dataset of Clinically Generated Visual Questions and Answers About Radiology Images","volume":"5","author":"Lau","year":"2018","journal-title":"Sci. Data"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"He, X., Zhang, Y., Mou, L., Xing, E., and Xie, P. (2020). PathVQA: 30000+ Questions for Medical Visual Question Answering. arXiv.","DOI":"10.36227\/techrxiv.13127537.v1"},{"key":"ref_53","unstructured":"Radford, A., Kim, J.W., Xu, T., Brockman, G., McLeavey, C., and Sutskever, I. (2022). Robust Speech Recognition via Large-Scale Weak Supervision. arXiv."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/11\/1\/11\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:23:38Z","timestamp":1759919018000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/11\/1\/11"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,6]]},"references-count":53,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,1]]}},"alternative-id":["jimaging11010011"],"URL":"https:\/\/doi.org\/10.3390\/jimaging11010011","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,6]]}}}