{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T22:39:09Z","timestamp":1780612749342,"version":"3.54.1"},"reference-count":110,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"name":"National Research Foundation of Korea (NRF) grant funded by the Korean government","award":["RS-2023-NR076833"],"award-info":[{"award-number":["RS-2023-NR076833"]}]},{"name":"Korea government (Ministry of Science and ICT","award":["IITP-2025-RS-2020-II201462, 50%"],"award-info":[{"award-number":["IITP-2025-RS-2020-II201462, 50%"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2026,3,31]]},"abstract":"<jats:p>Optical character recognition (OCR) is a vital process that involves the extraction of handwritten or printed text from scanned or printed images, converting it into a format that can be understood and processed by machines. The automatic extraction of text through OCR plays a crucial role in digitizing documents, enhancing productivity, and preserving historical records. This article offers an exhaustive review of contemporary applications, methodologies, and challenges associated with Arabic OCR. A thorough analysis is conducted on prevailing techniques utilized throughout the OCR process, with a dedicated effort to discern the most efficacious approaches that demonstrate enhanced outcomes. To ensure a thorough evaluation, a meticulous keyword-search methodology is adopted, encompassing a comprehensive analysis of articles relevant to Arabic OCR. In addition to presenting cutting-edge techniques and methods, this article identifies research gaps within the realm of Arabic OCR. We shed light on potential areas for future exploration and development, thereby guiding researchers toward promising avenues in the field of Arabic OCR. The outcomes of this study provide valuable insights for researchers, practitioners, and stakeholders involved in Arabic OCR, ultimately fostering advancements in the field and facilitating the creation of more accurate and efficient OCR systems for the Arabic language.<\/jats:p>","DOI":"10.1145\/3768150","type":"journal-article","created":{"date-parts":[[2025,9,17]],"date-time":"2025-09-17T10:35:02Z","timestamp":1758105302000},"page":"1-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey"],"prefix":"10.1145","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8513-570X","authenticated-orcid":false,"given":"Mahmoud","family":"Salaheldin Kasem","sequence":"first","affiliation":[{"name":"Department of Information and Communication Engineering, Chungbuk National University","place":["Cheongju, Korea (the Republic of)"]},{"name":"Multimedia Department, Faculty of Computer and Information, Assiut University","place":["Cheongju, Korea (the Republic of)"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6764-8969","authenticated-orcid":false,"given":"Mohamed","family":"Mahmoud","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Information, Assiut University","place":["Assiut, Egypt"]},{"name":"College of Electrical and Computer Engineering, Chungbuk National University","place":["Assiut, Egypt"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4333-2852","authenticated-orcid":false,"given":"Hyun-Soo","family":"Kang","sequence":"additional","affiliation":[{"name":"Department of Information and Communication Engineering, Chungbuk National University","place":["Cheongju, Korea (the Republic of)"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2025,11,8]]},"reference":[{"key":"e_1_3_2_2_2","unstructured":"Sondos Aabed and Ahmad Khairaldin. 2024. An end-to-end segmentation-free arabic handwritten recognition model on KHATT. arXiv:2406.15329. Retrieved from https:\/\/arxiv.org\/abs\/2406.15329"},{"key":"e_1_3_2_3_2","unstructured":"Abdelrahman Abdallah Mahmoud Abdalla Mahmoud SalahEldin Kasem Mohamed Mahmoud Ibrahim Abdelhalim Mohamed Elkasaby Yasser ElBendary and Adam Jatowt. 2024. CORU: Comprehensive post-OCR parsing and receipt understanding dataset. arXiv:2406.04493. Retrieved from https:\/\/arxiv.org\/abs\/2406.04493"},{"issue":"2","key":"e_1_3_2_4_2","doi-asserted-by":"crossref","first-page":"754","DOI":"10.11591\/ijeecs.v26.i2.pp754-763","article-title":"An approach to analysis of arabic text documents into text lines, words, and characters","volume":"26","author":"Abdo Hakim A.","year":"2022","unstructured":"Hakim A. Abdo, Ahmed Abdu, Ramesh R. Manza, and Shobha Bawiskar. 2022. An approach to analysis of arabic text documents into text lines, words, and characters. Indonesian Journal of Electrical Engineering and Computer Science 26, 2 (2022), 754\u2013763.","journal-title":"Indonesian Journal of Electrical Engineering and Computer Science"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.3390\/e23030340"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","unstructured":"Saad Bin Ahmed Saeeda Naz Salahuddin Swati and Muhammad Imran Razzak. 2019. Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 31 4 (2019) 1143\u20131151. DOI:10.1007\/S00521-017-3146-X","DOI":"10.1007\/S00521-017-3146-X"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3532609"},{"issue":"2","key":"e_1_3_2_8_2","doi-asserted-by":"crossref","first-page":"2219","DOI":"10.1007\/s13369-021-06163-9","article-title":"A novel approach to printed Arabic optical character recognition","volume":"47","author":"Ghamdi Mansoor A. Al","year":"2022","unstructured":"Mansoor A. Al Ghamdi. 2022. A novel approach to printed Arabic optical character recognition. Arabian Journal for Science and Engineering 47, 2 (2022), 2219\u20132237.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_9_2","first-page":"485","volume-title":"Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition","author":"Al-Ma\u2019adeed Somaya","year":"2002","unstructured":"Somaya Al-Ma\u2019adeed, Dave Elliman, and Colin A. Higgins. 2002. A data base for Arabic handwritten text recognition research. In Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition. IEEE, 485\u2013489."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0031-3203(02)00064-X"},{"issue":"1","key":"e_1_3_2_11_2","doi-asserted-by":"crossref","first-page":"69","DOI":"10.17576\/apjitm-2020-0901-06","article-title":"A review of Arabic text recognition dataset","volume":"9","author":"Al-Sheikh I. Saleh","year":"2020","unstructured":"I. Saleh Al-Sheikh, MASNIZAH Mohd, and L. Warlina. 2020. A review of Arabic text recognition dataset. Asia-Pacific J. Inf. Technol. Multimedia 9, 1 (2020), 69\u201381.","journal-title":"Asia-Pacific J. Inf. Technol. Multimedia"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/s40998-020-00317-5"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1108\/PRR-05-2017-0026"},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","unstructured":"Mansoor Alghamdi and William Teahan. 2018. Printed arabic script recognition: A Survey. International Journal of Advanced Computer Science and Applications 9 9 (2018).","DOI":"10.14569\/IJACSA.2018.090953"},{"key":"e_1_3_2_15_2","first-page":"1","volume-title":"Proceedings of the 2016 7th International Conference on Computer Science and Information Technology","author":"Alghamdi Mansoor A.","year":"2016","unstructured":"Mansoor A. Alghamdi, Ibrahim S. Alkhazi, and William J. Teahan. 2016. Arabic OCR evaluation tool. In Proceedings of the 2016 7th International Conference on Computer Science and Information Technology. IEEE, 1\u20136."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1007\/s44230-023-00024-4"},{"issue":"3","key":"e_1_3_2_17_2","first-page":"78","article-title":"A survey on the existing arabic optical character recognition and future trends","volume":"7","author":"Alhomed Lutfieh S.","year":"2018","unstructured":"Lutfieh S. Alhomed and Kamal M. Jambi. 2018. A survey on the existing arabic optical character recognition and future trends. International Journal of Advanced Research in Computer and Communication Engineering 7, 3 (2018), 78\u201388.","journal-title":"International Journal of Advanced Research in Computer and Communication Engineering"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-022-13831-x"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-021-06423-7"},{"issue":"8","key":"e_1_3_2_20_2","doi-asserted-by":"crossref","first-page":"9943","DOI":"10.1007\/s13369-021-06363-3","article-title":"Arabic handwritten recognition using deep learning: A Survey","volume":"47","author":"Alrobah Naseem","year":"2022","unstructured":"Naseem Alrobah and Saleh Albahli. 2022. Arabic handwritten recognition using deep learning: A Survey. Arabian Journal for Science and Engineering 47, 8 (2022), 9943\u20139963.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-024-09674-2"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-020-05070-8"},{"key":"e_1_3_2_23_2","doi-asserted-by":"crossref","unstructured":"Yazan M. Alwaqfi Mumtazimah Mohamad and Ahmad T. Al-Taani. 2022. Generative adversarial network for an improved arabic handwritten characters recognition. International Journal of Advances in Soft Computing & Its Applications 14 1 (2022).","DOI":"10.15849\/IJASCA.220328.12"},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"Yazan M. Alwaqfi Mumtazimah Mohamad Ahmad T. Al-Taani and Nazirah Abd Hamid. 2023. A novel hybrid DL model for printed Arabic word recognition based on GAN. International Journal of Advanced Computer Science and Applications 14 1 (2023).","DOI":"10.14569\/IJACSA.2023.0140165"},{"key":"e_1_3_2_25_2","first-page":"95","volume-title":"Proceedings of the 2022 IEEE Canadian Conference on Electrical and Computer Engineering .","author":"Alzrrog Nori","year":"2022","unstructured":"Nori Alzrrog, Jean-Fran\u00e7ois Bousquet, and Idris El-Feghi. 2022. Deep learning application for handwritten arabic word recognition. In Proceedings of the 2022 IEEE Canadian Conference on Electrical and Computer Engineering .IEEE, 95\u2013100."},{"key":"e_1_3_2_26_2","unstructured":"Jason Antonio Aditya Rachman Putra Harits Abdurrohman and Moch Shandy Tsalasa. 2022. A survey on scanned receipts OCR and information extraction. In Proceedings of the International Conference on Document Analysis and Recognit. Jerusalem Israel 29\u201330."},{"key":"e_1_3_2_27_2","first-page":"313","volume-title":"Proceedings of the IEC (Prague)","author":"Asiri Adil M.","year":"2005","unstructured":"Adil M. Asiri and Mohammad S. Khorsheed. 2005. Automatic processing of handwritten arabic forms using neural networks. In Proceedings of the IEC (Prague). 313\u2013317."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sigpro.2008.12.022"},{"key":"e_1_3_2_29_2","first-page":"40","volume-title":"Proceedings of the 2019 14th International Conference on Computer Engineering and Systems","author":"Awni Mohamed","year":"2019","unstructured":"Mohamed Awni, Mahmoud I. Khalil, and Hazem M. Abbas. 2019. Deep-learning ensemble for offline Arabic handwritten words recognition. In Proceedings of the 2019 14th International Conference on Computer Engineering and Systems. IEEE, 40\u201345."},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","unstructured":"Mahmoud Badry Hesham Hassan Hanaa Bayomi and Hussien Oakasha. 2018. QTID: Quran text image dataset. International Journal of Advanced Computer Science and Applications 9 3 (2018).","DOI":"10.14569\/IJACSA.2018.090351"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.32604\/cmc.2021.015489"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10313-2"},{"key":"e_1_3_2_33_2","unstructured":"Youssef Bassil and Mohammad Alwani. 2012. Ocr post-processing error correction algorithm using google online spelling suggestion. arXiv:1204.0191. Retrieved from https:\/\/arxiv.org\/abs\/1204.0191"},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","unstructured":"Bilal Bataineh. 2017. A printed PAW image database of arabic language for document analysis and recognition. Journal of ICT Research & Applications 11 2 (2017).","DOI":"10.5614\/itbj.ict.res.appl.2017.11.2.6"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22113995"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","unstructured":"Gagan Bhatia El Moatez Billah Nagoudi Fakhraddin Alwajih and Muhammad Abdul-Mageed. 2024. Qalam: A multimodal LLM for arabic optical character and handwriting recognition. arXiv:2407.13559. Retrieved from https:\/\/arxiv.org\/abs\/2407.13559","DOI":"10.18653\/v1\/2024.arabicnlp-1.19"},{"issue":"3","key":"e_1_3_2_37_2","doi-asserted-by":"crossref","first-page":"1624","DOI":"10.3390\/app13031624","article-title":"Recognition and classification of handwritten urdu numerals using deep learning techniques","volume":"13","author":"Bhatti Aamna","year":"2023","unstructured":"Aamna Bhatti, Ameera Arif, Waqar Khalid, Baber Khan, Ahmad Ali, Shehzad Khalid, and Atiq ur Rehman. 2023. Recognition and classification of handwritten urdu numerals using deep learning techniques. Applied Sciences 13, 3 (2023), 1624.","journal-title":"Applied Sciences"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-023-00436-9"},{"issue":"3","key":"e_1_3_2_39_2","doi-asserted-by":"crossref","first-page":"1692","DOI":"10.3390\/app13031692","article-title":"Towards accurate children\u2019s Arabic handwriting recognition via deep learning","volume":"13","author":"Durayhim Anfal Bin","year":"2023","unstructured":"Anfal Bin Durayhim, Amani Al-Ajlan, Isra Al-Turaiki, and Najwa Altwaijry. 2023. Towards accurate children\u2019s Arabic handwriting recognition via deep learning. Applied Sciences 13, 3 (2023), 1692.","journal-title":"Applied Sciences"},{"issue":"4","key":"e_1_3_2_40_2","article-title":"Characters segmentation from arabic handwritten document images: Hybrid approach","volume":"13","author":"Boraik Omar Ali","year":"2022","unstructured":"Omar Ali Boraik, M. Ravikumar, and Mufeed Ahmed Naji Saif. 2022. Characters segmentation from arabic handwritten document images: Hybrid approach. International Journal of Advanced Computer Science and Applications 13, 4 (2022).","journal-title":"International Journal of Advanced Computer Science and Applications"},{"key":"e_1_3_2_41_2","doi-asserted-by":"crossref","first-page":"877","DOI":"10.1007\/978-981-33-6893-4_79","volume-title":"WITS 2020: Proceedings of the 6th International Conference on Wireless Technologies, Embedded, and Intelligent Systems","author":"Boualam Manal","year":"2022","unstructured":"Manal Boualam, Youssef Elfakir, Ghizlane Khaissidi, and Mostafa Mrabti. 2022. Arabic handwriting word recognition based on convolutional recurrent neural network. In WITS 2020: Proceedings of the 6th International Conference on Wireless Technologies, Embedded, and Intelligent Systems. Springer, 877\u2013885."},{"key":"e_1_3_2_42_2","first-page":"1","volume-title":"Proceedings of the 2021 International Conference on Recent Advances in Mathematics and Informatics","author":"Bouchakour Lallouani","year":"2021","unstructured":"Lallouani Bouchakour, Fariza Meziani, Houda Latrache, Khadija Ghribi, and Mustapha Yahiaoui. 2021. Printed arabic characters recognition using combined features and CNN classifier. In Proceedings of the 2021 International Conference on Recent Advances in Mathematics and Informatics. IEEE, 1\u20135."},{"key":"e_1_3_2_43_2","first-page":"1","volume-title":"Proceedings of the 2022 4th International Conference on Pattern Analysis and Intelligent Systems","author":"Bouressace Hassina","year":"2022","unstructured":"Hassina Bouressace. 2022. A review of arabic document analysis methods. In Proceedings of the 2022 4th International Conference on Pattern Analysis and Intelligent Systems. IEEE, 1\u20137."},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.3390\/forecast3030033"},{"issue":"16","key":"e_1_3_2_45_2","doi-asserted-by":"crossref","first-page":"5430","DOI":"10.3390\/app10165430","article-title":"Automatic cnn-based Arabic numeral spotting and handwritten digit recognition by using deep transfer learning in Ottoman population registers","volume":"10","author":"Can Yekta Said","year":"2020","unstructured":"Yekta Said Can and M. Erdem Kabaday\u0131. 2020. Automatic cnn-based Arabic numeral spotting and handwritten digit recognition by using deep transfer learning in Ottoman population registers. Applied Sciences 10, 16 (2020), 5430.","journal-title":"Applied Sciences"},{"key":"e_1_3_2_46_2","first-page":"120","volume-title":"Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition","author":"Chabchoub Fatma","year":"2016","unstructured":"Fatma Chabchoub, Yousri Kessentini, Slim Kanoun, Veronique Eglin, and Frank Lebourgeois. 2016. SmartATID: A mobile captured arabic text images dataset for multi-purpose recognition tasks. In Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition. IEEE, 120\u2013125."},{"key":"e_1_3_2_47_2","unstructured":"Adrian Chan Anupam Mijar Mehreen Saeed Chau-Wai Wong and Akram Khater. 2024. HATFormer: Historic handwritten arabic text recognition with transformers. arXiv:2410.02179. Retrieved from https:\/\/arxiv.org\/abs\/2410.02179"},{"issue":"2","key":"e_1_3_2_48_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.4018\/IJAIML.20210701.oa9","article-title":"A survey on arabic handwritten script recognition systems","volume":"11","author":"Djaghbellou Soumia","year":"2021","unstructured":"Soumia Djaghbellou, Abderraouf Bouziane, Abdelouahab Attia, and Zahid Akhtar. 2021. A survey on arabic handwritten script recognition systems. International Journal of Artificial Intelligence and Machine Learning 11, 2 (2021), 1\u201317.","journal-title":"International Journal of Artificial Intelligence and Machine Learning"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-018-0297-y"},{"issue":"2","key":"e_1_3_2_50_2","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/s11063-024-11544-w","article-title":"CNN-based methods for offline arabic handwriting recognition: A review","volume":"56","author":"Khayati Mohsine El","year":"2024","unstructured":"Mohsine El Khayati, Ismail Kich, and Youssef Taouil. 2024. CNN-based methods for offline arabic handwriting recognition: A review. Neural Processing Letters 56, 2 (2024), 115.","journal-title":"Neural Processing Letters"},{"key":"e_1_3_2_51_2","first-page":"237","volume-title":"Proceedings of the Artificial Intelligence and Pattern Recognition","author":"El-Sherif Ezzat Ali","year":"2007","unstructured":"Ezzat Ali El-Sherif and Sherif Abdelazeem. 2007. A two-stage system for arabic handwritten digit recognition tested on a new large database. In Proceedings of the Artificial Intelligence and Pattern Recognition. 237\u2013242."},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","unstructured":"Mohsine Elkhayati Youssfi Elkettani and Mohammed Mourchid. 2022. Segmentation of handwritten Arabic graphemes using a directed convolutional neural network and mathematical morphology operations. Pattern Recognition 122 (2022) 108288. DOI:10.1016\/j.patcog.2021.108288","DOI":"10.1016\/j.patcog.2021.108288"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-025-11020-z"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","unstructured":"Mohamed Eltay Abdelmalek Zidouri and Irfan Ahmad. 2020. Exploring deep learning approaches to recognize handwritten Arabic texts. IEEE Access 8 (2020) 89882\u201389898. DOI:10.1109\/ACCESS.2020.2994248","DOI":"10.1109\/ACCESS.2020.2994248"},{"key":"e_1_3_2_55_2","doi-asserted-by":"crossref","unstructured":"Mohamed Eltay Abdelmalek Zidouri Irfan Ahmad and Yousef Elarian. 2022. Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition. PeerJ Computer Science 8 (2022) e861.","DOI":"10.7717\/peerj-cs.861"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.3390\/s18092786"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-85729-859-1_12"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","unstructured":"Fatemeh Haghighi and Hesam Omranpour. 2021. Stacking ensemble model of deep learning and its application to Persian\/Arabic handwritten digits recognition. Knowledge-Based Systems 220 (2021) 106940. DOI:10.1016\/j.knosys.2021.106940","DOI":"10.1016\/j.knosys.2021.106940"},{"key":"e_1_3_2_59_2","doi-asserted-by":"crossref","unstructured":"Karez Hamad and Mehmet Kaya. 2016. A detailed analysis of optical character recognition technology. International Journal of Applied Mathematics Electronics and Computers Special 1 (2016) 244\u2013249.","DOI":"10.18100\/ijamec.270374"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-023-15167-6"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3612076"},{"key":"e_1_3_2_62_2","first-page":"1516","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition","author":"Huang Zheng","year":"2019","unstructured":"Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shijian Lu, and CV Jawahar. 2019. Icdar2019 competition on scanned receipt ocr and information extraction. In Proceedings of the 2019 International Conference on Document Analysis and Recognition. IEEE, 1516\u20131520."},{"key":"e_1_3_2_63_2","unstructured":"Noman Islam Zeeshan Islam and Nazia Noor. 2017. A survey on optical character recognition system. arXiv:1710.05703. Retrieved from https:\/\/arxiv.org\/abs\/1710.05703"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","unstructured":"Sana Khamekhem Jemni Yousri Kessentini and Slim Kanoun. 2019. Out of vocabulary word detection and recovery in Arabic handwritten text recognition. Pattern Recognition 93 (2019) 507\u2013520. DOI:10.1016\/j.patcog.2019.05.003","DOI":"10.1016\/j.patcog.2019.05.003"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","unstructured":"Naila Habib Khan and Awais Adnan. 2018. Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6 (2018) 46019\u201346046. DOI:10.1109\/ACCESS.2018.2865532","DOI":"10.1109\/ACCESS.2018.2865532"},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2006.12.022"},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.3390\/app122211760"},{"key":"e_1_3_2_68_2","first-page":"255","volume-title":"Proceedings of the European Workshop on Visual Information Processing","author":"Lawgali Ahmed","year":"2013","unstructured":"Ahmed Lawgali, Maia Angelova, and Ahmed Bouridane. 2013. HACDB: Handwritten arabic characters database for automatic character recognition. In Proceedings of the European Workshop on Visual Information Processing. IEEE, 255\u2013259."},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2013.12.012"},{"key":"e_1_3_2_70_2","doi-asserted-by":"crossref","unstructured":"Mohamed G Mahdi Ahmed Sleem and Ibrahim Elhenawy. 2024. Deep learning algorithms for Arabic optical character recognition: A survey. Multicriteria Algorithms with Applications 2 (2024) 65\u201379.","DOI":"10.61356\/j.mawa.2024.26861"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2013.08.009"},{"key":"e_1_3_2_72_2","unstructured":"Sriparna Majumdar and Aaron Brick. 2022. Recognizing handwriting styles in a historical scanned document using scikit-fuzzy c-means clustering. arXiv:2210.16780. Retrieved from https:\/\/arxiv.org\/abs\/2210.16780"},{"issue":"7","key":"e_1_3_2_73_2","doi-asserted-by":"crossref","first-page":"2909","DOI":"10.1007\/s00371-022-02500-7","article-title":"Handwritten arabic and roman word recognition using holistic approach","volume":"39","author":"Malakar Samir","year":"2023","unstructured":"Samir Malakar, Samanway Sahoo, Anuran Chakraborty, Ram Sarkar, and Mita Nasipuri. 2023. Handwritten arabic and roman word recognition using holistic approach. The Visual Computer 39, 7 (2023), 2909\u20132932.","journal-title":"The Visual Computer"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2022.103068"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.3103\/S0146411623030069"},{"key":"e_1_3_2_76_2","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1145\/3322905.3322928","volume-title":"Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage","author":"Milo Thomas","year":"2019","unstructured":"Thomas Milo and Alicia Gonzalez Martinez. 2019. A new strategy for Arabic OCR: Archigraphemes, letter blocks, script grammar, and shape synthesis. In Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage. 93\u201396."},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","unstructured":"Seyyed Amir Hadi Minoofam Azam Bastanfard and Mohammad Reza Keyvanpour. 2025. DB-QM: A comparative quality measurement and its prospective on persian\/arabic databases for OCR. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 24 7 (July 2025). DOI:10.1145\/3711714","DOI":"10.1145\/3711714"},{"key":"e_1_3_2_78_2","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/3322905.3322927","volume-title":"Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage","author":"Mohamed Emad","year":"2019","unstructured":"Emad Mohamed and Zeeshan Ali Sayyed. 2019. Arabic-SOS: Segmentation, stemming, and orthography standardization for classical and pre-modern standard Arabic. In Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage. 27\u201332."},{"key":"e_1_3_2_79_2","first-page":"74","volume-title":"Proceedings of the 2023 International Mobile, Intelligent, and Ubiquitous Computing Conference.","author":"Mortadi Ahmad","year":"2023","unstructured":"Ahmad Mortadi, Ahmed Mohamed, Ahmed Talima, Ahmed Alkhattip, Ahmed Ibrahim, Ahmed Osman, and Yasser Hifny. 2023. ALNASIKH: An arabic OCR system based on transformers. In Proceedings of the 2023 International Mobile, Intelligent, and Ubiquitous Computing Conference.IEEE, 74\u201381."},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","unstructured":"Lamia Mosbah Ikram Moalla Tarek M. Hamdani Bilel Neji Taha Beyrouthy and Adel M. Alimi. 2024. ADOCRNet: A deep learning OCR for arabic documents recognition. IEEE Access 12 (2024) 55620\u201355631. DOI:10.1109\/ACCESS.2024.3379530","DOI":"10.1109\/ACCESS.2024.3379530"},{"key":"e_1_3_2_81_2","first-page":"182","volume-title":"Proceedings of the 2021 International Mobile, Intelligent, and Ubiquitous Computing Conference","author":"Mostafa Aly","year":"2021","unstructured":"Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Ghada Khoriba, and Amr S. Ghoneim. 2021. Ocformer: A transformer-based model for arabic handwritten text recognition. In Proceedings of the 2021 International Mobile, Intelligent, and Ubiquitous Computing Conference. IEEE, 182\u2013186."},{"key":"e_1_3_2_82_2","unstructured":"Aly Mostafa Omar Mohamed Ali Ashraf Ahmed Elbehery Salma Jamal Anas Salah and Amr S. Ghoneim. 2022. An end-to-end ocr framework for robust arabic-handwriting recognition using a novel transformers-based model and an innovative 270 million-words multi-font corpus of classical arabic with diacritics. arXiv:2208.11484. Retrieved from https:\/\/arxiv.org\/abs\/2208.11484"},{"issue":"2","key":"e_1_3_2_83_2","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1007\/s13369-021-05796-0","article-title":"Conditional deep convolutional generative adversarial networks for isolated handwritten arabic character generation","volume":"47","author":"Mustapha Ismail B.","year":"2022","unstructured":"Ismail B. Mustapha, Shafaatunnur Hasan, Hatem Nabus, and Siti Mariyam Shamsuddin. 2022. Conditional deep convolutional generative adversarial networks for isolated handwritten arabic character generation. Arabian Journal for Science and Engineering 47, 2 (2022), 1309\u20131320.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_84_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-021-11593-6"},{"key":"e_1_3_2_85_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10639-015-9377-5"},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1145\/3476887.3476888"},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.1145\/3453476"},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-023-14880-6"},{"issue":"5","key":"e_1_3_2_89_2","doi-asserted-by":"crossref","first-page":"14047","DOI":"10.1007\/s11042-023-16166-3","article-title":"Challenges and opportunities for Arabic CAPTCHAs","volume":"83","author":"Parvez Mohammad T.","year":"2024","unstructured":"Mohammad T. Parvez and Suliman A. Alsuhibany. 2024. Challenges and opportunities for Arabic CAPTCHAs. Multimedia Tools and Applications 83, 5 (2024), 14047\u201314062.","journal-title":"Multimedia Tools and Applications"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.1145\/2431211.2431222"},{"key":"e_1_3_2_91_2","first-page":"127","volume-title":"Proceedings of the CIFED","author":"Pechwitz Mario","year":"2002","unstructured":"Mario Pechwitz, S. Snoussi Maddouri, Volker M\u00e4rgner, Noureddine Ellouze, Hamid Amiri, et\u00a0al. 2002. IFN\/ENIT-database of handwritten Arabic words. In Proceedings of the CIFED. Citeseer, 127\u2013136."},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2022.04.021"},{"key":"e_1_3_2_93_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2019.08.013"},{"key":"e_1_3_2_94_2","doi-asserted-by":"publisher","unstructured":"Jabril Ramdan Khairuddin Omar Mohammad Faidzul and Ali Mady. 2013. Arabic handwriting data base for text recognition. Procedia Technology 11 (2013) 580\u2013584. DOI:10.1016\/j.protcy.2013.12.231","DOI":"10.1016\/j.protcy.2013.12.231"},{"key":"e_1_3_2_95_2","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1007\/978-3-031-07012-9_33","volume-title":"Emerging Technologies in Computer Engineering: Cognitive Computing and Intelligent IoT: 5th International Conference, ICETCE 2022, Jaipur, India, February 4\u20135, 2022, Revised Selected Papers","author":"Rashid Dhuha","year":"2022","unstructured":"Dhuha Rashid and Naveen Kumar Gondhi. 2022. Scrutinization of urdu handwritten text recognition with machine learning approach. In Emerging Technologies in Computer Engineering: Cognitive Computing and Intelligent IoT: 5th International Conference, ICETCE 2022, Jaipur, India, February 4\u20135, 2022, Revised Selected Papers. Springer, 383\u2013394."},{"key":"e_1_3_2_96_2","doi-asserted-by":"publisher","DOI":"10.3390\/app9224853"},{"key":"e_1_3_2_97_2","first-page":"215","volume-title":"Proceedings of the Document Recognition and Retrieval XX","author":"Sabbour Nazly","year":"2013","unstructured":"Nazly Sabbour and Faisal Shafait. 2013. A segmentation-free approach to arabic and urdu OCR. In Proceedings of the Document Recognition and Retrieval XX. SPIE, 215\u2013226."},{"key":"e_1_3_2_98_2","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1007\/978-3-031-29860-8_55","volume-title":"Proceedings of the International Conference on Digital Technologies and Applications","author":"Sabri My Abdelouahed","year":"2023","unstructured":"My Abdelouahed Sabri, Assia Ennouni, and Abdellah Aarab. 2023. A robust approach for arabic document images segmentation and indexation. In Proceedings of the International Conference on Digital Technologies and Applications. Springer, 540\u2013549."},{"key":"e_1_3_2_99_2","first-page":"56","volume-title":"Proceedings of the 2015 3rd International Conference on Image Information Processing","author":"Saddami Khairun","year":"2015","unstructured":"Khairun Saddami, Khairul Munadi, and Fitri Arnia. 2015. A database of printed jawi character image. In Proceedings of the 2015 3rd International Conference on Image Information Processing. IEEE, 56\u201359."},{"key":"e_1_3_2_100_2","first-page":"39","volume-title":"Proceedings of the 2023 15th International Conference on Developments in eSystems Engineering","author":"Salman Ghufran Jafar","year":"2023","unstructured":"Ghufran Jafar Salman and Mohammed Sahib Mahdi Altaei. 2023. Proposed deep learning system for arabic text detection and recognition. In Proceedings of the 2023 15th International Conference on Developments in eSystems Engineering. IEEE, 39\u201344."},{"issue":"3","key":"e_1_3_2_101_2","doi-asserted-by":"crossref","first-page":"314","DOI":"10.7763\/IJMLC.2012.V2.137","article-title":"A survey of OCR applications","volume":"2","author":"Singh Amarjot","year":"2012","unstructured":"Amarjot Singh, Ketan Bacchuwar, and Akshay Bhasin. 2012. A survey of OCR applications. International Journal of Machine Learning and Computing 2, 3 (2012), 314.","journal-title":"International Journal of Machine Learning and Computing"},{"key":"e_1_3_2_102_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-022-08045-z"},{"key":"e_1_3_2_103_2","unstructured":"Fouad Slimane Rolf Ingold Slim Kanoun Adel M. Alimi and Jean Hennebert. 2009. Database and evaluation protocols for Arabic printed text recognition. DIUF-University of Fribourg-Switzerland 1 (2009)."},{"key":"e_1_3_2_104_2","first-page":"1","volume-title":"Proceedings of the 2017 6th International Conference on Electrical Engineering and Informatics","author":"Sulaiman Alaa","year":"2017","unstructured":"Alaa Sulaiman, Khairuddin Omar, and Mohammad F. Nasrudin. 2017. A database for degraded Arabic historical manuscripts. In Proceedings of the 2017 6th International Conference on Electrical Engineering and Informatics. IEEE, 1\u20136."},{"key":"e_1_3_2_105_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-50835-1_66"},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","unstructured":"Moeen Tayyab Ayyaz Hussain Mohammed Ali Alshara Shakir Khan Reemiah Muneer Alotaibi and Abdul Rauf Baig. 2022. Recognition of visual arabic scripting news ticker from broadcast stream. IEEE Access 10 (2022) 59189\u201359204. DOI:10.1109\/ACCESS.2022.3179366","DOI":"10.1109\/ACCESS.2022.3179366"},{"key":"e_1_3_2_107_2","first-page":"1","article-title":"Deep learning adversarial attacks and defenses on license plate recognition system","author":"Vizcarra Conrado","year":"2024","unstructured":"Conrado Vizcarra, Shadan Alhamed, Abdulelah Algosaibi, Mohammed Alnaeem, Adel Aldalbahi, Nura Aljaafari, Ahmad Sawalmeh, Mahmoud Nazzal, Abdallah Khreishah, Abdulaziz Alhumam, and Muhammad Anan. 2024. Deep learning adversarial attacks and defenses on license plate recognition system. Cluster Computing (2024), 1\u201318.","journal-title":"Cluster Computing"},{"key":"e_1_3_2_108_2","unstructured":"Alhossien Waly Bassant Tarek Ali Feteha Rewan Yehia Gasser Amr Walid Gomaa and Ahmed Fares. 2025. Invizo: Arabic handwritten document optical character recognition solution. arXiv:2502.05277. Retrieved from https:\/\/arxiv.org\/abs\/2502.05277"},{"key":"e_1_3_2_109_2","first-page":"1221","volume-title":"Proceedings of the 2015 13th International Conference on Document Analysis and Recognition .","author":"Yousfi Sonia","year":"2015","unstructured":"Sonia Yousfi, Sid-Ahmed Berrani, and Christophe Garcia. 2015. ALIF: A dataset for arabic embedded text recognition in TV broadcast. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition .IEEE, 1221\u20131225."},{"key":"e_1_3_2_110_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333911"},{"key":"e_1_3_2_111_2","doi-asserted-by":"publisher","DOI":"10.3390\/jimaging4020032"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3768150","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,8]],"date-time":"2025-11-08T13:56:50Z","timestamp":1762610210000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3768150"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,8]]},"references-count":110,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2026,3,31]]}},"alternative-id":["10.1145\/3768150"],"URL":"https:\/\/doi.org\/10.1145\/3768150","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,8]]},"assertion":[{"value":"2023-12-20","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-12","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}