{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:14:39Z","timestamp":1760058879679,"version":"build-2065373602"},"reference-count":134,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2025,5,5]],"date-time":"2025-05-05T00:00:00Z","timestamp":1746403200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"King Abdulaziz City for Science and Technology"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computation"],"abstract":"<jats:p>Assessing text readability is important for helping language learners and readers select texts that match their proficiency levels. Research in cognitive psychology, which uses behavioral data such as eye-tracking and electroencephalogram signals, has shown its effectiveness in detecting cognitive activities that correlate with text difficulty during reading. However, Arabic, with its distinctive linguistic characteristics, presents unique challenges in readability assessment using cognitive data. While behavioral data have been employed in readability assessments, their full potential, particularly in Arabic contexts, remains underexplored. This paper presents the development of the first Arabic eye-tracking corpus, comprising eye movement data collected from Arabic-speaking participants, with a total of 57,617 words. Subsequently, this corpus can be utilized to evaluate a broad spectrum of text-based and gaze-based features, employing machine learning and deep learning methods to improve Arabic readability assessments by integrating cognitive data into the readability assessment process.<\/jats:p>","DOI":"10.3390\/computation13050108","type":"journal-article","created":{"date-parts":[[2025,5,5]],"date-time":"2025-05-05T21:42:09Z","timestamp":1746481329000},"page":"108","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["AraEyebility: Eye-Tracking Data for Arabic Text Readability"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7531-0666","authenticated-orcid":false,"given":"Ibtehal","family":"Baazeem","sequence":"first","affiliation":[{"name":"Artificial Intelligence and Robotics Institute, King Abdulaziz City for Science and Technology, Riyadh 13523, Saudi Arabia"},{"name":"College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7328-4935","authenticated-orcid":false,"given":"Hend","family":"Al-Khalifa","sequence":"additional","affiliation":[{"name":"College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5874-2611","authenticated-orcid":false,"given":"Abdulmalik","family":"Al-Salman","sequence":"additional","affiliation":[{"name":"College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]}],"member":"1968","published-online":{"date-parts":[[2025,5,5]]},"reference":[{"key":"ref_1","unstructured":"Balyan, R., McCarthy, K.S., and McNamara, D.S. (2018, January 21\u201323). Comparing Machine Learning Classification Approaches for Predicting Expository Text Difficulty. Proceedings of the the Thirty-First International Flairs Conference, Melbourne, FL, USA."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1075\/itl.165.2.01col","article-title":"Computational assessment of text readability: A survey of current and future research","volume":"165","year":"2014","journal-title":"ITL Int. J. Appl. Linguist."},{"key":"ref_3","first-page":"103","article-title":"Automatic readability measurements of the Arabic text: An exploratory study","volume":"35","year":"2010","journal-title":"Arab. J. Sci. Eng."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Nassiri, N., Lakhouaja, A., and Cavalli-Sforza, V. (2017, January 11\u201312). Modern Standard Arabic Readability Prediction. Proceedings of the Arabic Language Processing: From Theory to Practice (ICALP 2017), Fez, Morocco.","DOI":"10.1007\/978-3-319-73500-9_9"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Al-Ajlan, A.A., Al-Khalifa, H.S., and Al-Salman, A.S. (2008, January 13\u201316). Towards the development of an automatic readability measurements for Arabic language. Proceedings of the Third International Conference on Digital Information Management, London, UK.","DOI":"10.1109\/ICDIM.2008.4746711"},{"key":"ref_6","first-page":"19","article-title":"The Concept of Readability","volume":"26","author":"Dale","year":"1949","journal-title":"Elem. Engl."},{"key":"ref_7","first-page":"370","article-title":"AARI: Automatic Arabic readability index","volume":"11","author":"Jaradat","year":"2014","journal-title":"Int. Arab J. Inf. Technol."},{"key":"ref_8","unstructured":"Baazeem, I. (2015). Analysing the Effects of Latent Semantic Analysis Parameters on Plain Language Visualisation. [Master\u2019s Thesis, Queensland University]."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Mesgar, M., and Strube, M. (2015, January 4\u20135). Graph-based coherence modeling for assessing readability. Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, Denver, CO, USA.","DOI":"10.18653\/v1\/S15-1036"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1016\/j.procs.2018.10.459","article-title":"Arabic Readability Research: Current State and Future Directions","volume":"142","author":"Saddiki","year":"2018","journal-title":"Procedia Comp. Sci."},{"key":"ref_11","unstructured":"Feng, L., Elhadad, N.M., and Huenerfauth, M. (April, January 30). Cognitively motivated features for readability assessment. Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, Athens, Greece."},{"key":"ref_12","unstructured":"Balakrishna, S.V. (2015). Analyzing Text Complexity and Text Simplification: Connecting Linguistics, Processing and Educational Applications. [Ph.D. Thesis, der Eberhard Karls Universit\u00e4t T\u00fcbingen]."},{"key":"ref_13","unstructured":"Vajjala, S., Meurers, D., Eitel, A., and Scheiter, K. (2016, January 11). Towards grounding computational linguistic approaches to readability: Modeling reader-text interaction for easy and difficult texts. Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC), Osaka, Japan."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Vajjala, S., and Lucic, I. (2019, January 2). On understanding the relation between expert annotations of text readability and target reader comprehension. Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, Florence, Italy.","DOI":"10.18653\/v1\/W19-4437"},{"key":"ref_15","unstructured":"Mathias, S., Kanojia, D., Mishra, A., and Bhattacharyya, P. (2021, January 7\u201315). A Survey on Using Gaze Behaviour for Natural Language Processing. Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20) Survey Track, Yokohama, Japan."},{"key":"ref_16","unstructured":"Singh, A.D., Mehta, P., Husain, S., and Rajkumar, R. (2016, January 11). Quantifying sentence complexity based on eye-tracking measures. Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC), Osaka, Japan."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Copeland, L., Gedeon, T., and Caldwell, S. (2015, January 19\u201321). Effects of text difficulty and readers on predicting reading comprehension from eye movements. Proceedings of the 6th IEEE International Conference on Cognitive Infocommunications (CogInfoCom), Gyor, Hungary.","DOI":"10.1109\/CogInfoCom.2015.7390628"},{"key":"ref_18","unstructured":"Kennedy, A., Hill, R., and Pynte, J.E. (2003, January 20\u201324). The Dundee Corpus. Proceedings of the 12th European Conference on Eye Movements, Dundee, Scotland."},{"key":"ref_19","unstructured":"Hollenstein, N. (2021). Leveraging Cognitive Processing Signals for Natural Language Understanding. [Ph.D. Thesis, ETH Zurich]."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"602","DOI":"10.3758\/s13428-016-0734-0","article-title":"Presenting GECO: An eyetracking corpus of monolingual and bilingual sentence reading","volume":"49","author":"Cop","year":"2017","journal-title":"Behav. Res. Methods"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Mathias, S., Murthy, R., Kanojia, D., Mishra, A., and Bhattacharyya, P. (2020, January 4\u20137). Happy are those who grade without seeing: A multi-task learning approach to grade essays using gaze behaviour. Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, Suzhou, China.","DOI":"10.18653\/v1\/2020.aacl-main.86"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1038\/sdata.2018.291","article-title":"ZuCo, a simultaneous EEG and eye-tracking resource for natural sentence reading","volume":"5","author":"Hollenstein","year":"2018","journal-title":"Sci. Data"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1037\/xhp0000032","article-title":"Processing of Arabic diacritical marks: Phonological\u2013syntactic disambiguation of homographic verbs and visual crowding effects","volume":"41","author":"Hermena","year":"2015","journal-title":"J. Exp. Psychol. Hum. Percept. Perform."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1007\/s12144-019-00493-6","article-title":"Reading text with and without diacritics alters brain activation: The case of Arabic","volume":"39","author":"Sarsam","year":"2020","journal-title":"Curr. Psychol."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1443","DOI":"10.3758\/s13423-015-0809-4","article-title":"Effects of word length on eye movement control: The evidence from Arabic","volume":"22","author":"Paterson","year":"2015","journal-title":"Psychon. Bull. Rev."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Baazeem, I., Al-Khalifa, H., and Al-Salman, A. (2021). Cognitively Driven Arabic Text Readability Assessment Using Eye-Tracking. Appl. Sci., 11.","DOI":"10.3390\/app11188607"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1007\/s10579-014-9274-3","article-title":"Creating language resources for under-resourced languages: Methodologies, and experiments with Arabic","volume":"49","author":"Kruschwitz","year":"2015","journal-title":"Lang. Resour. Eval."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/j.procs.2017.10.106","article-title":"Automatic minimal diacritization of Arabic texts","volume":"117","author":"Alnefaie","year":"2017","journal-title":"Procedia Comput. Sci."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Hermena, E.W., Bouamama, S., Liversedge, S.P., and Drieghe, D. (2021). Does diacritics-based lexical disambiguation modulate word frequency, length, and predictability effects? An eye-movements investigation of processing Arabic diacritics. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0259987"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1075\/sal.10.03alj","article-title":"Eye movements in Arabic reading","volume":"10","author":"AlJassmi","year":"2021","journal-title":"Exp. Arab. Linguist."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Roman, G., and Pavard, B. (1987). A comparative study: How we read in Arabic and French. Eye Movements from Physiology to Cognition, Elsevier.","DOI":"10.1016\/B978-0-444-70113-8.50064-3"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3486675","article-title":"Light Diacritic Restoration to Disambiguate Homographs in Modern Arabic Texts","volume":"21","author":"Azmi","year":"2021","journal-title":"ACM Trans. Asian Low-Resour. Lang. Inf. Process."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Bouamor, H., Zaghouani, W., Diab, M., Obeid, O., Oflazer, K., Ghoneim, M., and Hawwari, A. (2015, January 26\u201331). A pilot study on Arabic multi-genre corpus diacritization. Proceedings of the Second Workshop on Arabic Natural Language Processing, Beijing, China.","DOI":"10.18653\/v1\/W15-3209"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"114037","DOI":"10.1016\/j.eswa.2020.114037","article-title":"Eye tracking algorithms, techniques, tools, and applications with an emphasis on machine learning and Internet of Things technologies","volume":"166","author":"Klaib","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_35","first-page":"1","article-title":"Human eye tracking and related issues: A review","volume":"2","author":"Singh","year":"2012","journal-title":"Int. J. Sci. Res. Publ."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1177\/0267658316637401","article-title":"Using eye-tracking in applied linguistics and second language research","volume":"32","author":"Conklin","year":"2016","journal-title":"Sec. Lang. Res."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1037\/0033-295X.87.4.329","article-title":"A theory of reading: From eye fixations to comprehension","volume":"87","author":"Just","year":"1980","journal-title":"Psychol. Rev."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Grabar, N., Farce, E., and Sparrow, L. (2018, January 8). Study of readability of health documents with eye-tracking approaches. Proceedings of the 1st Workshop on Automatic Text Adaptation (ATA), Tilburg, The Netherlands.","DOI":"10.18653\/v1\/W18-7003"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Mathias, S., Kanojia, D., Patel, K., Agarwal, S., Mishra, A., and Bhattacharyya, P. (2018, January 15\u201320). Eyes are the windows to the soul: Predicting the rating of text quality using gaze behaviour. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1219"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Gonzalez-Garduno, A.V., and S\u00f8gaard, A. (2017, January 8). Using gaze to predict text readability. Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, Copenhagen, Denmark.","DOI":"10.18653\/v1\/W17-5050"},{"key":"ref_41","unstructured":"Tobii Technology AB (2017). Tobii Studio User\u2019s Manual (Version 3.4.8), Tobii Technology AB."},{"key":"ref_42","unstructured":"Tobii Technology AB (2023, July 07). Fundamentals. Available online: https:\/\/developer.tobii.com\/xr\/learn\/analytics\/fundamentals\/."},{"key":"ref_43","unstructured":"Tobii Technology AB (2019). Tobii Pro Lab User\u2019s Manual (Version 1.130), Tobii Technology AB."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Wu, X., Xue, C., and Zhou, F. (2015, January 2\u20137). An Experimental Study on Visual Search Factors of Information Features in a Task Monitoring Interface. Proceedings of the Human-Computer Interaction: Users and Contexts, Los Angeles, CA, USA.","DOI":"10.1007\/978-3-319-21006-3_50"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Al-Wabil, A., and Al-Sheaha, M. (2010, January 14\u201316). Towards an interactive screening program for developmental dyslexia: Eye movement analysis in reading Arabic texts. Proceedings of the 12th International Conference on Computers Helping People with Special Needs, Vienna, Austria.","DOI":"10.1007\/978-3-642-14100-3_5"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Al-Edaily, A., Al-Wabil, A., and Al-Ohali, Y. (2013, January 1\u20133). Dyslexia Explorer: A Screening System for Learning Difficulties in the Arabic Language Using Eye Tracking. Proceedings of the Human Factors in Computing and Informatics, Maribor, Slovenia.","DOI":"10.1007\/978-3-642-39062-3_63"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Al-Edaily, A., Al-Wabil, A., and Al-Ohali, Y. (2013, January 21\u201326). Interactive Screening for Learning Difficulties: Analyzing Visual Patterns of Reading Arabic Scripts with Eye Tracking. Proceedings of the HCI International 2013\u2014Posters\u2019 Extended Abstracts, Las Vegas, NV, USA.","DOI":"10.1007\/978-3-642-39476-8_1"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1006\/brcg.1997.0917","article-title":"Inversion errors in Arabic number reading: Is there a nonsemantic route?","volume":"34","author":"Blanken","year":"1997","journal-title":"Brain Cogn."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Naz, S., Razzak, M.I., Hayat, K., Anwar, M.W., and Khan, S.Z. (2013, January 7\u20139). Challenges in baseline detection of Arabic script based languages. Proceedings of the Intelligent Systems for Science and Information: Extended and Selected Results from the Science and Information Conference, London, UK.","DOI":"10.1007\/978-3-319-04702-7_11"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Wiechmann, D., Qiao, Y., Kerz, E., and Mattern, J. (2022, January 22\u201327). Measuring the impact of (psycho-) linguistic and readability features and their spill over effects on the prediction of eye movement patterns. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland.","DOI":"10.18653\/v1\/2022.acl-long.362"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1016\/0010-0285(82)90008-1","article-title":"Making and correcting errors during sentence comprehension: Eye movements in the analysis of structurally ambiguous sentences","volume":"14","author":"Frazier","year":"1982","journal-title":"Cogn. Psychol."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1207\/s1532799xssr1003_3","article-title":"Eye movements as reflections of comprehension processes in reading","volume":"10","author":"Rayner","year":"2006","journal-title":"Sci. Stud. Read."},{"key":"ref_53","unstructured":"Underwood, G. (1998). Chapter 3\u2014Eye Movements and Measures of Reading Time. Eye Guidance in Reading and Scene Perception, Elsevier Science Ltd."},{"key":"ref_54","first-page":"50780","article-title":"Using eye movements to evaluate the cognitive processes involved in text comprehension","volume":"10","author":"Raney","year":"2014","journal-title":"J. Vis. Exp."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1080\/20445911.2015.1046877","article-title":"Developmental eye-tracking research in reading: Introduction to the special issue","volume":"27","author":"Schroeder","year":"2015","journal-title":"J. Cogn. Psychol."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1016\/j.procs.2017.01.162","article-title":"Eye movement analyses for obtaining Readability Formula for Latvian texts for primary school","volume":"104","author":"Atvars","year":"2017","journal-title":"Procedia Comp. Sci."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"8127","DOI":"10.1109\/JSEN.2019.2917834","article-title":"Readability analysis based on cognitive assessment using physiological sensing","volume":"19","author":"Sinha","year":"2019","journal-title":"IEEE Sens. J."},{"key":"ref_58","unstructured":"Zubov, V.I., and Petrova, T.E. (2020, January 16\u201318). Lexically or grammatically adapted texts: What is easier to process for secondary school children?. Proceedings of the 24th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, Petersburg, Russia."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3571510","article-title":"Approaches, Methods, and Resources for Assessing the Readability of Arabic Texts","volume":"22","author":"Nassiri","year":"2023","journal-title":"ACM Trans. Asian Low Resour. Lang. Inf. Process."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"263","DOI":"10.4236\/wjns.2013.34035","article-title":"Exploration of Arabic reading, in terms of the vocalization of the text form by registering the eyes movements of pupils","volume":"3","author":"Bensoltana","year":"2013","journal-title":"World J. Neurosci."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1080\/17586801.2020.1798327","article-title":"Are alternative meanings of an Arabic homograph activated even when it is disambiguated by vowel diacritics?","volume":"11","author":"Maroun","year":"2020","journal-title":"Writ. Syst. Res."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Awadh, F.H., Zoubrinetzky, R., Zaher, A., and Valdois, S. (2022). Visual attention span as a predictor of reading fluency and reading comprehension in Arabic. Front. Psychol., 13.","DOI":"10.3389\/fpsyg.2022.868530"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1007\/s11145-020-10040-6","article-title":"Parsing written language with non-standard grammar: An eye-tracking study of case marking in Arabic","volume":"34","author":"Hallberg","year":"2021","journal-title":"Read. Writ."},{"key":"ref_64","unstructured":"Leung, T., Boush, F., Chen, Q., and Al Kaabi, M. (2021, January 26\u201329). Eye movements when reading spaced and unspaced texts in Arabic. Proceedings of the Annual Meeting of the Cognitive Science Society, Vienna, Austria."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.17507\/tpls.1206.02","article-title":"Role of morphology in visual word recognition: A parafoveal preview study in Arabic using eye-tracking","volume":"12","author":"Khateb","year":"2022","journal-title":"Theory Pract. Lang. Stud."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Hermena, E.W., Juma, E.J., and AlJassmi, M. (2021). Parafoveal processing of orthographic, morphological, and semantic information during reading Arabic: A boundary paradigm investigation. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0254745"},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"2150003","DOI":"10.1142\/S271755452150003X","article-title":"Reading Process of Arab Children: An Eye-Tracking Study on Saudi Elementary Students","volume":"31","year":"2021","journal-title":"Int. J. Asian Lang. Process."},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/lnc3.12400","article-title":"Insights from the study of Arabic reading","volume":"14","author":"Hermena","year":"2020","journal-title":"Lang. Linguist. Compass."},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"1079","DOI":"10.1007\/s11145-023-10424-4","article-title":"Eye-movement patterns in skilled Arabic readers: Effects of specific features of Arabic versus universal factors","volume":"37","author":"Lahoud","year":"2023","journal-title":"Read. Writ."},{"key":"ref_70","first-page":"75","article-title":"Efficient measuring of readability to improve documents accessibility for arabic language learners","volume":"19","author":"Bessou","year":"2021","journal-title":"J. Digit. Inf. Manag."},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Nassiri, N., Cavalli-Sforza, V., and Lakhouaja, A. (2019, January 23\u201324). MoSAR: Modern Standard Arabic Readability Corpus for L1 Learners. Proceedings of the 4th International Conference on Big Data and Internet of Things (BDIoT\u201919), Rabat, Morocco.","DOI":"10.1145\/3372938.3372961"},{"key":"ref_72","unstructured":"El-Haj, M., and Rayson, P. (2016, January 23\u201328). OSMAN\u2015A Novel Arabic Readability Metric. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916), Portoro\u017e, Slovenia."},{"key":"ref_73","first-page":"168","article-title":"A corpus-based readability formula for estimate of Arabic texts reading difficulty","volume":"21","author":"Daud","year":"2013","journal-title":"World Appl. Sci. J."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Al Aqeel, S., Abanmy, N., Aldayel, A., Al-Khalifa, H., Al-Yahya, M., and Diab, M. (2018). Readability of written medicine information materials in Arabic language: Expert and consumer evaluation. BMC Health Serv. Res., 18.","DOI":"10.1186\/s12913-018-2944-x"},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1016\/j.procs.2016.04.017","article-title":"Readability of Arabic Medicine Information Leaflets: A Machine Learning Approach","volume":"82","author":"Alotaibi","year":"2016","journal-title":"Procedia Comp. Sci."},{"key":"ref_76","unstructured":"Nassiri, N., Lakhouaja, A., and Cavalli-Sforza, V. (2020, January 21\u201326). Combining Classical and Non-classical Features to Improve Readability Measures for Arabic First Language Texts. Proceedings of the International Conference on Advanced Intelligent Systems for Sustainable Development, Tangier, Morocco."},{"key":"ref_77","unstructured":"Barrett, M. (2018). Improving Natural Language Processing with Human Data: Eye Tracking and Other Data Sources Reflecting Cognitive Text Processing. [Ph.D. Thesis, University of Copenhagen]."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"826","DOI":"10.3758\/s13428-017-0908-4","article-title":"The Provo Corpus: A large eye-tracking corpus with predictability norms","volume":"50","author":"Luke","year":"2018","journal-title":"Behav. Res. Methods."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Salicchi, L., Chersoni, E., and Lenci, A. (2023). A study on surprisal and semantic relatedness for eye-tracking data prediction. Front. Psychol., 14.","DOI":"10.3389\/fpsyg.2023.1112365"},{"key":"ref_80","unstructured":"Hollenstein, N., Troendle, M., Zhang, C., and Langer, N. (2020, January 11\u201316). ZuCo 2.0: A dataset of physiological recordings during natural reading and annotation. Proceedings of the Twelfth Language Resources and Evaluation Conference, Marseille, France."},{"key":"ref_81","unstructured":"Leal, S.E., Duran, M.S., and Alu\u00edsio, S. (2018, January 20\u201326). A nontrivial sentence corpus for the task of sentence readability assessment in Portuguese. Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, NM, USA."},{"key":"ref_82","doi-asserted-by":"crossref","first-page":"1333","DOI":"10.1007\/s10579-022-09609-0","article-title":"RastrOS Project: Natural Language Processing contributions to the development of an eye-tracking corpus with predictability norms for Brazilian Portuguese","volume":"56","author":"Leal","year":"2022","journal-title":"Lang. Resour. Eval."},{"key":"ref_83","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/s41597-022-01464-6","article-title":"The database of eye-movement measures on words in Chinese reading","volume":"9","author":"Zhang","year":"2022","journal-title":"Sci. Data"},{"key":"ref_84","unstructured":"Hollenstein, N., Barrett, M., and Bj\u00f6rnsd\u00f3ttir, M. (2022, January 20\u201325). The Copenhagen Corpus of Eye Tracking Recordings from Natural Reading of Danish Texts. Proceedings of the Thirteenth Language Resources and Evaluation Conference, Marseille, France."},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Kazai, G., Kamps, J., Koolen, M., and Milic-Frayling, N. (2011, January 24\u201328). Crowdsourcing for book search evaluation: Impact of hit design on comparative system ranking. Proceedings of the 34th International ACM SIGIR Conference on Research and development in Information Retrieval, Beijing, China.","DOI":"10.1145\/2009916.2009947"},{"key":"ref_86","unstructured":"Aker, A., El-Haj, M., Albakour, M.-D., and Kruschwitz, U. (2012, January 21\u201327). Assessing Crowdsourcing Quality through Objective Tasks. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC\u201912), Istanbul, Turkey."},{"key":"ref_87","unstructured":"Vajjala, S., Majumder, B., Gupta, A., and Surana, H. (2020). Practical Natural Language Processing: A Comprehensive Guide to Building Real-World NLP Systems, O\u2019Reilly Media Inc."},{"key":"ref_88","unstructured":"(2021, June 27). Hindawi Foundation. Available online: http:\/\/www.hindawi.org\/."},{"key":"ref_89","unstructured":"Alrabiah, M., Alsalman, A., and Atwell, E. (2013, January 22). The design and construction of the 50 million words KSUCCA King Saud University Corpus of Classical Arabic. Proceedings of the WACL\u20192 Second Workshop on Arabic Corpus Linguistics, Lancaster University, UK."},{"key":"ref_90","first-page":"29","article-title":"MLAR: Machine Learning based System for Measuring the Readability of Online Arabic News","volume":"154","author":"Fouad","year":"2016","journal-title":"Int. J. Comput. Appl."},{"key":"ref_91","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1007\/s11145-016-9704-2","article-title":"Validating self-paced sentence-by-sentence reading: Story comprehension, recall, and narrative transportation","volume":"30","author":"Peterson","year":"2017","journal-title":"Read. Writ."},{"key":"ref_92","unstructured":"Aldayel, A., Al-Khalifa, H., Alaqeel, S., Abanmy, N., Al-Yahya, M., and Diab, M. (2018, January 8). ARC-WMI: Towards Building Arabic Readability Corpus for Written Medicine Information. Proceedings of the 3rd Workshop on Open-Source Arabic Corpora and Processing Tools, Miyazaki, Japan."},{"key":"ref_93","unstructured":"Al Khalil, M., Habash, N., and Jiang, Z. (2020, January 11\u201316). A large-scale leveled readability lexicon for Standard Arabic. Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, France."},{"key":"ref_94","unstructured":"Nielsen, J. (2022, December 15). Thinking Aloud: The #1 Usability Tool. Available online: https:\/\/www.nngroup.com\/articles\/thinking-aloud-the-1-usability-tool\/."},{"key":"ref_95","unstructured":"Leal, S.E., Vieira, J.M.M., dos Santos Rodrigues, E., Teixeira, E.N., and Alu\u00edsio, S. (2020, January 8\u201313). Using eye-tracking data to predict the readability of Brazilian Portuguese sentences in single-task, multi-task and sequential transfer learning approaches. Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain."},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Callison-Burch, C. (2009, January 6\u20137). Fast, cheap, and creative: Evaluating translation quality using Amazon\u2019s Mechanical Turk. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore.","DOI":"10.3115\/1699510.1699548"},{"key":"ref_97","unstructured":"Chen, Y., Zhang, W., Song, D., Zhang, P., Ren, Q., and Hou, Y. (2015, January 2). Inferring Document Readability by Integrating Text and Eye Movement Features. Proceedings of the SIGIR2015 Workshop on Neuro-Physiological Methods in IR Research, Santiago, Chile."},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Biedert, R., Dengel, A., Elshamy, M., and Buscher, G. (2012, January 28\u201330). Towards robust gaze-based objective quality measures for text. Proceedings of the Symposium on Eye Tracking Research and Applications, Santa Barbara, CA, USA.","DOI":"10.1145\/2168556.2168593"},{"key":"ref_99","unstructured":"SR Reserach Ltd. (2020). SR Research Experiment Builder User Manual (Version 2.3.1), SR Reserach Ltd."},{"key":"ref_100","unstructured":"Bhandari, P. (2022, December 28). Central Tendency|Understanding the Mean, Median & Mode. Available online: https:\/\/www.scribbr.com\/statistics\/central-tendency\/."},{"key":"ref_101","unstructured":"(2023, August 20). Measures of Central Tendency. Available online: https:\/\/statistics.laerd.com\/statistical-guides\/measures-central-tendency-mean-mode-median.php."},{"key":"ref_102","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1007\/978-981-16-3637-0_54","article-title":"Evaluating the Impact of Oversampling on Arabic L1 and L2 Readability Prediction Performances","volume":"Volume 237","author":"Nassiri","year":"2022","journal-title":"Networking, Intelligent Systems and Security"},{"key":"ref_103","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2022\/6774320","article-title":"English Text Readability Measurement Based on Convolutional Neural Network: A Hybrid Network Model","volume":"2022","author":"Jian","year":"2022","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_104","doi-asserted-by":"crossref","first-page":"3789","DOI":"10.1016\/j.jksuci.2020.12.021","article-title":"Arabic L2 readability assessment: Dimensionality reduction study","volume":"34","author":"Nassiri","year":"2022","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"key":"ref_105","unstructured":"Al Khalil, M., Saddiki, H., Habash, N., and Alfalasi, L. (2018, January 7\u201312). A leveled reading corpus of modern standard Arabic. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan."},{"key":"ref_106","unstructured":"Cavalli-Sforza, V., El Mezouar, M., and Saddiki, H. (2014, January 26\u201327). Matching an Arabic text to a learners\u2019 curriculum. Proceedings of the 2014 Fifth International Conference on Arabic Language Processing (CITALA 2014), Oujda, Morocco."},{"key":"ref_107","doi-asserted-by":"crossref","unstructured":"Salesky, E., and Shen, W. (2014, January 16). Exploiting Morphological, Grammatical, and Semantic Correlates for Improved Text Difficulty Assessment. Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications, Baltimore, MD, USA.","DOI":"10.3115\/v1\/W14-1819"},{"key":"ref_108","doi-asserted-by":"crossref","unstructured":"Saddiki, H., Bouzoubaa, K., and Cavalli-Sforza, V. (2015, January 17\u201320). Text readability for Arabic as a foreign language. Proceedings of the 2015 IEEE\/ACS 12th International Conference of Computer Systems and Applications (AICCSA), Marrakech, Morocco.","DOI":"10.1109\/AICCSA.2015.7507232"},{"key":"ref_109","unstructured":"Al Jarrah, E.Q. (2017). Using Language Features to Enhance Measuring the Readability of Arabic Text. [Master\u2019s Thesis, Yarmouk University]."},{"key":"ref_110","doi-asserted-by":"crossref","unstructured":"Mishra, A., and Bhattacharyya, P. (2017, January 4\u20139). Scanpath Complexity: Modeling Reading Effort Using Gaze Information. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.","DOI":"10.1609\/aaai.v31i1.11159"},{"key":"ref_111","unstructured":"SR Reserach Ltd (2017). EyeLink Data Viewer User\u2019s Manual (Version 3.1.97), SR Reserach Ltd."},{"key":"ref_112","unstructured":"Scikit-Learn (2023, October 05). Scikit-Learn Machine Learning in Python. Available online: https:\/\/scikit-learn.org\/stable\/."},{"key":"ref_113","unstructured":"Brownlee, J. (2022, December 20). Ordinal and One-Hot Encodings for Categorical Data. Available online: https:\/\/machinelearningmastery.com\/one-hot-encoding-for-categorical-data\/."},{"key":"ref_114","doi-asserted-by":"crossref","first-page":"372","DOI":"10.1037\/0033-2909.124.3.372","article-title":"Eye movements in reading and information processing: 20 years of research","volume":"124","author":"Rayner","year":"1998","journal-title":"Psychol. Bull."},{"key":"ref_115","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1109\/TPAMI.2023.3321337","article-title":"Automatic Gaze Analysis: A Survey of Deep Learning Based Approaches","volume":"46","author":"Ghosh","year":"2024","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_116","doi-asserted-by":"crossref","unstructured":"Makowski, S., J\u00e4ger, L.A., Abdelwahab, A., Landwehr, N., and Scheffer, T. (2018, January 10\u201314). A discriminative model for identifying readers and assessing text comprehension from eye movements. Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Dublin, Ireland.","DOI":"10.1007\/978-3-030-10925-7_13"},{"key":"ref_117","unstructured":"Caruso, M., Peacock, C.E., Southwell, R., Zhou, G., and D'Mello, S.K. (2022, January 24\u201327). Going Deep and Far: Gaze-Based Models Predict Multiple Depths of Comprehension during and One Week Following Reading. Proceedings of the 15th International Conference on Educational Data Mining, International Educational Data Mining Societ, Durham, UK."},{"key":"ref_118","doi-asserted-by":"crossref","unstructured":"Copeland, L., and Gedeon, T. (2013, January 2\u20135). Measuring reading comprehension using eye movements. Proceedings of the IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom), Budapest, Hungary.","DOI":"10.1109\/CogInfoCom.2013.6719207"},{"key":"ref_119","doi-asserted-by":"crossref","first-page":"35","DOI":"10.5430\/air.v3n3p35","article-title":"Predicting reading comprehension scores from eye movements using artificial neural networks and fuzzy output error","volume":"3","author":"Copeland","year":"2014","journal-title":"Artif. Intell. Res."},{"key":"ref_120","doi-asserted-by":"crossref","unstructured":"Sanches, C.L., Augereau, O., and Kise, K. (2017, January 9\u201315). Using the Eye Gaze to Predict Document Reading Subjective Understanding. Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.377"},{"key":"ref_121","doi-asserted-by":"crossref","unstructured":"Gonzalez-Garduno, A., and S\u00f8gaard, A. (2018, January 2\u20137). Learning to predict readability using eye-movement data from natives and learners. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.11978"},{"key":"ref_122","doi-asserted-by":"crossref","unstructured":"Sarti, G., Brunato, D., and Dell\u2019Orletta, F. (2021, January 10). That looks hard: Characterizing linguistic complexity in humans and language models. Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Virtual.","DOI":"10.18653\/v1\/2021.cmcl-1.5"},{"key":"ref_123","unstructured":"Khallaf, N., and Sharoff, S. (2021, January 19). Automatic Difficulty Classification of Arabic Sentences. Proceedings of the Sixth Arabic Natural Language Processing Workshop (WANLP), Virtual, Kyiv, Ukraine."},{"key":"ref_124","doi-asserted-by":"crossref","unstructured":"Berrichi, S., Nassiri, N., Mazroui, A., and Lakhouaja, A. (2024). Exploring the Impact of Deep Learning Techniques on Evaluating Arabic L1 Readability. Artificial Intelligence, Data Science and Applications, Springer.","DOI":"10.1007\/978-3-031-48573-2_1"},{"key":"ref_125","doi-asserted-by":"crossref","unstructured":"Shen, W., Williams, J., Marius, T., and Salesky, E. (2013, January 8). A language-independent approach to automatic text difficulty assessment for second-language learners. Proceedings of the 2nd Workshop on Predicting and Improving Text Readability for Target Reader Populations, Sofia, Bulgaria.","DOI":"10.21236\/ADA595522"},{"key":"ref_126","doi-asserted-by":"crossref","unstructured":"Saddiki, H., Habash, N., Cavalli-Sforza, V., and Al Khalil, M. (2018, January 19). Feature optimization for predicting readability of Arabic L1 and L2. Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, Melbourne, Australia.","DOI":"10.18653\/v1\/W18-3703"},{"key":"ref_127","unstructured":"Forsyth, J.N. (2014). Automatic Readability Detection for Modern Standard Arabic. [Master\u2019s Thesis, Department of Linguistics and English Language, Brigham Young University]."},{"key":"ref_128","first-page":"36","article-title":"Interpreting the Relevance of Readability Prediction Features","volume":"9","author":"Berrichi","year":"2023","journal-title":"Jordanian J. Comput. Inf. Technol."},{"key":"ref_129","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1007\/978-3-031-26254-8_73","article-title":"Impact of Feature Vectorization Methods on Arabic Text Readability Assessment","volume":"Volume 635","author":"Berrichi","year":"2023","journal-title":"Artificial Intelligence and Smart Environment (ICAISE 2022)"},{"key":"ref_130","first-page":"2041","article-title":"Developing Readability Computational Formula for Arabic Reading Materials Among Non-native Students in Malaysia","volume":"Volume 194","author":"Ghani","year":"2021","journal-title":"The Importance of New Technologies and Entrepreneurship in Business Development: In The Context of Economic Diversity in Developing Countries: The Impact of New Technologies and Entrepreneurship on Business Development"},{"key":"ref_131","unstructured":"Brooke, J., Tsang, V., Jacob, D., Shein, F., and Hirst, G. (2012, January 7). Building readability lexicons with unannotated corpora. Proceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations, Montr\u00e9al, Canada."},{"key":"ref_132","doi-asserted-by":"crossref","unstructured":"Mahanama, B., Jayawardana, Y., Rengarajan, S., Jayawardena, G., Chukoskie, L., Snider, J., and Jayarathna, S. (2022). Eye movement and pupil measures: A review. Front. Comput. Sci., 3.","DOI":"10.3389\/fcomp.2021.733531"},{"key":"ref_133","doi-asserted-by":"crossref","unstructured":"Sharafi, Z., Shaffer, T., Sharif, B., and Gu\u00e9h\u00e9neuc, Y.-G. (2015, January 1\u20134). Eye-tracking metrics in software engineering. Proceedings of the 2015 Asia-Pacific Software Engineering Conference (APSEC), New Delhi, India.","DOI":"10.1109\/APSEC.2015.53"},{"key":"ref_134","doi-asserted-by":"crossref","unstructured":"Mohamad Shahimin, M., and Razali, A. (2019). An eye tracking analysis on diagnostic performance of digital fundus photography images between ophthalmologists and optometrists. Int. J. Environ. Res. Public Health, 17.","DOI":"10.3390\/ijerph17010030"}],"container-title":["Computation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-3197\/13\/5\/108\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:27:20Z","timestamp":1760030840000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-3197\/13\/5\/108"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,5]]},"references-count":134,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2025,5]]}},"alternative-id":["computation13050108"],"URL":"https:\/\/doi.org\/10.3390\/computation13050108","relation":{},"ISSN":["2079-3197"],"issn-type":[{"type":"electronic","value":"2079-3197"}],"subject":[],"published":{"date-parts":[[2025,5,5]]}}}