{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T14:27:46Z","timestamp":1763389666546,"version":"3.41.0"},"reference-count":88,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2023,3,25]],"date-time":"2023-03-25T00:00:00Z","timestamp":1679702400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2023,4,30]]},"abstract":"<jats:p>Text readability assessment is a well-known problem that has acquired even more importance in today\u2019s information-rich world. In this article, we survey various approaches to measuring and assessing the readability of texts. Our specific goal is to provide a perspective on the state-of-the-art in readability assessment research for Arabic, which differs significantly from other languages on which readability studies have tended to focus. We provide background on readability assessment research and tools for English, for which readability studies are the most advanced. We then survey approaches adopted for Arabic, both classical formula-based approaches and studies that combine Machine Learning (ML) with Natural Language Processing (NLP) techniques. The works we cover target text corpora for different audiences: school-age first language readers (L1), foreign language learners (L2), and adult readers in non-academic contexts. Therefore, we explore differences between reading in L1 and L2 and consider how they play out specifically in Arabic after describing language characteristics that may impact readability. Finally, we highlight challenges for Arabic readability research and propose multiple future directions to improve readability assessment and related applications that would benefit from more attention.<\/jats:p>","DOI":"10.1145\/3571510","type":"journal-article","created":{"date-parts":[[2022,11,17]],"date-time":"2022-11-17T15:05:16Z","timestamp":1668697516000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Approaches, Methods, and Resources for Assessing the Readability of Arabic Texts"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8299-9422","authenticated-orcid":false,"given":"Naoual","family":"Nassiri","sequence":"first","affiliation":[{"name":"Department of Computer Science, Faculty of Sciences, University Mohamed First, Oujda, Morocco"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9877-0008","authenticated-orcid":false,"given":"Violetta","family":"Cavalli-Sforza","sequence":"additional","affiliation":[{"name":"School of Science and Engineering, AI Akhawayn University, Ifrane, Morocco"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9626-5377","authenticated-orcid":false,"given":"Abdelhak","family":"Lakhouaja","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Faculty of Sciences, University Mohamed First, Oujda, Morocco"}]}],"member":"320","published-online":{"date-parts":[[2023,3,25]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-015-9302-8"},{"key":"e_1_3_2_3_2","first-page":"1856","volume-title":"Proceedings of the 9th International Conference on Language Resources and Evaluation","author":"Abdelali Ahmed","year":"2014","unstructured":"Ahmed Abdelali, Francisco Guzman, Hassan Sajjad, and Stephan Vogel. 2014. The AMARA corpus: Building parallel language resources for the educational domain. In Proceedings of the 9th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Reykjavik, Iceland, 1856\u20131862. Retrieved from http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/pdf\/877_Paper.pdf."},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDIM.2008.4746711"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1186\/s12913-018-2944-x"},{"issue":"2","key":"e_1_3_2_6_2","first-page":"139","article-title":"Teachers\u2019 perceptions on the effectiveness of using arabic language teaching software in omani basic education","volume":"12","author":"Al-Busaidi Fatma","year":"2016","unstructured":"Fatma Al-Busaidi, Abdullah Alhashmi, Ali S. Al-Musawi, and Ali Kadhim. 2016. Teachers\u2019 perceptions on the effectiveness of using arabic language teaching software in omani basic education. International Journal of Education and Development Using Information and Communication Technology 12, 2 (2016), 139\u2013157.","journal-title":"International Journal of Education and Development Using Information and Communication Technology"},{"key":"e_1_3_2_7_2","first-page":"103","article-title":"Automatic readability measurements of the arabic text: An exploratory study","volume":"35","author":"Al-Khalifa Hend","year":"2010","unstructured":"Hend Al-Khalifa and Amani Al-Ajlan. 2010. Automatic readability measurements of the arabic text: An exploratory study. Arabian Journal for Science and Engineering 35, 2010 (2010), No. 2c, 103\u2013124.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_8_2","first-page":"3053","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Khalil Muhamed Al","year":"2020","unstructured":"Muhamed Al Khalil, Nizar Habash, and Zhengyang Jiang. 2020. A large-scale leveled readability lexicon for standard arabic. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 3053\u20133062. Retrieved from https:\/\/www.aclweb.org\/anthology\/2020.lrec-1.373."},{"key":"e_1_3_2_9_2","volume-title":"Proceedings of the 11th International Conference on Language Resources and Evaluation","author":"Khalil Muhamed Al","year":"2018","unstructured":"Muhamed Al Khalil, Hind Saddiki, Nizar Habash, and Latifa Alfalasi. 2018. A leveled reading corpus of modern standard arabic. In Proceedings of the 11th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Miyazaki, Japan. Retrieved from https:\/\/www.aclweb.org\/anthology\/L18-1366."},{"key":"e_1_3_2_10_2","first-page":"1808","volume-title":"Proceedings of the 10th International Conference on Language Resources and Evaluation","author":"Al-Sulaiti Latifa","year":"2016","unstructured":"Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell, and Ayman Alghamdi. 2016. Compilation of an arabic children\u2019s corpus. In Proceedings of the 10th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Portoro\u017e, Slovenia, 1808\u20131812. Retrieved from https:\/\/www.aclweb.org\/anthology\/L16-1285."},{"key":"e_1_3_2_11_2","first-page":"370","article-title":"AARI: Automatic arabic readability index","volume":"11","author":"Al-Tamimi Abdel-Karim","year":"2014","unstructured":"Abdel-Karim Al-Tamimi, Manar Jaradat, Nuha Aljarrah, and Sahar Ghanim. 2014. AARI: Automatic arabic readability index. International Arab Journal of Information Technology 11, 4 (2014), 370\u2013378.","journal-title":"International Arab Journal of Information Technology"},{"key":"e_1_3_2_12_2","unstructured":"Sultan Almujaiwel. 2016. Free\/open KACSTAC and its processing tools: Lexical resources for arabic lexicogrammatical microstructures based on collocational indicators. In Proceedings of the Input a Word Analyze the World: Selected Approaches to Corpus Linguistics. Newcastle Upon Tyne: Cambridge Scholars Publishing (2016) 153\u2013170."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.04.017"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.4103\/1658-631X.112937"},{"issue":"9","key":"e_1_3_2_15_2","doi-asserted-by":"crossref","first-page":"28","DOI":"10.5539\/elt.v9n9p28","article-title":"The effects of L2 reading skills on L1 reading skills through transfer.","volume":"9","author":"Altmisdort Gonca","year":"2016","unstructured":"Gonca Altmisdort. 2016. The effects of L2 reading skills on L1 reading skills through transfer. English Language Teaching 9, 9 (2016), 28\u201335.","journal-title":"English Language Teaching"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.5555\/1866795.1866796"},{"issue":"6","key":"e_1_3_2_17_2","first-page":"490","article-title":"Lix and rix: Variations on a little-known readability index","volume":"26","author":"Anderson Jonathan","year":"1983","unstructured":"Jonathan Anderson. 1983. Lix and rix: Variations on a little-known readability index. Journal of Reading 26, 6 (1983), 490\u2013496. Retrieved from http:\/\/www.jstor.org\/stable\/40031755.","journal-title":"Journal of Reading"},{"issue":"2","key":"e_1_3_2_18_2","first-page":"37","article-title":"A comparative study on strategies of the children for L1 and L2 reading comprehension in K12","volume":"4","author":"Belet Dilek","year":"2008","unstructured":"Dilek Belet, Esim Gursoy, et\u00a0al. 2008. A comparative study on strategies of the children for L1 and L2 reading comprehension in K12. College Teaching Methods and Styles Journal 4, 2 (2008), 37\u201348.","journal-title":"College Teaching Methods and Styles Journal"},{"issue":"6","key":"e_1_3_2_19_2","article-title":"Decision tree analysis on j48 algorithm for data mining","volume":"3","author":"Bhargava Neeraj","year":"2013","unstructured":"Neeraj Bhargava, Girja Sharma, Ritu Bhargava, and Manish Mathuria. 2013. Decision tree analysis on j48 algorithm for data mining. In Proceedings of The International Journal of Advanced Research in Computer Science and Software Engineering 3, 6 (2013).","journal-title":"Proceedings of The International Journal of Advanced Research in Computer Science and Software Engineering"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISACV.2018.8354062"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-018-9528-3"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"issue":"1","key":"e_1_3_2_23_2","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1109\/TPC.1981.6447826","article-title":"Why readability formulas fail","author":"Bruce Bertram","year":"1981","unstructured":"Bertram Bruce, Andee Rubin, and Kathleen Starr. 1981. Why readability formulas fail. IEEE Transactions on Professional Communication1 (1981), 50\u201352.","journal-title":"IEEE Transactions on Professional Communication"},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","DOI":"10.4324\/9780203883280","volume-title":"A Frequency Dictionary of Arabic: Core Vocabulary for Learners","author":"Buckwalter Tim","year":"2014","unstructured":"Tim Buckwalter and Dilworth Parkinson. 2014. A Frequency Dictionary of Arabic: Core Vocabulary for Learners. Routledge."},{"key":"e_1_3_2_25_2","article-title":"Matching an arabic text to a learners\u2019 curriculum","author":"Cavalli-Sforza Violetta","year":"2014","unstructured":"Violetta Cavalli-Sforza and Mariam El Mezouar. 2014. Matching an arabic text to a learners\u2019 curriculum. In Proceedings of the 5th International Conference on Arabic Language Processing.","journal-title":"In Proceedings of the 5th International Conference on Arabic Language Processing."},{"issue":"2","key":"e_1_3_2_26_2","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1017\/S0272263100007270","article-title":"The FSI\/ILR\/ACTFL proficiency scales and testing techniques: Development, current status, and needed research","volume":"10","author":"Clark John L. D.","year":"1988","unstructured":"John L. D. Clark and Ray T. Clifford. 1988. The FSI\/ILR\/ACTFL proficiency scales and testing techniques: Development, current status, and needed research. Studies in Second Language Acquisition 10, 2 (1988), 129\u2013147.","journal-title":"Studies in Second Language Acquisition"},{"issue":"1","key":"e_1_3_2_27_2","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1007\/s11845-010-0624-x","article-title":"Readability level of patient information leaflets for older people","volume":"180","author":"Cronin M.","year":"2011","unstructured":"M. Cronin, S. O\u2019Hanlon, and M. O\u2019Connor. 2011. Readability level of patient information leaflets for older people. Irish Journal of Medical Science 180, 1 (2011), 139\u2013142.","journal-title":"Irish Journal of Medical Science"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.3758\/s13428-015-0651-7"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1080\/0163853X.2017.1296264"},{"key":"e_1_3_2_30_2","first-page":"37","article-title":"A formula for predicting readability: Instructions","author":"Dale Edgar","year":"1948","unstructured":"Edgar Dale and Jeanne S. Chall. 1948. A formula for predicting readability: Instructions. Educational Research Bulletin (1948), 37\u201354.","journal-title":"Educational Research Bulletin"},{"key":"e_1_3_2_31_2","first-page":"251","volume-title":"Proceedings of the Tutoring and Intelligent Tutoring Systems","author":"Dascalu Mihai","year":"2018","unstructured":"Mihai Dascalu, Scott A. Crossley, Danielle S. McNamara, Philippe Dessus, and Stefan Trausan-Matu. 2018. Please readerbench this text: A multi-dimensional textual complexity assessment framework. In Proceedings of the Tutoring and Intelligent Tutoring Systems. Nova Science Publishers, Inc., 251\u2013271."},{"key":"e_1_3_2_32_2","first-page":"168","article-title":"A corpus-based readability formula for estimate of arabic texts reading difficulty","volume":"21","author":"Daud Nuraihan Mat","year":"2013","unstructured":"Nuraihan Mat Daud, Haslina Hassan, and Normaziah Abdul Aziz. 2013. A corpus-based readability formula for estimate of arabic texts reading difficulty. World Applied Sciences Journal 21 (2013), 168\u2013173.","journal-title":"World Applied Sciences Journal"},{"key":"e_1_3_2_33_2","unstructured":"BAK Dawood. 1977. The relationship between readability and selected language variables. Tesis Unpublished Master Thesis (In Arabic) Iraq Baghdad University."},{"issue":"2","key":"e_1_3_2_34_2","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1075\/itl.165.2.03del","article-title":"Assessing document and sentence readability in less resourced languages and across textual genres","volume":"165","author":"Dell\u2019Orletta Felice","year":"2014","unstructured":"Felice Dell\u2019Orletta, Simonetta Montemagni, and Giulia Venturi. 2014. Assessing document and sentence readability in less resourced languages and across textual genres. ITL-International Journal of Applied Linguistics 165, 2 (2014), 163\u2013193.","journal-title":"ITL-International Journal of Applied Linguistics"},{"key":"e_1_3_2_35_2","volume-title":"Proceedings of the 15th Workshop on Innovative Use of NLP for Building Educational Applications","author":"Deutsch Tovly","year":"2020","unstructured":"Tovly Deutsch, Masoud Jasbi, and Stuart Shieber. 2020. Linguistic features for readability assessment. In Proceedings of the 15th Workshop on Innovative Use of NLP for Building Educational Applications."},{"issue":"12","key":"e_1_3_2_36_2","doi-asserted-by":"crossref","first-page":"1610","DOI":"10.1001\/jamaophthalmol.2013.5521","article-title":"Readability assessment of online ophthalmic patient information","volume":"131","author":"Edmunds Matthew R.","year":"2013","unstructured":"Matthew R. Edmunds, Robert J. Barry, and Alastair K. Denniston. 2013. Readability assessment of online ophthalmic patient information. JAMA Ophthalmology 131, 12 (2013), 1610\u20131616.","journal-title":"JAMA Ophthalmology"},{"key":"e_1_3_2_37_2","unstructured":"Mahmoud El-Haj and Paul Edward Rayson. 2016. OSMAN: A novel arabic readability metric. (2016). Retrieved from http:\/\/eprints.lancs.ac.uk\/78553\/."},{"key":"e_1_3_2_38_2","first-page":"229","volume-title":"Proceedings of the 12th Conference of the European Chapter of the ACL","author":"Feng Lijun","year":"2009","unstructured":"Lijun Feng, No\u00e9mie Elhadad, and Matt Huenerfauth. 2009. Cognitively motivated features for readability assessment. In Proceedings of the 12th Conference of the European Chapter of the ACL. 229\u2013237."},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.5555\/1944566.1944598"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1037\/h0057532"},{"key":"e_1_3_2_41_2","article-title":"Automatic readability detection for modern standard arabic","author":"Forsyth Jonathan","year":"2014","unstructured":"Jonathan Forsyth. 2014. Automatic readability detection for modern standard arabic. Thesis Dissertation, Brigham Young University - Provo (2014). Retrieved from http:\/\/scholarsarchive.byu.edu\/etd\/3983.","journal-title":"Thesis Dissertation, Brigham Young University - Provo"},{"key":"e_1_3_2_42_2","first-page":"49","volume-title":"Proceedings of the 1st Workshop on Predicting and Improving Text Readability for Target Reader Populations","author":"Fran\u00e7ois Thomas","year":"2012","unstructured":"Thomas Fran\u00e7ois and Eleni Miltsakaki. 2012. Do NLP and machine learning improve traditional readability formulas?. In Proceedings of the 1st Workshop on Predicting and Improving Text Readability for Target Reader Populations. Association for Computational Linguistics, 49\u201357."},{"issue":"3","key":"e_1_3_2_43_2","first-page":"319","article-title":"Linguistic features for development of arabic text readability formula in malaysia: A preliminary study","volume":"19","author":"Ghani Kamarulzaman Abdul","year":"2014","unstructured":"Kamarulzaman Abdul Ghani, Ahmad Sabri Noh, and Nik Mohd Rahimi Yusoff. 2014. Linguistic features for development of arabic text readability formula in malaysia: A preliminary study. Middle-East Journal of Scientific Research 19, 3 (2014), 319\u2013331.","journal-title":"Middle-East Journal of Scientific Research"},{"key":"e_1_3_2_44_2","first-page":"334","volume-title":"Proceedings of the COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers","author":"Gonzalez-Dios Itziar","year":"2014","unstructured":"Itziar Gonzalez-Dios, Mar\u00eda Jes\u00fas Aranzabe, Arantza D\u00edaz de Ilarraza, and Haritz Salaberri. 2014. Simple or complex? Assessing the readability of basque texts. In Proceedings of the COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 334\u2013344."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.2307\/3586977"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.3758\/BF03195563"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.3758\/BF03195564"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1177\/002194366900600202"},{"key":"e_1_3_2_49_2","volume-title":"Proceedings of the Qatar Annual Research Conference","author":"Habash Nizar","year":"2013","unstructured":"Nizar Habash, Behrang Mohit, Ossama Obeid, Kemal Oflazer, Nadi Tomeh, and Wajdi Zaghouani. 2013. QALB: Qatar arabic language bank. In Proceedings of the Qatar Annual Research Conference. Doha, Qatar."},{"key":"e_1_3_2_50_2","unstructured":"Nizar Habash Owen Rambow and Ryan Roth. 2009. MADA+TOKAN: A toolkit for arabic tokenization diacritization morphological disambiguation POS tagging stemming and lemmatization. In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR\u201909) . Cairo 62."},{"issue":"1","key":"e_1_3_2_51_2","first-page":"1","article-title":"Quality and readability of web-based arabic health information on COVID-19: An infodemiological study","volume":"21","author":"Halboub Esam","year":"2021","unstructured":"Esam Halboub, Mohammed Sultan Al-Ak\u2019hali, Hesham M. Al-Mekhlafi, and Mohammed Nasser Alhajj. 2021. Quality and readability of web-based arabic health information on COVID-19: An infodemiological study. BMC Public Health 21, 1 (2021), 1\u20137.","journal-title":"BMC Public Health"},{"key":"e_1_3_2_52_2","volume-title":"Proceedings of the 9th International Conference on Spoken Language Processing","author":"Heilman Michael","year":"2006","unstructured":"Michael Heilman, Kevyn Collins-Thompson, Jamie Callan, and Maxine Eskenazi. 2006. Classroom success of an intelligent tutoring system for lexical practice and reading comprehension. In Proceedings of the 9th International Conference on Spoken Language Processing."},{"key":"e_1_3_2_53_2","first-page":"611","volume-title":"Proceedings of the International Conference on Recent Advances in Natural Language Processing","author":"Imperial Joseph Marvin","year":"2021","unstructured":"Joseph Marvin Imperial. 2021. BERT embeddings for automatic readability assessment. In Proceedings of the International Conference on Recent Advances in Natural Language Processing. 611\u2013618."},{"key":"e_1_3_2_54_2","article-title":"USAID (2015) research on reading in morocco: Analysis of the national education curriculum and textbooks. final report","author":"International RTI","year":"2015","unstructured":"RTI International and Al Akhawayn University in Ifrane. 2015. USAID (2015) research on reading in morocco: Analysis of the national education curriculum and textbooks. final report. USAID. Part 1 (Curriculum Analysis), Part 2 (Textbook Analysis, Parts A and B) (2015).","journal-title":"USAID. Part 1 (Curriculum Analysis), Part 2 (Textbook Analysis, Parts A and B)"},{"issue":"1","key":"e_1_3_2_55_2","first-page":"1","article-title":"Evaluating breast cancer websites targeting arabic speakers: Empirical investigation of popularity, availability, accessibility, readability, and quality","volume":"22","author":"Jasem Zahraa","year":"2022","unstructured":"Zahraa Jasem, Zainab AlMeraj, and Dari Alhuwail. 2022. Evaluating breast cancer websites targeting arabic speakers: Empirical investigation of popularity, availability, accessibility, readability, and quality. BMC Medical Informatics and Decision Making 22, 1 (2022), 1\u201315.","journal-title":"BMC Medical Informatics and Decision Making"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-demos.11"},{"key":"e_1_3_2_57_2","volume-title":"Proceedings of the 53rd Hawaii International Conference on System Sciences","author":"Kauchak David","year":"2020","unstructured":"David Kauchak and Gondy Leroy. 2020. A web-based medical text simplification tool. In Proceedings of the 53rd Hawaii International Conference on System Sciences."},{"key":"e_1_3_2_58_2","unstructured":"Nouran Khallaf and Serge Sharoff. 2021. Automatic difficulty classification of arabic sentences. arXiv:2103.04386. Retrieved from https:\/\/arxiv.org\/abs\/2103.04386."},{"key":"e_1_3_2_59_2","doi-asserted-by":"crossref","unstructured":"J. Peter Kincaid Robert P. Fishburne Jr Richard L. Rogers and Brad S. Chissom. 1975. Derivation of new readability formulas (automated readability index fog count and flesch reading ease formula) for navy enlisted personnel. Naval Technical Training Command Millington TN Research Branch.","DOI":"10.21236\/ADA006655"},{"key":"e_1_3_2_60_2","unstructured":"George Roger Klare et\u00a0al. 1963. Measurement of readability. (1963)."},{"key":"e_1_3_2_61_2","unstructured":"Bruce W. Lee Yoo Sung Jang and Jason Hyung-Jong Lee. 2021. Pushing on text readability assessment: A transformer meets handcrafted linguistic features. arXiv:2109.12258. Retrieved from https:\/\/arxiv.org\/abs\/2109.12258."},{"key":"e_1_3_2_62_2","unstructured":"Louis Martin Angela Fan \u00c9ric de la Clergerie Antoine Bordes and Beno\u00eet Sagot. 2020. Multilingual unsupervised sentence simplification. arXiv:2005.00352. Retrieved from https:\/\/arxiv.org\/abs\/2005.00352."},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00398"},{"issue":"8","key":"e_1_3_2_64_2","first-page":"639","article-title":"SMOG grading-a new readability formula","volume":"12","author":"Laughlin G. Harry Mc","year":"1969","unstructured":"G. Harry Mc Laughlin. 1969. SMOG grading-a new readability formula. Journal of Reading 12, 8 (1969), 639\u2013646.","journal-title":"Journal of Reading"},{"key":"e_1_3_2_65_2","doi-asserted-by":"crossref","first-page":"188","DOI":"10.4018\/978-1-60960-741-8.ch011","volume-title":"Proceedings of the Applied Natural Language Processing: Identification, Investigation, and Resolution","author":"McNamara Danielle S.","year":"2012","unstructured":"Danielle S. McNamara and Arthur C. Graesser. 2012. Coh-metrix: An automated tool for theoretical and applied natural language processing. In Proceedings of the Applied Natural Language Processing: Identification, Investigation, and Resolution. IGI Global, 188\u2013205."},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.5555\/2655323"},{"key":"e_1_3_2_67_2","unstructured":"Hamid Mohammadi and Seyed Hossein Khasteh. 2019. Text as environment: A deep reinforcement learning text readability assessment model. arXiv:1912.05957. Retrieved from https:\/\/arxiv.org\/abs\/1912.05957."},{"key":"e_1_3_2_68_2","first-page":"1","volume-title":"Proceedings of the 4th International Conference on Big Data and Internet of Things","author":"Nassiri Naoual","year":"2019","unstructured":"Naoual Nassiri, Violetta Cavalli-Sforza, and Abdelhak Lakhouaja. 2019. MoSAR: Modern standard arabic readability corpus for L1 learners. In Proceedings of the 4th International Conference on Big Data and Internet of Things. 1\u20137."},{"key":"e_1_3_2_69_2","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1007\/978-3-319-91947-8_49","volume-title":"Proceedings of the Natural Language Processing and Information Systems","author":"Nassiri Naoual","year":"2018","unstructured":"Naoual Nassiri, Abdelhak Lakhouaja, and Violetta Cavalli-Sforza. 2018. Arabic readability assessment for foreign language learners. In Proceedings of the Natural Language Processing and Information Systems. Max Silberztein, Faten Atigui, Elena Kornyshova, Elisabeth M\u00e9tais, and Farid Meziane (Eds.), Springer International Publishing, 480\u2013488."},{"key":"e_1_3_2_70_2","first-page":"120","volume-title":"Proceedings of the Arabic Language Processing: From Theory to Practice","author":"Nassiri Naoual","year":"2018","unstructured":"Naoual Nassiri, Abdelhak Lakhouaja, and Violetta Cavalli-Sforza. 2018. Modern standard arabic readability prediction. In Proceedings of the Arabic Language Processing: From Theory to Practice. Abdelmonaime Lachkar, Karim Bouzoubaa, Azzedine Mazroui, Abdelfettah Hamdani, and Abdelhak Lekhouaja (Eds.), Springer International Publishing, 120\u2013133."},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2020.12.021"},{"key":"e_1_3_2_72_2","first-page":"463","volume-title":"Proceedings of the Advanced Intelligent Systems for Sustainable Development.","author":"Nassiri Naoual","year":"2022","unstructured":"Naoual Nassiri, Abdelhak Lakhouaja, and Violetta Cavalli-Sforza. 2022. Combining classical and non-classical features to improve readability measures for arabic first language texts. In Proceedings of the Advanced Intelligent Systems for Sustainable Development.Janusz Kacprzyk, Valentina E. Balas, and Mostafa Ezziyyani (Eds.), Springer International Publishing, Cham, 463\u2013470."},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1007\/s40593-014-0029-5"},{"key":"e_1_3_2_74_2","first-page":"1094","volume-title":"Proceedings of the LREC","author":"Pasha Arfath","year":"2014","unstructured":"Arfath Pasha, Mohamed Al-Badrashiny, Mona T. Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. MADAMIRA: A fast, comprehensive tool for morphological analysis and disambiguation of arabic. In Proceedings of the LREC. 1094\u20131101."},{"key":"e_1_3_2_75_2","unstructured":"Xinying Qiu Yuan Chen Hanwu Chen Jian-Yun Nie Yuming Shen and Dawei Lu. 2021. Learning syntactic dense embedding with correlation graph for automatic readability assessment. arXiv:2107.04268. Retrieved from https:\/\/arxiv.org\/abs\/2107.04268."},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1145\/344599.344637"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1109\/AICCSA.2015.7507232"},{"key":"e_1_3_2_78_2","doi-asserted-by":"crossref","first-page":"20","DOI":"10.18653\/v1\/W18-3703","volume-title":"Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications","author":"Saddiki Hind","year":"2018","unstructured":"Hind Saddiki, Nizar Habash, Violetta Cavalli-Sforza, and Muhamed Al Khalil. 2018. Feature optimization for predicting readability of arabic L1 and L2. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications. Association for Computational Linguistics, Melbourne, Australia, 20\u201329. Retrieved from http:\/\/aclweb.org\/anthology\/W18-3703."},{"key":"e_1_3_2_79_2","first-page":"1404","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Santos Roney","year":"2020","unstructured":"Roney Santos, Gabriela Pedro, Sidney Leal, Oto Vale, Thiago Pardo, Kalina Bontcheva, and Carolina Scarton. 2020. Measuring the impact of readability features in fake news detection. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 1404\u20131413."},{"key":"e_1_3_2_80_2","volume-title":"Automated Readability Index","author":"Senter R. J.","year":"1967","unstructured":"R. J. Senter and Edgar A. Smith. 1967. Automated Readability Index. Technical Report. CINCINNATI UNIV OH."},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1109\/72.870050"},{"issue":"10","key":"e_1_3_2_82_2","first-page":"4","article-title":"A comparison of L1 and L2 reading: Cultural differences and schema","volume":"4","author":"Singhal Meena","year":"1998","unstructured":"Meena Singhal. 1998. A comparison of L1 and L2 reading: Cultural differences and schema. The Internet TESL Journal 4, 10 (1998), 4\u201310.","journal-title":"The Internet TESL Journal"},{"issue":"1","key":"e_1_3_2_83_2","first-page":"i\u201324","article-title":"Identification Of Sub-skills Of Reading Comprehension By Maximum Likelihood Factor Analysis 1","volume":"1972","author":"Spearritt Donald","year":"1972","unstructured":"Donald Spearritt. 1972. Identification Of Sub-skills Of Reading Comprehension By Maximum Likelihood Factor Analysis 1. ETS Research Bulletin Series 1972, 1 (1972), i\u201324.","journal-title":"ETS Research Bulletin Series"},{"key":"e_1_3_2_84_2","unstructured":"Edward L. Thorndike. 1921. The teacher\u2019s word book. (1921)."},{"key":"e_1_3_2_85_2","unstructured":"Sowmya Vajjala. 2021. Trends limitations and open challenges in automatic readability assessment research. arXiv:2105.00973. Retrieved from https:\/\/arxiv.org\/abs\/2105.00973."},{"key":"e_1_3_2_86_2","first-page":"163","volume-title":"Proceedings of the 7th Workshop on Building Educational Applications Using NLP","author":"Vajjala Sowmya","year":"2012","unstructured":"Sowmya Vajjala and Detmar Meurers. 2012. On improving the accuracy of readability classification using insights from second language acquisition. In Proceedings of the 7th Workshop on Building Educational Applications Using NLP. Association for Computational Linguistics, 163\u2013173."},{"key":"e_1_3_2_87_2","volume-title":"Learning Purpose and Language use","author":"Widdowson Henry George","year":"1983","unstructured":"Henry George Widdowson. 1983. Learning Purpose and Language use. Oxford University Press."},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2017.01.011"},{"key":"e_1_3_2_89_2","unstructured":"George Kingsley Zipf. 1949. Human behavior and the principle of least effort. Ravenio Books 2016."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3571510","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3571510","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:50:54Z","timestamp":1750182654000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3571510"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,25]]},"references-count":88,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,4,30]]}},"alternative-id":["10.1145\/3571510"],"URL":"https:\/\/doi.org\/10.1145\/3571510","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2023,3,25]]},"assertion":[{"value":"2021-09-13","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-06","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-03-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}