{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T08:19:55Z","timestamp":1774945195686,"version":"3.50.1"},"reference-count":190,"publisher":"Association for Computing Machinery (ACM)","issue":"8","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2025,8,31]]},"abstract":"<jats:p>Advances in language models have enabled significant strides in developing language technologies tailored for analyzing and processing Dialectical Arabic (DA), which exhibits unique linguistic features and variations compared to standard Arabic. This progress has sparked a surge of interest in various research tasks within the Arabic Natural Language Processing (ANLP) domain, encompassing areas such as sentiment analysis, dialect identification, normalization and classification, fake news detection, and part-of-speech tagging. The primary objective of this survey paper is to provide a comprehensive overview of the advancements made in dialectical ANLP from 2014 to 2024. A thorough analysis is undertaken, covering a corpus of approximately 200 research papers, to offer insights into the latest developments, resources, and applications concerning dialectical Arabic. By identifying and discussing the challenges and opportunities for future research, this study aspires to serve as a valuable reference for researchers, practitioners, and enthusiasts interested in the subject matter. Central to the investigation are the recent strides in natural language processing techniques that pertain to dialectical Arabic, namely DA sentiment analysis, DA identification, DA classification, DA normalization, DA part-of-speech tagging, and the role of DA in fake news detection, among other applications. Each research category is meticulously examined, providing a comprehensive understanding of their respective contributions, significance, encountered challenges, and the availability of pertinent datasets. This exhaustive survey paper encompasses existing studies within dialectical Arabic research categories. As a result, readers are presented with a detailed reference source in pursuing advancements and innovations within this field.<\/jats:p>","DOI":"10.1145\/3747290","type":"journal-article","created":{"date-parts":[[2025,7,3]],"date-time":"2025-07-03T07:22:43Z","timestamp":1751527363000},"page":"1-45","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["A Survey on Dialect Arabic Processing and Analysis: Recent Advances and Future Trends"],"prefix":"10.1145","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4561-2185","authenticated-orcid":false,"given":"Abdelghani","family":"Dahou","sequence":"first","affiliation":[{"name":"Zhejiang Normal University","place":["Jinhua, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8793-2465","authenticated-orcid":false,"given":"Abdelhalim Hafedh","family":"Dahou","sequence":"additional","affiliation":[{"name":"GESIS - Leibniz Institute for the Social Sciences","place":["Mannheim, Germany"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2760-5547","authenticated-orcid":false,"given":"Mohamed Amine","family":"Cheragui","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Africaine Ahmed Draia Adrar","place":["Adrar, Algeria"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-8903-3137","authenticated-orcid":false,"given":"Amin","family":"Abdedaiem","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Africaine Ahmed Draia Adrar","place":["Adrar, Algeria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6956-7641","authenticated-orcid":false,"given":"Mohammed A. A.","family":"Al-Qaness","sequence":"additional","affiliation":[{"name":"Zhejiang Normal University","place":["Jinhua, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7682-6269","authenticated-orcid":false,"given":"Mohamed","family":"Abd Elaziz","sequence":"additional","affiliation":[{"name":"Zagazig University","place":["Zagazig, Egypt"]},{"name":"Faculty of computer science and engineering, galala university","place":["Zagazig, Egypt"]},{"name":"Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University","place":["Zagazig, Egypt"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0666-7055","authenticated-orcid":false,"given":"Ahmed A.","family":"Ewees","sequence":"additional","affiliation":[{"name":"Damietta University","place":["New Damietta, Egypt"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5271-9215","authenticated-orcid":false,"given":"Zhonglong","family":"Zheng","sequence":"additional","affiliation":[{"name":"Zhejiang Normal University","place":["Jinhua, China"]},{"name":"Zhejiang Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University","place":["Jinhua, China"]}]}],"member":"320","published-online":{"date-parts":[[2025,8,21]]},"reference":[{"issue":"2","key":"e_1_3_2_2_2","doi-asserted-by":"crossref","first-page":"419","DOI":"10.1007\/s10579-019-09454-8","article-title":"Dzdc12: A new multipurpose parallel algerian arabizi\u2013french code-switched corpus","volume":"54","author":"Abainia Kheireddine","year":"2020","unstructured":"Kheireddine Abainia. 2020. Dzdc12: A new multipurpose parallel algerian arabizi\u2013french code-switched corpus. Language Resources and Evaluation 54, 2 (2020), 419\u2013455.","journal-title":"Language Resources and Evaluation"},{"key":"e_1_3_2_3_2","article-title":"Quantum artificial hummingbird algorithm for feature selection of social IoT","author":"Elaziz Mohamed Abd","year":"2023","unstructured":"Mohamed Abd Elaziz, Abdelghani Dahou, Mohammed Azmi Al-Betar, Shaker El-Sappagh, Diego Oliva, and Ahmad O Aseeri. 2023. Quantum artificial hummingbird algorithm for feature selection of social IoT. IEEE Access 11 (2023), 66257\u201366278.","journal-title":"IEEE Access"},{"issue":"2","key":"e_1_3_2_4_2","doi-asserted-by":"crossref","first-page":"258","DOI":"10.3390\/math11020258","article-title":"A hybrid multitask learning framework with a fire hawk optimizer for arabic fake news detection","volume":"11","author":"Elaziz Mohamed Abd","year":"2023","unstructured":"Mohamed Abd Elaziz, Abdelghani Dahou, Dina Ahmed Orabi, Samah Alshathri, Eman M Soliman, and Ahmed A Ewees. 2023. A hybrid multitask learning framework with a fire hawk optimizer for arabic fake news detection. Mathematics 11, 2 (2023), 258.","journal-title":"Mathematics"},{"key":"e_1_3_2_5_2","unstructured":"Amine Abdaoui Mohamed Berrimi Mourad Oussalah and Abdelouahab Moussaoui. 2021. DziriBERT: A Pre-trained language model for the algerian dialect. arXiv preprint arXiv:2109.12346 (2021)."},{"issue":"72","key":"e_1_3_2_6_2","doi-asserted-by":"crossref","first-page":"178","DOI":"10.4114\/intartif.vol26iss72pp178-201","article-title":"Fake news detection in low resource languages using setfit framework","volume":"26","author":"Abdedaiem Amin","year":"2023","unstructured":"Amin Abdedaiem, Abdelhalim Hafedh Dahou, and Mohamed Amine Cheragui. 2023. Fake news detection in low resource languages using setfit framework. Inteligencia Artificial 26, 72 (2023), 178\u2013201.","journal-title":"Inteligencia Artificial"},{"key":"e_1_3_2_7_2","first-page":"11","volume-title":"Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations","author":"Abdelali Ahmed","year":"2016","unstructured":"Ahmed Abdelali, Kareem Darwish, Nadir Durrani, and Hamdy Mubarak. 2016. Farasa: A fast and furious segmenter for arabic. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations. 11\u201316."},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","unstructured":"Ahmed Abdelali Hamdy Mubarak Shammur Chowdhury Maram Hasanain Basel Mousi Sabri Boughorbel Samir Abdaljalil Yassine El Kheir Daniel Izham Fahim Dalvi Majd Hawasly Nizi Nazar Youssef Elshahawy Ahmed Ali Nadir Durrani Natasa Milic-Frayling and Firoj Alam. 2024. LAraBench: Benchmarking arabic AI with large language models. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) Yvette Graham and Matthew Purver (Eds.). Association for Computational Linguistics St. Julian\u2019s Malta 487\u2013520. DOI:10.18653\/v1\/2024.eacl-long.30","DOI":"10.18653\/v1\/2024.eacl-long.30"},{"key":"e_1_3_2_9_2","first-page":"1","volume-title":"Proceedings of the Sixth Arabic Natural Language Processing Workshop","author":"Abdelali Ahmed","year":"2021","unstructured":"Ahmed Abdelali, Hamdy Mubarak, Younes Samih, Sabit Hassan, and Kareem Darwish. 2021. QADI: Arabic dialect identification in the wild. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. 1\u201310."},{"key":"e_1_3_2_10_2","doi-asserted-by":"crossref","unstructured":"Abdelhalim Hafedh Dahou and Mohamed Amine Cheragui. 2023. Dzner: A large algerian named entity recognition dataset. Natural Language Processing Journal 3 (2023) 100005.","DOI":"10.1016\/j.nlp.2023.100005"},{"issue":"3","key":"e_1_3_2_11_2","first-page":"777","article-title":"Using tweets and emojis to build tead: An arabic dataset for sentiment analysis","volume":"22","author":"Abdellaoui Houssem","year":"2018","unstructured":"Houssem Abdellaoui and Mounir Zrigui. 2018. Using tweets and emojis to build tead: An arabic dataset for sentiment analysis. Computaci\u00f3n y Sistemas 22, 3 (2018), 777\u2013786.","journal-title":"Computaci\u00f3n y Sistemas"},{"key":"e_1_3_2_12_2","first-page":"1","volume-title":"2019 International Conference on Intelligent Systems and Advanced Computing Sciences (ISACS)","author":"Abdelli Adel","year":"2019","unstructured":"Adel Abdelli, Fay\u00e7al Guerrouf, Okba Tibermacine, and Belkacem Abdelli. 2019. Sentiment analysis of arabic algerian dialect using a supervised method. In 2019 International Conference on Intelligent Systems and Advanced Computing Sciences (ISACS). IEEE, 1\u20136."},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1109\/MIUCC52538.2021.9447621","volume-title":"2021 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC)","author":"AbdElminaam Diaa Salama","year":"2021","unstructured":"Diaa Salama AbdElminaam, Nabil Neggaz, Ibrahim Abd Elatif Gomaa, Fatma Helmy Ismail, and Ahmed Elsawy. 2021. Aom-mpa: Arabic opinion mining using marine predators algorithm based feature selection. In 2021 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC). IEEE, 395\u2013402."},{"key":"e_1_3_2_14_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Abdul-Mageed Muhammad","year":"2018","unstructured":"Muhammad Abdul-Mageed, Hassan Alhuzali, and Mohamed Elaraby. 2018. You tweet what you speak: A city-level dataset of arabic dialects. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","unstructured":"Muhammad Abdul-Mageed AbdelRahim Elmadany and El Moatez Billah Nagoudi. 2021. ARBERT & MARBERT: Deep bidirectional transformers for arabic. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) Chengqing Zong Fei Xia Wenjie Li and Navigli Roberto (Eds.). Association for Computational Linguistics Online 7088\u20137105. DOI:10.18653\/v1\/2021.acl-long.551","DOI":"10.18653\/v1\/2021.acl-long.551"},{"key":"e_1_3_2_16_2","first-page":"97","volume-title":"Proceedings of the Fifth Arabic Natural Language Processing Workshop","author":"Abdul-Mageed Muhammad","year":"2020","unstructured":"Muhammad Abdul-Mageed, Chiyu Zhang, Houda Bouamor, and Nizar Habash. 2020. NADI 2020: The first nuanced arabic dialect identification shared task. In Proceedings of the Fifth Arabic Natural Language Processing Workshop. 97\u2013110."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","unstructured":"Muhammad Abdul-Mageed Chiyu Zhang AbdelRahim Elmadany and Lyle Ungar. 2020. Toward micro-dialect identification in diaglossic and code-switched environments. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber Trevor Cohn Yulan He and Yang Liu (Eds.). Association for Computational Linguistics Online 5855\u20135876. DOI:10.18653\/v1\/2020.emnlp-main.472","DOI":"10.18653\/v1\/2020.emnlp-main.472"},{"issue":"2","key":"e_1_3_2_18_2","first-page":"403","article-title":"Comparative analysis of ML POS on arabic tweets","volume":"95","author":"Abdulkareem Mustafa","year":"2017","unstructured":"Mustafa Abdulkareem and Sabrina Tiun. 2017. Comparative analysis of ML POS on arabic tweets. Journal of Theoretical and Applied Information Technology 95, 2 (2017), 403.","journal-title":"Journal of Theoretical and Applied Information Technology"},{"key":"e_1_3_2_19_2","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1109\/ICMLA.2018.00134","volume-title":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","author":"Abdullah Malak","year":"2018","unstructured":"Malak Abdullah, Mirsad Hadzikadicy, and Samira Shaikhz. 2018. SEDAT: Sentiment and emotion detection in Arabic text using CNN-LSTM deep learning. In 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE, 835\u2013840."},{"issue":"12","key":"e_1_3_2_20_2","doi-asserted-by":"crossref","first-page":"345","DOI":"10.3390\/a13120345","article-title":"Nature-inspired optimization algorithms for text document clustering-a comprehensive analysis","volume":"13","author":"Abualigah Laith","year":"2020","unstructured":"Laith Abualigah, Amir H Gandomi, Mohamed Abd Elaziz, Abdelazim G Hussien, Ahmad M Khasawneh, Mohammad Alshinwan, and Essam H Houssein. 2020. Nature-inspired optimization algorithms for text document clustering-a comprehensive analysis. Algorithms 13, 12 (2020), 345.","journal-title":"Algorithms"},{"key":"e_1_3_2_21_2","first-page":"1","volume-title":"2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE)","author":"Abuuznien Shahad","year":"2021","unstructured":"Shahad Abuuznien, Zena Abdelmohsin, Ehsan Abdu, and Izzeldein Amin. 2021. Sentiment analysis for sudanese arabic dialect using comparative supervised learning approach. In 2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE). IEEE, 1\u20136."},{"key":"e_1_3_2_22_2","first-page":"312","volume-title":"Proceedings of the Sixth Arabic Natural Language Processing Workshop","author":"Abuzayed Abeer","year":"2021","unstructured":"Abeer Abuzayed and Hend Al-Khalifa. 2021. Sarcasm and sentiment detection in arabic tweets using BERT-based models and data augmentation. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. 312\u2013317."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","unstructured":"Murtadha Ahmed Saghir Alfasly Bo Wen Jamal Addeen Mohammed Ahmed and Yunfeng Liu. 2024. AlclaM: Arabic Dialect Language Model. In Proceedings of the Second Arabic Natural Language Processing Conference Nizar Habash Houda Bouamor Ramy Eskander Nadi Tomeh Ibrahim Abu Farha Ahmed Abdelali Samia Touileb Injy Hamed Yaser Onaizan Bashar Alhafni Wissam Antoun Salam Khalifa Hatem Haddad Imed Zitouni Badr AlKhamissi Rawan Almatham and Khalil Mrini (Eds.). Association for Computational Linguistics Bangkok Thailand 153\u2013159. DOI:10.18653\/v1\/2024.arabicnlp-1.14","DOI":"10.18653\/v1\/2024.arabicnlp-1.14"},{"issue":"2","key":"e_1_3_2_24_2","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1007\/s42979-022-01499-x","article-title":"Automatic building of a large arabic spelling error corpus","volume":"4","author":"Aichaoui Shaimaa Ben","year":"2022","unstructured":"Shaimaa Ben Aichaoui, Nawel Hiri, Abdelhalim Hafedh Dahou, and Mohamed Amine Cheragui. 2022. Automatic building of a large arabic spelling error corpus. SN Computer Science 4, 2 (2022), 108.","journal-title":"SN Computer Science"},{"key":"e_1_3_2_25_2","first-page":"1","volume-title":"2014 5th International Conference on Information and Communication Systems (ICICS)","author":"Al-Ayyoub Mahmoud","year":"2014","unstructured":"Mahmoud Al-Ayyoub, Marwan K Rihani, Nidal I Dalgamoni, and Nawaf A Abdulla. 2014. Spoken arabic dialects identification: The case of egyptian and jordanian dialects. In 2014 5th International Conference on Information and Communication Systems (ICICS). IEEE, 1\u20136."},{"key":"e_1_3_2_26_2","first-page":"1","volume-title":"2024 International Joint Conference on Neural Networks (IJCNN)","author":"Al-Azani Sadam","year":"2024","unstructured":"Sadam Al-Azani, Nora Alturayeif, Haneen Abouelresh, and Alhanoof Alhunief. 2024. A comprehensive framework and empirical analysis for evaluating large language models in arabic dialect identification. In 2024 International Joint Conference on Neural Networks (IJCNN). IEEE, 1\u20137."},{"key":"e_1_3_2_27_2","first-page":"1","volume-title":"2018 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)","author":"Al-Azani Sadam","year":"2018","unstructured":"Sadam Al-Azani and El-Sayed M El-Alfy. 2018. Detection of arabic spam tweets using word embedding and machine learning. In 2018 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT). IEEE, 1\u20135."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","unstructured":"Saja Al-Dabet Sara Tedmori and Mohammad AL-Smadi. 2021. Enhancing Arabic aspect-based sentiment analysis using deep learning models. Computer Speech and Language 69 (September 2021). DOI:10.1016\/j.csl.2021.101224","DOI":"10.1016\/j.csl.2021.101224"},{"issue":"1","key":"e_1_3_2_29_2","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1007\/s12559-018-9592-7","article-title":"A study of arabic social media users-posting behavior and author\u2019s gender prediction","volume":"11","author":"Al-Ghadir Abdulrahman I","year":"2019","unstructured":"Abdulrahman I Al-Ghadir and Aqil M Azmi. 2019. A study of arabic social media users-posting behavior and author\u2019s gender prediction. Cognitive Computation 11, 1 (2019), 71\u201386.","journal-title":"Cognitive Computation"},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","unstructured":"Abdulmohsen Al-Thubaity Qubayl Alqahtani and Abdulaziz Aljandal. 2018. Sentiment lexicon for sentiment analysis of Saudi dialect tweets. Procedia Computer Science 142 (2018) 301\u2013307.","DOI":"10.1016\/j.procs.2018.10.494"},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.procs.2017.10.094","article-title":"Arasenti-tweet: A corpus for arabic sentiment analysis of saudi tweets","volume":"117","author":"Al-Twairesh Nora","year":"2017","unstructured":"Nora Al-Twairesh, Hend Al-Khalifa, AbdulMalik Al-Salman, and Yousef Al-Ohali. 2017. Arasenti-tweet: A corpus for arabic sentiment analysis of saudi tweets. Procedia Computer Science 117 (2017), 63\u201372.","journal-title":"Procedia Computer Science"},{"key":"e_1_3_2_32_2","first-page":"69","volume-title":"Informatics","author":"Aldjanabi Wassen","year":"2021","unstructured":"Wassen Aldjanabi, Abdelghani Dahou, Mohammed AA Al-qaness, Mohamed Abd Elaziz, Ahmed Mohamed Helmi, and Robertas Dama\u0161evi\u010dius. 2021. Arabic offensive and hate speech detection using a cross-corpora multi-task learning model. In Informatics, Vol. 8. Multidisciplinary Digital Publishing Institute, 69."},{"key":"e_1_3_2_33_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Alharbi Randah","year":"2018","unstructured":"Randah Alharbi, Walid Magdy, Kareem Darwish, Ahmed Abdelali, and Hamdy Mubarak. 2018. Part-of-speech tagging for arabic gulf dialect using Bi-LSTM. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"issue":"3","key":"e_1_3_2_34_2","doi-asserted-by":"crossref","first-page":"439","DOI":"10.3390\/pr10030439","article-title":"A combined text-based and metadata-based deep-learning framework for the detection of spam accounts on the social media platform twitter","volume":"10","author":"Alhassun Atheer S","year":"2022","unstructured":"Atheer S Alhassun and Murad A Rassam. 2022. A combined text-based and metadata-based deep-learning framework for the detection of spam accounts on the social media platform twitter. Processes 10, 3 (2022), 439.","journal-title":"Processes"},{"key":"e_1_3_2_35_2","first-page":"1","volume-title":"2020 International Joint Conference on Neural Networks (IJCNN)","author":"Ali Abbas Raza","year":"2020","unstructured":"Abbas Raza Ali. 2020. Multi-dialect arabic speech recognition. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 1\u20137."},{"key":"e_1_3_2_36_2","first-page":"12","volume-title":"Proceedinsg of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur\u2019an QA and Fine-Grained Hate Speech Detection","author":"Ali Zien Sheikh","year":"2022","unstructured":"Zien Sheikh Ali, Abdulaziz Al-Ali, and Tamer Elsayed. 2022. Detecting users prone to spread fake news on arabic twitter. In Proceedinsg of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur\u2019an QA and Fine-Grained Hate Speech Detection. 12\u201322."},{"key":"e_1_3_2_37_2","first-page":"302","volume-title":"Proceedings of the Fifth Arabic Natural Language Processing Workshop","author":"Aliwy Ahmed","year":"2020","unstructured":"Ahmed Aliwy, Hawraa Taher, and Zena AboAltaheen. 2020. Arabic dialects identification for all arabic countries. In Proceedings of the Fifth Arabic Natural Language Processing Workshop. 302\u2013307."},{"key":"e_1_3_2_38_2","first-page":"119","volume-title":"Proceedings of the Fifth Arabic Natural Language Processing Workshop","author":"Alkaoud Mohamed","year":"2020","unstructured":"Mohamed Alkaoud and Mairaj Syed. 2020. On the importance of tokenization in arabic embedding models. In Proceedings of the Fifth Arabic Natural Language Processing Workshop. 119\u2013129."},{"key":"e_1_3_2_39_2","doi-asserted-by":"crossref","first-page":"101138","DOI":"10.1016\/j.csl.2020.101138","article-title":"Part-of-speech tagging for arabic tweets using CRF and Bi-LSTM","volume":"65","author":"AlKhwiter Wasan","year":"2021","unstructured":"Wasan AlKhwiter and Nora Al-Twairesh. 2021. Part-of-speech tagging for arabic tweets using CRF and Bi-LSTM. Computer Speech & Language 65 (2021), 101138.","journal-title":"Computer Speech & Language"},{"key":"e_1_3_2_40_2","first-page":"602","volume-title":"International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems","author":"Alomari Khaled Mohammad","year":"2017","unstructured":"Khaled Mohammad Alomari, Hatem M ElSherif, and Khaled Shaalan. 2017. Arabic tweets sentimental analysis using machine learning. In International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems. Springer, 602\u2013610."},{"key":"e_1_3_2_41_2","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1109\/ISTAS.2018.8638164","volume-title":"2018 IEEE International Symposium on Technology and Society (ISTAS)","author":"Alorini Dema","year":"2018","unstructured":"Dema Alorini and Danda B Rawat. 2018. Bayesian reasoning based malicious data discovery on gulf-dialectical arabic tweets. In 2018 IEEE International Symposium on Technology and Society (ISTAS). IEEE, 133\u2013138."},{"key":"e_1_3_2_42_2","doi-asserted-by":"crossref","first-page":"448","DOI":"10.1109\/ICCNC.2019.8685659","volume-title":"2019 International Conference on Computing, Networking and Communications (ICNC)","author":"Alorini Dema","year":"2019","unstructured":"Dema Alorini and Danda B Rawat. 2019. Automatic spam detection on gulf dialectical arabic tweets. In 2019 International Conference on Computing, Networking and Communications (ICNC). IEEE, 448\u2013452."},{"key":"e_1_3_2_43_2","article-title":"Mining arabic twitter conversations on health care: A new approach to analysing arabic language on social media","author":"Alqtati Nael","year":"2021","unstructured":"Nael Alqtati, Jonathan AJ Wilson, and Varuna De Silva. 2021. Mining arabic twitter conversations on health care: A new approach to analysing arabic language on social media. Journal of Islamic Marketing 13, 12 (2021), 2649\u20132671.","journal-title":"Journal of Islamic Marketing"},{"key":"e_1_3_2_44_2","unstructured":"Sarah Alqurashi Btool Hamoui Abdulaziz Alashaikh Ahmad Alhindi and Eisa Alanazi. 2021. Eating garlic prevents COVID-19 infection: Detecting misinformation on the Arabic content of Twitter. arXiv preprint arXiv:2101.05626 (2021)."},{"key":"e_1_3_2_45_2","first-page":"282","volume-title":"Proceedings of the Fifth Arabic Natural Language Processing Workshop","author":"AlShenaifi Nouf","year":"2020","unstructured":"Nouf AlShenaifi and Aqil Azmi. 2020. Faheem at NADI shared task: Identifying the dialect of arabic tweet. In Proceedings of the Fifth Arabic Natural Language Processing Workshop. 282\u2013287."},{"key":"e_1_3_2_46_2","unstructured":"Maha J. Althobaiti. 2020. Automatic Arabic dialect identification systems for written texts: A survey. arXiv preprint arXiv:2009.12622 (2020)."},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","unstructured":"Fakhraddin Alwajih Gagan Bhatia and Muhammad Abdul-Mageed. 2024. Dallah: A dialect-aware multimodal large language model for arabic. In Proceedings of the Second Arabic Natural Language Processing Conference Nizar Habash Houda Bouamor Ramy Eskander Nadi Tomeh Ibrahim Abu Farha Ahmed Abdelali Samia Touileb Injy Hamed Yaser Onaizan Bashar Alhafni Wissam Antoun Salam Khalifa Hatem Haddad Imed Zitouni Badr AlKhamissi Rawan Almatham and Khalil Mrini (Eds.). Association for Computational Linguistics Bangkok Thailand 320\u2013336. DOI:10.18653\/v1\/2024.arabicnlp-1.27","DOI":"10.18653\/v1\/2024.arabicnlp-1.27"},{"key":"e_1_3_2_48_2","first-page":"1","volume-title":"2020 3rd International Conference on Computer Applications & Information Security (ICCAIS)","author":"AlYami Reem","year":"2020","unstructured":"Reem AlYami and Rabeah AlZaidy. 2020. Arabic dialect identification in social media. In 2020 3rd International Conference on Computer Applications & Information Security (ICCAIS). IEEE, 1\u20132."},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1109\/ICICS49469.2020.239507","volume-title":"2020 11th International Conference on Information and Communication Systems (ICICS)","author":"Alzaqebah Abdullah","year":"2020","unstructured":"Abdullah Alzaqebah, Bushra Smadi, and Bassam H Hammo. 2020. Arabic sentiment analysis based on salp swarm algorithm with s-shaped transfer functions. In 2020 11th International Conference on Information and Communication Systems (ICICS). IEEE, 179\u2013184."},{"key":"e_1_3_2_50_2","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1109\/ICICS52457.2021.9464600","volume-title":"2021 12th International Conference on Information and Communication Systems (ICICS)","author":"Alzyout Moath","year":"2021","unstructured":"Moath Alzyout, Emran AL Bashabsheh, Hassan Najadat, and Ahmad Alaiad. 2021. Sentiment analysis of arabic tweets about violence against women using machine learning. In 2021 12th International Conference on Information and Communication Systems (ICICS). IEEE, 171\u2013176."},{"issue":"12","key":"e_1_3_2_51_2","doi-asserted-by":"crossref","first-page":"12511","DOI":"10.1016\/j.aej.2022.05.029","article-title":"Arabic rumor detection: A comparative study","volume":"61","author":"Amoudi Ghada","year":"2022","unstructured":"Ghada Amoudi, Rasha Albalawi, Fatimah Baothman, Amani Jamal, Hanan Alghamdi, and Areej Alhothali. 2022. Arabic rumor detection: A comparative study. Alexandria Engineering Journal 61, 12 (2022), 12511\u201312523.","journal-title":"Alexandria Engineering Journal"},{"key":"e_1_3_2_52_2","unstructured":"Wissam Antoun Fady Baly and Hazem Hajj. 2020. AraBERT: Transformer-based model for arabic language understanding. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools with a Shared Task on Offensive Language Detection Hend Al-Khalifa Walid Magdy Kareem Darwish Tamer Elsayed and Hamdy Mubarak (Eds.). European Language Resource Association Marseille France 9\u201315. Retrieved from https:\/\/aclanthology.org\/2020.osact-1.2\/"},{"key":"e_1_3_2_53_2","unstructured":"Wissam Antoun Fady Baly and Hazem Hajj. 2021. AraELECTRA: Pre-training text discriminators for arabic language understanding. In Proceedings of the Sixth Arabic Natural Language Processing Workshop Nizar Habash Houda Bouamor Hazem Hajj Walid Magdy Wajdi Zaghouani Fethi Bougares Nadi Tomeh Ibrahim Abu Farha and Samia Touileb (Eds.). Association for Computational Linguistics Kyiv Ukraine (Virtual) 191\u2013195. Retrieved from https:\/\/aclanthology.org\/2021.wanlp-1.20\/"},{"key":"e_1_3_2_54_2","first-page":"196","volume-title":"Proceedings of the Sixth Arabic Natural Language Processing Workshop","author":"Antoun Wissam","year":"2021","unstructured":"Wissam Antoun, Fady Baly, and Hazem Hajj. 2021. AraGPT2: Pre-trained transformer for arabic language generation. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. Association for Computational Linguistics, Kyiv, Ukraine (Virtual), 196\u2013207."},{"key":"e_1_3_2_55_2","doi-asserted-by":"crossref","first-page":"384","DOI":"10.1109\/MIUCC55081.2022.9781707","volume-title":"2022 2nd International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC)","author":"Ashraf Nsrin","year":"2022","unstructured":"Nsrin Ashraf, Hamada Nayel, and Mohamed Taha. 2022. A comparative study of machine learning approaches for rumors detection in Covid-19 tweets. In 2022 2nd International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC). IEEE, 384\u2013387."},{"key":"e_1_3_2_56_2","first-page":"1","volume-title":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","author":"Assaf Rasha","year":"2021","unstructured":"Rasha Assaf and Mahmoud Saheb. 2021. Dataset for arabic fake news. In 2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 1\u20134."},{"issue":"2","key":"e_1_3_2_57_2","first-page":"256","article-title":"Sentiment analysis of arabic jordanian dialect tweets","volume":"10","author":"Atoum Jalal Omer","year":"2019","unstructured":"Jalal Omer Atoum and Mais Nouman. 2019. Sentiment analysis of arabic jordanian dialect tweets. Int. J. Adv. Comput. Sci. Appl 10, 2 (2019), 256\u2013262.","journal-title":"Int. J. Adv. Comput. Sci. Appl"},{"key":"e_1_3_2_58_2","unstructured":"Ramy Baly Alaa Khaddaj Hazem Hajj Wassim El-Hajj and Khaled Bashir Shaban. 2019. Arsentd-lev: A multi-topic corpus for target-based sentiment analysis in arabic levantine tweets. arXiv preprint arXiv:1906.01830 (2019)."},{"issue":"12","key":"e_1_3_2_59_2","doi-asserted-by":"crossref","first-page":"2502","DOI":"10.3390\/app8122502","article-title":"A multitask-based neural machine translation model with part-of-speech tags integration for arabic dialects","volume":"8","author":"Baniata Laith H","year":"2018","unstructured":"Laith H Baniata, Seyoung Park, and Seong-Bae Park. 2018. A multitask-based neural machine translation model with part-of-speech tags integration for arabic dialects. Applied Sciences 8, 12 (2018), 2502.","journal-title":"Applied Sciences"},{"issue":"11","key":"e_1_3_2_60_2","article-title":"SDCT: Multi-dialects corpus classification for saudi tweets","volume":"11","author":"Bayazed Afnan","year":"2020","unstructured":"Afnan Bayazed, Ola Torabah, Redha AlSulami, Dimah Alahmadi, Amal Babour, and Kawther Saeedi. 2020. SDCT: Multi-dialects corpus classification for saudi tweets. Methodology 11, 11 (2020), 216\u2013233.","journal-title":"Methodology"},{"issue":"2","key":"e_1_3_2_61_2","doi-asserted-by":"crossref","first-page":"1485","DOI":"10.11591\/ijece.v11i2.pp1485-1497","article-title":"New approach for arabic named entity recognition on social media based on feature selection using genetic algorithm","volume":"11","author":"Benali Brahim Ait","year":"2021","unstructured":"Brahim Ait Benali, Soukaina Mihi, Ismail El Bazi, and Nabil Laachfoubi. 2021. New approach for arabic named entity recognition on social media based on feature selection using genetic algorithm. International Journal of Electrical and Computer Engineering (IJECE) 11, 2 (2021), 1485\u20131497.","journal-title":"International Journal of Electrical and Computer Engineering (IJECE)"},{"key":"e_1_3_2_62_2","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/978-981-16-3637-0_7","volume-title":"Networking, Intelligent Systems and Security","author":"Bensalah Nouhaila","year":"2022","unstructured":"Nouhaila Bensalah, Habib Ayad, Abdellah Adib, and Abdelhamid Ibn El Farouk. 2022. CRAN: An hybrid CNN-RNN attention-based model for arabic machine translation. In Networking, Intelligent Systems and Security. Springer, 87\u2013102."},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","unstructured":"Gagan Bhatia El Moatez Billah Nagoudi Fakhraddin Alwajih and Muhammad Abdul-Mageed. 2024. Qalam: A multimodal LLM for arabic optical character and handwriting recognition. In Proceedings of the Second Arabic Natural Language Processing Conference Nizar Habash Houda Bouamor Ramy Eskander Nadi Tomeh Ibrahim Abu Farha Ahmed Abdelali Samia Touileb Injy Hamed Yaser Onaizan Bashar Alhafni Wissam Antoun Salam Khalifa Hatem Haddad Imed Zitouni Badr AlKhamissi Rawan Almatham and Khalil Mrini (Eds.). Association for Computational Linguistics Bangkok Thailand 210\u2013224. DOI:10.18653\/v1\/2024.arabicnlp-1.19","DOI":"10.18653\/v1\/2024.arabicnlp-1.19"},{"key":"e_1_3_2_64_2","article-title":"iCompass at CheckThat! 2022: ARBERT and AraBERT for arabic checkworthy tweet identification","author":"Bilel T","year":"2022","unstructured":"T Bilel, BN Mohamed Aziz, and H Haddad. 2022. iCompass at CheckThat! 2022: ARBERT and AraBERT for arabic checkworthy tweet identification. Working Notes of CLEF 3180 (2022), 694\u2013701.","journal-title":"Working Notes of CLEF"},{"key":"e_1_3_2_65_2","volume-title":"LREC","author":"Bouamor Houda","year":"2018","unstructured":"Houda Bouamor, Nizar Habash, Mohammad Salameh, Wajdi Zaghouani, Owen Rambow, Dana Abdulrahim, Ossama Obeid, Salam Khalifa, Fadhl Eryani, Alexander Erdmann, et\u00a0al. 2018. The MADAR arabic dialect corpus and lexicon.. In LREC."},{"key":"e_1_3_2_66_2","doi-asserted-by":"crossref","first-page":"199","DOI":"10.18653\/v1\/W19-4622","volume-title":"Proceedings of the Fourth Arabic Natural Language Processing Workshop","author":"Bouamor Houda","year":"2019","unstructured":"Houda Bouamor, Sabit Hassan, and Nizar Habash. 2019. The MADAR shared task on arabic fine-grained dialect identification. In Proceedings of the Fourth Arabic Natural Language Processing Workshop. 199\u2013207."},{"key":"e_1_3_2_67_2","unstructured":"Soumia Bougrine Hadda Cherroun and Djelloul Ziadi. 2017. Hierarchical classification for spoken Arabic dialect identification using prosody: Case of algerian dialects. arXiv preprint arXiv:1703.10065 (2017)."},{"key":"e_1_3_2_68_2","unstructured":"Rahma Boujelbane Mariem Ellouze Fr\u00e9d\u00e9ric B\u00e9chet and Lamia Belguith. 2014. De l\u2019arabe standard vers l\u2019arabe dialectal: Projection de corpus et ressources linguistiques en vue du traitement automatique de l\u2019oral dans les m\u00e9dias tunisiens [From Modern Standard Arabic to Tunisian dialect: Corpus projection and linguistic resources towards the automatic processing of speech in the Tunisian media]. Traitement Automatique des Langues 55 2 (2014) 73\u201396. Retrieved from https:\/\/aclanthology.org\/2014.tal-2.4\/"},{"key":"e_1_3_2_69_2","doi-asserted-by":"crossref","first-page":"665","DOI":"10.1016\/j.procs.2022.03.088","article-title":"Building an optimal dataset for arabic fake news detection","volume":"201","author":"Bsoul Mohammad A","year":"2022","unstructured":"Mohammad A Bsoul, Abdallah Qusef, and Saleh Abu-Soud. 2022. Building an optimal dataset for arabic fake news detection. Procedia Computer Science 201 (2022), 665\u2013672.","journal-title":"Procedia Computer Science"},{"key":"e_1_3_2_70_2","first-page":"3521","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Chen Minghao","year":"2020","unstructured":"Minghao Chen, Shuai Zhao, Haifeng Liu, and Deng Cai. 2020. Adversarial-learned loss for domain adaptation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 3521\u20133528."},{"issue":"3","key":"e_1_3_2_71_2","doi-asserted-by":"crossref","first-page":"388","DOI":"10.3390\/knowledge2030022","article-title":"Arabic aspect-based sentiment classification using Seq2Seq dialect normalization and transformers","volume":"2","author":"Chennafi Mohammed ElAmine","year":"2022","unstructured":"Mohammed ElAmine Chennafi, Hanane Bedlaoui, Abdelghani Dahou, and Mohammed AA Al-qaness. 2022. Arabic aspect-based sentiment classification using Seq2Seq dialect normalization and transformers. Knowledge 2, 3 (2022), 388\u2013401.","journal-title":"Knowledge"},{"key":"e_1_3_2_72_2","doi-asserted-by":"crossref","first-page":"122546","DOI":"10.1016\/j.techfore.2023.122546","article-title":"A social media event detection framework based on transformers and swarm optimization for public notification of crises and emergency management","volume":"192","author":"Dahou Abdelghani","year":"2023","unstructured":"Abdelghani Dahou, Alhassan Mabrouk, Ahmed A Ewees, Marwa A Gaheen, and Mohamed Abd Elaziz. 2023. A social media event detection framework based on transformers and swarm optimization for public notification of crises and emergency management. Technological Forecasting and Social Change 192 (2023), 122546.","journal-title":"Technological Forecasting and Social Change"},{"key":"e_1_3_2_73_2","unstructured":"Mohamed Amine Cheragui Abdelhalim Hafedh Dahou and Mohamed Abdelmoazz. 2021. A3C: Arabic anaphora annotated corpus. In Proceedings of the 4th International Conference on Natural Language and Speech Processing (ICNLSP 2021) Mourad Abbas and Abed Alhakim Freihat (Eds.). Association for Computational Linguistics Trento Italy 147\u2013155. Retrieved from https:\/\/aclanthology.org\/2021.icnlsp-1.17\/"},{"key":"e_1_3_2_74_2","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1007\/978-3-031-18516-8_18","volume-title":"International Symposium on Modelling and Implementation of Complex Systems","author":"Dahou Abdelhalim Hafedh","year":"2023","unstructured":"Abdelhalim Hafedh Dahou and Mohamed Amine Cheragui. 2023. Impact of normalization and data augmentation in NER for algerian arabic dialect. In International Symposium on Modelling and Implementation of Complex Systems. Springer, 249\u2013262."},{"key":"e_1_3_2_75_2","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1007\/978-3-031-25344-7_13","volume-title":"12th International Conference on Information Systems and Advanced Technologies \u201cICISAT 2022\u201d Intelligent Information, Data Science and Decision Support System","author":"Dahou Abdelhalim Hafedh","year":"2023","unstructured":"Abdelhalim Hafedh Dahou and Mohamed Amine Cheragui. 2023. Named entity recognition for algerian arabic dialect in social media. In 12th International Conference on Information Systems and Advanced Technologies \u201cICISAT 2022\u201d Intelligent Information, Data Science and Decision Support System. Springer, 135\u2013145."},{"key":"e_1_3_2_76_2","first-page":"458","volume-title":"Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing","author":"Dahou Abdelhalim Hafedh","year":"2023","unstructured":"Abdelhalim Hafedh Dahou, Mohamed Amine Cheragui, and Ahmed Abdelali. 2023. Performance analysis of arabic pre-trained models on named entity recognition task. In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing. 458\u2013467."},{"issue":"6","key":"e_1_3_2_77_2","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1017\/S1351324920000078","article-title":"Effective multi-dialectal arabic POS tagging","volume":"26","author":"Darwish Kareem","year":"2020","unstructured":"Kareem Darwish, Mohammed Attia, Hamdy Mubarak, Younes Samih, Ahmed Abdelali, Llu\u00eds M\u00e0rquez, Mohamed Eldesouki, and Laura Kallmeyer. 2020. Effective multi-dialectal arabic POS tagging. Natural Language Engineering 26, 6 (2020), 677\u2013690.","journal-title":"Natural Language Engineering"},{"key":"e_1_3_2_78_2","doi-asserted-by":"crossref","first-page":"130","DOI":"10.18653\/v1\/W17-1316","volume-title":"Proceedings of the Third Arabic Natural Language Processing Workshop","author":"Darwish Kareem","year":"2017","unstructured":"Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, and Mohamed Eldesouki. 2017. Arabic pos tagging: Don\u2019t abandon feature engineering just yet. In Proceedings of the Third Arabic Natural Language Processing Workshop. 130\u2013137."},{"key":"e_1_3_2_79_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Darwish Kareem","year":"2018","unstructured":"Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki, Younes Samih, Randah Alharbi, Mohammed Attia, Walid Magdy, and Laura Kallmeyer. 2018. Multi-dialect arabic POS tagging: A CRF approach. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","unstructured":"Jacob Devlin Ming-Wei Chang Kenton Lee and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Volume 1 (Long and Short Papers) Jill Burstein Christy Doran and Thamar Solorio (Eds.). Association for Computational Linguistics Minneapolis Minnesota 4171\u20134186. DOI:10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"issue":"4","key":"e_1_3_2_81_2","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1177\/0165551514534143","article-title":"A study of the effects of preprocessing strategies on sentiment analysis for Arabic text","volume":"40","author":"Duwairi Rehab","year":"2014","unstructured":"Rehab Duwairi and Mahmoud El-Orfali. 2014. A study of the effects of preprocessing strategies on sentiment analysis for Arabic text. Journal of Information Science 40, 4 (2014), 501\u2013513.","journal-title":"Journal of Information Science"},{"issue":"4","key":"e_1_3_2_82_2","doi-asserted-by":"crossref","first-page":"4001","DOI":"10.1007\/s13369-021-05383-3","article-title":"A deep learning framework for automatic detection of hate speech embedded in arabic tweets","volume":"46","author":"Duwairi Rehab","year":"2021","unstructured":"Rehab Duwairi, Amena Hayajneh, and Muhannad Quwaider. 2021. A deep learning framework for automatic detection of hate speech embedded in arabic tweets. Arabian Journal for Science and Engineering 46, 4 (2021), 4001\u20134014.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_83_2","first-page":"1","volume-title":"2014 5th International Conference on Information and Communication Systems (ICICS)","author":"Duwairi Rehab M","year":"2014","unstructured":"Rehab M Duwairi, Raed Marji, Narmeen Sha\u2019ban, and Sally Rushaidat. 2014. Sentiment analysis in arabic tweets. In 2014 5th International Conference on Information and Communication Systems (ICICS). IEEE, 1\u20136."},{"key":"e_1_3_2_84_2","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1016\/j.procs.2017.10.092","article-title":"A web-based tool for arabic sentiment analysis","volume":"117","author":"El-Masri Mazen","year":"2017","unstructured":"Mazen El-Masri, Nabeela Altrabsheh, Hanady Mansour, and Allan Ramsay. 2017. A web-based tool for arabic sentiment analysis. Procedia Computer Science 117 (2017), 38\u201345.","journal-title":"Procedia Computer Science"},{"key":"e_1_3_2_85_2","first-page":"2824","volume-title":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Mekki Abdellah El","year":"2021","unstructured":"Abdellah El Mekki, Abdelkader El Mahdaouy, Ismail Berrada, and Ahmed Khoumsi. 2021. Domain adaptation for arabic cross-domain and cross-dialect sentiment analysis from contextualized word embedding. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2824\u20132837."},{"key":"e_1_3_2_86_2","first-page":"256","volume-title":"International Conference on Intelligent Networking and Collaborative Systems","author":"Elhadad Mohamed K","year":"2020","unstructured":"Mohamed K Elhadad, Kin Fun Li, and Fayez Gebali. 2020. COVID-19-FAKES: A twitter (arabic\/english) dataset for detecting misleading information on COVID-19. In International Conference on Intelligent Networking and Collaborative Systems. Springer, 256\u2013268."},{"key":"e_1_3_2_87_2","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1007\/978-3-030-57796-4_25","volume-title":"Advances in Intelligent Networking and Collaborative Systems: The 12th International Conference on Intelligent Networking and Collaborative Systems (INCoS-2020) 12","author":"Elhadad Mohamed K","year":"2021","unstructured":"Mohamed K Elhadad, Kin Fun Li, and Fayez Gebali. 2021. COVID-19-FAKES: A twitter (arabic\/english) dataset for detecting misleading information on COVID-19. In Advances in Intelligent Networking and Collaborative Systems: The 12th International Conference on Intelligent Networking and Collaborative Systems (INCoS-2020) 12. Springer, 256\u2013268."},{"key":"e_1_3_2_88_2","first-page":"20","article-title":"Arsas: An arabic speech-act and sentiment corpus of tweets","volume":"3","author":"Elmadany AbdelRahim","year":"2018","unstructured":"AbdelRahim Elmadany, Hamdy Mubarak, and Walid Magdy. 2018. Arsas: An arabic speech-act and sentiment corpus of tweets. OSACT 3 (2018), 20.","journal-title":"OSACT"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","unstructured":"AbdelRahim Elmadany ElMoatez Billah Nagoudi and Muhammad Abdul-Mageed. 2023. ORCA: A challenging benchmark for arabic language understanding. In Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers Jordan Boyd-Graber and Naoaki Okazaki (Eds.). Association for Computational Linguistics Toronto Canada 9559\u20139586. DOI:10.18653\/v1\/2023.findings-acl.609","DOI":"10.18653\/v1\/2023.findings-acl.609"},{"key":"e_1_3_2_90_2","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1007\/978-3-319-66854-3_20","volume-title":"International Conference on Model and Data Engineering","author":"Elouardighi Abdeljalil","year":"2017","unstructured":"Abdeljalil Elouardighi, Mohcine Maghfour, and Hafdalla Hammia. 2017. Collecting and processing arabic facebook comments for sentiment analysis. In International Conference on Model and Data Engineering. Springer, 262\u2013274."},{"key":"e_1_3_2_91_2","first-page":"23","volume-title":"Computational Linguistics and Intelligent Text Processing: 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part II 16","author":"ElSahar Hady","year":"2015","unstructured":"Hady ElSahar and Samhaa R El-Beltagy. 2015. Building large arabic multi-domain resources for sentiment analysis. In Computational Linguistics and Intelligent Text Processing: 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part II 16. Springer, 23\u201334."},{"issue":"3","key":"e_1_3_2_92_2","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/j.eij.2020.04.001","article-title":"Gender identification for egyptian arabic dialect in twitter using deep learning models","volume":"21","author":"ElSayed Shereen","year":"2020","unstructured":"Shereen ElSayed and Mona Farouk. 2020. Gender identification for egyptian arabic dialect in twitter using deep learning models. Egyptian Informatics Journal 21, 3 (2020), 159\u2013167.","journal-title":"Egyptian Informatics Journal"},{"key":"e_1_3_2_93_2","first-page":"4130","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Eryani Fadhl","year":"2020","unstructured":"Fadhl Eryani, Nizar Habash, Houda Bouamor, and Salam Khalifa. 2020. A spelling correction corpus for multiple arabic dialects. In Proceedings of the 12th Language Resources and Evaluation Conference. 4130\u20134138."},{"key":"e_1_3_2_94_2","doi-asserted-by":"publisher","unstructured":"Fahim Faisal Orevaoghene Ahia Aarohi Srivastava Kabir Ahuja David Chiang Yulia Tsvetkov and Antonios Anastasopoulos. 2024. DIALECTBENCH: An NLP benchmark for dialects varieties and closely-related languages. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku Andre Martins and Vivek Srikumar (Eds.). Association for Computational Linguistics Bangkok Thailand 14412\u201314454. DOI:10.18653\/v1\/2024.acl-long.777","DOI":"10.18653\/v1\/2024.acl-long.777"},{"key":"e_1_3_2_95_2","doi-asserted-by":"crossref","first-page":"192","DOI":"10.18653\/v1\/W19-4621","volume-title":"Proceedings of the Fourth Arabic Natural Language Processing Workshop","author":"Farha Ibrahim Abu","year":"2019","unstructured":"Ibrahim Abu Farha and Walid Magdy. 2019. Mazajak: An online arabic sentiment analyser. In Proceedings of the Fourth Arabic Natural Language Processing Workshop. 192\u2013198."},{"issue":"4","key":"e_1_3_2_96_2","doi-asserted-by":"crossref","first-page":"1811","DOI":"10.1007\/s12652-021-02948-w","article-title":"Classification of arabic healthcare questions based on word embeddings learned from massive consultations: A deep learning approach","volume":"13","author":"Faris Hossam","year":"2022","unstructured":"Hossam Faris, Maria Habib, Mohammad Faris, Alaa Alomari, Pedro A Castillo, and Manal Alomari. 2022. Classification of arabic healthcare questions based on word embeddings learned from massive consultations: A deep learning approach. Journal of Ambient Intelligence and Humanized Computing 13, 4 (2022), 1811\u20131827.","journal-title":"Journal of Ambient Intelligence and Humanized Computing"},{"key":"e_1_3_2_97_2","first-page":"1","article-title":"Darijabert: A step forward in nlp for the written moroccan dialect","author":"Gaanoun Kamel","year":"2024","unstructured":"Kamel Gaanoun, Abdou Mohamed Naira, Anass Allak, and Imade Benelallam. 2024. Darijabert: A step forward in nlp for the written moroccan dialect. International Journal of Data Science and Analytics (2024), 1\u201313.","journal-title":"International Journal of Data Science and Analytics"},{"key":"e_1_3_2_98_2","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1016\/j.procs.2019.06.048","article-title":"Implementation of machine learning algorithms in arabic sentiment analysis using N-Gram features","volume":"154","author":"Gamal Donia","year":"2019","unstructured":"Donia Gamal, Marco Alfonse, El-Sayed M El-Horbaty, and Abdel-Badeeh M Salem. 2019. Implementation of machine learning algorithms in arabic sentiment analysis using N-Gram features. Procedia Computer Science 154 (2019), 332\u2013340.","journal-title":"Procedia Computer Science"},{"issue":"1","key":"e_1_3_2_99_2","doi-asserted-by":"crossref","first-page":"33","DOI":"10.5815\/ijmecs.2019.01.04","article-title":"Twitter benchmark dataset for arabic sentiment analysis","volume":"11","author":"Gamal Donia","year":"2019","unstructured":"Donia Gamal, Marco Alfonse, El-Sayed M El-Horbaty, and Abdel-Badeeh M Salem. 2019. Twitter benchmark dataset for arabic sentiment analysis. Int J Mod Educ Comput Sci 11, 1 (2019), 33.","journal-title":"Int J Mod Educ Comput Sci"},{"key":"e_1_3_2_100_2","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1007\/978-3-030-73882-2_54","volume-title":"International Conference on Digital Technologies and Applications","author":"Garouani Moncef","year":"2021","unstructured":"Moncef Garouani, Hanae Chrita, and Jamal Kharroubi. 2021. Sentiment analysis of moroccan tweets using text mining. In International Conference on Digital Technologies and Applications. Springer, 597\u2013608."},{"key":"e_1_3_2_101_2","doi-asserted-by":"publisher","unstructured":"Abbas Ghaddar Yimeng Wu Sunyam Bagga Ahmad Rashid Khalil Bibi Mehdi Rezagholizadeh Chao Xing Yasheng Wang Xinyu Duan Zhefeng Wang Baoxing Huai Xin Jiang Qun Liu and Phillippe Langlais. 2022. Revisiting Pre-trained language models and their evaluation for arabic natural language processing. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg Zornitsa Kozareva and Yue Zhang (Eds.). Association for Computational Linguistics Abu Dhabi United Arab Emirates 3135\u20133151. DOI:10.18653\/v1\/2022.emnlp-main.205","DOI":"10.18653\/v1\/2022.emnlp-main.205"},{"key":"e_1_3_2_102_2","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1109\/ICICS55353.2022.9811124","volume-title":"2022 13th International Conference on Information and Communication Systems (ICICS)","author":"Gharaibeh Maram","year":"2022","unstructured":"Maram Gharaibeh, Rasha Obeidat, Malak Abdullah, and Yara Al-Harahsheh. 2022. Datasets and approaches of COVID-19 misinformation detection: A survey. In 2022 13th International Conference on Information and Communication Systems (ICICS). IEEE, 337\u2013345."},{"key":"e_1_3_2_103_2","first-page":"203","volume-title":"Handbook of Statistics","author":"Gudivada Venkat N","year":"2015","unstructured":"Venkat N Gudivada, Dhana Rao, and Vijay V Raghavan. 2015. Big data driven natural language processing research and applications. In Handbook of Statistics. Vol. 33. Elsevier, 203\u2013238."},{"issue":"2","key":"e_1_3_2_104_2","first-page":"1","article-title":"A semi-supervised approach for sentiment analysis of arab (ic+ izi) messages: Application to the algerian dialect","volume":"2","author":"Guellil Imane","year":"2021","unstructured":"Imane Guellil, Ahsan Adeel, Faical Azouaou, Fodil Benali, Ala-Eddine Hachani, Kia Dashtipour, Mandar Gogate, Cosimo Ieracitano, Reza Kashani, and Amir Hussain. 2021. A semi-supervised approach for sentiment analysis of arab (ic+ izi) messages: Application to the algerian dialect. SN Computer Science 2, 2 (2021), 1\u201318.","journal-title":"SN Computer Science"},{"key":"e_1_3_2_105_2","first-page":"557","volume-title":"International Conference on Brain Inspired Cognitive Systems","author":"Guellil Imane","year":"2018","unstructured":"Imane Guellil, Ahsan Adeel, Faical Azouaou, and Amir Hussain. 2018. Sentialg: Automated corpus annotation for algerian sentiment analysis. In International Conference on Brain Inspired Cognitive Systems. Springer, 557\u2013567."},{"issue":"65","key":"e_1_3_2_106_2","doi-asserted-by":"crossref","first-page":"124","DOI":"10.4114\/intartif.vol23iss65pp124-135","article-title":"Arabic dialect sentiment analysis with ZERO effort. \\(\\backslash\\)  \\(\\backslash\\) Case study: Algerian dialect","volume":"23","author":"Guellil Imane","year":"2020","unstructured":"Imane Guellil, Marcelo Mendoza, and Faical Azouaou. 2020. Arabic dialect sentiment analysis with ZERO effort. \\(\\backslash\\) \\(\\backslash\\) Case study: Algerian dialect. Inteligencia Artificial 23, 65 (2020), 124\u2013135.","journal-title":"Inteligencia Artificial"},{"key":"e_1_3_2_107_2","first-page":"1","volume-title":"Hybrid Computational Intelligence","author":"Gupta Neha","year":"2020","unstructured":"Neha Gupta and Rashmi Agrawal. 2020. Application and techniques of opinion mining. In Hybrid Computational Intelligence. Elsevier, 1\u201323."},{"key":"e_1_3_2_108_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Habash Nizar","year":"2018","unstructured":"Nizar Habash, Fadhl Eryani, Salam Khalifa, Owen Rambow, Dana Abdulrahim, Alexander Erdmann, Reem Faraj, Wajdi Zaghouani, Houda Bouamor, Nasser Zalmout, et\u00a0al. 2018. Unified guidelines and resources for arabic dialect orthography. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"issue":"1","key":"e_1_3_2_109_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-02139-8","article-title":"Introduction to arabic natural language processing","volume":"3","author":"Habash Nizar Y","year":"2010","unstructured":"Nizar Y Habash. 2010. Introduction to arabic natural language processing. Synthesis Lectures on Human Language Technologies 3, 1 (2010), 1\u2013187.","journal-title":"Synthesis Lectures on Human Language Technologies"},{"issue":"2","key":"e_1_3_2_110_2","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1007\/s42979-022-01541-y","article-title":"TunBERT: Pretraining BERT for tunisian dialect understanding","volume":"4","author":"Haddad Hatem","year":"2023","unstructured":"Hatem Haddad, Ahmed Cheikh Rouhou, Abir Messaoudi, Abir Korched, Chayma Fourati, Amel Sellami, Moez Ben HajHmida, and Faten Ghriss. 2023. TunBERT: Pretraining BERT for tunisian dialect understanding. SN Computer Science 4, 2 (2023), 194.","journal-title":"SN Computer Science"},{"issue":"6","key":"e_1_3_2_111_2","doi-asserted-by":"crossref","first-page":"275","DOI":"10.25046\/aj020634","article-title":"A multilingual system for cyberbullying detection: Arabic content detection using machine learning","volume":"2","author":"Haidar Batoul","year":"2017","unstructured":"Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2017. A multilingual system for cyberbullying detection: Arabic content detection using machine learning. Advances in Science, Technology and Engineering Systems Journal 2, 6 (2017), 275\u2013284.","journal-title":"Advances in Science, Technology and Engineering Systems Journal"},{"key":"e_1_3_2_112_2","doi-asserted-by":"crossref","first-page":"59","DOI":"10.18653\/v1\/W15-3207","volume-title":"Workshop on Arabic Natural Language Processing","author":"Hamdi Ahmed","year":"2015","unstructured":"Ahmed Hamdi, Alexis Nasr, Nizar Habash, and N\u00faria Gala. 2015. POS-tagging of tunisian dialect using standard Arabic resources and tools. In Workshop on Arabic Natural Language Processing. 59\u201368."},{"key":"e_1_3_2_113_2","first-page":"79","volume-title":"International Conference on Arabic Language Processing","author":"Harrat Salima","year":"2019","unstructured":"Salima Harrat, Karima Meftouh, Karima Abidi, and Kamel Sma\u00efli. 2019. Automatic identification methods on a corpus of twenty five fine-grained arabic dialects. In International Conference on Arabic Language Processing. Springer, 79\u201392."},{"key":"e_1_3_2_114_2","first-page":"113","volume-title":"Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations","author":"Hassan Sabit","year":"2021","unstructured":"Sabit Hassan, Hamdy Mubarak, Ahmed Abdelali, and Kareem Darwish. 2021. Asad: Arabic social media analytics and understanding. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations. 113\u2013118."},{"key":"e_1_3_2_115_2","first-page":"1","volume-title":"2016 4th Saudi International Conference on Information Technology (Big Data Analysis)(KACSTIT)","author":"Hathlian Nourah F Bin","year":"2016","unstructured":"Nourah F Bin Hathlian and Alaaeldin M Hafezs. 2016. Sentiment-subjective analysis framework for arabic social media posts. In 2016 4th Saudi International Conference on Information Technology (Big Data Analysis)(KACSTIT). IEEE, 1\u20136."},{"issue":"4","key":"e_1_3_2_116_2","doi-asserted-by":"crossref","first-page":"53","DOI":"10.3390\/bdcc3040053","article-title":"Semantic ontology-based approach to enhance arabic text classification","volume":"3","author":"Hawalah Ahmad","year":"2019","unstructured":"Ahmad Hawalah. 2019. Semantic ontology-based approach to enhance arabic text classification. Big Data and Cognitive Computing 3, 4 (2019), 53.","journal-title":"Big Data and Cognitive Computing"},{"issue":"8","key":"e_1_3_2_117_2","doi-asserted-by":"crossref","first-page":"10453","DOI":"10.1007\/s13369-021-06449-y","article-title":"Arabic fake news detection based on textual analysis","volume":"47","author":"Himdi Hanen","year":"2022","unstructured":"Hanen Himdi, George Weir, Fatmah Assiri, and Hassanin Al-Barhamtoshy. 2022. Arabic fake news detection based on textual analysis. Arabian Journal for Science and Engineering 47, 8 (2022), 10453\u201310469.","journal-title":"Arabian Journal for Science and Engineering"},{"key":"e_1_3_2_118_2","doi-asserted-by":"crossref","first-page":"609","DOI":"10.1109\/ICDAR.2017.105","volume-title":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"1","author":"Hkiri Emna","year":"2017","unstructured":"Emna Hkiri, Souheyl Mallat, and Mounir Zrigui. 2017. Integrating bilingual named entities lexicon with conditional random fields model for Arabic named entities recognition. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 1. IEEE, 609\u2013614."},{"key":"e_1_3_2_119_2","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1109\/ReTIS.2015.7232904","volume-title":"2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS)","author":"Ibrahim Hossam S","year":"2015","unstructured":"Hossam S Ibrahim, Sherif M Abdou, and Mervat Gheith. 2015. MIKA: A tagged corpus for modern standard Arabic and colloquial sentiment analysis. In 2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS). IEEE, 353\u2013358."},{"key":"e_1_3_2_120_2","doi-asserted-by":"publisher","unstructured":"Go Inoue Salam Khalifa and Nizar Habash. 2022. Morphosyntactic tagging with pre-trained language models for arabic and its dialects. In Findings of the Association for Computational Linguistics: ACL 2022 Smaranda Muresan Preslav Nakov and Aline Villavicencio (Eds.). Association for Computational Linguistics Dublin Ireland 1708\u20131719. DOI:10.18653\/v1\/2022.findings-acl.135","DOI":"10.18653\/v1\/2022.findings-acl.135"},{"key":"e_1_3_2_121_2","doi-asserted-by":"crossref","first-page":"421","DOI":"10.18653\/v1\/K17-1042","volume-title":"Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)","author":"Inoue Go","year":"2017","unstructured":"Go Inoue, Hiroyuki Shindo, and Yuji Matsumoto. 2017. Joint prediction of morphosyntactic categories for fine-grained Arabic part-of-speech tagging exploiting tag dictionary information. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). 421\u2013431."},{"key":"e_1_3_2_122_2","first-page":"276","volume-title":"Proceedings of the Sixth Arabic Natural Language Processing Workshop","author":"Issa Elsayed","year":"2021","unstructured":"Elsayed Issa, Mohammed AlShakhori, Reda Al-Bahrani, and Gus Hahn-Powell. 2021. Country-level arabic dialect identification using RNNs with and without linguistic features. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. 276\u2013281."},{"key":"e_1_3_2_123_2","doi-asserted-by":"crossref","first-page":"596","DOI":"10.1109\/JEEIT.2019.8717386","volume-title":"2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)","author":"Jardaneh Ghaith","year":"2019","unstructured":"Ghaith Jardaneh, Hamed Abdelhaq, Momen Buzz, and Douglas Johnson. 2019. Classifying arabic tweets based on credibility using content and user features. In 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT). IEEE, 596\u2013601."},{"key":"e_1_3_2_124_2","doi-asserted-by":"crossref","first-page":"18","DOI":"10.3115\/v1\/W14-3603","volume-title":"Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP)","author":"Jarrar Mustafa","year":"2014","unstructured":"Mustafa Jarrar, Nizar Habash, Diyam Akra, and Nasser Zalmout. 2014. Building a corpus for palestinian arabic: A preliminary study. In Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP). 18\u201327."},{"key":"e_1_3_2_125_2","doi-asserted-by":"crossref","DOI":"10.4135\/9788132108498","volume-title":"Research Methods for Graduate Business and Social Science Students","author":"John Adams","year":"2007","unstructured":"Adams John. 2007. Research Methods for Graduate Business and Social Science Students. ohn Adams, Hafiz TA Khan, Robert Raeside and David White, 2007."},{"issue":"6","key":"e_1_3_2_126_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3712060","article-title":"Natural language processing for dialects of a language: A survey","volume":"57","author":"Joshi Aditya","year":"2025","unstructured":"Aditya Joshi, Raj Dabre, Diptesh Kanojia, Zhuang Li, Haolan Zhan, Gholamreza Haffari, and Doris Dippold. 2025. Natural language processing for dialects of a language: A survey. Comput. Surveys 57, 6 (2025), 1\u201337.","journal-title":"Comput. Surveys"},{"issue":"5","key":"e_1_3_2_127_2","first-page":"1409","article-title":"Cyber-bullying and cyber-harassment detection using supervised machine learning techniques in arabic social media contents","volume":"21","author":"Kanan Tarek","year":"2020","unstructured":"Tarek Kanan, Amal Aldaaja, and Bilal Hawashin. 2020. Cyber-bullying and cyber-harassment detection using supervised machine learning techniques in arabic social media contents. Journal of Internet Technology 21, 5 (2020), 1409\u20131421.","journal-title":"Journal of Internet Technology"},{"key":"e_1_3_2_128_2","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.procs.2017.10.105","article-title":"Soukhria: Towards an irony detection system for arabic in social media","volume":"117","author":"Karoui Jihen","year":"2017","unstructured":"Jihen Karoui, Farah Banamara Zitoune, and Veronique Moriceau. 2017. Soukhria: Towards an irony detection system for arabic in social media. Procedia Computer Science 117 (2017), 161\u2013168.","journal-title":"Procedia Computer Science"},{"key":"e_1_3_2_129_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Khalifa Salam","year":"2018","unstructured":"Salam Khalifa, Nizar Habash, Fadhl Eryani, Ossama Obeid, Dana Abdulrahim, and Meera Al Kaabi. 2018. A morphologically annotated corpus of Emirati Arabic. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"key":"e_1_3_2_130_2","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1007\/978-3-030-69717-4_40","volume-title":"International Conference on Advanced Machine Learning Technologies and Applications","author":"Khalifa Yasmin","year":"2021","unstructured":"Yasmin Khalifa and Ashraf Elnagar. 2021. Sentiment analysis of colloquial arabic tweets with emojis. In International Conference on Advanced Machine Learning Technologies and Applications. Springer, 418\u2013430."},{"key":"e_1_3_2_131_2","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1109\/IDSTA53674.2021.9660811","volume-title":"2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","author":"Khalil Ashwaq","year":"2021","unstructured":"Ashwaq Khalil, Moath Jarrah, Monther Aldwairi, and Yaser Jararweh. 2021. Detecting arabic fake news using machine learning. In 2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA). IEEE, 171\u2013177."},{"key":"e_1_3_2_132_2","doi-asserted-by":"publisher","unstructured":"Md Tawkat Islam Khondaker Abdul Waheed El Moatez Billah Nagoudi and Muhammad Abdul-Mageed. 2023. GPTAraEval: A comprehensive evaluation of ChatGPT on arabic NLP. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor Juan Pino and Kalika Bali (Eds.). Association for Computational Linguistics Singapore 220\u2013247. DOI:10.18653\/v1\/2023.emnlp-main.16","DOI":"10.18653\/v1\/2023.emnlp-main.16"},{"key":"e_1_3_2_133_2","doi-asserted-by":"crossref","first-page":"8","DOI":"10.18653\/v1\/2020.fever-1.2","volume-title":"Proceedings of the Third Workshop on Fact Extraction and VERification (FEVER)","author":"Khouja Jude","year":"2020","unstructured":"Jude Khouja. 2020. Stance prediction and claim verification: An arabic perspective. In Proceedings of the Third Workshop on Fact Extraction and VERification (FEVER). 8\u201317."},{"key":"e_1_3_2_134_2","unstructured":"Nikita Kitaev Lukasz Kaiser and Anselm Levskaya. 2020. Reformer: The efficient transformer. In 8th International Conference on Learning Representations ICLR 2020 Addis Ababa Ethiopia April 26-30 2020 OpenReview.net. Retrieved from https:\/\/openreview.net\/forum?id=rkgNKkHtvB"},{"issue":"2","key":"e_1_3_2_135_2","doi-asserted-by":"crossref","first-page":"66","DOI":"10.4018\/IJTD.2020040105","article-title":"Ooredoo rayek: A business decision support system based on multi-language sentiment analysis of algerian operator telephones","volume":"11","author":"Klouche Badia","year":"2020","unstructured":"Badia Klouche, Sidi Mohamed Benslimane, and Sakina Rim Bennabi. 2020. Ooredoo rayek: A business decision support system based on multi-language sentiment analysis of algerian operator telephones. International Journal of Technology Diffusion (IJTD) 11, 2 (2020), 66\u201381.","journal-title":"International Journal of Technology Diffusion (IJTD)"},{"key":"e_1_3_2_136_2","doi-asserted-by":"crossref","unstructured":"Anis Koubaa Adel Ammar Lahouari Ghouti Omar Najar and Serry Sibaee. 2024. Arabiangpt: Native arabic gpt-based large language. arXiv preprint arXiv:2402.15313 (2024).","DOI":"10.20944\/preprints202402.1409.v1"},{"key":"e_1_3_2_137_2","doi-asserted-by":"publisher","unstructured":"Wang Ling Chris Dyer Alan W. Black Isabel Trancoso Ram\u00f3n Fermandez Silvio Amir Lu\u00eds Marujo and Tiago Lu\u00eds. 2015. Finding function in form: Compositional character models for open vocabulary word representation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing Llu\u00eds M\u00e0rquez Chris Callison-Burch and Jian Su (Eds.). Association for Computational Linguistics Lisbon Portugal 1520\u20131530. DOI:10.18653\/v1\/D15-1176","DOI":"10.18653\/v1\/D15-1176"},{"key":"e_1_3_2_138_2","first-page":"2348","volume-title":"LREC","author":"Maamouri Mohamed","year":"2014","unstructured":"Mohamed Maamouri, Ann Bies, Seth Kulick, Michael Ciul, Nizar Habash, and Ramy Eskander. 2014. Developing an egyptian arabic treebank: Impact of dialectal morphology on annotation and tool development.. In LREC. 2348\u20132354."},{"key":"e_1_3_2_139_2","unstructured":"Mohamed Maamouri Ann Bies Seth Kulick Dalila Tabessi and Sondos Krouna. 2012. Egyptian Arabic Treebank DF Parts 1-8 V2. 0-LDC catalog numbers LDC2012E93. LDC2012E98 LDC2012E89 LDC2012E99 LDC2012E107 LDC2012E125 LDC2013E12 LDC2013E21 (2012)."},{"key":"e_1_3_2_140_2","doi-asserted-by":"crossref","first-page":"104266","DOI":"10.1016\/j.rinp.2021.104266","article-title":"Using artificial intelligence techniques for detecting Covid-19 epidemic fake news in moroccan tweets","volume":"25","author":"Madani Youness","year":"2021","unstructured":"Youness Madani, Mohammed Erritali, and Belaid Bouikhalene. 2021. Using artificial intelligence techniques for detecting Covid-19 epidemic fake news in moroccan tweets. Results in Physics 25 (2021), 104266.","journal-title":"Results in Physics"},{"key":"e_1_3_2_141_2","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1007\/978-3-030-00856-7_18","volume-title":"International Conference on Model and Data Engineering","author":"Maghfour Mohcine","year":"2018","unstructured":"Mohcine Maghfour and Abdeljalil Elouardighi. 2018. Standard and dialectal arabic text classification for sentiment analysis. In International Conference on Model and Data Engineering. Springer, 282\u2013291."},{"issue":"6","key":"e_1_3_2_142_2","article-title":"Fake news detection in arabic tweets during the COVID-19 pandemic","volume":"12","author":"Mahlous Ahmed Redha","year":"2021","unstructured":"Ahmed Redha Mahlous and Ali Al-Laith. 2021. Fake news detection in arabic tweets during the COVID-19 pandemic. International Journal of Advanced Computer Science and Applications 12, 6 (2021), 778\u2013788.","journal-title":"International Journal of Advanced Computer Science and Applications"},{"key":"e_1_3_2_143_2","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/978-981-10-0515-2_3","volume-title":"Computational Linguistics: 14th International Conference of the Pacific Association for Computational Linguistics, PACLING 2015, Bali, Indonesia, May 19-21, 2015, Revised Selected Papers","author":"Malmasi Shervin","year":"2016","unstructured":"Shervin Malmasi, Eshrag Refaee, and Mark Dras. 2016. Arabic dialect identification using a parallel multidialectal corpus. In Computational Linguistics: 14th International Conference of the Pacific Association for Computational Linguistics, PACLING 2015, Bali, Indonesia, May 19-21, 2015, Revised Selected Papers. Springer, 35\u201353."},{"key":"e_1_3_2_144_2","unstructured":"Malak Mashaabi Shahad Al-Khalifa and Hend Al-Khalifa. 2024. A survey of large language models for arabic language and its dialects. arXiv preprint arXiv:2410.20238 (2024)."},{"issue":"1","key":"e_1_3_2_145_2","first-page":"129","article-title":"Deep learning for sentiment analysis of tunisian dialect","volume":"25","author":"Masmoudi Abir","year":"2021","unstructured":"Abir Masmoudi, Jamila Hamdi, and Lamia Hadrich Belguith. 2021. Deep learning for sentiment analysis of tunisian dialect. Computaci\u00f3n y Sistemas 25, 1 (2021), 129\u2013148.","journal-title":"Computaci\u00f3n y Sistemas"},{"issue":"2","key":"e_1_3_2_146_2","first-page":"1","article-title":"Transliteration of arabizi into arabic script for tunisian dialect","volume":"19","author":"Masmoudi Abir","year":"2019","unstructured":"Abir Masmoudi, Mariem Ellouze Khmekhem, Mourad Khrouf, and Lamia Hadrich Belguith. 2019. Transliteration of arabizi into arabic script for tunisian dialect. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 19, 2 (2019), 1\u201321.","journal-title":"ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)"},{"key":"e_1_3_2_147_2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.18653\/v1\/W17-1307","volume-title":"Third Arabic Natural Language Processing Workshop (WANLP)","author":"Mdhaffar Salima","year":"2017","unstructured":"Salima Mdhaffar, Fethi Bougares, Yannick Esteve, and Lamia Hadrich-Belguith. 2017. Sentiment analysis of tunisian dialects: Linguistic ressources and experiments. In Third Arabic Natural Language Processing Workshop (WANLP). 55\u201361."},{"key":"e_1_3_2_148_2","first-page":"26","volume-title":"Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation","author":"Meftouh Karima","year":"2015","unstructured":"Karima Meftouh, Salima Harrat, Salma Jamoussi, Mourad Abbas, and Kamel Smaili. 2015. Machine translation experiments on PADIC: A parallel Arabic dialect corpus. In Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation. 26\u201334."},{"key":"e_1_3_2_149_2","doi-asserted-by":"crossref","first-page":"234","DOI":"10.18653\/v1\/W19-4628","volume-title":"Proceedings of the Fourth Arabic Natural Language Processing Workshop","author":"Mishra Pruthwik","year":"2019","unstructured":"Pruthwik Mishra and Vandan Mujadia. 2019. Arabic dialect identification for travel and twitter text. In Proceedings of the Fourth Arabic Natural Language Processing Workshop. 234\u2013238."},{"key":"e_1_3_2_150_2","doi-asserted-by":"crossref","unstructured":"Hassoun Al-Jawad MM H. Alharbi AF Almukhtar and AA Alnawas. 2022. Constructing Twitter Corpus of Iraqi Arabic Dialect (CIAD) for Sentiment Analysis. Scientific and Technical Journal of Information Technologies Mechanics and Optics 22 2 (2022) 308\u2013316.","DOI":"10.17586\/2226-1494-2022-22-2-308-316"},{"key":"e_1_3_2_151_2","first-page":"1","volume-title":"2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI)","author":"Mohdeb Djamila","year":"2021","unstructured":"Djamila Mohdeb, Meriem Laifa, and Miloud Naidja. 2021. An arabic corpus for Covid-19 related fake news. In 2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI). IEEE, 1\u20135."},{"key":"e_1_3_2_152_2","unstructured":"Basel Mousi Nadir Durrani Fatema Ahmad Md. Arid Hasan Maram Hasanain Tameem Kabbani Fahim Dalvi Shammur Absar Chowdhury and Firoj Alam. 2025. AraDiCE: Benchmarks for dialectal and cultural capabilities in LLMs. In Proceedings of the 31st International Conference on Computational Linguistics Owen Rambow Leo Wanner Marianna Apidianaki Hend Al-Khalifa Barbara Di Eugenio and Steven Schockaert (Eds.). Association for Computational Linguistics Abu Dhabi UAE 4186\u20134218. Retrieved from https:\/\/aclanthology.org\/2025.coling-main.283\/"},{"key":"e_1_3_2_153_2","unstructured":"Hamdy Mubarak and Sabit Hassan. 2021. ArCorona: Analyzing arabic tweets in the early days of coronavirus (COVID-19) pandemic. In Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis Eben Holderness Jimeno Yepes Antonio Lavelli Alberto Minard Anne-Lyse James Pustejovsky and Fabio Rinaldi (Eds.). Association for Computational Linguistics online 1\u20136. Retrieved from https:\/\/aclanthology.org\/2021.louhi-1.1\/"},{"issue":"5","key":"e_1_3_2_154_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.5121\/ijaia.2017.8501","article-title":"An enhanced approach for arabic sentiment analysis","volume":"8","author":"Mustafa Haidy H","year":"2017","unstructured":"Haidy H Mustafa, Ammar Mohamed, and Doaa S Elzanfaly. 2017. An enhanced approach for arabic sentiment analysis. International Journal of Artificial Intelligence and Applications (IJAIA) 8, 5 (2017), 1\u201314.","journal-title":"International Journal of Artificial Intelligence and Applications (IJAIA)"},{"key":"e_1_3_2_155_2","doi-asserted-by":"crossref","first-page":"2515","DOI":"10.18653\/v1\/D15-1299","volume-title":"Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing","author":"Nabil Mahmoud","year":"2015","unstructured":"Mahmoud Nabil, Mohamed Aly, and Amir Atiya. 2015. Astd: Arabic sentiment tweets dataset. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2515\u20132519."},{"key":"e_1_3_2_156_2","first-page":"6","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)","author":"Obeid Ossama","year":"2019","unstructured":"Ossama Obeid, Mohammad Salameh, Houda Bouamor, and Nizar Habash. 2019. ADIDA: Automatic dialect identification for arabic. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations). 6\u201311."},{"key":"e_1_3_2_157_2","first-page":"1094","volume-title":"Lrec","author":"Pasha Arfath","year":"2014","unstructured":"Arfath Pasha, Mohamed Al-Badrashiny, Mona T Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. Madamira: A fast, comprehensive tool for morphological analysis and disambiguation of arabic.. In Lrec, Vol. 14. Citeseer, 1094\u20131101."},{"key":"e_1_3_2_158_2","unstructured":"Zhaozhi Qian Faroq Altam Muhammad Alqurishi and Riad Souissi. 2024. Cameleval: Advancing culturally aligned arabic language models and benchmarks. arXiv preprint arXiv:2409.12623 (2024)."},{"key":"e_1_3_2_159_2","doi-asserted-by":"crossref","first-page":"244","DOI":"10.18653\/v1\/W19-4630","volume-title":"Proceedings of the Fourth Arabic Natural Language Processing Workshop","author":"Ragab Ahmad","year":"2019","unstructured":"Ahmad Ragab, Haitham Seelawi, Mostafa Samir, Abdelrahman Mattar, Hesham Al-Bataineh, Mohammad Zaghloul, Ahmad Mustafa, Bashar Talafha, Abed Alhakim Freihat, and Hussein Al-Natsheh. 2019. Mawdoo3 ai at madar shared task: Arabic fine-grained dialect identification with ensemble learning. In Proceedings of the Fourth Arabic Natural Language Processing Workshop. 244\u2013248."},{"key":"e_1_3_2_160_2","article-title":"SANA: Sentiment analysis on newspapers comments in algeria","author":"Rahab Hichem","year":"2019","unstructured":"Hichem Rahab, Abdelhafid Zitouni, and Mahieddine Djoudi. 2019. SANA: Sentiment analysis on newspapers comments in algeria. Journal of King Saud University-Computer and Information Sciences (2019).","journal-title":"Journal of King Saud University-Computer and Information Sciences"},{"key":"e_1_3_2_161_2","unstructured":"Tharindu Ranasinghe Hadeel Saadany Alistair Plum Salim Mandhari Emad Mohamed Constantin Orasan and Ruslan Mitkov. 2019. RGCL at IDAT: Deep learning models for irony detection in arabic language. (2019) 416\u2013425."},{"key":"e_1_3_2_162_2","first-page":"2268","volume-title":"LREC","author":"Refaee Eshrag","year":"2014","unstructured":"Eshrag Refaee and Verena Rieser. 2014. An arabic twitter corpus for subjectivity and sentiment analysis. In LREC. 2268\u20132273."},{"key":"e_1_3_2_163_2","first-page":"1","volume-title":"2022 International Conference on Advanced Aspects of Software Engineering (ICAASE)","author":"Righi Mohammed El Manar","year":"2022","unstructured":"Mohammed El Manar Righi, Djallel Eddine Boussahel, Djamila Mohdeb, Meriem Laifa, and Messaoud Bendiaf. 2022. Rumor stance classification: A case study on the propagation of political rumors on the algerian online social space. In 2022 International Conference on Advanced Aspects of Software Engineering (ICAASE). IEEE, 1\u20136."},{"key":"e_1_3_2_164_2","doi-asserted-by":"crossref","first-page":"502","DOI":"10.18653\/v1\/S17-2088","volume-title":"Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)","author":"Rosenthal Sara","year":"2017","unstructured":"Sara Rosenthal, Noura Farra, and Preslav Nakov. 2017. SemEval-2017 task 4: Sentiment analysis in twitter. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). 502\u2013518."},{"issue":"4","key":"e_1_3_2_165_2","doi-asserted-by":"crossref","first-page":"e12275","DOI":"10.1111\/lnc3.12275","article-title":"A survey on author profiling, deception, and irony detection for the arabic language","volume":"12","author":"Rosso Paolo","year":"2018","unstructured":"Paolo Rosso, Francisco Rangel, Irazu Hern\u00e1ndez Far\u00edas, Leticia Cagnina, Wajdi Zaghouani, and Anis Charfi. 2018. A survey on author profiling, deception, and irony detection for the arabic language. Language and Linguistics Compass 12, 4 (2018), e12275.","journal-title":"Language and Linguistics Compass"},{"key":"e_1_3_2_166_2","doi-asserted-by":"crossref","first-page":"69","DOI":"10.18653\/v1\/W15-3208","volume-title":"Proceedings of the Second Workshop on Arabic Natural Language Processing","author":"Saadane Houda","year":"2015","unstructured":"Houda Saadane and Nizar Habash. 2015. A conventional orthography for algerian arabic. In Proceedings of the Second Workshop on Arabic Natural Language Processing. 69\u201379."},{"key":"e_1_3_2_167_2","first-page":"70","volume-title":"Proceedings of the 3rd International Workshop on Rumours and Deception in Social Media (RDSM)","author":"Saadany Hadeel","year":"2020","unstructured":"Hadeel Saadany, Constantin Or\u01cesan, and Emad Mohamed. 2020. Fake or real? A study of arabic satirical fake news. In Proceedings of the 3rd International Workshop on Rumours and Deception in Social Media (RDSM). 70\u201380."},{"key":"e_1_3_2_168_2","doi-asserted-by":"crossref","first-page":"22","DOI":"10.3115\/v1\/W14-5904","volume-title":"Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP)","author":"Sadat Fatiha","year":"2014","unstructured":"Fatiha Sadat, Farzindar Kazemi, and Atefeh Farzindar. 2014. Automatic identification of arabic language varieties and dialects in social media. In Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP). 22\u201327."},{"issue":"1","key":"e_1_3_2_169_2","doi-asserted-by":"crossref","first-page":"1407","DOI":"10.1016\/j.jksuci.2019.10.002","article-title":"An ensemble approach for spam detection in arabic opinion texts","volume":"34","author":"Saeed Radwa MK","year":"2022","unstructured":"Radwa MK Saeed, Sherine Rady, and Tarek F Gharib. 2022. An ensemble approach for spam detection in arabic opinion texts. Journal of King Saud University-Computer and Information Sciences 34, 1 (2022), 1407\u20131416.","journal-title":"Journal of King Saud University-Computer and Information Sciences"},{"key":"e_1_3_2_170_2","doi-asserted-by":"publisher","unstructured":"Ali Safaya Moutasem Abdullatif and Deniz Yuret. 2020. KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media. In Proceedings of the Fourteenth Workshop on Semantic Evaluation International Committee for Computational Linguistics Barcelona (online) Aurelie Herbelot Xiaodan Zhu Alexis Palmer Nathan Schneider Jonathan May and Ekaterina Shutova (Eds.). 2054\u20132059. DOI:10.18653\/v1\/2020.semeval-1.271","DOI":"10.18653\/v1\/2020.semeval-1.271"},{"key":"e_1_3_2_171_2","doi-asserted-by":"crossref","first-page":"5094","DOI":"10.18653\/v1\/2020.coling-main.447","volume-title":"Proceedings of the 28th International Conference on Computational Linguistics","author":"Sajjad Hassan","year":"2020","unstructured":"Hassan Sajjad, Ahmed Abdelali, Nadir Durrani, and Fahim Dalvi. 2020. Arabench: Benchmarking dialectal arabic-english machine translation. In Proceedings of the 28th International Conference on Computational Linguistics. 5094\u20135107."},{"key":"e_1_3_2_172_2","first-page":"1332","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics","author":"Salameh Mohammad","year":"2018","unstructured":"Mohammad Salameh, Houda Bouamor, and Nizar Habash. 2018. Fine-grained arabic dialect identification. In Proceedings of the 27th International Conference on Computational Linguistics. 1332\u20131344."},{"key":"e_1_3_2_173_2","doi-asserted-by":"crossref","first-page":"432","DOI":"10.18653\/v1\/K17-1043","volume-title":"Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)","author":"Samih Younes","year":"2017","unstructured":"Younes Samih, Mohamed Eldesouki, Mohammed Attia, Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak, and Laura Kallmeyer. 2017. Learning from relatives: Unified dialectal arabic segmentation. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). 432\u2013441."},{"key":"e_1_3_2_174_2","first-page":"1","volume-title":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","author":"Sawan Aktham","year":"2021","unstructured":"Aktham Sawan, Thaer Thaher, et\u00a0al. 2021. Sentiment analysis model for fake news identification in arabic tweets. In 2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 1\u20136."},{"key":"e_1_3_2_175_2","first-page":"173","volume-title":"Proceedings of the Sixth Arabic Natural Language Processing Workshop","author":"Seelawi Haitham","year":"2021","unstructured":"Haitham Seelawi, Ibraheem Tuffaha, Mahmoud Gzawi, Wael Farhan, Bashar Talafha, Riham Badawi, Zyad Sober, Oday Al-Dweik, Abed Alhakim Freihat, and Hussein Al-Natsheh. 2021. ALUE: Arabic language understanding evaluation. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. 173\u2013184."},{"key":"e_1_3_2_176_2","doi-asserted-by":"publisher","unstructured":"Fatima Shannag Bassam H. Hammo and Hossam Faris. 2022. The design construction and evaluation of annotated Arabic cyberbullying corpus. Education and Information Technologies 27 8 (September 2022) 10977\u201311023. DOI:10.1007\/s10639-022-11056-x","DOI":"10.1007\/s10639-022-11056-x"},{"key":"e_1_3_2_177_2","doi-asserted-by":"crossref","first-page":"71951","DOI":"10.1109\/ACCESS.2022.3185083","article-title":"JointBert for detecting arabic fake news","volume":"10","author":"Shishah Wesam","year":"2022","unstructured":"Wesam Shishah. 2022. JointBert for detecting arabic fake news. IEEE Access 10 (2022), 71951\u201371960.","journal-title":"IEEE Access"},{"key":"e_1_3_2_178_2","first-page":"264","volume-title":"International Conference on Arabic Language Processing","author":"Tachicart Ridouane","year":"2019","unstructured":"Ridouane Tachicart and Karim Bouzoubaa. 2019. Towards automatic normalization of the moroccan dialectal arabic user generated text. In International Conference on Arabic Language Processing. Springer, 264\u2013275."},{"key":"e_1_3_2_179_2","first-page":"216","volume-title":"5th International Conference on Arabic Language Processing","author":"Tachicart Ridouane","year":"2014","unstructured":"Ridouane Tachicart, Karim Bouzoubaa, and Hamid Jaafar. 2014. Building a moroccan dialect electronic dictionary (MDED). In 5th International Conference on Arabic Language Processing. 216\u2013221."},{"key":"e_1_3_2_180_2","unstructured":"Bashar Talafha Mohammad Ali Muhy Eddin Za\u2019ter Haitham Seelawi Ibraheem Tuffaha Mostafa Samir Wael Farhan and Hussein Al-Natsheh. 2020. Multi-dialect Arabic BERT for Country-level Dialect Identification. In Proceedings of the Fifth Arabic Natural Language Processing Workshop Imed Zitouni Muhammad Abdul-Mageed Houda Bouamor Fethi Bougares Mahmoud El-Haj Nadi Tomeh and Wajdi Zaghouani (Eds.). Association for Computational Linguistics Barcelona Spain (Online) 111\u2013118. Retrieved from https:\/\/aclanthology.org\/2020.wanlp-1.10\/"},{"key":"e_1_3_2_181_2","first-page":"313","volume-title":"Proceedings of the Fifth Arabic Natural Language Processing Workshop","author":"Touileb Samia","year":"2020","unstructured":"Samia Touileb. 2020. LTG-ST at NADI shared task 1: Arabic dialect identification using a stacking classifier. In Proceedings of the Fifth Arabic Natural Language Processing Workshop. 313\u2013319."},{"key":"e_1_3_2_182_2","first-page":"95","volume-title":"Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing","author":"Touileb Samia","year":"2022","unstructured":"Samia Touileb. 2022. NERDz: A preliminary dataset of named entities for algerian. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing. 95\u2013101."},{"key":"e_1_3_2_183_2","doi-asserted-by":"publisher","unstructured":"Samia Touileb and Jeremy Barnes. 2021. The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 Chengqing Zong Fei Xia Wenjie Li and Roberto Navigli (Eds.). Association for Computational Linguistics Online 3700\u20133712. DOI:10.18653\/v1\/2021.findings-acl.324","DOI":"10.18653\/v1\/2021.findings-acl.324"},{"key":"e_1_3_2_184_2","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC), Portoroz, Slovenia","author":"Turki Houcemeddine","year":"2016","unstructured":"Houcemeddine Turki, Emad Adel, Tariq Daouda, and Nassim Regragui. 2016. A conventional orthography for maghrebi arabic. In Proceedings of the International Conference on Language Resources and Evaluation (LREC), Portoroz, Slovenia."},{"key":"e_1_3_2_185_2","unstructured":"Anshul Wadhawan. 2021. Dialect identification in nuanced arabic Tweets using farasa segmentation and AraBERT. In Proceedings of the Sixth Arabic Natural Language Processing Workshop Association for Computational Linguistics Nizar Habash Houda Bouamor Hazem Hajj Walid Magdy Wajdi Zaghouani Fethi Bougares Nadi Tomeh Ibrahim Abu Farha and Samia Touileb (Eds.). Kyiv Ukraine (Virtual) 291\u2013295. Retrieved from https:\/\/aclanthology.org\/2021.wanlp-1.35\/"},{"issue":"2","key":"e_1_3_2_186_2","doi-asserted-by":"crossref","first-page":"1545","DOI":"10.1007\/s00500-023-08384-6","article-title":"A systematic review of arabic text classification: Areas, applications, and future directions","volume":"28","author":"Wahdan Ahlam","year":"2024","unstructured":"Ahlam Wahdan, Mostafa Al-Emran, and Khaled Shaalan. 2024. A systematic review of arabic text classification: Areas, applications, and future directions. Soft Computing 28, 2 (2024), 1545\u20131566.","journal-title":"Soft Computing"},{"key":"e_1_3_2_187_2","first-page":"3","volume-title":"International Conference on Web Engineering","author":"Younes Jihen","year":"2015","unstructured":"Jihen Younes, Hadhemi Achour, and Emna Souissi. 2015. Constructing linguistic resources for the tunisian dialect using textual user-generated contents on the social web. In International Conference on Web Engineering. Springer, 3\u201314."},{"key":"e_1_3_2_188_2","article-title":"Romanized tunisian dialect transliteration using sequence labelling techniques","author":"Younes Jihene","year":"2020","unstructured":"Jihene Younes, Hadhemi Achour, Emna Souissi, and Ahmed Ferchichi. 2020. Romanized tunisian dialect transliteration using sequence labelling techniques. Journal of King Saud University-Computer and Information Sciences 34, 3 (2020), 982\u2013992.","journal-title":"Journal of King Saud University-Computer and Information Sciences"},{"key":"e_1_3_2_189_2","volume-title":"Proceedings of the 2nd International Conference on Arabic Computational Linguistics","author":"Younes Jihene","year":"2016","unstructured":"Jihene Younes, Emna Souissi, and Hadhemi Achour. 2016. A hidden markov model for the automatic transliteration of romanized tunisian dialect. In Proceedings of the 2nd International Conference on Arabic Computational Linguistics."},{"issue":"1","key":"e_1_3_2_190_2","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1162\/COLI_a_00169","article-title":"Arabic dialect identification","volume":"40","author":"Zaidan Omar F","year":"2014","unstructured":"Omar F Zaidan and Chris Callison-Burch. 2014. Arabic dialect identification. Computational Linguistics 40, 1 (2014), 171\u2013202.","journal-title":"Computational Linguistics"},{"key":"e_1_3_2_191_2","unstructured":"Randa Zarnoufi Hamid Jaafar Walid Bachri and Mounia Abik. 2020. MANorm: A normalization dictionary for Moroccan Arabic Dialect written in Latin script. In Proceedings of the Fifth Arabic Natural Language Processing Workshop Imed Zitouni Muhammad Abdul-Mageed Houda Bouamor Fethi Bougares Mahmoud El-Haj Nadi Tomeh and Wajdi Zaghouani (Eds.). Association for Computational Linguistics Barcelona Spain (Online) 155\u2013166. Retrieved from https:\/\/aclanthology.org\/2020.wanlp-1.14\/"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3747290","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T13:27:28Z","timestamp":1755782848000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3747290"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,21]]},"references-count":190,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,8,31]]}},"alternative-id":["10.1145\/3747290"],"URL":"https:\/\/doi.org\/10.1145\/3747290","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,21]]},"assertion":[{"value":"2024-02-04","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-06-28","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-21","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}