{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T14:28:18Z","timestamp":1765376898945,"version":"3.46.0"},"reference-count":97,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2025,12,31]]},"abstract":"<jats:p>\n                    Multimodal sentiment analysis and emotion recognition have attracted significant interest in multimodal learning. Naturally, humans express their feelings and emotions through nuanced expressions across various verbal and non-verbal modalities. Despite this, there remains a critical gap in publicly accessible multimodal datasets for the Arabic language. To address this issue, we posited that creating a large and high-quality Arabic multimodal dataset would significantly improve sentiment analysis and emotion recognition in Arabic contexts. We aimed to develop a large, high-quality Arabic Multimodal Sentiment Analysis and Emotion Recognition (A\n                    <jats:sc>md<\/jats:sc>\n                    \u2019S\n                    <jats:sc>a<\/jats:sc>\n                    E\n                    <jats:sc>r<\/jats:sc>\n                    ) dataset by building upon our AMSA dataset, increasing its size to 1,037 samples, and adding emotional labels. Leveraging a novel methodology, we carefully selected and annotated data across audio, text, and visual modalities, and proposed a hybrid inter-annotator agreement strategy. Extensive analyses were conducted to validate the robustness of the dataset. We experimented with the A\n                    <jats:sc>md<\/jats:sc>\n                    \u2019S\n                    <jats:sc>a<\/jats:sc>\n                    E\n                    <jats:sc>r<\/jats:sc>\n                    dataset using a customized MERBench framework, which demonstrated the dataset\u2019s efficacy and reliability. Our findings indicate the high quality of the dataset and underscore the importance of multimodal context for accurate sentiment analysis and emotion recognition in Arabic. We recommend further research and application of the A\n                    <jats:sc>md<\/jats:sc>\n                    \u2019S\n                    <jats:sc>a<\/jats:sc>\n                    E\n                    <jats:sc>r<\/jats:sc>\n                    dataset in broader Arabic contexts, as it provides a valuable resource for advancing multimodal analysis in this language.\n                  <\/jats:p>","DOI":"10.1145\/3774880","type":"journal-article","created":{"date-parts":[[2025,11,5]],"date-time":"2025-11-05T11:45:18Z","timestamp":1762343118000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Amd'SaEr: Arabic Multimodal Dataset for Sentiment Analysis and Emotion Recognition"],"prefix":"10.1145","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-0820-5381","authenticated-orcid":false,"given":"Abdelhamid","family":"Haouhat","sequence":"first","affiliation":[{"name":"University of Amar Telidji Laghouat","place":["Laghouat, Algeria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8357-5501","authenticated-orcid":false,"given":"Slimane","family":"Bellaouar","sequence":"additional","affiliation":[{"name":"Mathematics and Computer Science, Universit\u00e8 de Ghardaia","place":["Ghardaia, Algeria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3245-6913","authenticated-orcid":false,"given":"Attia","family":"Nehar","sequence":"additional","affiliation":[{"name":"Ziane Achour University of Djelfa","place":["Djelfa, Algeria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5117-0320","authenticated-orcid":false,"given":"Hadda","family":"Cherroun","sequence":"additional","affiliation":[{"name":"University of Amar Telidji Laghouat","place":["Laghouat, Algeria"]}]}],"member":"320","published-online":{"date-parts":[[2025,12,10]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-58130-9_6"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2021.06.003"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2017.07.004"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.551"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2023.103005"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.heliyon.2024.e39786"},{"issue":"1","key":"e_1_3_2_8_2","doi-asserted-by":"crossref","first-page":"389","DOI":"10.18280\/isi.290138","article-title":"Sentiment analysis methods for Arabic content on social media: A systematic review","volume":"29","author":"AlMotairi Reem K","year":"2024","unstructured":"Reem K AlMotairi and Mohammed Hadwan. 2024. Sentiment analysis methods for Arabic content on social media: A systematic review. Ing\u00e9nierie des Syst\u00e8mes d\u2019Information 29, 1 (Feb.2024), 389\u2013396.","journal-title":"Ing\u00e9nierie des Syst\u00e8mes d\u2019Information"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-6571-2_290"},{"key":"e_1_3_2_10_2","first-page":"9","volume-title":"Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection","author":"Antoun Wissam","year":"2020","unstructured":"Wissam Antoun, Fady Baly, and Hazem Hajj. 2020. AraBERT: Transformer-based model for arabic language understanding. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, Hend Al-Khalifa, Walid Magdy, Kareem Darwish, Tamer Elsayed, and Hamdy Mubarak (Eds.). European Language Resource Association, Marseille, France, 9\u201315. Retrieved from https:\/\/aclanthology.org\/2020.osact-1.2\/"},{"key":"e_1_3_2_11_2","first-page":"555","volume-title":"Proceedings of the Handbook of Linguistic Annotation","author":"Artstein Ron","year":"2008","unstructured":"Ron Artstein and Massimo Poesio. 2008. Inter-annotator agreement: The kappa statistic. In Proceedings of the Handbook of Linguistic Annotation. Oxford University Press, 555\u2013596."},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3295662"},{"key":"e_1_3_2_13_2","series-title":"NIPS \u201920","volume-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems","author":"Baevski Alexei","year":"2020","unstructured":"Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A framework for self-supervised learning of speech representations. In Proceedings of the 34th International Conference on Neural Information Processing Systems (Vancouver, BC, Canada) (NIPS \u201920). Curran Associates Inc., Red Hook, NY, USA, Article 1044, 12 pages."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1208"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2798607"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2016.7477553"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-008-9076-6"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2007.905145"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1105"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.189"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022627411411"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1967.1053964"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3586075"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-021-11759-2"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007413511361"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7353446"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1080\/02699939208411068"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-psych-032720-035940"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/WF-IoT54382.2022.10152117"},{"key":"e_1_3_2_31_2","unstructured":"Explosion AI. 2017\u2013present. Prodigy: A modern annotation tool for AI Machine Learning & NLP. Retrieved from https:\/\/prodi.gy"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2015.2457417"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1874246"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1037\/h0031619"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.14711\/thesis-991012662968603412"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/965400.965451"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/7403128"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093414"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10710-017-9314-z"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-02139-8"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.723"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/IDSTA58916.2023.10317847"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.arabicnlp-1.48"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413678"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952132"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2021.3122291"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2005.12.126"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2006.10.019"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-55190-2_1"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/2522848.2531745"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.2001-354"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3612836"},{"key":"e_1_3_2_55_2","unstructured":"Zheng Lian Licai Sun Yong Ren Hao Gu Haiyang Sun Lan Chen Bin Liu and Jianhua Tao. 2024. MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition. arxiv:2401.03429. Retrieved from https:\/\/arxiv.org\/abs\/2401.03429"},{"key":"e_1_3_2_56_2","article-title":"Sentiment analysis and opinion mining","volume":"168","author":"Liu Bing","year":"2012","unstructured":"Bing Liu. 2012. Sentiment analysis and opinion mining. Morgan & Claypool Publishers 168 (2012).","journal-title":"Morgan & Claypool Publishers"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1209"},{"key":"e_1_3_2_58_2","unstructured":"Annette Markham Elizabeth Buchanan with feedback from the AOIR Ethics Working Committee et\u00a0al. 2012. Ethical decision-making and internet research: Recommendations from the AOIR ethics working committee. Version 2.0 https:\/\/www.aoir.org\/reports\/ethics2.pdf"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.naacl-tutorials.5"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/2070481.2070509"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.18178\/joig.6.1.39-43"},{"key":"e_1_3_2_62_2","unstructured":"O. Najar S. Sibaee L. Ghouti and A. Koubaa. 2023. Aranizer 0.1.8. Retrieved July 4 2023 from https:\/\/pypi.org\/project\/aranizer"},{"key":"e_1_3_2_63_2","first-page":"689","volume-title":"Proceedings of the 28th International Conference on Machine Learning (ICML-11)","author":"Ngiam Jiquan","year":"2011","unstructured":"Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. 2011. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning (ICML-11). 689\u2013696."},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2020.05.034"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.3115\/1218955.1218990"},{"key":"e_1_3_2_66_2","first-page":"973","volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"P\u00e9rez-Rosas Ver\u00f3nica","year":"2013","unstructured":"Ver\u00f3nica P\u00e9rez-Rosas, Rada Mihalcea, and Louis-Philippe Morency. 2013. Utterance-level multimodal sentiment analysis. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Hinrich Schuetze, Pascale Fung, and Massimo Poesio (Eds.). Association for Computational Linguistics, Sofia, Bulgaria, 973\u2013982. Retrieved from https:\/\/aclanthology.org\/P13-1096"},{"key":"e_1_3_2_67_2","unstructured":"Paul Pokotylo. 2025. Measuring Inter-Annotator Agreement: Building Trustworthy Datasets. Retrieved April 1 2025 from https:\/\/keymakr.com\/blog\/measuring-inter-annotator-agreement-building-trustworthy-datasets\/"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.01.095"},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1050"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022643204877"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1109\/5.18626"},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2008.34.3.319"},{"key":"e_1_3_2_73_2","volume-title":"Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201806)","author":"Reidsma Dennis","year":"2006","unstructured":"Dennis Reidsma, Dirk Heylen, and Roeland Ordelman. 2006. Annotating emotions in meetings. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201806), Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, and Daniel Tapias (Eds.). European Language Resources Association (ELRA), Genoa, Italy. Retrieved from https:\/\/aclanthology.org\/L06-1384\/"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-022-07451-7"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIEV.2014.6850685"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","unstructured":"Bj\u00f6rn Schuller Stefan Steidl and Anton Batliner. 2009. The interspeech 2009 emotion challenge. DOI: 10.21437\/Interspeech.2009-103","DOI":"10.21437\/Interspeech.2009-103"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.5120\/20790-3435"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2017.08.003"},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-020-10037-x"},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1656"},{"key":"e_1_3_2_81_2","unstructured":"Yao-Hung Hubert Tsai Paul Pu Liang Amir Zadeh Louis-Philippe Morency and Ruslan Salakhutdinov. 2019. Learning Factorized Multimodal Representations. arxiv:1806.06176. Retrieved from https:\/\/arxiv.org\/abs\/1806.06176"},{"key":"e_1_3_2_82_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"},{"key":"e_1_3_2_83_2","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017), 6000\u20136010.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_84_2","article-title":"A theoretical foundation for inductive transfer","volume":"19","author":"Ventura Dan","year":"2007","unstructured":"Dan Ventura and Sean Warnick. 2007. A theoretical foundation for inductive transfer. Brigham Young University, College of Physical and Mathematical Sciences 19, 8 (2007).","journal-title":"Brigham Young University, College of Physical and Mathematical Sciences"},{"issue":"6","key":"e_1_3_2_85_2","first-page":"282","article-title":"Sentiment analysis and opinion mining: A survey","volume":"2","author":"Vinodhini G","year":"2012","unstructured":"G Vinodhini and RM Chandrasekaran. 2012. Sentiment analysis and opinion mining: A survey. International Journal of Advanced Research in Computer Science and Software Engineering 2, 6 (2012), 282\u2013292.","journal-title":"International Journal of Advanced Research in Computer Science and Software Engineering"},{"key":"e_1_3_2_86_2","volume-title":"Data Mining, Fourth Edition: Practical Machine Learning Tools and Techniques (4th ed.)","author":"Witten Ian H.","year":"2016","unstructured":"Ian H. Witten, Eibe Frank, Mark A. Hall, and Christopher J. Pal. 2016. Data Mining, Fourth Edition: Practical Machine Learning Tools and Techniques (4th ed.). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA."},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2010.2057200"},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2023.3275156"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-018-1236-4"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2823317"},{"key":"e_1_3_2_91_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1115"},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12021"},{"key":"e_1_3_2_93_2","unstructured":"Amir Zadeh Rowan Zellers Eli Pincus and Louis-Philippe Morency. 2016. Mosi: Multimodal corpus of sentiment intensity and subjectivity analysis in online opinion videos. arXiv:1606.06259. Retrieved from https:\/\/arxiv.org\/abs\/1606.06259 (2016)."},{"key":"e_1_3_2_94_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1208"},{"key":"e_1_3_2_95_2","doi-asserted-by":"publisher","DOI":"10.1145\/3664647.3680777"},{"key":"e_1_3_2_96_2","doi-asserted-by":"publisher","DOI":"10.3390\/s24123714"},{"key":"e_1_3_2_97_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3093397"},{"key":"e_1_3_2_98_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10676-010-9227-5"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3774880","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T14:25:23Z","timestamp":1765376723000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3774880"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,10]]},"references-count":97,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2025,12,31]]}},"alternative-id":["10.1145\/3774880"],"URL":"https:\/\/doi.org\/10.1145\/3774880","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2025,12,10]]},"assertion":[{"value":"2024-08-21","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-11-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-12-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}