{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,19]],"date-time":"2025-11-19T09:38:14Z","timestamp":1763545094338,"version":"3.41.0"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,12,24]],"date-time":"2021-12-24T00:00:00Z","timestamp":1640304000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Deanship of Research at the Jordan University of Science and Technology","award":["#20180193"],"award-info":[{"award-number":["#20180193"]}]},{"name":"NVIDIA Corporation"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,1,31]]},"abstract":"<jats:p>\n            In this work, we present several deep learning models for the automatic diacritization of Arabic text. Our models are built using two main approaches, viz. Feed-Forward Neural Network (FFNN) and Recurrent Neural Network (RNN), with several enhancements such as 100-hot encoding, embeddings, Conditional Random Field (CRF), and Block-Normalized Gradient (BNG). The models are tested on the only freely available benchmark dataset and the results show that our models are either better or on par with other models even those requiring human-crafted language-dependent post-processing steps, unlike ours. Moreover, we show how diacritics in Arabic can be used to enhance the models of downstream NLP tasks such as Machine Translation (MT) and Sentiment Analysis (SA) by proposing novel\n            <jats:italic>Translation over Diacritization<\/jats:italic>\n            (ToD) and\n            <jats:italic>Sentiment over Diacritization<\/jats:italic>\n            (SoD) approaches.\n          <\/jats:p>","DOI":"10.1145\/3470849","type":"journal-article","created":{"date-parts":[[2021,12,24]],"date-time":"2021-12-24T09:44:18Z","timestamp":1640339058000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Neural Arabic Text Diacritization: State-of-the-Art Results and a Novel Approach for Arabic NLP Downstream Tasks"],"prefix":"10.1145","volume":"21","author":[{"given":"Ali","family":"Fadel","sequence":"first","affiliation":[{"name":"Jordan University of Science and Technology, Irbid, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ibraheem","family":"Tuffaha","sequence":"additional","affiliation":[{"name":"Jordan University of Science and Technology, Irbid, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9372-9076","authenticated-orcid":false,"given":"Mahmoud","family":"Al-Ayyoub","sequence":"additional","affiliation":[{"name":"Jordan University of Science and Technology, Irbid, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,12,24]]},"reference":[{"issue":"2","key":"e_1_3_3_2_2","first-page":"103","article-title":"Accurate and fast recurrent neural network solution for the automatic diacritization of Arabic text","volume":"6","author":"Abandah Gheith","year":"2020","unstructured":"Gheith Abandah and Asma Abdel-Karim. 2020. Accurate and fast recurrent neural network solution for the automatic diacritization of Arabic text. Jordanian Journal of Computers and Information Technology 6, 2 (2020), 103\u2013121.","journal-title":"Jordanian Journal of Computers and Information Technology"},{"key":"e_1_3_3_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-015-0242-2"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2020.12.002"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-018-0799-4"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3018885"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2017.10.106"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-4606"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/AEECT.2017.8257765"},{"key":"e_1_3_3_10_2","article-title":"Optimizing performance of recurrent neural networks on gpus","author":"Appleyard Jeremy","year":"2016","unstructured":"Jeremy Appleyard, Tomas Kocisky, and Phil Blunsom. 2016. Optimizing performance of recurrent neural networks on gpus. arXiv:1604.01946 (2016). Retrieved from https:\/\/arxiv.org\/abs\/1604.01946.","journal-title":"arXiv:1604.01946"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324913000284"},{"key":"e_1_3_3_12_2","unstructured":"Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915) San Diego CA USA . http:\/\/arxiv.org\/abs\/1409.0473"},{"key":"e_1_3_3_13_2","unstructured":"Ahmad Barqawi and Taha Zerrouki. 2017. Shakkala Arabic Text Vocalization. Retrieved February 20 2021 from https:\/\/github.com\/Barqawiz\/Shakkala."},{"key":"e_1_3_3_14_2","article-title":"Hybrid approaches for automatic vowelization of Arabic texts","author":"Bebah Mohamed","year":"2014","unstructured":"Mohamed Bebah, Chennoufi Amine, Mazroui Azzeddine, and Lakhouaja Abdelhak. 2014. Hybrid approaches for automatic vowelization of Arabic texts. arXiv:1410.2646. Retrieved from https:\/\/arxiv.org\/abs\/1410.2646.","journal-title":"arXiv:1410.2646"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1274"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00051"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2874767"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2016.06.004"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1302"},{"key":"e_1_3_3_20_2","article-title":"LDC Arabic treebanks and associated corpora: Data divisions manual","author":"Diab Mona","year":"2013","unstructured":"Mona Diab, Nizar Habash, Owen Rambow, and Ryan Roth. 2013. LDC Arabic treebanks and associated corpora: Data divisions manual. arXiv:1309.5652 (2013). Retrieved from https:\/\/arxiv.org\/abs\/1309.5652.","journal-title":"arXiv:1309.5652"},{"issue":"61","key":"e_1_3_3_21_2","first-page":"2121","article-title":"Adaptive subgradient methods for online learning and stochastic optimization","volume":"12","author":"Duchi John","year":"2011","unstructured":"John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12, 61 (2011), 2121\u20132159. http:\/\/jmlr.org\/papers\/v12\/duchi11a.html.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/AICCSA.2016.7945800"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-67056-0_3"},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-18117-2_2"},{"key":"e_1_3_3_25_2","doi-asserted-by":"crossref","unstructured":"Ali Fadel Ibraheem Tuffaha Bara\u2019 Al-Jawarneh and Mahmoud Al-Ayyoub. 2019. Arabic text diacritization using deep neural networks. In Proceedings of the 2nd International Conference on Computer Applications & and Amp; Information Security .","DOI":"10.1109\/CAIS.2019.8769512"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1311"},{"key":"e_1_3_3_27_2","article-title":"Bidirectional LSTM-CRF models for sequence tagging","author":"Huang Zhiheng","year":"2015","unstructured":"Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv:1508.01991. Retrieved from https:\/\/arxiv.org\/abs\/1508.01991.","journal-title":"arXiv:1508.01991"},{"key":"e_1_3_3_28_2","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915) San Diego CA USA . http:\/\/arxiv.org\/abs\/1412.6980."},{"issue":"86","key":"e_1_3_3_29_2","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Maaten Laurens van der","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, 86 (2008), 2579\u20132605. http:\/\/jmlr.org\/papers\/v9\/vandermaaten08a.html.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/3297278"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2018.091150"},{"key":"e_1_3_3_32_2","first-page":"2390","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Mubarak Hamdy","year":"2019","unstructured":"Hamdy Mubarak, Ahmed Abdelali, Hassan Sajjad, Younes Samih, and Kareem Darwish. 2019. Highly effective Arabic diacritization using sequence to sequence modeling. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2390\u20132395."},{"key":"e_1_3_3_33_2","first-page":"1094","volume-title":"Proceedings of the 9th International Conference on Language Resources and Evaluation","volume":"14","author":"Pasha Arfath","year":"2014","unstructured":"Arfath Pasha, Mohamed Al-Badrashiny, Mona T. Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. Madamira: A fast, comprehensive tool for morphological analysis and disambiguation of arabic. In Proceedings of the 9th International Conference on Language Resources and Evaluation, Vol. 14. 1094\u20131101."},{"key":"e_1_3_3_34_2","doi-asserted-by":"crossref","unstructured":"Rico Sennrich Barry Haddow and Alexandra Birch. 2015. Neural machine translation of rare words with subword units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) . Association for Computational Linguistics 1715\u20131725. https:\/\/aclanthology.org\/P16-1162.","DOI":"10.18653\/v1\/P16-1162"},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1152"},{"key":"e_1_3_3_36_2","first-page":"2214","volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation","volume":"2012","author":"Tiedemann J\u00f6rg","year":"2012","unstructured":"J\u00f6rg Tiedemann. 2012. Parallel data, tools and interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation, Vol. 2012. 2214\u20132218."},{"key":"e_1_3_3_37_2","article-title":"Block-normalized gradient method: An empirical study for training deep neural network","author":"Yu Adams Wei","year":"2017","unstructured":"Adams Wei Yu, Lei Huang, Qihang Lin, Ruslan Salakhutdinov, and Jaime Carbonell. 2017. Block-normalized gradient method: An empirical study for training deep neural network. arXiv:1707.04822. Retrieved from https:\/\/arxiv.org\/abs\/1707.04822.","journal-title":"arXiv:1707.04822"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2017.01.011"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2008.06.001"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3470849","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3470849","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:55Z","timestamp":1750191535000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3470849"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,24]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1,31]]}},"alternative-id":["10.1145\/3470849"],"URL":"https:\/\/doi.org\/10.1145\/3470849","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2021,12,24]]},"assertion":[{"value":"2020-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-12-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}