{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T05:50:15Z","timestamp":1759384215108,"version":"3.41.0"},"reference-count":65,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2021,11,18]],"date-time":"2021-11-18T00:00:00Z","timestamp":1637193600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Scientific Research and Innovation Support Fund\u2013Ministry of Higher Education\u2013Jordan","award":["ICT\/2\/5\/2016"],"award-info":[{"award-number":["ICT\/2\/5\/2016"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,3,31]]},"abstract":"<jats:p>Treebanks are valuable linguistic resources that include the syntactic structure of a language sentence in addition to part-of-speech tags and morphological features. They are mainly utilized in modeling statistical parsers. Although the statistical natural language parser has recently become more accurate for languages such as English, those for the Arabic language still have low accuracy. The purpose of this article is to construct a new Arabic dependency treebank based on the traditional Arabic grammatical theory and the characteristics of the Arabic language, to investigate their effects on the accuracy of statistical parsers. The proposed Arabic dependency treebank, called I3rab, contrasts with existing Arabic dependency treebanks in two main concepts. The first concept is the approach of determining the main word of the sentence, and the second concept is the representation of the joined and covert pronouns. To evaluate I3rab, we compared its performance against a subset of Prague Arabic Dependency Treebank that shares a comparable level of details. The conducted experiments show that the percentage improvement reached up to 10.24% in UAS and 18.42% in LAS.<\/jats:p>","DOI":"10.1145\/3472295","type":"journal-article","created":{"date-parts":[[2021,11,18]],"date-time":"2021-11-18T20:41:51Z","timestamp":1637268111000},"page":"1-32","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["I3rab: A New Arabic Dependency Treebank Based on Arabic Grammatical Theory"],"prefix":"10.1145","volume":"21","author":[{"given":"Dana","family":"Halabi","sequence":"first","affiliation":[{"name":"Princess Sumaya University for Technology, Amman, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ebaa","family":"Fayyoumi","sequence":"additional","affiliation":[{"name":"Computer Science Department, Hashemite University, Zarqa, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arafat","family":"Awajan","sequence":"additional","affiliation":[{"name":"Princess Sumaya University for Technology, Amman, Jordan; Mutah University, Karak, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,11,18]]},"reference":[{"key":"e_1_3_3_2_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511840722"},{"key":"e_1_3_3_3_1","first-page":"595","volume-title":"Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation","author":"Alotaiby F.","year":"2010","unstructured":"F. Alotaiby, S. Foda, and I. Alkharashi. 2010. Clitics in Arabic language: A statistical study. In Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation. 595\u2013601."},{"key":"e_1_3_3_4_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.10368"},{"key":"e_1_3_3_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSC50631.2021.00041"},{"key":"e_1_3_3_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/1654576.1654588"},{"key":"e_1_3_3_7_1","volume-title":"Handling Arabic Morphological and Syntactic Ambiguity within the LFG Framework with a View to Machine Translation","author":"Attia M. A.","year":"2008","unstructured":"M. A. Attia. 2008. Handling Arabic Morphological and Syntactic Ambiguity within the LFG Framework with a View to Machine Translation. The University of Manchester, UK."},{"issue":"4","key":"e_1_3_3_8_1","first-page":"179","article-title":"Arabic text preprocessing for the natural language processing applications","volume":"25","author":"Awajan A.","year":"2007","unstructured":"A. Awajan. 2007. Arabic text preprocessing for the natural language processing applications. Arab Gulf J. Sci. Res. 25, 4 (2007), 179\u2013189.","journal-title":"Arab Gulf J. Sci. Res."},{"key":"e_1_3_3_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2665077"},{"key":"e_1_3_3_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/1087081"},{"key":"e_1_3_3_11_1","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1007\/978-94-010-0201-1_7","volume-title":"Treebanks","author":"B\u00f6hmov\u00e1 A.","year":"2003","unstructured":"A. B\u00f6hmov\u00e1, J. Haji\u010d, E. Haji\u010dov\u00e1, and B. Hladk\u00e1. 2003. The Prague dependency treebank. In Treebanks. Springer, Dordrecht, 103\u2013127."},{"key":"e_1_3_3_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/1596276.1596305"},{"key":"e_1_3_3_13_1","volume-title":"Syntactic Structures","author":"Chomsky N.","year":"2009","unstructured":"N. Chomsky. 2009. Syntactic Structures. De Gruyter Mouton, Berlin."},{"key":"e_1_3_3_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11168-004-7429-x"},{"key":"e_1_3_3_15_1","volume-title":"Proceedings of the 3rd Workshop on Treebanks and Linguistic Theories","author":"Civit M.","year":"2004","unstructured":"M. Civit, N. Buf\u00ed, and P. Valverde. 2004. Cat3LB: A treebank for Catalan with word sense annotation. In Proceedings of the 3rd Workshop on Treebanks and Linguistic Theories. Tuebingen, Germany."},{"key":"e_1_3_3_16_1","doi-asserted-by":"publisher","DOI":"10.1162\/089120103322753356"},{"key":"e_1_3_3_17_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2010-398"},{"key":"e_1_3_3_18_1","first-page":"1","volume-title":"Proceedings of the 7th International Conference on Informatics and Systems (INFOS\u201910)","author":"Dukes K.","year":"2010","unstructured":"K. Dukes and T. Buckwalter. 2010. A dependency treebank of the Quran using traditional Arabic grammar. In Proceedings of the 7th International Conference on Informatics and Systems (INFOS\u201910). IEEE, 1\u20137."},{"key":"e_1_3_3_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/2206329.2206341"},{"key":"e_1_3_3_20_1","unstructured":"K. Dukes. 2015. Statistical parsing by machine learning from a classical Arabic treebank. Retrieved from https:\/\/arXiv:1510.07193."},{"key":"e_1_3_3_21_1","doi-asserted-by":"publisher","DOI":"10.3390\/data6060067"},{"issue":"1","key":"e_1_3_3_22_1","article-title":"Treebanks: Linking linguistic theory to computational linguistics","volume":"7","author":"Frank A.","year":"2012","unstructured":"A. Frank, A. Zaenen, and E. Hinrichs. 2012. Treebanks: Linking linguistic theory to computational linguistics. Linguist. Iss. Lang. Technol. 7, 1 (2012).","journal-title":"Linguist. Iss. Lang. Technol."},{"key":"e_1_3_3_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1690219.1690255"},{"key":"e_1_3_3_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638271"},{"key":"e_1_3_3_25_1","unstructured":"A. Bharati V. Chaitanya and R. Sangal. 1995. Natural language processing: A Paninian perspective. Indian Institute of Technology Kanpur. New Delhi: Prentice-Hall of India."},{"key":"e_1_3_3_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/1667583.1667651"},{"key":"e_1_3_3_27_1","first-page":"110","volume-title":"Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools","author":"Hajic J.","year":"2004","unstructured":"J. Hajic, O. Smrz, P. Zem\u00e1nek, J. \u0160naidauf, and E. Be\u0161ka. 2004. Prague Arabic dependency treebank: Development in data and tools. In Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools. 110\u2013117."},{"key":"e_1_3_3_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICTCS.2017.49"},{"key":"e_1_3_3_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACIT50332.2020.9300100"},{"issue":"1","key":"e_1_3_3_30_1","article-title":"Syntactic annotation in the I3rab dependency treebank","volume":"18","author":"Halabi D.","year":"2021","unstructured":"D. Halabi, A. Awajan, and E. Fayyoumi. 2021. Syntactic annotation in the I3rab dependency treebank. Int. Arab. J. Inf. Technol 18, 1 (2021).","journal-title":"Int. Arab. J. Inf. Technol"},{"key":"e_1_3_3_31_1","unstructured":"J. Hall J. Nilsson and J. Nivre. 2013. Malteval. Retrieved from http:\/\/www.maltparser.org\/malteval.html."},{"key":"e_1_3_3_32_1","first-page":"69","volume-title":"Proceedings of the Korean Society for Language and Information Conference","author":"Han C. H.","year":"2002","unstructured":"C. H. Han, N. R. Han, E. S. Ko, P. Martha, and Y. Heejong. 2002. Penn Korean treebank: Development and evaluation. In Proceedings of the Korean Society for Language and Information Conference. Korean Society for Language and Information, 69\u201378."},{"key":"e_1_3_3_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/2145432.2145454"},{"key":"e_1_3_3_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/1538443"},{"key":"e_1_3_3_35_1","first-page":"31","volume-title":"Proceedings of the Treebanks and Linguistic Theories Conference","author":"Kulick S.","year":"2006","unstructured":"S. Kulick, R. Gabbard, and M. Marcus. 2006. Parsing the Arabic treebank: Analysis and improvements. In Proceedings of the Treebanks and Linguistic Theories Conference. 31\u201342."},{"key":"e_1_3_3_36_1","unstructured":"LDC. 2004a. Buckwalter Arabic morphological analyzer version. Retrieved from https:\/\/catalog.ldc.upenn.edu\/LDC2004L02."},{"key":"e_1_3_3_37_1","unstructured":"LDC. 2004b. Prague Arabic dependency treebank 1.0. Retrieved from https:\/\/catalog.ldc.upenn.edu\/docs\/LDC2004T23\/."},{"key":"e_1_3_3_38_1","unstructured":"LDC. 2007. Prague Arabic dependency treebank ++. Retrieved from http:\/\/padt-online.blogspot.com\/2007\/01\/conll-shared-task-2007.html."},{"key":"e_1_3_3_39_1","unstructured":"LDC. 2018. 2007 CoNLL shared task\u2014Arabic and English. Retrieved from https:\/\/catalog.ldc.upenn.edu\/LDC2018T08."},{"key":"e_1_3_3_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCIA.2016.10"},{"key":"e_1_3_3_42_1","first-page":"466","volume-title":"Proceedings of the NEMLAR Conference on Arabic Language Resources and Tools","author":"Maamouri M.","year":"2004","unstructured":"M. Maamouri, A. Bies, T. Buckwalter, and W. Mekki. 2004. The Penn Arabic treebank: Building a large-scale annotated Arabic corpus. In Proceedings of the NEMLAR Conference on Arabic Language Resources and Tools. 466\u2013467."},{"key":"e_1_3_3_43_1","volume-title":"Proceedings of the Linguistic Data Consortium","author":"Maamouri M.","year":"2009","unstructured":"M. Maamouri, A. Bies, S. Krouna, F. Gaddeche, and B. Bouziri. 2009. Penn Arabic treebank guidelines. In Proceedings of the Linguistic Data Consortium."},{"key":"e_1_3_3_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/972470.972475"},{"key":"e_1_3_3_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-2340"},{"key":"e_1_3_3_46_1","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201908)","author":"Nilsson J.","year":"2008","unstructured":"J. Nilsson and J. Nivre. 2008. MaltEval: An Evaluation and visualization tool for dependency parsing. In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201908)."},{"key":"e_1_3_3_47_1","first-page":"1","article-title":"Dependency grammar and dependency parsing","volume":"5133","author":"Nivre J.","year":"2005","unstructured":"J. Nivre. 2005. Dependency grammar and dependency parsing. MSI Report 5133, (1959) 1\u201332.","journal-title":"MSI Report"},{"key":"e_1_3_3_48_1","first-page":"12","volume-title":"Proceedings of the ICON09 NLP Tools Contest: Indian Language Dependency Parsing","author":"Nivre J.","year":"2009","unstructured":"J. Nivre. 2009. Parsing Indian languages with maltparser. Proceedings of the ICON09 NLP Tools Contest: Indian Language Dependency Parsing. 12\u201318."},{"key":"e_1_3_3_49_1","first-page":"2216","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201906","author":"Nivre J.","year":"2006","unstructured":"J. Nivre, J. Hall, and J. Nilsson. 2006. Maltparser: A data-driven parser-generator for dependency parsing. In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201906). 2216\u20132219."},{"key":"e_1_3_3_50_1","first-page":"915","volume-title":"Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL\u201907)","author":"Nivre J.","year":"2007","unstructured":"J. Nivre, J. Hall, S. K\u00fcbler, R. McDonald, J. Nilsson, S. Riedel, and D. Yuret. 2007. The CoNLL 2007 shared task on dependency parsing. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL\u201907). 915\u2013932."},{"key":"e_1_3_3_51_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324906004505"},{"key":"e_1_3_3_52_1","unstructured":"Nivre. 2018. MaltParser. Retrieved from http:\/\/www.maltparser.org\/."},{"key":"e_1_3_3_53_1","doi-asserted-by":"publisher","DOI":"10.5555\/1626281.1626292"},{"key":"e_1_3_3_54_1","doi-asserted-by":"publisher","DOI":"10.1075\/sihols.45"},{"key":"e_1_3_3_55_1","doi-asserted-by":"publisher","DOI":"10.1075\/sihols.53"},{"key":"e_1_3_3_56_1","doi-asserted-by":"crossref","unstructured":"A. Renduchintala and A. Williams. 2021. Investigating failures of automatic translation in the case of unambiguous gender. Retrieved from https:\/\/arXiv:2104.07838.","DOI":"10.18653\/v1\/2022.acl-long.243"},{"key":"e_1_3_3_57_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511486975"},{"issue":"2","key":"e_1_3_3_58_1","first-page":"247","article-title":"Building a treebank of modern Hebrew text","volume":"42","author":"Sima'an K.","year":"2001","unstructured":"K. Sima'an, A. Itai, Y. Winter, A. Altman, and N. Nativ. 2001. Building a treebank of modern Hebrew text. Traitement Automatique des Langues 42, 2 (2001), 247\u2013380.","journal-title":"Traitement Automatique des Langues"},{"key":"e_1_3_3_59_1","first-page":"38","volume-title":"Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools","author":"Smrz O.","year":"2004","unstructured":"O. Smrz and P. Pajas. 2004. Morphotrees of Arabic and their annotation in the TrEd environment. In Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools. 38\u201341."},{"key":"e_1_3_3_60_1","volume-title":"Proceedings of the Workshop on Arabic and Local Languages (LREC 2008), Marrakech, Morocco","author":"Smrz O.","year":"2008","unstructured":"O. Smrz, V. Bielicky, and J. Hajic. 2008. Prague Arabic dependency treebank: A word on the million words. In Proceedings of the Workshop on Arabic and Local Languages (LREC 2008), Marrakech, Morocco. 16--23."},{"key":"e_1_3_3_61_1","first-page":"147","volume-title":"Proceedings of the International Symposium on Processing of Arabic","author":"Smrz O.","year":"2002","unstructured":"O. Smrz, J. \u0160naidauf, and P. Zem\u00e1nek. 2002. Prague dependency treebank for Arabic: Multi-level annotation of Arabic corpus. In Proceedings of the International Symposium on Processing of Arabic. 147\u2013155."},{"key":"e_1_3_3_62_1","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC)","author":"Solberg P. E.","year":"2014","unstructured":"P. E. Solberg, A. Skj\u00e6rholt, L. \u00d8vrelid, K. Hagen, and J. B. Johannessen. 2014. The Norwegian dependency treebank. In Proceedings of the International Conference on Language Resources and Evaluation (LREC)."},{"key":"e_1_3_3_63_1","doi-asserted-by":"publisher","DOI":"10.21236\/AD1003943"},{"key":"e_1_3_3_64_1","first-page":"143","article-title":"Teaching treebanking","author":"Volk M.","year":"2005","unstructured":"M. Volk, S. Gustafson-Capkov\u00e1, D. Hagstrand, and H. Uibo. 2005. Teaching treebanking. Nordisk Sprogteknologi 2004: 2004: Aarbog for Nordisk Sprogteknologisk Forskningsprogram 2000\u20132004, 143.","journal-title":"Nordisk Sprogteknologi 2004: 2004: Aarbog for Nordisk Sprogteknologisk Forskningsprogram 2000\u20132004"},{"key":"e_1_3_3_65_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072133.1072147"},{"key":"e_1_3_3_66_1","volume-title":"Proceedings of the Linguistic Data Consortium","author":"Xue N.","year":"2013","unstructured":"N. Xue, X. Zhang, Z. Jiang, M. Palmer, F. Xia, F. D. Chiou and M. Chang. 2013. Chinese Treebank 8.0 LDC2013T21. In Proceedings of the Linguistic Data Consortium."},{"key":"e_1_3_3_67_1","unstructured":"H. Yu X. Wu W. Jiang Q. Liu and S. Lin. 2015. An automatic machine translation evaluation metric based on dependency parsing model. Retrieved from https:\/\/arXiv:1508.01996."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472295","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472295","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:01Z","timestamp":1750183801000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472295"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,18]]},"references-count":65,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,3,31]]}},"alternative-id":["10.1145\/3472295"],"URL":"https:\/\/doi.org\/10.1145\/3472295","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2021,11,18]]},"assertion":[{"value":"2020-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-11-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}