{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T16:18:40Z","timestamp":1756311520139,"version":"3.41.0"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"10","license":[{"start":{"date-parts":[[2023,10,13]],"date-time":"2023-10-13T00:00:00Z","timestamp":1697155200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2023,10,31]]},"abstract":"<jats:p>The specificities of Arabic parsing, such as agglutination, vocalization, and the relatively order-free words in Arabic sentences, remain major issues to consider. To promote its robustness, such parseing should define different types of constraints. Property Grammar (PG) formalism verifies the satisfiability of the constraints directly on the units of the structure, thanks to its properties (or relations). In this context, we propose to build a probabilistic parser with syntactic properties, using a PG, and we measure the production rules in terms of different implicit information and in particular the syntactic properties. We experimented with our parser on the treebank ATB, using the parsing algorithm CYK, and we obtained encouraging results. Our method is also automatic for implementation of most property types. Its generalization for other languages or corpus domains (using treebanks) could be a good perspective. Its combination with pre-trained models of BERT may also make our parser faster.<\/jats:p>","DOI":"10.1145\/3612921","type":"journal-article","created":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T11:36:08Z","timestamp":1695123368000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["An Arabic Probabilistic Parser Based on a Property Grammar"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1201-6790","authenticated-orcid":false,"given":"Raja","family":"Bensalem","sequence":"first","affiliation":[{"name":"Faculty of Economic Sciences and Management of Sfax, Computer Sciences Department, University of Sfax, Tunisia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5972-9694","authenticated-orcid":false,"given":"Kais","family":"Haddar","sequence":"additional","affiliation":[{"name":"Faculty of Sciences of Sfax, Computer Sciences Department, Tunisia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5216-9591","authenticated-orcid":false,"given":"Philippe","family":"Blache","sequence":"additional","affiliation":[{"name":"Laboratoire Parole et Langage (LPL), Aix en Provence, Provence-Alpes-C\u00f4te d'Azur, France"}]}],"member":"320","published-online":{"date-parts":[[2023,10,13]]},"reference":[{"issue":"8","key":"e_1_3_3_2_1","first-page":"11 pages","article-title":"Parsing Arabic nominal sentences using context free grammar and fundamental rules of classical grammar","volume":"9","author":"Ababou N.","year":"2017","unstructured":"N. Ababou, A. Mazroui, and R. Belehbib. 2017. Parsing Arabic nominal sentences using context free grammar and fundamental rules of classical grammar. Int. J. Intell. Syst. Appl. 9, 8 (2017), 11 pages.","journal-title":"Int. J. Intell. Syst. Appl"},{"issue":"3","key":"e_1_3_3_3_1","first-page":"567","article-title":"A machine learning system for distinguishing nominal and verbal Arabic sentences","volume":"15","author":"Abdelrazaq D.","year":"2018","unstructured":"D. Abdelrazaq, S. Abu-Soud, A. Awajan, and Arafat. 2018. A machine learning system for distinguishing nominal and verbal Arabic sentences. Int. Arab J. Info. Technol. 15, 3 (2018), 567\u2013584.","journal-title":"Int. Arab J. Info. Technol."},{"key":"e_1_3_3_4_1","first-page":"169","volume-title":"Proceedings of European Language Resources Association (ELRA\u201919)","author":"Adebara I.","year":"2019","unstructured":"I. Adebara. 2019. Womb grammars: A constraint solving model for learning the grammar of Yoruba. In Proceedings of European Language Resources Association (ELRA\u201919). 169\u2013172."},{"key":"e_1_3_3_5_1","first-page":"1","volume-title":"Proceedings of the 6th International Conference on Dependency Linguistics (SyntaxFest\u201921)","author":"Al-Ghamdi S.","year":"2021","unstructured":"S. Al-Ghamdi, H. Al-Khalifa, and A. Al-Salman. 2021. A dependency treebank for classical Arabic poetry. In Proceedings of the 6th International Conference on Dependency Linguistics (SyntaxFest\u201921). Association for Computational Linguistics, 1\u20139."},{"key":"e_1_3_3_6_1","doi-asserted-by":"publisher","DOI":"10.3390\/app13074225"},{"key":"e_1_3_3_7_1","unstructured":"W. Antoun F. Baly and H. Hajj. 2021. Arabert: Transformer-based model for Arabic language understanding. Retrieved from https:\/\/ArXiv:2003.00104"},{"key":"e_1_3_3_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/SDS.2019.8768587"},{"key":"e_1_3_3_9_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl a 00288"},{"key":"e_1_3_3_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-32959-4_16"},{"key":"e_1_3_3_11_1","doi-asserted-by":"publisher","DOI":"10.5220\/0005617001080117"},{"key":"e_1_3_3_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-52758-1 17"},{"key":"e_1_3_3_13_1","first-page":"240","volume-title":"Proceedings of the 17th International Conference on Text, Speech and Dialogue (TSD\u201914)","volume":"8655","author":"Bensalem R. B.","year":"2014","unstructured":"R. B. Bensalem, M. Elkarwi, K. Haddar, and P. Blache. 2014. Building an Arabic linguistic resource from a treebank: the Case of Property Grammar. In Proceedings of the 17th International Conference on Text, Speech and Dialogue (TSD\u201914), Vol. 8655, Springer, 240\u2013246."},{"key":"e_1_3_3_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-77113-7 14"},{"key":"e_1_3_3_15_1","first-page":"415","volume-title":"M\u00e9canismes de contr\u00f4le pour l'analyse en Grammaires de Propri\u00e9t\u00e9s","author":"Blache P.","year":"2006","unstructured":"P. Blache and S. Rauzy. 2006. M\u00e9canismes de contr\u00f4le pour l'analyse en Grammaires de Propri\u00e9t\u00e9s. P. Mertens, C. Fairon, A. Dister, et P. Watrin (eds.). 415\u2013424."},{"key":"e_1_3_3_16_1","first-page":"307","article-title":"Enrichissement du FTB: Un treebank hybride constituants\/propri\u00e9t\u00e9s","volume":"2","author":"Blache P.","year":"2012","unstructured":"P. Blache and S. Rauzy. 2012. Enrichissement du FTB: Un treebank hybride constituants\/propri\u00e9t\u00e9s. In Proceedings of the Conference on Automatic Natural Language Processing (TALN\u201912), Vol. 2, 307\u2013320.","journal-title":"In Proceedings of the Conference on Automatic Natural Language Processing (TALN\u201912)"},{"key":"e_1_3_3_17_1","first-page":"6","volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)","author":"Blache P.","year":"2012","unstructured":"P. Blache and S. Rauzy. 2012. Hybridization and Treebank Enrichment with Constraint-based Representations. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912), 6\u201313."},{"key":"e_1_3_3_18_1","volume-title":"Proceedings of the 2nd Asia Pacific Corpus Linguistics Conference (APCLC\u201914)","author":"Blache P.","year":"2014","unstructured":"P. Blache and S. Rauzy. 2014. A chinese constraint grammar extracted from the chinese treebank. In Proceedings of the 2nd Asia Pacific Corpus Linguistics Conference (APCLC\u201914), Hong Kong."},{"key":"e_1_3_3_19_1","first-page":"2336","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916)","author":"Blache P.","year":"2016","unstructured":"P. Blache, S. Rauzy, and G. Montcheuil. 2016. MarsaGram: An excursion in the forests of parsing trees. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916), European Language Resources Association (ELRA), 2336\u20132342."},{"key":"e_1_3_3_20_1","doi-asserted-by":"publisher","DOI":"10.15398\/jlm.v4i2.129"},{"key":"e_1_3_3_21_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K18-2005"},{"key":"e_1_3_3_22_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1082"},{"key":"e_1_3_3_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.tcs.2015.01.043"},{"key":"e_1_3_3_24_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"volume-title":"Proceedings of 7th International Conference on Electrical, Electronic and Computing Engineering IcETRAN","author":"Dordevic T.","key":"e_1_3_3_25_1","unstructured":"T. Dordevic and S. Stojkovic. 2020. Different approaches in serbian language parsing using context-free grammars. In Proceedings of 7th International Conference on Electrical, Electronic and Computing Engineering IcETRAN, Etno-Selo Stani\u0161i\u0107i, Bosnia and Herzegovina (Online conference), 588--591."},{"key":"e_1_3_3_26_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K17-3002"},{"key":"e_1_3_3_27_1","first-page":"101","volume-title":"Proceedings of the Huiti\u00e8mes Journ\u00e9es Francophones de Programmation par Contraintes (JFPC\u201912)","author":"Duchier D.","year":"2012","unstructured":"D. Duchier, T.-B.-H. Dao, and Y. Parmentier. 2012. Analyse Syntaxique par Contraintes pour les Grammaires de Propri\u00e9t\u00e9s \u00e0 traits. In Proceedings of the Huiti\u00e8mes Journ\u00e9es Francophones de Programmation par Contraintes (JFPC\u201912). 101\u2013106."},{"key":"e_1_3_3_28_1","first-page":"123","volume-title":"Proceedings of the Sixi\u00e8mes Journ\u00e9es Francophones de Programmation par Contraintes (JFPC\u201910)","author":"Duchier D.","year":"2010","unstructured":"D. Duchier, T.-B.-H. Dao, Y. Parmentier, and W. Lesaint. 2010. Une mod\u00e9lisation en CSP des grammaires de propri\u00e9t\u00e9s. In Proceedings of the Sixi\u00e8mes Journ\u00e9es Francophones de Programmation par Contraintes (JFPC\u201910). 123\u2013132."},{"key":"e_1_3_3_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/362007.362035"},{"key":"e_1_3_3_30_1","doi-asserted-by":"crossref","unstructured":"J. M. Eisenschlos S. Ruder P. Czapla M. Kardas S. Gugger and J. Howard. 2019. Multit: Efficient multi-lingual language model fine-tuning. Retrieved from. https:\/\/ArXiv:1909.04761","DOI":"10.18653\/v1\/D19-1572"},{"key":"e_1_3_3_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-019-04153-6"},{"journal-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Habash N.","key":"e_1_3_3_32_1","unstructured":"N. Habash, R. Roth, O. Rambow, R. Eskander, and N. Tomeh. 2013. Morphological analysis and disambiguation for dialectal Arabic. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Atlanta, Georgia, 426--432."},{"key":"e_1_3_3_33_1","first-page":"2672","volume-title":"Proceedings of the 13th Language Resources and Evaluation Conference","author":"Habash N.","year":"2022","unstructured":"N. Habash, M. AbuOdeh, D. Taji, R. Faraj, J. El Gizuli, and O. Kallas. 2022. Camel Treebank: An open multi-genre Arabic dependency treebank. In Proceedings of the 13th Language Resources and Evaluation Conference. European Language Resources Association, 2672\u20132681."},{"key":"e_1_3_3_34_1","volume-title":"Proceedings of the International Conference on Arabic Language Resources and Tools (MEDAR\u201909)","author":"Habash N.","year":"2009","unstructured":"N. Habash, R. Faraj, and R. Roth. 2009. Syntactic annotation in the Columbia Arabic Treebank. In Proceedings of the International Conference on Arabic Language Resources and Tools (MEDAR\u201909)."},{"key":"e_1_3_3_35_1","doi-asserted-by":"crossref","unstructured":"J. Howard and S. Ruder. 2018. Universal language model fine-tuning for text classification. Retrieved from https:\/\/arxiv.org\/abs\/1801.06146v5","DOI":"10.18653\/v1\/P18-1031"},{"key":"e_1_3_3_36_1","unstructured":"G. Inoue B. Alhafni N. Baimukan H. Bouamor and N. Habash. 2021. The Interplay of Variant Size and Task Type in Arabic Pre-trained Language Models."},{"key":"e_1_3_3_37_1","unstructured":"T. Kasami. 1965. An efficient recognition and syntax analysis algorithm for context-free languages (AFCRL-65-758)."},{"journal-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Khalifa S.","key":"e_1_3_3_38_1","unstructured":"S. Khalifa, N. Zalmout, and N. Habash. 2020. Morphological analysis and disambiguation for gulf Arabic: The interplay between resources and methods. In Proceedings of the 12th Language Resources and Evaluation Conference, European Language Resources Association, Marseille, 3895--3904."},{"key":"e_1_3_3_39_1","doi-asserted-by":"publisher","DOI":"10.4236\/ojs.2021.111006"},{"key":"e_1_3_3_40_1","doi-asserted-by":"crossref","unstructured":"N. Kitaev and D. Klein. 2018. Constituency parsing with a self-attentive encoder. Retrieved from https:\/\/ArXiv:1805.01052","DOI":"10.18653\/v1\/P18-1249"},{"key":"e_1_3_3_41_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1340"},{"key":"e_1_3_3_42_1","volume-title":"Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'10) European Language Resources Association (ELRA\u201910)","author":"Kulick S.","year":"2010","unstructured":"S. Kulick, A. Bies, and M. Maamouri. 2010. Consistent and flexible integration of morphological annotation in the Arabic treebank. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'10) European Language Resources Association (ELRA\u201910)."},{"volume-title":"International Conference on Learning Representations","author":"Li Z.","key":"e_1_3_3_43_1","unstructured":"Z. Li, R. Wang, K. Chen, M. Utiyama, E. Sumita, Z. Zhang, and H. Zhao. 2020. Data-dependent gaussian prior objective for language generation. In International Conference on Learning Representations."},{"key":"e_1_3_3_44_1","first-page":"74","volume-title":"Proceedings of the Tunisian-Algerian Joint Conference on Applied Computing (TACC\u201921)","author":"Maalej R.","year":"2021","unstructured":"R. Maalej, N. Khoufi, and C. Aloulou. 2021. Parsing Arabic using deep learning technology. In Proceedings of the Tunisian-Algerian Joint Conference on Applied Computing (TACC\u201921). 74\u201380."},{"key":"e_1_3_3_45_1","volume-title":"Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC\u201908)","author":"Maamouri M.","year":"2008","unstructured":"M. Maamouri, A. Bies, and S. Kulick. 2008. Enhancing the Arabic Treebank: A collaborative effort toward new annotation guidelines. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC\u201908), European Language Resources Association (ELRA)."},{"key":"e_1_3_3_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-70296-0_4"},{"key":"e_1_3_3_47_1","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.3599719"},{"key":"e_1_3_3_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/SPW53761.2021.00030"},{"key":"e_1_3_3_49_1","first-page":"1659","volume-title":"Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC\u201916)","author":"Nivre J.","year":"2016","unstructured":"J. Nivre, M.-C. de Marnee, F. Ginter, Y. Goldberg, J. Hajic, C. D. Manning, R. McDonald, S. Petrov, S. Pyysalo, N. Silveira, R. Tsarfaty, and D. Zeman. 2016. Universal Dependencies v1: A multilingual treebank collection. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC\u201916), European Language Resources Association (ELRA), 1659\u20131666."},{"key":"e_1_3_3_50_1","doi-asserted-by":"crossref","unstructured":"M. E. Peters M. Neumann M. Iyyer M. Gardner C. Clark K. Lee and L. Zettlemoyer. 2018. Deep contextualized word representations. Retrieved from https:\/\/ArXiv:1802.05365","DOI":"10.18653\/v1\/N18-1202"},{"key":"e_1_3_3_51_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2022.02.007"},{"key":"e_1_3_3_52_1","first-page":"2054","article-title":"KUISAIL at SemEval-2020 task 12: BERT-CNN for offensive speech identification in social media","author":"Safaya A.","year":"2020","unstructured":"A. Safaya, M. Abdullatif, and D. Yuret. 2020. KUISAIL at SemEval-2020 task 12: BERT-CNN for offensive speech identification in social media. In Proceedings of the 14th Workshop on Semantic Evaluation, International Committee for Computational Linguistics, Barcelona (online), 2054\u20132059.","journal-title":"Proceedings of the 14th Workshop on Semantic Evaluation, International Committee for Computational Linguistics"},{"journal-title":"Findings of the Association for Computational Linguistics: (ACL-IJCNLP'21)","author":"Sahay A.","key":"e_1_3_3_53_1","unstructured":"A. Sahay, A. Nasery, A. Maheshwari, G. Ramakrishnan, and R. Iyer. 2021. Rule augmented unsupervised constituency parsing. In Findings of the Association for Computational Linguistics: (ACL-IJCNLP'21), 4923--4932."},{"key":"e_1_3_3_54_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2017.10.117"},{"journal-title":"Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology","author":"Taji D.","key":"e_1_3_3_55_1","unstructured":"D. Taji, S. Khalifa, O. Obeid, F. Eryani, and N. Habash. 2018. An Arabic morphological analyzer and generator with copious features. In Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology, Association for Computational Linguistics, Brussels, Belgium, 140--150."},{"key":"e_1_3_3_56_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0019 -9958(67)80007-X"},{"journal-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Zanzotto F. M.","key":"e_1_3_3_57_1","unstructured":"F. M. Zanzotto, A. Santilli, L. Ranaldi, D. Onorati, P. Tommasino, and F. Fallucchi. 2020. KERMIT: Complementing transformer architectures with encoders of explicit syntactic interpretations. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 256--267."},{"key":"e_1_3_3_58_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K18-2001"},{"volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Zhang Y.","key":"e_1_3_3_59_1","unstructured":"Y. Zhang, Z. Li, and M. Zhang. 2020. Efficient second-order treeCRF for neural dependency parsing. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 3295--3305."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3612921","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3612921","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:29:18Z","timestamp":1750285758000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3612921"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,13]]},"references-count":58,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2023,10,31]]}},"alternative-id":["10.1145\/3612921"],"URL":"https:\/\/doi.org\/10.1145\/3612921","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2023,10,13]]},"assertion":[{"value":"2022-10-25","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-23","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}