{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T09:02:49Z","timestamp":1776243769927,"version":"3.50.1"},"reference-count":50,"publisher":"World Scientific Pub Co Pte Ltd","issue":"03","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Info. Know. Mgmt."],"published-print":{"date-parts":[[2026,3]]},"abstract":"<jats:p>Sentence similarity assesses how two sentences are similar to each other in terms of their lexical and semantic similarities. However, the Arabic language presents a challenge due to the richness of its features and analysis ambiguities. In this context, we describe syntactic\u2013semantic similarity based on chunks analysis for Arabic paraphrases using partial tree kernels and attention constituency vector tree (ACVT) kernel. For each sentence, a parsing tree is created to extract the main chunks. Then, chunks embeddings are generated using Aravec pre-trained embeddings. Finally, the similarity is measured using the updated ACVT kernel formula to consider the semantic information and the weights provided with the main chunks in the constituent tree instead of being with the leaves of the tree (i.e. words). The proposed method is evaluated using an Arabic paraphrasing benchmark and compared with [Formula: see text]-gram chunks similarity. Several experiments are conducted to exploit the effect of the decay factors in the tree kernel function. The proposed method achieves better correlation than the [Formula: see text]-gram chunk method. Furthermore, the method can be applied to paraphrase detection and achieves a recall of 0.70 and a precision of 0.757, with a similarity threshold of 0.5, while the results of paraphrase detection with a threshold of 0.3 show a better recall of 0.94 and a precision of 0.751.<\/jats:p>","DOI":"10.1142\/s0219649225500923","type":"journal-article","created":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T01:08:54Z","timestamp":1758676134000},"source":"Crossref","is-referenced-by-count":0,"title":["Arabic Sentence Similarity Based on Attention Constituency Vector Tree"],"prefix":"10.1142","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6358-059X","authenticated-orcid":false,"given":"Marwah","family":"Alian","sequence":"first","affiliation":[{"name":"Computer Science Department, Faculty of Information Technology, The World Islamic Sciences and Education University, Amman, Jordan"},{"name":"Basic Sciences Department, Faculty of Science, The Hashemite University, Zarqa, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7067-5658","authenticated-orcid":false,"given":"Arafat","family":"Awajan","sequence":"additional","affiliation":[{"name":"Computer Science Department, King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2025,9,23]]},"reference":[{"key":"S0219649225500923BIB001","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2024.10.188"},{"key":"S0219649225500923BIB002","first-page":"1","volume-title":"2018 International Arab Conference on Information Technology (ACIT2018)","author":"Alian M","year":"2018"},{"key":"S0219649225500923BIB003","doi-asserted-by":"publisher","DOI":"10.1109\/ICICS49469.2020.239485"},{"issue":"3","key":"S0219649225500923BIB004","first-page":"446","volume":"18","author":"Alian M","year":"2021","journal-title":"The International Arab Journal of Information Technology (IAJIT)"},{"key":"S0219649225500923BIB005","doi-asserted-by":"publisher","DOI":"10.1007\/s41870-022-01147-w"},{"key":"S0219649225500923BIB006","doi-asserted-by":"publisher","DOI":"10.1007\/s13369-023-07694-z"},{"key":"S0219649225500923BIB007","doi-asserted-by":"publisher","DOI":"10.1145\/3446770"},{"key":"S0219649225500923BIB008","volume-title":"Transformation Rules for Arabic Language","author":"Al-Kholi M","year":"1999"},{"key":"S0219649225500923BIB009","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.122959"},{"key":"S0219649225500923BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/SMC.2013.92"},{"key":"S0219649225500923BIB011","first-page":"595","volume-title":"the 24th Pacific Asia Conference on Language, Information and Computation","author":"Alotaiby F","year":"2010"},{"key":"S0219649225500923BIB012","first-page":"1","volume-title":"5th International Conference on Learning Representations (ICLR 2017)","author":"Arora S","year":"2017"},{"key":"S0219649225500923BIB013","unstructured":"Attia, M and H Somers (2008). Handling Arabic morphological and syntactic ambiguity within the LFG framework with a view to machine translation. Ph.D. Thesis, University of Manchester, Manchester."},{"key":"S0219649225500923BIB014","volume-title":"the iConference 2017","author":"Beel J","year":"2017"},{"key":"S0219649225500923BIB015","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S17-2001"},{"key":"S0219649225500923BIB016","first-page":"807","volume-title":"Encyclopedia of Research Methods for the Sciences","author":"Chen PY","year":"2004"},{"key":"S0219649225500923BIB017","doi-asserted-by":"publisher","DOI":"10.1515\/9783112316009"},{"key":"S0219649225500923BIB018","doi-asserted-by":"publisher","DOI":"10.1145\/3447735"},{"key":"S0219649225500923BIB019","first-page":"45","volume-title":"the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics","author":"Fernando S","year":"2008"},{"key":"S0219649225500923BIB020","doi-asserted-by":"publisher","DOI":"10.1002\/9781118712696"},{"key":"S0219649225500923BIB021","doi-asserted-by":"publisher","DOI":"10.1108\/eb026526"},{"issue":"1","key":"S0219649225500923BIB022","first-page":"152","volume":"58","author":"Kadhem SM","year":"2017","journal-title":"Iraqi Journal of Science"},{"key":"S0219649225500923BIB023","doi-asserted-by":"crossref","unstructured":"Kenter, T, A Borisov and M de Rijke (2016). Siamese cbow: Optimizing word embeddings for sentence representations. arXiv:1606.04640.","DOI":"10.18653\/v1\/P16-1089"},{"key":"S0219649225500923BIB024","unstructured":"Kiros\n                      et al.\n                      (2015): Skip-thought vectors, NIPS\u201915: Proceedings of the 29th International Conference on Neural Information Processing Systems, MIT Press, United States, 2, pp. 3294\u20133302."},{"key":"S0219649225500923BIB025","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/575"},{"key":"S0219649225500923BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-23281-8_1"},{"key":"S0219649225500923BIB027","doi-asserted-by":"publisher","DOI":"10.4018\/IJCINI.2020010103"},{"key":"S0219649225500923BIB028","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-77116-8_25"},{"key":"S0219649225500923BIB029","doi-asserted-by":"publisher","DOI":"10.1109\/ICEMIS.2016.7745319"},{"key":"S0219649225500923BIB030","volume-title":"Int. Conf. Learning Representations-ICLR Workshop","author":"Mikolov T","year":"2013"},{"key":"S0219649225500923BIB031","first-page":"1","volume-title":"Neural Information Processing Systems (NIPS)","author":"Mikolov T","year":"2013"},{"key":"S0219649225500923BIB032","unstructured":"Mishra, MK and J Viradiya (2019). Survey of Sentence Embedding Methods. Parul Institute of Engineering and Technology, doi:10.13140\/RG.2.2.21861.45289."},{"key":"S0219649225500923BIB033","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2017.10.117"},{"key":"S0219649225500923BIB034","doi-asserted-by":"publisher","DOI":"10.1007\/11871842_32"},{"key":"S0219649225500923BIB035","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1303"},{"key":"S0219649225500923BIB036","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-73500-9_2"},{"key":"S0219649225500923BIB037","doi-asserted-by":"publisher","DOI":"10.1075\/sihols.45"},{"key":"S0219649225500923BIB038","doi-asserted-by":"crossref","unstructured":"Pagliardini, M, P Gupta and M Jaggi (2017). Unsupervised learning of sentence embeddings using compositional n-gram features. arXiv:1703.02507.","DOI":"10.18653\/v1\/N18-1049"},{"key":"S0219649225500923BIB039","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"S0219649225500923BIB040","unstructured":"Rong, X (2016). word2vec Parameter Learning Explained, arXiv:1411.2738, v4, Jun 2016."},{"key":"S0219649225500923BIB041","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511486975"},{"key":"S0219649225500923BIB042","doi-asserted-by":"publisher","DOI":"10.5220\/0011624800003393"},{"key":"S0219649225500923BIB043","volume-title":"Annual Conference of the North American Chapter of the Association for Computational Linguistics (ACL 2013)","author":"Socher R","year":"2013"},{"key":"S0219649225500923BIB044","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-54906-9_36"},{"key":"S0219649225500923BIB045","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-30164-8_652"},{"key":"S0219649225500923BIB046","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W15-1702"},{"key":"S0219649225500923BIB047","doi-asserted-by":"publisher","DOI":"10.1007\/s40595-016-0080-2"},{"key":"S0219649225500923BIB048","unstructured":"Wang, R (2006). Advanced techniques for Chinese chunk segmentation and similarity measure of Chinese sentences. Ph.D. Thesis, The Hong Kong Polytechnic University, Hong Kong."},{"key":"S0219649225500923BIB049","first-page":"1","volume-title":"ICLR","author":"Wieting J","year":"2016"},{"key":"S0219649225500923BIB050","doi-asserted-by":"publisher","DOI":"10.13053\/rcs-110-1-2"}],"container-title":["Journal of Information &amp; Knowledge Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219649225500923","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T08:18:48Z","timestamp":1776241128000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0219649225500923"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,23]]},"references-count":50,"journal-issue":{"issue":"03","published-print":{"date-parts":[[2026,3]]}},"alternative-id":["10.1142\/S0219649225500923"],"URL":"https:\/\/doi.org\/10.1142\/s0219649225500923","relation":{},"ISSN":["0219-6492","1793-6926"],"issn-type":[{"value":"0219-6492","type":"print"},{"value":"1793-6926","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,23]]},"article-number":"2550092"}}