{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,25]],"date-time":"2026-04-25T01:48:39Z","timestamp":1777081719179,"version":"3.51.4"},"reference-count":65,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Info. Know. Mgmt."],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:p>Semantic similarity is the task of measuring relations between sentences or words to determine the degree of similarity or resemblance. Several applications of natural language processing require semantic similarity measurement to achieve good results; these applications include plagiarism detection, text entailment, text summarisation, paraphrasing identification, and information extraction. Many researchers have proposed new methods to measure the semantic similarity of Arabic and English texts. In this research, these methods are reviewed and compared. Results show that the precision of the corpus-based approach exceeds 0.70. The precision of the descriptive feature-based technique is between 0.670 and 0.86, with a Pearson correlation coefficient of over 0.70. Meanwhile, the word embedding technique has a correlation of 0.67, and its accuracy is in the range 0.76\u20130.80. The best results are achieved by the feature-based approach.<\/jats:p>","DOI":"10.1142\/s0219649220500331","type":"journal-article","created":{"date-parts":[[2020,12,2]],"date-time":"2020-12-02T08:22:28Z","timestamp":1606897348000},"page":"2050033","source":"Crossref","is-referenced-by-count":10,"title":["Semantic Similarity for English and Arabic Texts: A Review"],"prefix":"10.1142","volume":"19","author":[{"given":"Marwah","family":"Alian","sequence":"first","affiliation":[{"name":"Princess Sumaya University for Technology, Amman, Jordan"},{"name":"Hashemite University, Zarqa, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arafat","family":"Awajan","sequence":"additional","affiliation":[{"name":"Princess Sumaya University for Technology, Amman, Jordan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2020,12,2]]},"reference":[{"key":"S0219649220500331BIB001","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1007\/978-3-540-85836-2_29","volume-title":"Data Warehousing and Knowledge Discovery","volume":"5182","author":"Achananuparp P","year":"2008"},{"key":"S0219649220500331BIB002","doi-asserted-by":"crossref","first-page":"252","DOI":"10.18653\/v1\/S15-2045","volume-title":"Proc. 9th Int. Workshop on Semantic Evaluation (SemEval 2015)","author":"Agirrea E","year":"2015"},{"key":"S0219649220500331BIB003","first-page":"155","volume-title":"Proc. 11th Int. Conf. Information and Communication Systems","author":"Alian M","year":"2020"},{"key":"S0219649220500331BIB004","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1109\/SMC.2013.92","volume-title":"2013 IEEE Int. Conf. Systems, Man and Cybernetics (SMC)","author":"Almarsoomi FA","year":"2013"},{"key":"S0219649220500331BIB005","first-page":"87","volume-title":"ICALLL (WASET)","volume":"70","author":"Almarsoomi FO","year":"2012"},{"issue":"1","key":"S0219649220500331BIB006","first-page":"85","volume":"21","author":"Al-Ramahi MA","year":"2012","journal-title":"Abhath AL-Yarmouk: Basic Sciences & Engineering"},{"issue":"1","key":"S0219649220500331BIB007","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3844\/jcssp.2016.1.18","volume":"12","author":"Alzahrani S","year":"2016","journal-title":"Journal of Computer Sciences"},{"key":"S0219649220500331BIB008","first-page":"1","volume-title":"2015 IEEE\/ACS 12th Int. Conf. Computer Systems and Applications (AICCSA)","author":"Aouicha MB","year":"2015"},{"issue":"2","key":"S0219649220500331BIB009","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1007\/s10772-015-9284-6","volume":"19","author":"Awajan A","year":"2016","journal-title":"International Journal of Speech Technology"},{"key":"S0219649220500331BIB010","first-page":"1083","volume-title":"Proc. National Conf. Undergraduate Research (NCUR) 2014","author":"Boling C","year":"2014"},{"key":"S0219649220500331BIB011","first-page":"1","volume-title":"The 11th Int. Workshop on Semantic Evaluation (SemEval-2017)","author":"CER DD-G","year":"2017"},{"issue":"4","key":"S0219649220500331BIB012","first-page":"300","volume":"1","author":"Cha S","year":"2007","journal-title":"International Journal of Mathematical Models and Methods in Applied Sciences"},{"key":"S0219649220500331BIB013","first-page":"1009","volume-title":"21st Int. Conf. Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics","author":"Chen HH","year":"2006"},{"key":"S0219649220500331BIB014","first-page":"121","volume-title":"The Int. World Wide Web Conf. Committee (IW3C2)","author":"Chim H","year":"2007"},{"key":"S0219649220500331BIB015","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/11736790_9","volume-title":"Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Tectual Entailment. MLCW 2005","volume":"3944","author":"Dagan I","year":"2005"},{"key":"S0219649220500331BIB016","volume-title":"Microsoft Research Paraphrase Corpus","author":"Dolan BB","year":"2005"},{"key":"S0219649220500331BIB017","first-page":"228","volume-title":"Proc. 2007 Joint Conf. Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)","author":"Erkan G","year":"2007"},{"issue":"1","key":"S0219649220500331BIB018","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1145\/503104.503110","volume":"20","author":"Finkelstein LG","year":"2002","journal-title":"ACM Transactions on Information Systems"},{"key":"S0219649220500331BIB019","doi-asserted-by":"crossref","DOI":"10.1002\/9781118712696","volume-title":"LMF Lexical Markup Framework","author":"Francopoulo G","year":"2013"},{"key":"S0219649220500331BIB020","first-page":"1","volume-title":"5th Int. Symp. I\/V Communications and Mobile Network","author":"Froud H","year":"2010"},{"key":"S0219649220500331BIB021","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1109\/CIST.2012.6388065","volume-title":"2012 Colloquium in Information Science and Technology","author":"Froud H","year":"2012"},{"issue":"13","key":"S0219649220500331BIB022","first-page":"0975","volume":"68","author":"Gomaa WF","year":"2013","journal-title":"Journal of Computer Applications"},{"key":"S0219649220500331BIB023","volume-title":"The Joint SIGDAT Conf. Empirical Methods in Natural Language Processing and Very Large Corpora.","author":"Hatzivassiloglou V","year":"1999"},{"key":"S0219649220500331BIB024","doi-asserted-by":"crossref","first-page":"1576","DOI":"10.18653\/v1\/D15-1181","volume-title":"Proc. 2015 Conf. Empirical Methods in Natural Language Processing","author":"He H","year":"2015"},{"key":"S0219649220500331BIB025","first-page":"192","volume-title":"The 17th Annual Int. ACM SIGIR Conf. Research and Development in Information Retrieval","author":"Hersh WB","year":"1994"},{"issue":"6","key":"S0219649220500331BIB026","doi-asserted-by":"crossref","first-page":"1098","DOI":"10.1109\/TFUZZ.2010.2065811","volume":"18","author":"Huang HH","year":"2010","journal-title":"IEEE Transactions on Fuzzy Systems"},{"key":"S0219649220500331BIB027","first-page":"49","volume-title":"Proc. New Zealand Computer Science Research Student Conf. 2008","author":"Huang M","year":"2008"},{"key":"S0219649220500331BIB028","first-page":"269","volume-title":"SAI Computing Conf.","author":"Hussein AS","year":"2016"},{"issue":"2","key":"S0219649220500331BIB029","first-page":"25","volume":"2","author":"Islam A","year":"2005","journal-title":"ACM Transactions on Knowledge Discovery from Data"},{"key":"S0219649220500331BIB030","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.ipm.2015.01.001","volume":"51","author":"Jiang Y","year":"2015","journal-title":"Information Processing and Management"},{"issue":"1","key":"S0219649220500331BIB031","first-page":"152","volume":"58","author":"Kadhem SM","year":"2017","journal-title":"Iraqi Journal of Science"},{"key":"S0219649220500331BIB032","first-page":"1411","volume-title":"24th ACM Int. Conf. Information and Knowledge Management (CIKM \u201915)","author":"Kenter T","year":"2015"},{"issue":"2","key":"S0219649220500331BIB033","first-page":"285","volume":"25","author":"Kintsch W","year":"1998","journal-title":"Discourse Processes"},{"key":"S0219649220500331BIB034","first-page":"412","volume-title":"19th Annual Meeting of the Cognitive Science Society","author":"Landauer TK","year":"1997"},{"key":"S0219649220500331BIB035","first-page":"4137","volume-title":"Proc. Twenty-Seventh Int. Joint Conf. Artificial Intelligence (IJCAI-18)","author":"Le Y","year":"2018"},{"issue":"2","key":"S0219649220500331BIB036","doi-asserted-by":"crossref","first-page":"265","DOI":"10.7551\/mitpress\/7287.003.0018","volume":"49","author":"Leacock C","year":"1998","journal-title":"WordNet: An Electronic Lexical Database"},{"issue":"4","key":"S0219649220500331BIB037","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1109\/TKDE.2003.1209005","volume":"15","author":"Li Y","year":"2003","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"4","key":"S0219649220500331BIB038","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1109\/TKDE.2003.1209005","volume":"15","author":"Li Y","year":"2003","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"8","key":"S0219649220500331BIB039","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1109\/TKDE.2006.130","volume":"18","author":"Li Y","year":"2006","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"S0219649220500331BIB041","first-page":"129","volume-title":"24th Int. Symp. Computer and Information Sciences","author":"Madylova A","year":"2009"},{"key":"S0219649220500331BIB042","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/978-3-030-23281-8_1","volume-title":"Natural Language Processing and Information Systems. NLDB 2019.","volume":"11608","author":"Mahmoud AZM","year":"2019"},{"key":"S0219649220500331BIB043","doi-asserted-by":"crossref","first-page":"9263","DOI":"10.1007\/s13369-019-04039-7","volume":"44","author":"Mahmoud AZ","year":"2019","journal-title":"Arabian Journal for Science and Engineering"},{"issue":"3","key":"S0219649220500331BIB044","first-page":"81","volume":"5","author":"Meng L","year":"2012","journal-title":"International Journal of Grid and Distributed Computing"},{"key":"S0219649220500331BIB047","series-title":"Communications in Computer and Information Science","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1007\/978-3-319-73500-9_2","volume-title":"Arabic Language Processing: From Theory to Practice. ICALP 2017","volume":"782","author":"Nagoudi EMB","year":"2018"},{"key":"S0219649220500331BIB048","doi-asserted-by":"crossref","first-page":"18","DOI":"10.18653\/v1\/W17-1303","volume-title":"Proc. Third Arabic Natural Language Processing Workshop (WANLP)","author":"Nagoudi EMB","year":"2017"},{"key":"S0219649220500331BIB049","first-page":"528","volume-title":"Proc. 2018 Conf. North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Pagliardini M","year":"2018"},{"key":"S0219649220500331BIB051","first-page":"1341","volume-title":"Proc. 51st Annual Meeting of the Association for Computational Linguistics","author":"Pilehvar MT","year":"2013"},{"key":"S0219649220500331BIB052","first-page":"74","volume-title":"Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conf. (AINL-ISMW FRUCT)","author":"Pronoza E","year":"2015"},{"key":"S0219649220500331BIB053","first-page":"54","volume-title":"Proc. 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries. Association for Computational Linguistics","author":"Radev DR","year":"2009"},{"key":"S0219649220500331BIB054","first-page":"337","volume-title":"The 20th Int. Conf. World Wide Web (WWW \u201811)","author":"Radinsky KA","year":"2011"},{"key":"S0219649220500331BIB055","first-page":"448","volume-title":"Proc. 14th Int. Joint Conf. Artificial Intelligence","author":"Resnik P","year":"1995"},{"issue":"10","key":"S0219649220500331BIB056","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1145\/365628.365657","volume":"8","author":"Rubenstein HG","year":"1965","journal-title":"Communications of the ACM"},{"key":"S0219649220500331BIB057","first-page":"377","volume-title":"Proc. 15th Int. World Wide Web Conf.","author":"Sahami M","year":"2006"},{"key":"S0219649220500331BIB058","volume-title":"CS224d: Deep Learning for Natural Language Processing","author":"Sanborn AA","year":"2015"},{"key":"S0219649220500331BIB059","first-page":"460","volume-title":"Proc. Int. Conf. Computer and Communication Engineering 2008","author":"Selamat A","year":"2008"},{"key":"S0219649220500331BIB060","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/978-3-642-41968-3_22","volume":"282","author":"Soori H","year":"2013","journal-title":"Lecture Notes in Electrical Engineering"},{"key":"S0219649220500331BIB061","doi-asserted-by":"crossref","first-page":"241","DOI":"10.3115\/v1\/S14-2039","volume-title":"Proc. 8th Int. Workshop on Semantic Evaluation (SemEval 2014)","author":"Sultan Md A","year":"2014"},{"key":"S0219649220500331BIB062","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1016\/j.knosys.2013.06.015","volume":"50","author":"Taieb MAH","year":"2013","journal-title":"Knowledge-Based Systems"},{"key":"S0219649220500331BIB063","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1007\/978-3-319-19644-2_43","volume-title":"Int. Conf. Hybrid Artificial Intelligence Systems","author":"Taieb MAH","year":"2015"},{"key":"S0219649220500331BIB064","first-page":"1136","volume-title":"Proc. Nineteenth Int. Joint Conf. Artificial Intelligence (IJCAI)","author":"Turney P","year":"2005"},{"key":"S0219649220500331BIB065","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/s40595-016-0080-2","volume":"4","author":"Wali W","year":"2017","journal-title":"Vietnam Journal of Computer Science"},{"issue":"4","key":"S0219649220500331BIB066","first-page":"627","volume":"21","author":"Wali W","year":"2017","journal-title":"Computaci\u00f3n y Sistemas"},{"key":"S0219649220500331BIB067","first-page":"131","volume-title":"Proc. Australasian Language Technology Workshop","author":"Wan S","year":"2006"},{"key":"S0219649220500331BIB068","first-page":"133","volume-title":"Proc. Annual Meeting of the Association for Computational Linguistics","author":"Wu Z","year":"1994"},{"key":"S0219649220500331BIB069","first-page":"256","volume-title":"3rd Int. Conf. Intelligent System and Knowledge Engineering (ISKE)","volume":"1","author":"Zhou ZW","year":"2008"}],"container-title":["Journal of Information &amp; Knowledge Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219649220500331","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,18]],"date-time":"2024-08-18T11:33:09Z","timestamp":1723980789000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219649220500331"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12]]},"references-count":65,"journal-issue":{"issue":"04","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["10.1142\/S0219649220500331"],"URL":"https:\/\/doi.org\/10.1142\/s0219649220500331","relation":{},"ISSN":["0219-6492","1793-6926"],"issn-type":[{"value":"0219-6492","type":"print"},{"value":"1793-6926","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12]]}}}