{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,1]],"date-time":"2022-04-01T22:52:08Z","timestamp":1648853528196},"reference-count":34,"publisher":"Cambridge University Press (CUP)","issue":"4","license":[{"start":{"date-parts":[[2012,2,10]],"date-time":"2012-02-10T00:00:00Z","timestamp":1328832000000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2013,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In this paper, we discuss the results of a new unsupervised and computationally lightweight scoring of how two words are morphologically related to each other. This measure is meant to be an alternative to stemming, radicals (root) extraction, and morphological analysis in a wide range of applications; especially information extraction related ones. Compared to light stemming, which seems to be the most convenient approach for systems with efficiency concerns, our measure does not neglect unconditionally a prefix or a suffix as the light stemming does. Instead, our measure takes into account all letters of the word but with different weights. This prevents the missing of a significant letter. Compared to heavy stemming, morphological analysis, or radicals extraction, which rely on dictionaries and compatibility databases, our measure does not rely on any language-specific morphology knowledge. This makes our approach unsupervised and theoretically language independent and computationally much lighter. Our tests targeted Arabic: a Semitic language recognized to have a complex morphology due to its highly inflectional lexicon.<\/jats:p>","DOI":"10.1017\/s1351324912000071","type":"journal-article","created":{"date-parts":[[2012,2,10]],"date-time":"2012-02-10T13:23:05Z","timestamp":1328880185000},"page":"537-555","source":"Crossref","is-referenced-by-count":5,"title":["On morphological relatedness"],"prefix":"10.1017","volume":"19","author":[{"given":"AHMED","family":"KHORSI","sequence":"first","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2012,2,10]]},"reference":[{"key":"S1351324912000071_ref20","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1145\/564376.564425","volume-title":"Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Larkey","year":"2002"},{"key":"S1351324912000071_ref29","first-page":"1","volume-title":"Semitic '07: Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages","author":"Smr\u017e","year":"2007"},{"key":"S1351324912000071_ref18","first-page":"60","volume-title":"Proceedings of the 8th Workshop of the ACL Special Interest Group in Computational Phonology (SIGPHON)","author":"Karagol-Ayan","year":"2006"},{"key":"S1351324912000071_ref23","doi-asserted-by":"publisher","DOI":"10.1075\/sspcl.5"},{"key":"S1351324912000071_ref13","doi-asserted-by":"crossref","DOI":"10.1016\/0020-0271(74)90044-8","volume-title":"Word Segmentation by Letter Successor Varieties","author":"Hafer","year":"1974"},{"key":"S1351324912000071_ref16","volume-title":"Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet","year":"1999"},{"key":"S1351324912000071_ref6","first-page":"631","volume-title":"Proceedings of TREC 2002","author":"Chen","year":"2002"},{"key":"S1351324912000071_ref9","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2008.07-002-R1-06-30"},{"key":"S1351324912000071_ref14","doi-asserted-by":"publisher","DOI":"10.2307\/411036"},{"key":"S1351324912000071_ref19","article-title":"Effective unsupervised Arabic word stemming: towards an unsupervised radicals extraction","volume":"9","author":"Khorsi","year":"2012","journal-title":"IAJIT"},{"key":"S1351324912000071_ref1","first-page":"664","volume-title":"Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management","author":"Aslam","year":"2005"},{"key":"S1351324912000071_ref2","doi-asserted-by":"publisher","DOI":"10.3115\/1118647.1118653"},{"key":"S1351324912000071_ref7","doi-asserted-by":"publisher","DOI":"10.3115\/1118647.1118650"},{"key":"S1351324912000071_ref10","first-page":"199","volume-title":"Proceedings of the 38th Annual Meeting on Association for Computational Linguistics","author":"de Roeck","year":"2000"},{"key":"S1351324912000071_ref15","doi-asserted-by":"publisher","DOI":"10.1016\/0022-0000(84)90025-4"},{"key":"S1351324912000071_ref24","first-page":"52","volume-title":"Proceedings of the 7th Workshop of the ACL Special Interest Group in Computational Phonology (SIGPHON)","author":"Monson","year":"2004"},{"key":"S1351324912000071_ref3","first-page":"91","article-title":"A Markovian approach for Arabic root extraction","volume":"8","author":"Boudlal","year":"2011","journal-title":"International Arab Journal of Information Technology"},{"key":"S1351324912000071_ref4","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1145\/345508.345543","volume-title":"Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Buckley","year":"2000"},{"key":"S1351324912000071_ref27","first-page":"67","volume-title":"Proceedings of the Fourth Conference on Computational Natural Language Learning and of the Second Learning Language in Logic Workshop","author":"Schone","year":"2000"},{"key":"S1351324912000071_ref26","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2006.07.020"},{"key":"S1351324912000071_ref28","first-page":"1","volume-title":"Proceedings of the 6th Workshop of the ACL Special Interest Group in Computational Phonology (SIGPHON)","author":"Sharma","year":"2002"},{"key":"S1351324912000071_ref30","doi-asserted-by":"publisher","DOI":"10.3115\/1118647.1118649"},{"key":"S1351324912000071_ref31","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4020-6046-5"},{"key":"S1351324912000071_ref32","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/4775.001.0001","volume-title":"Morphology and Computation","author":"Sproat","year":"1992"},{"key":"S1351324912000071_ref22","first-page":"399","volume-title":"Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1","author":"Lee","year":"2003"},{"key":"S1351324912000071_ref12","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/4643.001.0001","volume-title":"The minimum description length principle","author":"Grnwald","year":"2007"},{"key":"S1351324912000071_ref34","unstructured":". 1965. . ."},{"key":"S1351324912000071_ref33","doi-asserted-by":"publisher","DOI":"10.1145\/267954.267957"},{"key":"S1351324912000071_ref17","volume-title":"Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition","author":"Jurafsky","year":"2009"},{"key":"S1351324912000071_ref5","first-page":"31","volume-title":"Proceedings of the Workshop on Computational Approaches to Arabic Script-Based Languages","author":"Buckwalter","year":"2004"},{"key":"S1351324912000071_ref8","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511546853"},{"key":"S1351324912000071_ref21","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4020-6046-5_12"},{"key":"S1351324912000071_ref11","doi-asserted-by":"publisher","DOI":"10.1162\/089120101750300490"},{"key":"S1351324912000071_ref25","first-page":"391","volume-title":"Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1","author":"Rogati","year":"2003"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324912000071","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,28]],"date-time":"2021-12-28T18:26:10Z","timestamp":1640715970000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324912000071\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,2,10]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,10]]}},"alternative-id":["S1351324912000071"],"URL":"https:\/\/doi.org\/10.1017\/s1351324912000071","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,2,10]]}}}