{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,3,2]],"date-time":"2024-03-02T15:24:16Z","timestamp":1709393056692},"reference-count":32,"publisher":"IGI Global","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,10,1]]},"abstract":"
In this paper, the authors propose a method for lexical enrichment of Arabic queries in order to improve the performance of the information retrieval systems SRI. This method has two types of enrichment: linguistic and contextual. The first one is based on the linguistic analysis (lemmatization, morphological, syntactic and semantic analysis), whose goal is to generate a descriptive list (list-desc). This list contains a set of linguistic lexicon assigned to each significant term in the query. The second enrichment consists in integrating contextual information derived from the corpus documents. It is based on statistical analysis using Salton weighting functions: TF-IDF and TF-IEF. The TF-IDF function is applied on the list-desc and documents in the corpus in order to identify relevant documents. TF-IEF function is made between the list-desc and sentences belonging to the relevant documents to identify relevant sentences. Then, terms in these sentences are weighted, and those with highest weights are considered rich in terms of informative and contextual importance are added to the original query. The authors' lexical enrichment method was evaluated on a corpus of documents belonging to a specialized domain and results show its interest in terms of precision and recall.<\/p>","DOI":"10.4018\/ijirr.2013100103","type":"journal-article","created":{"date-parts":[[2014,6,17]],"date-time":"2014-06-17T15:27:21Z","timestamp":1403018841000},"page":"35-51","source":"Crossref","is-referenced-by-count":10,"title":["Method of Lexical Enrichment in Information Retrieval System in Arabic"],"prefix":"10.4018","volume":"3","author":[{"given":"Souheyl","family":"Mallat","sequence":"first","affiliation":[{"name":"Department of Computer Sciences, University of Monastir, Monastir, Tunisia"}]},{"given":"Anis","family":"Zouaghi","sequence":"additional","affiliation":[{"name":"Department of Computer Sciences, Higher Institute of Applied Science and Technologies Sousse, Sousse University, Sousse, Tunisia"}]},{"given":"Emna","family":"Hkiri","sequence":"additional","affiliation":[{"name":"Department of Computer Sciences, University of Monastir, Monastir, Tunisia"}]},{"given":"Mounir","family":"Zrigui","sequence":"additional","affiliation":[{"name":"Department of Computer Sciences, University of Monastir, Monastir, Tunisia"}]}],"member":"2432","reference":[{"key":"ijirr.2013100103-0","unstructured":"Aliane, H., Alimazighi, Z., & Boughacha, R. (2004). Un Syst\u00e8me de reformulation de requ\u00eates pour la recherche d\u2019information. La revue de l'Information Scientifique et Technique (RIST), 14(1). Ann\u00e9e 2004. Retrieved from http:\/\/www.webreview.dz\/IMG\/pdf\/h.aliane.pdf"},{"key":"ijirr.2013100103-1","unstructured":"Andreewsky, A., Binquet, J. P., Debili, F., Fluhr, C., & Pouderoux, B. (1981). Le traitement linguistique et statistique des textes et son application dans la documentation juridique. Actes du Sixi\u00e8me Symposium sur l'Informatique Juridique en Europe, Thessaloniki, Gr\u00e8ce."},{"key":"ijirr.2013100103-2","doi-asserted-by":"publisher","DOI":"10.1145\/322017.322021"},{"key":"ijirr.2013100103-3","unstructured":"Baeza, R., & Ribeiro, B. (1999).Modern information retrieval. ACM Press Books, Addison-Wesley Edition."},{"key":"ijirr.2013100103-4","unstructured":"Bessou, S., Saadi, A., & Touahria, M. (2008). Vers une recherche d'information plus intelligente application \u00e0 la langue arabe. In Proceedings of the 1\u00e8re Conf\u00e9rence Internationale Syst\u00e8mes d\u2019Information et Intelligence Economique (SIIE\u20192008), Hammamet, Tunisie."},{"key":"ijirr.2013100103-5","author":"R.Blachere","year":"1975","journal-title":"Grammaires de l\u2019arabe classique morphologie et syntaxe. G.P"},{"key":"ijirr.2013100103-6","unstructured":"Boulaknadel, S. (2006). Utilisation des syntagmes nominaux dans un syst\u00e8me de recherche d\u2019information en langue arabe. Actes des 1\u00e8res Rencontres Jeunes Chercheurs en Recherche d\u2019Information. Retrieved from http:\/\/www.asso-aria.org\/coria\/2006\/341.pdf"},{"key":"ijirr.2013100103-7","unstructured":"Boulaknadel, & Daille. (2008). Utilisation des termes complexes dans un syst\u00e8me de recherche d\u2019information en langue arabe. In Proceedings of the 9es Journ\u00e9es internationales d\u2019Analyse statistique des Donn\u00e9es Textuelles (JADT 2008). Retrieved from http:\/\/lexicometrica.univ-paris3.fr\/jadt\/jadt2008\/pdf\/boulaknadel-daille-aboutajdibe.pdf"},{"key":"ijirr.2013100103-8","unstructured":"Boulaknadel, S. (2008). Traitement automatique des langues et recherche d\u2019information en langue arabe dans domaine de sp\u00e9cialit\u00e9."},{"key":"ijirr.2013100103-9","first-page":"59","author":"C.Buckley","year":"1992","journal-title":"Automatic retrieval with locality information using smart"},{"key":"ijirr.2013100103-10","doi-asserted-by":"crossref","unstructured":"Diab, M., Hacioglu, K., & Jurafsky, D. (2004). Automatic tagging of arabic text: From raw text to base phrase chunks. In Proceedings of NAACL-HLT, Boston, MA (pp. 149\u2013152).","DOI":"10.3115\/1613984.1614022"},{"key":"ijirr.2013100103-11","unstructured":"Farag, A., & Andreas, N. (2008). Improving Arabic text retrieval via detection of word form variations. In Proceedings of the 1\u00e8re Conf\u00e9rence Internationale Syst\u00e8mes d\u2019Information et Intelligence Economique (SIIE\u20192008), Hammamet \u2013 Tunisie."},{"key":"ijirr.2013100103-12","author":"S.Gauch","year":"1992","journal-title":"An expert system for automatic query reformulation. Technical report"},{"key":"ijirr.2013100103-13","unstructured":"Gaussier, E., Grefenstette, G., & Schulze, M. (1997). Traitement du langage naturel et recherche d\u2019informations: quelques exp\u00e9riences sur le fran\u00e7ais. In Proceedings of 1\u00e8res journ\u00e9es scientifiques et techniques du r\u00e9seau francophone de l\u2019ing\u00e9nierie de la langue de l\u2019AUPELF-UREF, Avignon, France (pp. 33\u201345)."},{"key":"ijirr.2013100103-14","unstructured":"Grabs, T., & Scheck, H. (2002). Flexible information retrieval from XML with PowerDB XML. In Proceedings in the First INEX Workshop (pp. 26-32)."},{"key":"ijirr.2013100103-15","unstructured":"Hadouche. (2005). Int\u00e9gration d\u2019information syntaxico-s\u00e9mantiques dans les bases de donn\u00e9es terminologiques: M\u00e9thodologie d\u2019annotation et perspective d\u2019automatisation."},{"key":"ijirr.2013100103-16","unstructured":"Ihadjaden, M. (1994). Conception, r\u00e9alisation et \u00e9valuation d\u2019un syst\u00e8me de recherche et de cat\u00e9gorisation automatique d\u2019information textuelle sur Internet. Th\u00e8se de l\u2019universit\u00e9 ParisIV."},{"key":"ijirr.2013100103-17","unstructured":"Khoja, S., & Garside, S. (1999). Stemming Arabic text. Technical report, Computing department, Lancaster University, Lancaster, UK. Retrieved fro. http:\/\/www.comp.lancs.ac.uk\/computing\/users\/khoja\/stemmer.ps"},{"key":"ijirr.2013100103-18","doi-asserted-by":"crossref","unstructured":"Koolen, M., Kazai, G., & Craswell, N. (2009). Wikipedia pages as entry points for book search. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM \u201909) (pp. 44\u201353). New York, NY: ACM.","DOI":"10.1145\/1498759.1498807"},{"key":"ijirr.2013100103-19","unstructured":"Laib, M., Semmar, N., & Fluhr, C. (2006). Utilisation d\u2019une approche linguistique pour l\u2019indexation et l\u2019interrogation en langage naturel de bases de donn\u00e9es textuelles multilingues. Actes du Premier Colloque International sur le Traitement Automatique de la Langue Arabe."},{"key":"ijirr.2013100103-20","doi-asserted-by":"crossref","unstructured":"Li, Y., Luk, W., Ho, K., & Chung, F. (2007). Improving weak ad-hoc queries using Wikipedia as external corpus. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR\u201907) (pp. 797\u2013798). New York, NY: ACM.","DOI":"10.1145\/1277741.1277914"},{"key":"ijirr.2013100103-21","doi-asserted-by":"crossref","unstructured":"Milne, D., Witten, I., & Nichols, D. (2007). A knowledge-based search engine powered by wikipedia. In Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management (CIKM \u201907) (pp. 445\u2013454). New York, NY: ACM.","DOI":"10.1145\/1321440.1321504"},{"key":"ijirr.2013100103-22","unstructured":"Mitra, M., Buckley, C., Singhal, A., & Cardi, C. (1997). In analysis of statistical and syntactic phrases. In Proceedings of the 5\u00e8me Conf\u00e9rence de Recherche d\u2019Information Assist\u00e9e par Ordinateur (RIAO), Montreal, Canada (pp. 200-214)."},{"key":"ijirr.2013100103-23","first-page":"129","volume":"27","author":"S.Robertson","year":"1976","journal-title":"Relevance Weighting of Search Terms"},{"issue":"4","key":"ijirr.2013100103-24","first-page":"6","article-title":"Text and web mining approaches in order to build specialized ontologies.","volume":"10","author":"M.Roche","year":"2009","journal-title":"Journal of Digital Information"},{"key":"ijirr.2013100103-25","unstructured":"Salton, G., Fox, E., & Wu, H. (2008). Extended Boolean information retrieval."},{"key":"ijirr.2013100103-26","unstructured":"Salton & McGill. M. (1986). Introduction to modern information retrieval. New York, NY: McGraw-Hill Inc."},{"key":"ijirr.2013100103-27","doi-asserted-by":"crossref","unstructured":"Salton. & Wong. (1975). A vector space model for automatic indexing. Commun of the ACM, 18(11), 613\u2013620.","DOI":"10.1145\/361219.361220"},{"key":"ijirr.2013100103-28","author":"M.Silberztein","year":"1993","journal-title":"Dictionnaires \u00e9lectroniques et analyse automatique de textes, Le syst\u00e8me INTEX"},{"key":"ijirr.2013100103-29","doi-asserted-by":"crossref","unstructured":"Silberztein & Intex. (1999). A finite state transducer toolbox. Theorical compter Scinece, 231(1), 33-46.","DOI":"10.1016\/S0304-3975(99)00015-8"},{"key":"ijirr.2013100103-30","doi-asserted-by":"crossref","unstructured":"Xu, Y., Jones, G., & Wang, B. (2009). Query dependent pseudo-relevance feedback based on Wikipedia. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR \u201909) (pp. 59\u201366). New York, NY: ACM.","DOI":"10.1145\/1571941.1571954"},{"key":"ijirr.2013100103-31","unstructured":"Yannick, P. (2000). Mod\u00e9lisation de documents audiovisuels en Strates Interconnect\u00e9es par les Annotations pour l'exploitation contextuelle. Th\u00e8se disponible sur l\u2019url. Retrieved from http:\/\/lisi.insa-lyon.fr\/~yprie\/these\/node1.html"}],"container-title":["International Journal of Information Retrieval Research"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=109661","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,1]],"date-time":"2022-06-01T17:33:41Z","timestamp":1654104821000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/ijirr.2013100103"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2013,10,1]]},"references-count":32,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,10]]}},"URL":"http:\/\/dx.doi.org\/10.4018\/ijirr.2013100103","relation":{},"ISSN":["2155-6377","2155-6385"],"issn-type":[{"value":"2155-6377","type":"print"},{"value":"2155-6385","type":"electronic"}],"subject":["General Medicine"],"published":{"date-parts":[[2013,10,1]]}}}