{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,25]],"date-time":"2026-01-25T04:05:53Z","timestamp":1769313953872,"version":"3.49.0"},"reference-count":56,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2023,7,22]],"date-time":"2023-07-22T00:00:00Z","timestamp":1689984000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2025,6]]},"abstract":"<jats:p>\n            In this article, a pseudo-relevance feedback (PRF)\u2013based framework is presented for effective query expansion (QE). As candidate expansion terms, the proposed PRF framework considers the terms that are different morphological variants of the original query terms and are semantically close to them. This strategy of selecting expansion terms is expected to preserve the query intent after expansion. While judging the suitability of an expansion term with respect to a base query, two aspects of relation of the term with the query are considered. The first aspect probes to what extent the candidate term is semantically\n            <jats:italic>linked<\/jats:italic>\n            to the original query and the second one checks the extent to which the candidate term can\n            <jats:italic>supplement<\/jats:italic>\n            the base query terms. The semantic relationship between a query and expansion terms is modelled using bidirectional encoder representations from transformers (BERT). The degree of similarity is used to estimate the relative importance of the expansion terms with respect to the query. The quantified relative importance is used to assign weights of the expansion terms in the final query. Finally, the expansion terms are grouped into semantic clusters to strengthen the original query intent. A set of experiments was performed on three different Text REtrieval Conference (TREC) collections to experimentally validate the effectiveness of the proposed QE algorithm. The results show that the proposed QE approach yields competitive retrieval effectiveness over the existing state-of-the-art PRF methods in terms of the mean average precision (MAP) and precision P at position 10 (P@10).\n          <\/jats:p>","DOI":"10.1177\/01655515231184831","type":"journal-article","created":{"date-parts":[[2023,7,22]],"date-time":"2023-07-22T12:34:46Z","timestamp":1690029286000},"page":"604-622","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":2,"title":["Semantics-aware query expansion using pseudo-relevance feedback"],"prefix":"10.1177","volume":"51","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3163-4367","authenticated-orcid":false,"given":"Pankaj","family":"Singh","sequence":"first","affiliation":[{"name":"Indian Institute of Technology Kharagpur, India"},{"name":"Indian Institute of Technology Kharagpur, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6573-0093","authenticated-orcid":false,"given":"Plaban Kumar","family":"Bhowmick","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology Kharagpur, India"}]}],"member":"179","published-online":{"date-parts":[[2023,7,22]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/2744199"},{"key":"e_1_3_3_3_2","first-page":"2011","volume-title":"Proceedings of the 18th ACM conference on information and knowledge management, CIKM \u201909","author":"He B","unstructured":"He B, Ounis I. Finding good feedback documents. In: Proceedings of the 18th ACM conference on information and knowledge management, CIKM \u201909, Hong Kong, China, 2\u20136 November 2009, pp. 2011\u20132014. New York: Association for Computing Machinery."},{"key":"e_1_3_3_4_2","first-page":"711","volume-title":"Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201907","author":"Ko Y","unstructured":"Ko Y, An H, Seo J. An effective snippet generation method using the pseudo relevance feedback technique. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201907, Amsterdam, 23\u201327 July 2007, pp. 711\u2013712. New York: Association for Computing Machinery."},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2017.09.001"},{"issue":"4","key":"e_1_3_3_6_2","first-page":"29","article-title":"Effective and robust query-based stemming","volume":"31","author":"Paik JH","unstructured":"Paik JH, Parui SK, Pal D, et al. Effective and robust query-based stemming. ACM Trans Inf Syst 31(4): 18: 29.","journal-title":"ACM Trans Inf Syst"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2014.08.006"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2019.05.025"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2014.07.004"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2015.09.002"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.05.009"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551518792210"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/383952.383972"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551518816302"},{"key":"e_1_3_3_15_2","volume-title":"SIGIR \u201916: the 39th international ACM SIGIR conference on research and development in Information Retrieva","author":"Zhang Z","unstructured":"Zhang Z, Wang Q, Si L, et al. Learning for efficient supervised query expansion via two-stage feature selection. In: SIGIR \u201916: the 39th international ACM SIGIR conference on research and development in Information Retrieva, Pisa, 17\u201321 July 2016. New York: Association for Computing Machinery."},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2876425"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551519863346"},{"key":"e_1_3_3_18_2","first-page":"313","volume-title":"The Smart retrieval system-experiments in automatic document processing","author":"Rocchio J","year":"1971","unstructured":"Rocchio J. Relevance feedback in information retrieval. In: Rocchio J (ed.) The Smart retrieval system-experiments in automatic document processing. Englewood Cliffs, NJ: Prentice-Hall, 1971, pp. 313\u2013323."},{"key":"e_1_3_3_19_2","doi-asserted-by":"crossref","unstructured":"Stephen Robertson S Walker S Jones MM et al. Okapi at TREC-3. In: Harman DK (ed.) Overview of the Third Text REtrieval Conference (TREC-3). Gaithersburg MD: NIST 1995 pp. 109\u2013126 https:\/\/www.microsoft.com\/en-us\/research\/publication\/okapi-at-trec-3\/","DOI":"10.6028\/NIST.SP.500-225.routing-city"},{"key":"e_1_3_3_20_2","first-page":"143","volume-title":"Relevance weighting of search terms","author":"Robertson SE","year":"1988","unstructured":"Robertson SE, Jones KS. Relevance weighting of search terms. Dublin: Taylor Graham Publishing, pp. 143\u2013160, 1988."},{"key":"e_1_3_3_21_2","first-page":"1483","volume-title":"Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 16","author":"Zamani H","unstructured":"Zamani H, Dadashkarimi J, Shakery A, et al. Pseudo-relevance feedback based on matrix factorization. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 16, Indianapolis, IN, 24\u201328 October 2016, pp. 1483\u20131492. New York: Association for Computing Machinery."},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3307624.3307626"},{"issue":"3","key":"e_1_3_3_23_2","first-page":"744","article-title":"Document summarization using NMF and pseudo relevance feedback based on k-means clustering","volume":"35","author":"Park S","year":"2016","unstructured":"Park S, Cha B, Kim JW. Document summarization using NMF and pseudo relevance feedback based on k-means clustering. Comput Inform 2016; 35(3): 744\u2013760.","journal-title":"Comput Inform"},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/3130348.3130364"},{"key":"e_1_3_3_25_2","first-page":"1929","volume-title":"Proceedings of the 25th ACM international on conference on information and knowledge management, CIKM 16","author":"Kuzi S","unstructured":"Kuzi S, Shtok A, Kurland O. Query expansion using word embeddings. In: Proceedings of the 25th ACM international on conference on information and knowledge management, CIKM 16, Indianapolis, IN, 24\u201328 October 2016, pp. 1929\u20131932. New York: Association for Computing Machinery."},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.1186\/s12911-019-0986-6"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-018-1269-8"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1002\/asi.24241"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.05.005"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551518799637"},{"key":"e_1_3_3_31_2","first-page":"123","volume-title":"Proceedings of the 2016 ACM international conference on the theory of information retrieval, ICTIR \u201916","author":"Zamani H","unstructured":"Zamani H, Bruce Croft W. Estimating embedding vectors for queries. In: Proceedings of the 2016 ACM international conference on the theory of information retrieval, ICTIR \u201916, Newark, DE, 12\u201316 September 2016, pp. 123\u2013132. New York: Association for Computing Machinery."},{"key":"e_1_3_3_32_2","first-page":"795","volume-title":"Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201915","author":"Ganguly D","unstructured":"Ganguly D, Roy D, Mitra M, et al. Word embedding based generalized language model for information retrieval. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201915, Santiago, Chile, 9\u201313 August 2015, pp. 795\u2013798. New York: Association for Computing Machinery."},{"key":"e_1_3_3_33_2","first-page":"54","volume-title":"Proceedings of the 23rd international conference on computational linguistics: Posters, COLING 10","author":"Bernhard D","unstructured":"Bernhard D. Query expansion based on pseudo relevance feedback from definition clusters. In: Proceedings of the 23rd international conference on computational linguistics: Posters, COLING 10, Beijing, China, 23\u201327 2010, pp. 54\u201362. New York: Association for Computational Linguistics"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2013.01.001"},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2016.07.004"},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/2983323.2983750"},{"key":"e_1_3_3_37_2","doi-asserted-by":"crossref","unstructured":"Zamani H Bruce Croft W. Embedding-based query language models. In: Carterette B Fang H Lalmas M et al. (eds) ICTIR. New York: ACM 2016 pp. 147\u2013156 http:\/\/dblp.uni-trier.de\/db\/conf\/ictir\/ictir2016.html#ZamaniC16a","DOI":"10.1145\/2970398.2970405"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.04.007"},{"key":"e_1_3_3_39_2","first-page":"1771","volume-title":"Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery and data mining, KDD \u201919","author":"Han FX","unstructured":"Han FX, Niu D, Chen H, et al. A deep generative approach to search extrapolation and recommendation. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery and data mining, KDD \u201919, Anchorage, AK, 4\u20138 August 2019, pp. 1771\u20131779. New York: Association for Computing Machinery."},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313412"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102182"},{"key":"e_1_3_3_42_2","first-page":"83","volume-title":"Proceedings of the 25th international conference companion on World Wide Web, WWW \u201916 Companion","author":"Nalisnick E","unstructured":"Nalisnick E, Mitra B, Craswell N, et al. Improving document ranking with dual word embeddings. In: Proceedings of the 25th international conference companion on World Wide Web, WWW \u201916 Companion, Montreal, QC, Canada, 11\u201315 April 2016, pp. 83\u201384. New York: Association for Computing Machinery."},{"key":"e_1_3_3_43_2","unstructured":"Devlin J Chang MW Lee K et al. BERT: pre-training of deep bidirectional transformers for language understanding 2018 http:\/\/arxiv.org\/abs\/1810.04805."},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102289"},{"key":"e_1_3_3_45_2","unstructured":"Nogueira R Cho K. Passage re-ranking with BERT 2019 http:\/\/arxiv.org\/abs\/1901.04085"},{"key":"e_1_3_3_46_2","first-page":"3490","volume-title":"Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP)","author":"Yilmaz ZA","unstructured":"Yilmaz ZA, Yang W, Zhang H, et al. Cross-domain modeling of sentence-level evidence for document retrieval. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, China, 3\u20137 November 2019, pp. 3490\u20133496. New York: Association for Computational Linguistics."},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102342"},{"key":"e_1_3_3_48_2","volume-title":"Foundations of statistical natural language processing","author":"Manning C","year":"1999","unstructured":"Manning C, Sch\u00fctze H. Foundations of statistical natural language processing. Cambridge, MA: MIT press, 1999."},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551519860043"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/333135.333138"},{"key":"e_1_3_3_51_2","first-page":"313","volume-title":"Readings in information retrieval","author":"Porter MF","unstructured":"Porter MF. An algorithm for suffix stripping. In: Robertson SE (ed.) Readings in information retrieval. San Francisco, CA: Morgan Kaufmann Publishers, pp. 313\u2013316."},{"key":"e_1_3_3_52_2","unstructured":"Allan J Connell M Croft WB et al. Inquery and trec-9 https:\/\/www.researchgate.net\/publication\/221037360_INQUERY_and_TREC-9"},{"key":"e_1_3_3_53_2","first-page":"303","volume-title":"Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201907","author":"Collins-Thompson K","unstructured":"Collins-Thompson K, Callan J. Estimation and use of uncertainty in pseudo-relevance feedback. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR \u201907, Amsterdam, 23\u201327 July 2007, pp. 303\u2013310. New York: Association for Computing Machinery."},{"key":"e_1_3_3_54_2","first-page":"154","volume-title":"Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR 06","author":"Diaz F","unstructured":"Diaz F, Metzler D. Improving the estimation of relevance models using large external corpora. In: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR 06, Seattle, WA, 6\u201311 August 2006, pp. 154\u2013161. New York: Association for Computing Machinery."},{"key":"e_1_3_3_55_2","first-page":"579","volume-title":"Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval","author":"Lv Y","unstructured":"Lv Y, Zhai CX. Positional relevance model for pseudo-relevance feedback. In: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, Geneva, 19\u201323 July 2010, pp. 579\u2013586. New York: Association for Computing Machinery."},{"key":"e_1_3_3_56_2","first-page":"535","volume-title":"Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval","author":"Miao J","unstructured":"Miao J, Huang JX, Ye Z. Proximity-based Rocchio\u2019s model for pseudo relevance. In: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, Portland, OR, 12\u201316 August 2012, pp. 535\u2013544. New York: Association for Computing Machinery."},{"key":"e_1_3_3_57_2","first-page":"323","volume-title":"Proceedings of the 37th International ACM SIGIR conference on research & development in information retrieval, SIGIR 14","author":"Ye Z","unstructured":"Ye Z, Huang JX. A simple term frequency transformation model for effective pseudo relevance feedback. In: Proceedings of the 37th International ACM SIGIR conference on research & development in information retrieval, SIGIR 14, Paris, 21\u201325 July 2019, pp. 323\u2013332. New York: Association for Computing Machinery."}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01655515231184831","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/01655515231184831","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01655515231184831","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,19]],"date-time":"2025-05-19T11:21:07Z","timestamp":1747653667000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/01655515231184831"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,22]]},"references-count":56,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["10.1177\/01655515231184831"],"URL":"https:\/\/doi.org\/10.1177\/01655515231184831","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,22]]}}}