{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:48:10Z","timestamp":1754156890763,"version":"3.41.2"},"reference-count":52,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[2023,5,18]],"date-time":"2023-05-18T00:00:00Z","timestamp":1684368000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["DTA"],"published-print":{"date-parts":[[2024,1,29]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Question answering (QA) answers the questions asked by people in the form of natural language. In the QA, due to the subjectivity of users, the questions they query have different expressions, which increases the difficulty of text retrieval. Therefore, the purpose of this paper is to explore new query rewriting method for QA that integrates multiple related questions (RQs) to form an optimal question. Moreover, it is important to generate a new dataset of the original query (OQ) with multiple RQs.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>This study collects a new dataset SQuAD_extend by crawling the QA community and uses word-graph to model the collected OQs. Next, Beam search finds the best path to get the best question. To deeply represent the features of the question, pretrained model BERT is used to model sentences.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The experimental results show three outstanding findings. (1) The quality of the answers is better after adding the RQs of the OQs. (2) The word-graph that is used to model the problem and choose the optimal path is conducive to finding the best question. (3) Finally, BERT can deeply characterize the semantics of the exact problem.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The proposed method can use word-graph to construct multiple questions and select the optimal path for rewriting the question, and the quality of answers is better than the baseline. In practice, the research results can help guide users to clarify their query intentions and finally achieve the best answer.<\/jats:p><\/jats:sec>","DOI":"10.1108\/dta-05-2022-0187","type":"journal-article","created":{"date-parts":[[2023,5,18]],"date-time":"2023-05-18T09:25:58Z","timestamp":1684401958000},"page":"1-23","source":"Crossref","is-referenced-by-count":0,"title":["A novel word-graph-based query rewriting method for question answering"],"prefix":"10.1108","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5107-4339","authenticated-orcid":false,"given":"Rongen","family":"Yan","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7923-9329","authenticated-orcid":false,"given":"Depeng","family":"Dang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8987-3956","authenticated-orcid":false,"given":"Hu","family":"Gao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yan","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wenhui","family":"Yu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","published-online":{"date-parts":[[2023,5,18]]},"reference":[{"issue":"No. 2","key":"key2024012913142803600_ref001","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1145\/990301.990303","article-title":"Learning to find answers to questions on the web","volume":"Vol. 4","year":"2004","journal-title":"ACM Transactions on Internet Technology (TOIT)"},{"issue":"No. 5","key":"key2024012913142803600_ref002","doi-asserted-by":"crossref","first-page":"1698","DOI":"10.1016\/j.ipm.2019.05.009","article-title":"Query expansion techniques for information retrieval: a survey","volume":"Vol. 56","year":"2019","journal-title":"Information Processing & Management"},{"issue":"No. 3","key":"key2024012913142803600_ref003","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1162\/089120105774321091","article-title":"Sentence fusion for multidocument news summarization","volume":"Vol. 31","year":"2005","journal-title":"Computational Linguistics"},{"key":"key2024012913142803600_ref004","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1145\/1367497.1367561","article-title":"Finding the right facts in the crowd: factoid question answering over social media","volume-title":"Proceedings of the 17th International Conference on World Wide Web","year":"2008"},{"key":"key2024012913142803600_ref005","first-page":"102","article-title":"An interface for annotating science questions","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations","year":"2018"},{"journal-title":"arXiv preprint arXiv:1806.00358","article-title":"A systematic classification of knowledge, reasoning, and context within the arc dataset","year":"2018","key":"key2024012913142803600_ref006"},{"key":"key2024012913142803600_ref007","first-page":"298","article-title":"Keyphrase extraction for n-best reranking in multi-sentence compression","year":"2013","journal-title":"Proceedings of NAACL-HLT 2013"},{"first-page":"426","article-title":"Efficient query evaluation using a two-level retrieval process","year":"2003","key":"key2024012913142803600_ref008"},{"key":"key2024012913142803600_ref009","article-title":"Ask the right questions: active question reformulation with reinforcement learning","volume":"abs\/1705.07830","year":"2017","journal-title":"arXiv preprint arXiv:1705.07830"},{"first-page":"69","volume-title":"Automatic Query Expansion Using Smart: Trec 3","year":"1995","key":"key2024012913142803600_ref010"},{"first-page":"353","article-title":"Building a question-answering corpus using social media and news articles","year":"2016","key":"key2024012913142803600_ref011"},{"key":"key2024012913142803600_ref012","article-title":"Reading Wikipedia to answer open-domain questions","volume":"abs\/1704.00051","year":"2017","journal-title":"arXiv preprint arXiv:1704.00051"},{"key":"key2024012913142803600_ref013","article-title":"Think you have solved question answering? Try arc, the ai2 reasoning challenge","volume":"abs\/1803.05457","year":"2018","journal-title":"arXiv preprint arXiv:1803.05457"},{"key":"key2024012913142803600_ref014","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1613\/jair.2433","article-title":"Global inference for sentence compression: an integer linear programming approach","volume":"Vol. 31","year":"2008","journal-title":"Journal of Artificial Intelligence Research"},{"key":"key2024012913142803600_ref015","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1613\/jair.2655","article-title":"Sentence compression as tree transduction","volume":"Vol. 34","year":"2009","journal-title":"Journal of Artificial Intelligence Research"},{"issue":"No. 3","key":"key2024012913142803600_ref016","first-page":"1","article-title":"An abstractive approach to sentence compression","volume":"Vol. 4","year":"2013","journal-title":"ACM Transactions on Intelligent Systems and Technology (TIST)"},{"year":"2006","key":"key2024012913142803600_ref017","article-title":"Back to basics: Classy 2006"},{"first-page":"239","article-title":"Random walks on the click graph","year":"2007","key":"key2024012913142803600_ref018"},{"volume-title":"Search Engines: Information Retrieval in Practice","year":"2010","key":"key2024012913142803600_ref019"},{"key":"key2024012913142803600_ref020","article-title":"Bert: pre-training of deep bidirectional transformers for language understanding","volume":"abs\/1810.04805","year":"2018","journal-title":"arXiv preprint arXiv:1810.04805"},{"key":"key2024012913142803600_ref021","first-page":"322","article-title":"Multi-sentence compression: finding shortest paths in word graphs","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics'","year":"2010"},{"key":"key2024012913142803600_ref022","first-page":"177","article-title":"Sentence fusion via dependency graph compression","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing'","year":"2008"},{"key":"key2024012913142803600_ref023","first-page":"180","article-title":"Lexicalized Markov grammars for sentence compression","volume-title":"Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics","year":"2007"},{"key":"key2024012913142803600_ref024","first-page":"340","article-title":"Opinosis: a graph based approach to abstractive summarization of highly redundant opinions","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)","year":"2010"},{"key":"key2024012913142803600_ref025","first-page":"666","article-title":"Learning Lexicon models from search logs for query expansion","volume-title":"Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning'","year":"2012"},{"key":"key2024012913142803600_ref026","article-title":"Annotation artifacts in natural language inference data","volume":"abs\/1803.02324","year":"2018","journal-title":"arXiv preprint arXiv:1803.02324"},{"key":"key2024012913142803600_ref027","article-title":"DuReader: a chinese machine reading comprehension dataset from real-world applications","volume":"abs\/1711.05073","year":"2017","journal-title":"arXiv preprint arXiv:1711.05073"},{"key":"key2024012913142803600_ref028","unstructured":"Hermann, K.M., Kocisky, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M. and Blunsom, P. (2015), \u201cTeaching machines to read and comprehend\u201d, in Hermann, K. M., Kocisky, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M. and Blunsom, P. (Eds), Advances in Neural Information Processing Systems, MIT Press Cambridge, MA, USA, pp. 1693-1701."},{"key":"key2024012913142803600_ref029","article-title":"Flowqa: Grasping flow in history for conversational machine comprehension","volume":"abs\/1810.06683","year":"2018","journal-title":"arXiv preprint arXiv:1810.06683"},{"first-page":"387","article-title":"Generating query substitutions","year":"2006","key":"key2024012913142803600_ref030"},{"key":"key2024012913142803600_ref031","first-page":"703","article-title":"Statistics-based summarization-step one: sentence compression","volume":"Vol. 2000","year":"2000","journal-title":"AAAI\/IAAI"},{"first-page":"1078","article-title":"Adversarial filters of dataset biases","year":"2020","key":"key2024012913142803600_ref032"},{"issue":"No. 1","key":"key2024012913142803600_ref033","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1002\/asi.23155","article-title":"Complementary QA network analysis for QA retrieval in social question-answering websites","volume":"Vol. 66","year":"2015","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"key2024012913142803600_ref034","first-page":"297","article-title":"Discriminative sentence compression with soft syntactic evidence","volume-title":"11th Conference of the European Chapter of the Association for Computational Linguistics","year":"2006"},{"key":"key2024012913142803600_ref035","first-page":"136","article-title":"Abstractive meeting summarization with entailment and fusion","volume-title":"Proceedings of the 14th European Workshop on Natural Language Generation","year":"2013"},{"issue":"No. 1","key":"key2024012913142803600_ref036","first-page":"1","article-title":"Social question answering: textual, user, and network features for best answer prediction","volume":"Vol. 35","year":"2016","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"key":"key2024012913142803600_ref037","article-title":"Answering science exam questions using query rewriting with background knowledge","volume":"abs\/1809.05726","year":"2018","journal-title":"arXiv preprint arXiv:1809.05726"},{"key":"key2024012913142803600_ref038","doi-asserted-by":"crossref","first-page":"639","DOI":"10.1145\/1277741.1277851","article-title":"Context sensitive stemming for web search","volume-title":"Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","year":"2007"},{"key":"key2024012913142803600_ref039","article-title":"Knowledge Enhanced Contextual Contextual Word Representations","volume":"abs\/1909.04164","year":"2018","journal-title":"arXiv preprint arXiv:1909.04164"},{"year":"2018","key":"key2024012913142803600_ref040","article-title":"Improving language understanding by generative pre-training"},{"key":"key2024012913142803600_ref041","first-page":"2383","article-title":"Squad: 100,000+ questions for machine comprehension of text","year":"2016","journal-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing"},{"key":"key2024012913142803600_ref042","first-page":"193","article-title":"Mctest: A challenge dataset for the open-domain machine comprehension of text","volume-title":"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing","year":"2013"},{"issue":"No. 3","key":"key2024012913142803600_ref043","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1162\/coli_a_00010","article-title":"Query rewriting using monolingual statistical machine translation","volume":"Vol. 36","year":"2010","journal-title":"Computational Linguistics"},{"journal-title":"arXiv preprint arXiv:1611.01603","article-title":"Bidirectional attention flow for machine comprehension","year":"2016","key":"key2024012913142803600_ref044"},{"first-page":"685","article-title":"Summarizing microblogs automatically, Human Language Technologies: The 2010 Annual Conference of The North American Chapter of the Association for Computational linguistics","year":"2010","key":"key2024012913142803600_ref045"},{"first-page":"191","article-title":"Keyword query expansion on linked data using linguistic and semantic features","year":"2013","key":"key2024012913142803600_ref046"},{"issue":"No. 2","key":"key2024012913142803600_ref047","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1007\/s10791-006-7149-y","article-title":"Automatic question answering using the web: beyond the factoid","volume":"Vol. 9","year":"2006","journal-title":"Information Retrieval"},{"article-title":"The pythy summarization system: Microsoft research at duc 2007","volume-title":"Proceedings of DUC","year":"2007","key":"key2024012913142803600_ref048"},{"key":"key2024012913142803600_ref049","first-page":"290","article-title":"Supervised and unsupervised learning for sentence compression","volume-title":"Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics","year":"2005"},{"key":"key2024012913142803600_ref050","article-title":"A sentence compression based framework to query-focused multi-document summarization","volume":"abs\/1606.07548","year":"2016","journal-title":"arXiv preprint arXiv:1606.07548"},{"first-page":"43","article-title":"Improving search relevance for short queries in community question answering","year":"2014","key":"key2024012913142803600_ref051"},{"article-title":"Query expansion using local and global document analysis","volume-title":"Proceedings of the 19th Annual International ACM Sigir Conference on Research and Development in Information Retrieval","year":"1996","key":"key2024012913142803600_ref052"}],"container-title":["Data Technologies and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-05-2022-0187\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-05-2022-0187\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:15:09Z","timestamp":1753398909000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/dta\/article\/58\/1\/1-23\/1221168"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,18]]},"references-count":52,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,5,18]]},"published-print":{"date-parts":[[2024,1,29]]}},"alternative-id":["10.1108\/DTA-05-2022-0187"],"URL":"https:\/\/doi.org\/10.1108\/dta-05-2022-0187","relation":{},"ISSN":["2514-9288","2514-9288"],"issn-type":[{"type":"print","value":"2514-9288"},{"type":"electronic","value":"2514-9288"}],"subject":[],"published":{"date-parts":[[2023,5,18]]}}}