{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,7]],"date-time":"2025-10-07T14:26:45Z","timestamp":1759847205005,"version":"3.41.0"},"reference-count":75,"publisher":"Association for Computing Machinery (ACM)","issue":"9","license":[{"start":{"date-parts":[[2024,8,16]],"date-time":"2024-08-16T00:00:00Z","timestamp":1723766400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Ministry of Education and Training, Vietnam","award":["B2024-SKH-01"],"award-info":[{"award-number":["B2024-SKH-01"]}]},{"name":"Intelligent Integration Co., Ltd. (INT2), Vietnam"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2024,9,30]]},"abstract":"<jats:p>Question-answer generation (QAG) is a challenging task that generates both questions and answers from a given input paragraph context. The QAG task has recently achieved promising results thanks to the appearance of large pre-trained language models, yet, QAG models are mainly implemented in common languages, e.g., English. There still remains a gap in domain and language adaptation of these QAG models to low-resource languages such as Vietnamese. To address the gap, this article presents a large-scale and systematic study of QAG in Vietnamese. To do that, we first implement several QAG models by using the common fine-tuning techniques based on powerful pre-trained language models. We next introduce a set of instructions designed for the QAG task. These instructions are used to fine-tuned the pre-trained language and large language models. Extensive experimental results of both automatic and human evaluation on five benchmark machine reading comprehension datasets show two important points. First, the instruction-tuning method has the potential to enhance the performance of QAG models. Second, large language models trained in English need more data for fine-tuning to work well on the downstream QAG tasks of low-resource languages. We also provide a prototype system to demonstrate how our QAG models actually work. The code for fine-tuning QAG models and instructions are also made available.<\/jats:p>","DOI":"10.1145\/3675781","type":"journal-article","created":{"date-parts":[[2024,6,29]],"date-time":"2024-06-29T10:23:42Z","timestamp":1719656622000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Towards Vietnamese Question and Answer Generation: An Empirical Study"],"prefix":"10.1145","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-1610-4717","authenticated-orcid":false,"given":"Quoc-Hung","family":"Pham","sequence":"first","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam and University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-4129-4655","authenticated-orcid":false,"given":"Huu-Loi","family":"Le","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-4224-3843","authenticated-orcid":false,"given":"Minh","family":"Dang Nhat","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-8181-9293","authenticated-orcid":false,"given":"Khang","family":"Tran T.","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1639-4346","authenticated-orcid":false,"given":"Manh","family":"Tran-Tien","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-6103-3399","authenticated-orcid":false,"given":"Viet-Hung","family":"Dang","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5544-1300","authenticated-orcid":false,"given":"Huy-The","family":"Vu","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5028-0608","authenticated-orcid":false,"given":"Minh-Tien","family":"Nguyen","sequence":"additional","affiliation":[{"name":"Hung Yen University of Technology and Education, Hung Yen, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7640-9190","authenticated-orcid":false,"given":"Xuan-Hieu","family":"Phan","sequence":"additional","affiliation":[{"name":"University of Engineering and Technology, Vietnam National University, Hanoi, Viet Nam"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,8,16]]},"reference":[{"key":"e_1_3_3_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1620"},{"key":"e_1_3_3_3_1","first-page":"65","volume-title":"Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization","author":"Banerjee Satanjeev","year":"2005","unstructured":"Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization, Jade Goldstein, Alon Lavie, Chin-Yew Lin, and Clare Voss (Eds.). Association for Computational Linguistics, Ann Arbor, MI, 65\u201372. DOI:https:\/\/aclanthology.org\/W05-0909"},{"key":"e_1_3_3_4_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.696"},{"key":"e_1_3_3_5_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-5821"},{"key":"e_1_3_3_6_1","article-title":"Alpagasus: Training a better alpaca with fewer data","author":"Chen Lichang","year":"2023","unstructured":"Lichang Chen, Shiyang Li, Jun Yan, Hai Wang, Kalpa Gunaratna, Vikas Yadav, Zheng Tang, Vijay Srinivasan, Tianyi Zhou, Heng Huang, et\u00a0al. 2023. Alpagasus: Training a better alpaca with fewer data. arXiv preprint arXiv:2307.08701 (2023).","journal-title":"arXiv preprint arXiv:2307.08701"},{"key":"e_1_3_3_7_1","article-title":"Reinforcement learning based graph-to-sequence model for natural question generation","author":"Chen Yu","year":"2019","unstructured":"Yu Chen, Lingfei Wu, and Mohammed J. Zaki. 2019. Reinforcement learning based graph-to-sequence model for natural question generation. arXiv preprint arXiv:1908.04942 (2019).","journal-title":"arXiv preprint arXiv:1908.04942"},{"key":"e_1_3_3_8_1","article-title":"BERT: Pre-training of deep bidirectional transformers for language understanding","volume":"1810","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. CoRR abs\/1810.04805 (2018). arXiv:1810.04805DOI:http:\/\/arxiv.org\/abs\/1810.04805","journal-title":"CoRR"},{"key":"e_1_3_3_9_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171-4186","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171-4186."},{"key":"e_1_3_3_10_1","volume-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems","author":"Dong Li","year":"2019","unstructured":"Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, and Hsiao-Wuen Hon. 2019. Unified language model pre-training for natural language understanding and generation. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, Article 1170, 13 pages."},{"key":"e_1_3_3_11_1","article-title":"Learning to ask: Neural question generation for reading comprehension","author":"Du Xinya","year":"2017","unstructured":"Xinya Du, Junru Shao, and Claire Cardie. 2017a. Learning to ask: Neural question generation for reading comprehension. arXiv:1705.00106 (2017).","journal-title":"arXiv:1705.00106"},{"key":"e_1_3_3_12_1","article-title":"Learning to Ask: Neural question generation for reading comprehension","volume":"1705","author":"Du Xinya","year":"2017","unstructured":"Xinya Du, Junru Shao, and Claire Cardie. 2017b. Learning to Ask: Neural question generation for reading comprehension. CoRR abs\/1705.00106 (2017). arXiv:1705.00106DOI:http:\/\/arxiv.org\/abs\/1705.00106","journal-title":"CoRR"},{"key":"e_1_3_3_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1123"},{"key":"e_1_3_3_14_1","article-title":"Towards diverse and effective question-answer pair generation from children storybooks","author":"Eo Sugyeong","year":"2023","unstructured":"Sugyeong Eo, Hyeonseok Moon, Jinsung Kim, Yuna Hur, Jeongwook Kim, Songeun Lee, Changwoo Chun, Sungsoo Park, and Heuiseok Lim. 2023. Towards diverse and effective question-answer pair generation from children storybooks. arXiv preprint arXiv:2306.06605 (2023).","journal-title":"arXiv preprint arXiv:2306.06605"},{"key":"e_1_3_3_15_1","first-page":"2573","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing","author":"Fei Zichu","year":"2021","unstructured":"Zichu Fei, Qi Zhang, and Yaqian Zhou. 2021. Iterative GNN-based decoder for question generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2573\u20132582."},{"issue":"2","key":"e_1_3_3_16_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3432689","article-title":"Meaningful answer generation of e-commerce question-answering","volume":"39","author":"Gao Shen","year":"2021","unstructured":"Shen Gao, Xiuying Chen, Zhaochun Ren, Dongyan Zhao, and Rui Yan. 2021. Meaningful answer generation of e-commerce question-answering. ACM Transactions on Information Systems (TOIS) 39, 2 (2021), 1\u201326.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"key":"e_1_3_3_17_1","first-page":"429","volume-title":"Proceedings of the 12th ACM International Conference on Web Search and Data Mining","author":"Gao Shen","year":"2019","unstructured":"Shen Gao, Zhaochun Ren, Yihong Zhao, Dongyan Zhao, Dawei Yin, and Rui Yan. 2019. Product-aware answer generation in e-commerce question-answering. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 429\u2013437."},{"key":"e_1_3_3_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/IHCI.2012.6481826"},{"key":"e_1_3_3_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/1857999.1858085"},{"key":"e_1_3_3_20_1","article-title":"Training compute-optimal large language models","author":"Hoffmann Jordan","year":"2022","unstructured":"Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, et\u00a0al. 2022. Training compute-optimal large language models. arXiv preprint arXiv:2203.15556 (2022).","journal-title":"arXiv preprint arXiv:2203.15556"},{"key":"e_1_3_3_21_1","first-page":"13064","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Huang Qingbao","year":"2021","unstructured":"Qingbao Huang, Mingyi Fu, Linzhang Mo, Yi Cai, Jingyun Xu, Pijian Li, Qing Li, and Ho-fung Leung. 2021. Entity guided question generation with contextual structure and sequence information capturing. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 13064\u201313072."},{"key":"e_1_3_3_22_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.naacl-main.229"},{"key":"e_1_3_3_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1870658.1870750"},{"key":"e_1_3_3_24_1","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 6130\u20136140","author":"Jia Xin","year":"2020","unstructured":"Xin Jia, Wenjie Zhou, Xu Sun, and Yunfang Wu. 2020. How to ask good questions? Try to leverage paraphrases. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 6130\u20136140."},{"key":"e_1_3_3_25_1","first-page":"889","volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Labutov Igor","year":"2015","unstructured":"Igor Labutov, Sumit Basu, and Lucy Vanderwende. 2015. Deep questions without deep understanding. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 889\u2013898."},{"key":"e_1_3_3_26_1","doi-asserted-by":"publisher","DOI":"10.3115\/1626355.1626389"},{"key":"e_1_3_3_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10590-009-9059-4"},{"key":"e_1_3_3_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-30194-3_16"},{"key":"e_1_3_3_29_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.20"},{"key":"e_1_3_3_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_3_31_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1484"},{"key":"e_1_3_3_32_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00415"},{"key":"e_1_3_3_33_1","first-page":"1","volume-title":"Proceedings of the Knowledge Capture Conference","author":"Liang Chen","year":"2017","unstructured":"Chen Liang, Xiao Yang, Drew Wham, Bart Pursel, Rebecca Passonneaur, and C. Lee Giles. 2017. Distractor generation with generative adversarial nets for automatically creating fill-in-the-blank questions. In Proceedings of the Knowledge Capture Conference. 1\u20134."},{"key":"e_1_3_3_34_1","first-page":"74","volume-title":"Text Summarization Branches Out","author":"Lin Chin-Yew","year":"2004","unstructured":"Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Text Summarization Branches Out. 74\u201381."},{"key":"e_1_3_3_35_1","first-page":"105","volume-title":"Proceedings of the 14th European Workshop on Natural Language Generation","author":"Lindberg David","year":"2013","unstructured":"David Lindberg, Fred Popowich, John Nesbit, and Phil Winne. 2013. Generating natural language questions to support learning on-line. In Proceedings of the 14th European Workshop on Natural Language Generation. 105\u2013114."},{"key":"e_1_3_3_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380270"},{"key":"e_1_3_3_37_1","volume-title":"arXiv:1907.11692","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019a. RoBERTa: A robustly optimized BERT pretraining approach. In arXiv:1907.11692."},{"key":"e_1_3_3_38_1","article-title":"Generative question refinement with deep reinforcement learning in retrieval-based QA system","author":"Liu Ye","year":"2019","unstructured":"Ye Liu, Chenwei Zhang, Xiaohui Yan, Yi Chang, and Philip S. Yu. 2019b. Generative question refinement with deep reinforcement learning in retrieval-based QA system. Proceedings of the 28th ACM International Conference on Information and Knowledge Management (2019). DOI:https:\/\/api.semanticscholar.org\/CorpusID:199668978","journal-title":"Proceedings of the 28th ACM International Conference on Information and Knowledge Management"},{"key":"e_1_3_3_39_1","article-title":"Conversational machine reading comprehension for Vietnamese healthcare texts","volume":"2105","author":"Luu Son T.","year":"2021","unstructured":"Son T. Luu, Mao Nguyen Bui, Loi Duc Nguyen, Khiem Vinh Tran, Kiet Van Nguyen, and Ngan Luu-Thuy Nguyen. 2021. Conversational machine reading comprehension for Vietnamese healthcare texts. CoRR abs\/2105.01542 (2021). arXiv:2105.01542DOI:https:\/\/arxiv.org\/abs\/2105.01542","journal-title":"CoRR"},{"key":"e_1_3_3_40_1","unstructured":"Son T. Luu Khoi Trong Hoang Tuong Quang Pham Kiet Van Nguyen and Ngan Luu-Thuy Nguyen. 2023. A Multiple Choices Reading Comprehension Corpus for Vietnamese Language Education. arxiv:2303.18162 [cs.CL]."},{"key":"e_1_3_3_41_1","first-page":"8464","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Ma Xiyao","year":"2020","unstructured":"Xiyao Ma, Qile Zhu, Yanlin Zhou, and Xiaolin Li. 2020. Improving question generation with sentence-level semantic matching and answer position inferring. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 8464\u20138471."},{"key":"e_1_3_3_42_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118894.1118897"},{"key":"e_1_3_3_43_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1429"},{"key":"e_1_3_3_44_1","doi-asserted-by":"crossref","first-page":"1037","DOI":"10.18653\/v1\/2020.findings-emnlp.92","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2020","author":"Nguyen Dat Quoc","year":"2020","unstructured":"Dat Quoc Nguyen and Anh Tuan Nguyen. 2020. PhoBERT: Pre-trained language models for Vietnamese. In Findings of the Association for Computational Linguistics: EMNLP 2020. 1037\u20131042."},{"key":"e_1_3_3_45_1","article-title":"A Vietnamese dataset for evaluating machine reading comprehension","volume":"2009","author":"Nguyen Kiet Van","year":"2020","unstructured":"Kiet Van Nguyen, Duc-Vu Nguyen, Anh Gia-Tuan Nguyen, and Ngan Luu-Thuy Nguyen. 2020. A Vietnamese dataset for evaluating machine reading comprehension. CoRR abs\/2009.14725 (2020). arXiv:2009.14725DOI:https:\/\/arxiv.org\/abs\/2009.14725","journal-title":"CoRR"},{"key":"e_1_3_3_46_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073135"},{"key":"e_1_3_3_47_1","doi-asserted-by":"crossref","first-page":"8864","DOI":"10.18653\/v1\/2020.emnlp-main.713","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Perez Ethan","year":"2020","unstructured":"Ethan Perez, Patrick Lewis, Wen-tau Yih, Kyunghyun Cho, and Douwe Kiela. 2020. Unsupervised question decomposition for question answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 8864\u20138880."},{"key":"e_1_3_3_48_1","doi-asserted-by":"crossref","first-page":"136","DOI":"10.18653\/v1\/2022.naacl-srw.18","volume-title":"Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop","author":"Phan Long","year":"2022","unstructured":"Long Phan, Hieu Tran, Hieu Nguyen, and Trieu H. Trinh. 2022. ViT5: Pretrained text-to-text transformer for Vietnamese language generation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop. 136\u2013142."},{"key":"e_1_3_3_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-021-06477-7"},{"key":"e_1_3_3_50_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.468"},{"key":"e_1_3_3_51_1","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.18653\/v1\/2021.emnlp-main.108","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing","author":"Pyatkin Valentina","year":"2021","unstructured":"Valentina Pyatkin, Paul Roit, Julian Michael, Yoav Goldberg, Reut Tsarfaty, and Ido Dagan. 2021. Asking it all: Generating contextualized questions for any semantic role. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 1429\u20131441."},{"key":"e_1_3_3_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.217"},{"key":"e_1_3_3_53_1","doi-asserted-by":"publisher","DOI":"10.5555\/3455716.3455856"},{"key":"e_1_3_3_54_1","first-page":"140:1\u2013140:67","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2019","unstructured":"Colin Raffel, Noam M. Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2019. Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21 (2019), 140:1\u2013140:67. DOI:https:\/\/api.semanticscholar.org\/CorpusID:204838007","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_3_55_1","volume-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2383\u20132392","author":"Rajpurkar Pranav","year":"2016","unstructured":"Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, and Percy Liang. 2016. SQuAD: 100,000+ questions for machine comprehension of text. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2383\u20132392."},{"key":"e_1_3_3_56_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.439"},{"issue":"6","key":"e_1_3_3_57_1","first-page":"7","article-title":"Alpaca: A strong, replicable instruction-following model","volume":"3","author":"Taori Rohan","year":"2023","unstructured":"Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, Xuechen Li, Carlos Guestrin, Percy Liang, and Tatsunori B. Hashimoto. 2023a. Alpaca: A strong, replicable instruction-following model. Stanford Center for Research on Foundation Models. https:\/\/crfm.stanford.edu\/2023\/03\/13\/alpaca.html 3, 6 (2023), 7.","journal-title":"Stanford Center for Research on Foundation Models"},{"key":"e_1_3_3_58_1","article-title":"Stanford Alpaca: An Instruction-following LLaMA model","author":"Taori Rohan","year":"2023","unstructured":"Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, Xuechen Li, Carlos Guestrin, Percy Liang, and Tatsunori B. Hashimoto. 2023b. Stanford Alpaca: An Instruction-following LLaMA model. https:\/\/github.com\/tatsu-lab\/stanford_alpaca","journal-title":"https:\/\/github.com\/tatsu-lab\/stanford_alpaca"},{"key":"e_1_3_3_59_1","unstructured":"Hugo Touvron Thibaut Lavril Gautier Izacard Xavier Martinet Marie-Anne Lachaux Timoth\u00e9e Lacroix Baptiste Rozi\u00e8re Naman Goyal Eric Hambro Faisal Azhar Aurelien Rodriguez Armand Joulin Edouard Grave and Guillaume Lample. 2023a. LLaMA: Open and Efficient Foundation Language Models. arxiv:2302.13971 [cs.CL]"},{"key":"e_1_3_3_60_1","article-title":"Llama 2: Open foundation and fine-tuned chat models","author":"Touvron Hugo","year":"2023","unstructured":"Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et\u00a0al. 2023b. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023).","journal-title":"arXiv preprint arXiv:2307.09288"},{"key":"e_1_3_3_61_1","article-title":"BARTPho: Pre-trained sequence-to-sequence models for Vietnamese","author":"Tran Nguyen Luong","year":"2021","unstructured":"Nguyen Luong Tran, Duong Minh Le, and Dat Quoc Nguyen. 2021. BARTPho: Pre-trained sequence-to-sequence models for Vietnamese. arXiv:2109.09701 (2021).","journal-title":"arXiv:2109.09701"},{"key":"e_1_3_3_62_1","doi-asserted-by":"crossref","first-page":"670","DOI":"10.18653\/v1\/2022.emnlp-main.42","volume-title":"Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing","author":"Ushio Asahi","year":"2022","unstructured":"Asahi Ushio, Fernando Alva-Manchego, and Jose Camacho-Collados. 2022. Generative language models for paragraph-level question generation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 670\u2013688."},{"key":"e_1_3_3_63_1","doi-asserted-by":"crossref","first-page":"14262","DOI":"10.18653\/v1\/2023.findings-acl.899","volume-title":"Findings of the Association for Computational Linguistics: ACL 2023","author":"Ushio Asahi","year":"2023","unstructured":"Asahi Ushio, Fernando Alva-Manchego, and Jose Camacho-Collados. 2023a. An empirical comparison of LM-based question and answer generation methods. In Findings of the Association for Computational Linguistics: ACL 2023, Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics, Toronto, Canada, 14262\u201314272."},{"key":"e_1_3_3_64_1","article-title":"A practical toolkit for multilingual question and answer generation","author":"Ushio Asahi","year":"2023","unstructured":"Asahi Ushio, Fernando Alva-Manchego, and Jose Camacho-Collados. 2023b. A practical toolkit for multilingual question and answer generation. arXiv preprint arXiv:2305.17416 (2023).","journal-title":"arXiv preprint arXiv:2305.17416"},{"issue":"5","key":"e_1_3_3_65_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3527631","article-title":"New Vietnamese corpus for machine reading comprehension of health news articles","volume":"21","author":"Nguyen Kiet Van","year":"2022","unstructured":"Kiet Van Nguyen, Tin Van Huynh, Duc-Vu Nguyen, Anh Gia-Tuan Nguyen, and Ngan Luu-Thuy Nguyen. 2022. New Vietnamese corpus for machine reading comprehension of health news articles. Transactions on Asian and Low-Resource Language Information Processing 21, 5 (2022), 1\u201328.","journal-title":"Transactions on Asian and Low-Resource Language Information Processing"},{"key":"e_1_3_3_66_1","first-page":"559","volume-title":"Intelligent Information and Database Systems","author":"Vu Nguyen","year":"2022","unstructured":"Nguyen Vu and Kiet Van Nguyen. 2022. Enhancing Vietnamese question generation with reinforcement learning. In Intelligent Information and Database Systems. 559\u2013570."},{"key":"e_1_3_3_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigDataCongress.2017.46"},{"key":"e_1_3_3_68_1","first-page":"414","volume-title":"International Conference on Artificial Neural Networks","author":"Wang YuChen","year":"2023","unstructured":"YuChen Wang and Li Li. 2023. Generating question-answer pairs for few-shot learning. In International Conference on Artificial Neural Networks. Springer, 414\u2013425."},{"key":"e_1_3_3_69_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2020\/553"},{"key":"e_1_3_3_70_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.41"},{"key":"e_1_3_3_71_1","article-title":"ProphetNet: Predicting future N-gram for sequence-to-sequence pre-training","volume":"2001","author":"Yan Yu","year":"2020","unstructured":"Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang, and Ming Zhou. 2020. ProphetNet: Predicting future N-gram for sequence-to-sequence pre-training. CoRR abs\/2001.04063 (2020). arXiv:2001.04063DOI:https:\/\/arxiv.org\/abs\/2001.04063","journal-title":"CoRR"},{"issue":"1","key":"e_1_3_3_72_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3468889","article-title":"A review on question generation from natural language text","volume":"40","author":"Zhang Ruqing","year":"2021","unstructured":"Ruqing Zhang, Jiafeng Guo, Lu Chen, Yixing Fan, and Xueqi Cheng. 2021. A review on question generation from natural language text. ACM Transactions on Information Systems (TOIS) 40, 1 (2021), 1\u201343.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"key":"e_1_3_3_73_1","doi-asserted-by":"crossref","first-page":"2495","DOI":"10.18653\/v1\/D19-1253","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Zhang Shiyue","year":"2019","unstructured":"Shiyue Zhang and Mohit Bansal. 2019. Addressing semantic drift in question generation for semi-supervised question answering. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2495\u20132509."},{"key":"e_1_3_3_74_1","article-title":"BERTscore: Evaluating text generation with BERT","author":"Zhang Tianyi","year":"2019","unstructured":"Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi. 2019. BERTscore: Evaluating text generation with BERT. arXiv preprint arXiv:1904.09675 (2019).","journal-title":"arXiv preprint arXiv:1904.09675"},{"key":"e_1_3_3_75_1","first-page":"662","volume-title":"National CCF Conference on Natural Language Processing and Chinese Computing","author":"Zhou Qingyu","year":"2017","unstructured":"Qingyu Zhou, Nan Yang, Furu Wei, Chuanqi Tan, Hangbo Bao, and Ming Zhou. 2017a. Neural question generation from text: A preliminary study. In National CCF Conference on Natural Language Processing and Chinese Computing. Springer, 662\u2013671."},{"key":"e_1_3_3_76_1","article-title":"Neural question generation from text: A preliminary study","volume":"1704","author":"Zhou Qingyu","year":"2017","unstructured":"Qingyu Zhou, Nan Yang, Furu Wei, Chuanqi Tan, Hangbo Bao, and Ming Zhou. 2017b. Neural question generation from text: A preliminary study. CoRR abs\/1704.01792 (2017). arXiv:1704.01792DOI:http:\/\/arxiv.org\/abs\/1704.01792","journal-title":"CoRR"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3675781","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3675781","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:04:09Z","timestamp":1750291449000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3675781"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,16]]},"references-count":75,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2024,9,30]]}},"alternative-id":["10.1145\/3675781"],"URL":"https:\/\/doi.org\/10.1145\/3675781","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2024,8,16]]},"assertion":[{"value":"2024-03-10","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-22","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-08-16","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}