{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T13:17:02Z","timestamp":1771075022592,"version":"3.50.1"},"reference-count":69,"publisher":"Association for Computing Machinery (ACM)","issue":"2","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2026,2,28]]},"abstract":"<jats:p>\n                    Machine Reading Comprehension (MRC) is a key task in Natural Language Understanding that enables automated systems to answer questions based on textual input. While MRC has made substantial progress for high-resource languages, low-resource languages pose significant challenges due to their complex linguistic features. This article presents a comprehensive human-annotated dataset for Urdu MRC, comprising 20,000 question-answer pairs derived from 1,540 articles across seven domains. Unlike previous translation-based datasets, this dataset contains question-answer pairs created through rigorous crowd-sourcing and expert annotation. The dataset encompasses diverse question types, including both answerable and unanswerable questions, with answers ranging from single words to complete sentences, effectively capturing Urdu\u2019s morphological richness and syntactic diversity. To address the limitations of traditional evaluation metrics like Exact Match (EM) and\n                    <jats:inline-formula content-type=\"math\/tex\">\n                      <jats:tex-math notation=\"LaTeX\" version=\"MathJax\">\\(F_1\\)<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    in assessing Urdu answers, we propose Semantic Match (SM), a metric designed to measure semantic equivalence between predicted and ground-truth answers. Our evaluation demonstrates the dataset\u2019s increased complexity, with state-of-the-art models achieving only 0.82% SM accuracy. Together, the dataset and evaluation metric establish a robust framework for advancing Urdu MRC research, bridging critical gaps in both dataset quality and evaluation methodology.\n                  <\/jats:p>","DOI":"10.1145\/3759455","type":"journal-article","created":{"date-parts":[[2025,8,7]],"date-time":"2025-08-07T11:13:14Z","timestamp":1754565194000},"page":"1-34","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["UQuAD+: Benchmark Dataset for Urdu Machine Reading Comprehension"],"prefix":"10.1145","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5394-8298","authenticated-orcid":false,"given":"Samreen","family":"Kazi","sequence":"first","affiliation":[{"name":"School of Mathematics and Computer Science, Institute of Business Administration","place":["Karachi, Pakistan"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2275-7464","authenticated-orcid":false,"given":"Shakeel","family":"Khoja","sequence":"additional","affiliation":[{"name":"School of Mathematics and Computer Science, Institute of Business Administration","place":["Karachi, Pakistan"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,2,14]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICWR51868.2021.9443126"},{"key":"e_1_3_2_3_2","unstructured":"Josh Achiam Steven Adler Sandhini Agarwal Lama Ahmad Ilge Akkaya Florencia Leoni Aleman Diogo Almeida Janko Altenschmidt Sam Altman Shyamal Anadkat et\u00a0al. 2023. GPT-4 technical report. arXiv:2303.08774. Retrieved from https:\/\/arxiv.org\/abs\/2303.08774"},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"Mohammed Akour Sameer Abufardeh Kenneth Magel and Qasem Al-Radaideh. 2011. QArabPro: A rule based question answering system for reading comprehension tests in Arabic. American Journal of Applied Sciences 8 6 (2011) 652\u2013661.","DOI":"10.3844\/ajassp.2011.652.661"},{"key":"e_1_3_2_5_2","volume-title":"Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing","author":"Anuranjana Kaveri","year":"2019","unstructured":"Kaveri Anuranjana, Vijjini Rao, and Radhika Mamidi. 2019. HindiRC: A dataset for reading comprehension in Hindi. In Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing."},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","unstructured":"Samee Arif Abdul Hameed Azeemi Agha Ali Raza and Awais Athar. 2024. Generalists vs. specialists: Evaluating large language models for Urdu. arXiv:2407.04459. Retrieved from https:\/\/arxiv.org\/abs\/2407.04459","DOI":"10.18653\/v1\/2024.findings-emnlp.426"},{"key":"e_1_3_2_7_2","unstructured":"Samee Arif Sualeha Farid Awais Athar and Agha Ali Raza. 2024. UQA: Corpus for Urdu question answering. arXiv:2405.01458. Retrieved from https:\/\/arxiv.org\/abs\/2405.01458"},{"key":"e_1_3_2_8_2","unstructured":"Mikel Artetxe Sebastian Ruder and Dani Yogatama. 2019. On the cross-lingual transferability of monolingual representations. arXiv:1910.11856. Retrieved from https:\/\/arxiv.org\/abs\/1910.11856"},{"key":"e_1_3_2_9_2","unstructured":"Payal Bajaj Daniel Campos Nick Craswell Li Deng Jianfeng Gao Xiaodong Liu Rangan Majumder Andrew McNamara Bhaskar Mitra Tri Nguyen et\u00a0al. 2016. MS MARCO: A human generated machine reading comprehension dataset. arXiv:1611.09268. Retrieved from https:\/\/arxiv.org\/abs\/1611.09268"},{"key":"e_1_3_2_10_2","unstructured":"Ondrej Bajgar Rudolf Kadlec and Jan Kleindienst. 2016. Embracing data abundance: BookTest dataset for reading comprehension. arXiv preprint arXiv:1610.00956 (2016)."},{"key":"e_1_3_2_11_2","doi-asserted-by":"crossref","unstructured":"Lucas Bandarkar Davis Liang Benjamin Muller Mikel Artetxe Satya Narayan Shukla Donald Husa Naman Goyal Abhinandan Krishnan Luke Zettlemoyer and Madian Khabsa. 2023. The belebele benchmark: A parallel reading comprehension dataset in 122 language variants. arXiv preprint arXiv:2308.16884 (2023).","DOI":"10.18653\/v1\/2024.acl-long.44"},{"key":"e_1_3_2_12_2","doi-asserted-by":"crossref","unstructured":"M. M. Biltawi A. Awajan and S. Tedmori. 2023. Arabic span extraction-based reading comprehension benchmark (ASER) and neural baseline models. ACM Transactions on Asian and Low-Resource Language Information Processing 22 5 (2023) 1\u201329.","DOI":"10.1145\/3579047"},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","unstructured":"Danqi Chen Jason Bolton and Christopher D. Manning. 2016. A thorough examination of the CNN\/daily mail reading comprehension task. arXiv preprint arXiv:1606.02858 (2016).","DOI":"10.18653\/v1\/P16-1223"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-tutorials.8"},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","unstructured":"Jonathan H. Clark Eunsol Choi Michael Collins Dan Garrette Tom Kwiatkowski Vitaly Nikolaev and Jennimaria Palomaki. 2020. TyDi QA: A benchmark for information-seeking question answering in typologically diverse languages. Transactions of the Association for Computational Linguistics 8 (2020) 454\u2013470.","DOI":"10.1162\/tacl_a_00317"},{"key":"e_1_3_2_16_2","unstructured":"Peter Clark Isaac Cowhey Oren Etzioni Tushar Khot Ashish Sabharwal Carissa Schoenick and Oyvind Tafjord. 2018. Think you have solved question answering? Try ARC the AI2 reasoning challenge. arXiv preprint arXiv:1803.05457 (2018)."},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20 1 (1960) 37\u201346.","DOI":"10.1177\/001316446002000104"},{"key":"e_1_3_2_18_2","unstructured":"Yiming Cui Ting Liu Wanxiang Che Li Xiao Zhipeng Chen Wentao Ma Shijin Wang and Guoping Hu. 2018. A span-extraction dataset for chinese machine reading comprehension. arXiv preprint arXiv:1810.07366 (2018)."},{"key":"e_1_3_2_19_2","doi-asserted-by":"crossref","unstructured":"Robert Dale. 2021. GPT-3: What\u2019s it good for? Natural Language Engineering 27 1 (2021) 113\u2013118.","DOI":"10.1017\/S1351324920000601"},{"key":"e_1_3_2_20_2","unstructured":"Jacob Devlin Ming-Wei Chang Kenton Lee and Kristina Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"Martin d\u2019Hoffschmidt Wacim Belblidia Tom Brendl\u00e9 Quentin Heinrich and Maxime Vidal. 2020. FQuAD: French question answering dataset. arXiv preprint arXiv:2002.06071 (2020).","DOI":"10.18653\/v1\/2020.findings-emnlp.107"},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"Daria Dzendzik Carl Vogel and Jennifer Foster. 2021. English machine reading comprehension datasets: A survey. arXiv preprint arXiv:2101.10421 (2021).","DOI":"10.18653\/v1\/2021.emnlp-main.693"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58219-7_1"},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"Wei He Kai Liu Jing Liu Yajuan Lyu Shiqi Zhao Xinyan Xiao Yuan Liu Yizhong Wang Hua Wu Qiaoqiao She et\u00a0al. 2017. DuReader: A chinese machine reading comprehension dataset from real-world applications. arXiv preprint arXiv:1711.05073 (2017).","DOI":"10.18653\/v1\/W18-2605"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33016529"},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","unstructured":"Yimin Jing Deyi Xiong and Yan Zhen. 2019. BiPaR: A bilingual parallel dataset for multilingual and cross-lingual reading comprehension on novels. arXiv preprint arXiv:1910.05040 (2019).","DOI":"10.18653\/v1\/D19-1249"},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","unstructured":"Mandar Joshi Eunsol Choi Daniel S. Weld and Luke Zettlemoyer. 2017. TriviaQA: A large scale distantly supervised challenge dataset for reading comprehension. arXiv preprint arXiv:1705.03551 (2017).","DOI":"10.18653\/v1\/P17-1147"},{"key":"e_1_3_2_28_2","doi-asserted-by":"crossref","unstructured":"Vladimir Karpukhin Barlas O\u011fuz Sewon Min Patrick Lewis Ledell Wu Sergey Edunov Danqi Chen and Wen-tau Yih. 2020. Dense passage retrieval for open-domain question answering. arXiv preprint arXiv:2004.04906 (2020).","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_2_29_2","unstructured":"Samreen Kazi and Shakeel Khoja. 2021. UQuAD1.0: Development of an Urdu question answering training data for machine reading comprehension. arXiv preprint arXiv:2111.01543 (2021)."},{"key":"e_1_3_2_30_2","first-page":"1009","volume-title":"Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation (PACLIC \u201924)","author":"Kazi Samreen","year":"2024","unstructured":"Samreen Kazi and Shakeel Khoja. 2024. UPERF: Urdu proximity enhanced retrieval framework. In Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation (PACLIC \u201924). Asian Federation of Natural Language Processing, 1009\u20131017."},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","unstructured":"Samreen Kazi and Shakeel Khoja. 2026. Towards building Urdu language document retrieval framework. Computer Speech & Language 95 (2026) 101797. DOI:10.1016\/j.csl.2025.101797","DOI":"10.1016\/j.csl.2025.101797"},{"key":"e_1_3_2_32_2","doi-asserted-by":"crossref","unstructured":"Samreen Kazi Shakeel Khoja and Ali Daud. 2023. A survey of deep learning techniques for machine reading comprehension. Artificial Intelligence Review 56 Suppl 2 (2023) 2509\u20132569.","DOI":"10.1007\/s10462-023-10583-4"},{"key":"e_1_3_2_33_2","doi-asserted-by":"crossref","unstructured":"Samreen Kazi Shakeel Khoja and Ali Daud. 2025. Bridging the gap: A survey of document retrieval techniques for high-resource and low-resource languages. Computer Science Review 57 (2025) 100756.","DOI":"10.1016\/j.cosrev.2025.100756"},{"key":"e_1_3_2_34_2","unstructured":"Samreen Kazi and Shakeel Ahmed Khoja. 2024. Context-aware question answering in Urdu. In Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024). Association for Computational Linguistics Trento 233\u2013242. Retrieved from https:\/\/aclanthology.org\/2024.icnlsp-1.26\/"},{"key":"e_1_3_2_35_2","first-page":"141","volume-title":"Proceedings of the First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages","author":"Kazi Samreen","year":"2025","unstructured":"Samreen Kazi, Maria Rahim, and Shakeel Ahmed Khoja. 2025. Crossing language boundaries: Evaluation of large language models on Urdu-English question answering. In Proceedings of the First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages. 141\u2013151."},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","unstructured":"Tom\u00e1\u0161 Ko\u010disk\u1ef3 Jonathan Schwarz Phil Blunsom Chris Dyer Karl Moritz Hermann G\u00e1bor Melis and Edward Grefenstette. 2018. The narrativeQA reading comprehension challenge. Transactions of the Association for Computational Linguistics 6 (2018) 317\u2013328.","DOI":"10.1162\/tacl_a_00023"},{"key":"e_1_3_2_37_2","doi-asserted-by":"crossref","unstructured":"Anastasia Krithara Anastasios Nentidis Konstantinos Bougiatiotis and Georgios Paliouras. 2023. BioASQ-QA: A manually curated corpus for biomedical question answering. Scientific Data 10 1 (2023) 170.","DOI":"10.1038\/s41597-023-02068-4"},{"key":"e_1_3_2_38_2","doi-asserted-by":"crossref","unstructured":"Tom Kwiatkowski Jennimaria Palomaki Olivia Redfield Michael Collins Ankur Parikh Chris Alberti Danielle Epstein Illia Polosukhin Jacob Devlin Kenton Lee et\u00a0al. 2019. Natural questions: A benchmark for question answering research. Transactions of the Association for Computational Linguistics 7 (2019) 453\u2013466.","DOI":"10.1162\/tacl_a_00276"},{"key":"e_1_3_2_39_2","doi-asserted-by":"crossref","unstructured":"Guokun Lai Qizhe Xie Hanxiao Liu Yiming Yang and Eduard Hovy. 2017. RACE: Large-scale reading comprehension dataset from examinations. arXiv preprint arXiv:1704.04683 (2017).","DOI":"10.18653\/v1\/D17-1082"},{"key":"e_1_3_2_40_2","unstructured":"BigScience Workshop Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ili\u0107 Daniel Hesslow Roman Castagn\u00e9 Alexandra Sasha Luccioni Fran\u00e7ois Yvon Matthias Gall\u00e9 Jonathan Tow Alexander M. Rush Stella Biderman Albert Webson Pawan Sasanka Ammanamanchi Thomas Wang Beno\u00eet Sagot Niklas Muennighoff Albert Villanova del Moral Olatunji Ruwase Rachel Bawden Stas Bekman Angelina McMillan-Major Iz Beltagy Huu Nguyen Lucile Saulnier Samson Tan Pedro Ortiz Suarez Victor Sanh Hugo Lauren\u00e7on Yacine Jernite Julien Launay Margaret Mitchell Colin Raffel Aaron Gokaslan and many others. 2023. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. Retrieved from https:\/\/arxiv.org\/abs\/2211.05100"},{"key":"e_1_3_2_41_2","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Lee Kyungjae","year":"2018","unstructured":"Kyungjae Lee, Kyoungho Yoon, Sunghyun Park, and Seung-won Hwang. 2018. Semi-supervised training data generation for multilingual question answering. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). European Language Resources Association (ELRA)."},{"key":"e_1_3_2_42_2","doi-asserted-by":"crossref","unstructured":"Patrick Lewis Barlas O\u011fuz Ruty Rinott Sebastian Riedel and Holger Schwenk. 2019. MLQA: Evaluating cross-lingual extractive question answering. arXiv preprint arXiv:1910.07475 (2019).","DOI":"10.18653\/v1\/2020.acl-main.653"},{"key":"e_1_3_2_43_2","unstructured":"Seungyoung Lim Myungji Kim and Jooyoul Lee. 2019. KorQuAD1.0: Korean QA dataset for machine reading comprehension. arXiv preprint arXiv:1909.07005 (2019)."},{"key":"e_1_3_2_44_2","unstructured":"Haokun Liu Derek Tam Mohammed Muqeeth Jay Mohta Tenghao Huang Mohit Bansal and Colin A. Raffel. 2022. Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning. Advances in Neural Information Processing Systems 35 (2022) 1950\u20131965."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1227"},{"key":"e_1_3_2_46_2","unstructured":"Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)."},{"key":"e_1_3_2_47_2","doi-asserted-by":"crossref","unstructured":"Shayne Longpre Yi Lu and Joachim Daiber. 2021. MKQA: A linguistically diverse benchmark for multilingual open domain question answering. Transactions of the Association for Computational Linguistics 9 (2021) 1389\u20131406.","DOI":"10.1162\/tacl_a_00433"},{"key":"e_1_3_2_48_2","doi-asserted-by":"crossref","unstructured":"Timo M\u00f6ller Julian Risch and Malte Pietsch. 2021. GermanQuAD and GermanDPR: Improving non-english question answering and passage retrieval. arXiv preprint arXiv:2104.12741 (2021).","DOI":"10.18653\/v1\/2021.mrqa-1.4"},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","unstructured":"Hussein Mozannar Karl El Hajal Elie Maamary and Hazem Hajj. 2019. Neural Arabic question answering. arXiv preprint arXiv:1906.05394 (2019).","DOI":"10.18653\/v1\/W19-4612"},{"key":"e_1_3_2_50_2","unstructured":"Tri Nguyen Mir Rosenberg Xia Song Jianfeng Gao Saurabh Tiwary Rangan Majumder and Li Deng. 2016. MS MARCO: A human generated machine reading comprehension dataset. Choice 2640 (2016) 660."},{"key":"e_1_3_2_51_2","unstructured":"OpenAI. 2023. ChatGPT. (2023). Retrieved April 18 2025 from https:\/\/openai.com\/index\/chatgpt\/"},{"key":"e_1_3_2_52_2","doi-asserted-by":"crossref","unstructured":"Pranav Rajpurkar Robin Jia and Percy Liang. 2018. Know what you don\u2019t know: Unanswerable questions for SQuAD. arXiv preprint arXiv:1806.03822 (2018).","DOI":"10.18653\/v1\/P18-2124"},{"key":"e_1_3_2_53_2","doi-asserted-by":"crossref","unstructured":"Pranav Rajpurkar Jian Zhang Konstantin Lopyrev and Percy Liang. 2016. SQuAD: 100 000+ questions for machine comprehension of text. arXiv preprint arXiv:1606.05250 (2016).","DOI":"10.18653\/v1\/D16-1264"},{"key":"e_1_3_2_54_2","doi-asserted-by":"crossref","unstructured":"Siva Reddy Danqi Chen and Christopher D. Manning. 2019. CoQA: A conversational question answering challenge. Transactions of the Association for Computational Linguistics 7 (2019) 249\u2013266.","DOI":"10.1162\/tacl_a_00266"},{"key":"e_1_3_2_55_2","doi-asserted-by":"crossref","unstructured":"Vinay Samuel Houda Aynaou Arijit Ghosh Chowdhury Karthik Venkat Ramanan and Aman Chadha. 2023. Can LLMs augment low-resource reading comprehension datasets? Opportunities and challenges. arXiv preprint arXiv:2309.12426 (2023).","DOI":"10.18653\/v1\/2024.acl-srw.36"},{"key":"e_1_3_2_56_2","unstructured":"Minjoon Seo Aniruddha Kembhavi Ali Farhadi and Hannaneh Hajishirzi. 2016. Bidirectional attention flow for machine comprehension. arXiv preprint arXiv:1611.01603 (2016)."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","unstructured":"Muhammad Shakeel and Rao Muhammad Adeel Nawab. 2025. Open-domain factoid question-answering in urdu: data and methods. IEEE Access 13 1 (2025) 30167\u201330185. DOI:10.1109\/ACCESS.2025.3540939","DOI":"10.1109\/ACCESS.2025.3540939"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.439"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1075"},{"key":"e_1_3_2_60_2","unstructured":"Alex Tamkin Dan Jurafsky and Noah Goodman. 2020. Language through a prism: A spectral approach for multiscale language representations. Advances in Neural Information Processing Systems 33 (2020) 5492\u20135504."},{"key":"e_1_3_2_61_2","doi-asserted-by":"crossref","unstructured":"Adam Trischler Tong Wang Xingdi Yuan Justin Harris Alessandro Sordoni Philip Bachman and Kaheer Suleman. 2016. NewsQA: A machine comprehension dataset. arXiv preprint arXiv:1611.09830 (2016).","DOI":"10.18653\/v1\/W17-2623"},{"key":"e_1_3_2_62_2","first-page":"239","volume-title":"Proceedings of the 16th International Conference on Natural Language Processing","author":"Vaidya Shalaka","year":"2019","unstructured":"Shalaka Vaidya, Hiranmai Sri Adibhatla, and Radhika Mamidi. 2019. Samajh-boojh: A reading comprehension system in Hindi. In Proceedings of the 16th International Conference on Natural Language Processing. 239\u2013248."},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463099"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","unstructured":"Xiaoyi Wang Jie Liu Jiong Wang Jianyong Duan Guixia Guan Qing Zhang and Jianshe Zhou. 2024. Document-level relation extraction based on machine reading comprehension and hybrid pointer-sequence labeling. ACM Transactions on Asian and Low-Resource Language Information Processing 23 7 (2024) 1\u201316. DOI:10.1145\/3666042","DOI":"10.1145\/3666042"},{"key":"e_1_3_2_65_2","doi-asserted-by":"crossref","unstructured":"Dirk Weissenborn Georg Wiese and Laura Seiffe. 2017. Making neural QA as simple as possible but not simpler. arXiv preprint arXiv:1703.04816 (2017).","DOI":"10.18653\/v1\/K17-1028"},{"key":"e_1_3_2_66_2","doi-asserted-by":"crossref","unstructured":"Zhilin Yang Peng Qi Saizheng Zhang Yoshua Bengio William W. Cohen Ruslan Salakhutdinov and Christopher D. Manning. 2018. HotpotQA: A dataset for diverse explainable multi-hop question answering. arXiv preprint arXiv:1809.09600 (2018).","DOI":"10.18653\/v1\/D18-1259"},{"key":"e_1_3_2_67_2","unstructured":"Adams Wei Yu David Dohan Minh-Thang Luong Rui Zhao Kai Chen Mohammad Norouzi and Quoc V. Le. 2018. QANet: Combining local convolution with global self-attention for reading comprehension. arXiv preprint arXiv:1804.09541 (2018)."},{"key":"e_1_3_2_68_2","doi-asserted-by":"crossref","unstructured":"Chengzhi Zeng Shixian Li Qingjia Li Jing Hu and Jinquan Hu. 2020. A survey on machine reading comprehension\u2014tasks evaluation metrics and benchmark datasets. Applied Sciences 10 21 (2020) 7640.","DOI":"10.3390\/app10217640"},{"key":"e_1_3_2_69_2","unstructured":"Shengyu Zhang Linfeng Dong Xiaoya Li Sen Zhang Xiaofei Sun Shuhe Wang Jiwei Li Runyi Hu Tianwei Zhang Fei Wu et\u00a0al. 2023. Instruction tuning for large language models: A survey. arXiv preprint arXiv:2308.10792 (2023)."},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-emnlp.135"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3759455","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T12:47:27Z","timestamp":1771073247000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3759455"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,14]]},"references-count":69,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,2,28]]}},"alternative-id":["10.1145\/3759455"],"URL":"https:\/\/doi.org\/10.1145\/3759455","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,14]]},"assertion":[{"value":"2024-12-27","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-29","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-02-14","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}