{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T10:40:11Z","timestamp":1755859211012,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":55,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,7,13]]},"DOI":"10.1145\/3726302.3730299","type":"proceedings-article","created":{"date-parts":[[2025,7,14]],"date-time":"2025-07-14T01:21:38Z","timestamp":1752456098000},"page":"3832-3842","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Wrong Answers Can Also Be Useful: P\n            <scp>lausible<\/scp>\n            QA - A Large-Scale QA Dataset with Answer Plausibility Scores"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4850-9239","authenticated-orcid":false,"given":"Jamshid","family":"Mozafari","sequence":"first","affiliation":[{"name":"University of Innsbruck, Innsbruck, Tyrol, Austria"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8747-4927","authenticated-orcid":false,"given":"Abdelrahman","family":"Abdallah","sequence":"additional","affiliation":[{"name":"University of Innsbruck, Innsbruck, Tyrol, Austria"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-3578-2393","authenticated-orcid":false,"given":"Bhawna","family":"Piryani","sequence":"additional","affiliation":[{"name":"University of Innsbruck, Innsbruck, Tyrol, Austria"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7235-0665","authenticated-orcid":false,"given":"Adam","family":"Jatowt","sequence":"additional","affiliation":[{"name":"University of Innsbruck, Innsbruck, Tyrol, Austria"}]}],"member":"320","published-online":{"date-parts":[[2025,7,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-022-01783-5"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.emnlp-main.799"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","unstructured":"Raghunath Arnab. 2017. Chapter 7 - Stratified Sampling. In Survey Sampling Theory and Applications Raghunath Arnab (Ed.). Academic Press 213-256. doi:10.1016\/B978-0-12-811848-1.00007-8","DOI":"10.1016\/B978-0-12-811848-1.00007-8"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3556538"},{"volume-title":"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing","author":"Berant Jonathan","key":"e_1_3_2_1_5_1","unstructured":"Jonathan Berant, Andrew Chou, Roy Frostig, and Percy Liang. 2013. Semantic Parsing on Freebase from Question-Answer Pairs. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, David Yarowsky, Timothy Baldwin, Anna Korhonen, Karen Livescu, and Steven Bethard (Eds.). Association for Computational Linguistics, Seattle, Washington, USA, 1533-1544. https:\/\/aclanthology.org\/D13-1160\/"},{"key":"e_1_3_2_1_6_1","first-page":"324","article-title":"Rank Analysis of Incomplete Block Designs","volume":"39","author":"Bradley Ralph Allan","year":"1952","unstructured":"Ralph Allan Bradley and Milton E. Terry. 1952. Rank Analysis of Incomplete Block Designs: I. The Method of Paired Comparisons. Biometrika, Vol. 39, 3\/4 (1952), 324-345. http:\/\/www.jstor.org\/stable\/2334029","journal-title":"I. The Method of Paired Comparisons. Biometrika"},{"key":"e_1_3_2_1_7_1","first-page":"1877","volume-title":"Lin (Eds.)","volume":"33","author":"Brown Tom","year":"2020","unstructured":"Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 1877-1901. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2020\/file\/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.20"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-acl.248"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1803.05457"},{"key":"e_1_3_2_1_11_1","unstructured":"DeepSeek-AI Aixin Liu Bei Feng Bing Xue Bingxuan Wang Bochao Wu Chengda Lu Chenggang Zhao Chengqi Deng Chenyu Zhang Chong Ruan Damai Dai Daya Guo Dejian Yang Deli Chen Dongjie Ji Erhang Li Fangyun Lin Fucong Dai Fuli Luo Guangbo Hao Guanting Chen Guowei Li H. Zhang and Others Bao. 2024. DeepSeek-V3 Technical Report. arXiv e-prints (Dec. 2024). doi:10.48550\/arXiv.2412.19437"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1704.05179"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.872"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.117"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","unstructured":"Gemini Team Petko Georgiev Ving Ian Lei Ryan Burnell Libin Bai Anmol Gulati Garrett Tanzer Damien Vincent Zhufeng Pan Shibo Wang Soroosh Mariooryad Yifan Ding Xinyang Geng Fred Alcober Roy Frostig Mark Omernick Lexi Walker Cosmin Paduraru Christina Sorokin and Others Tacchetti. 2024. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. arXiv e-prints (March 2024). doi:10.48550\/arXiv.2403.05530","DOI":"10.48550\/arXiv.2403.05530"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","unstructured":"Gemma Team Morgane Riviere Shreya Pathak Pier Giuseppe Sessa Cassidy Hardin Surya Bhupatiraju L\u00e9onard Hussenot Thomas Mesnard Bobak Shahriari Alexandre Ram\u00e9 Johan Ferret Peter Liu Pouya Tafti Abe Friesen Michelle Casbon Sabela Ramos Ravin Kumar Charline Le Lan and Others. 2024. Gemma 2: Improving Open Language Models at a Practical Size. arXiv e-prints (July 2024). doi:10.48550\/arXiv.2408.00118","DOI":"10.48550\/arXiv.2408.00118"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2407.21783"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-acl.263"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.67"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1215"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","unstructured":"Albert Q. Jiang Alexandre Sablayrolles Arthur Mensch Chris Bamford Devendra Singh Chaplot Diego de las Casas Florian Bressand Gianna Lengyel Guillaume Lample Lucile Saulnier L\u00e9lio Renard Lavaud Marie-Anne Lachaux Pierre Stock Teven Le Scao Thibaut Lavril Thomas Wang Timoth\u00e9e Lacroix and William El Sayed. 2023. Mistral 7B. arXiv e-prints (Oct. 2023). doi:10.48550\/arXiv.2310.06825","DOI":"10.48550\/arXiv.2310.06825"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1147"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.307"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_2_1_25_1","volume-title":"Learning The Difference That Makes A Difference With Counterfactually-Augmented Data. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=Sklgs0NFvr","author":"Kaushik Divyansh","year":"2020","unstructured":"Divyansh Kaushik, Eduard Hovy, and Zachary Lipton. 2020. Learning The Difference That Makes A Difference With Counterfactually-Augmented Data. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=Sklgs0NFvr"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177729694"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00276"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1082"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2501.13125"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","unstructured":"Mike Lewis Yinhan Liu Naman Goyal Marjan Ghazvininejad Abdelrahman Mohamed Omer Levy Veselin Stoyanov and Luke Zettlemoyer. 2020. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation Translation and Comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Dan Jurafsky Joyce Chai Natalie Schluter and Joel Tetreault (Eds.). Association for Computational Linguistics Online 7871-7880. doi:10.18653\/v1\/2020.acl-main.703","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_2_1_31_1","volume-title":"Learning Question Classifiers. In COLING 2002: The 19th International Conference on Computational Linguistics. https:\/\/aclanthology.org\/C02-1150\/","author":"Li Xin","year":"2002","unstructured":"Xin Li and Dan Roth. 2002. Learning Question Classifiers. In COLING 2002: The 19th International Conference on Computational Linguistics. https:\/\/aclanthology.org\/C02-1150\/"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1907.11692"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000102"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-emnlp.546"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657855"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2502.00857"},{"key":"e_1_3_2_1_37_1","unstructured":"OpenAI Josh Achiam Steven Adler Sandhini Agarwal Lama Ahmad Ilge Akkaya Florencia Leoni Aleman Diogo Almeida Janko Altenschmidt Sam Altman et al. 2023. GPT-4 Technical Report. arXiv e-prints (March 2023). doi:10.48550\/arXiv.2303.08774"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1098\/rspl.1895.0041"},{"key":"e_1_3_2_1_39_1","unstructured":"Radek Pel\u00e1nek Ji Rih\u00e1k et al. 2016. Properties and Applications of Wrong Answers in Online Educational Systems. International Educational Data Mining Society (2016). https:\/\/eric.ed.gov\/?id=ED592699"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.2307\/2346567"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2405.12819"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.47"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2412.15115"},{"key":"e_1_3_2_1_44_1","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res., Vol. 21, 1, Article 140 (Jan. 2020), 67 pages. http:\/\/jmlr.org\/papers\/v21\/20-074.html","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.eval4nlp-1.2"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1264"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00266"},{"key":"e_1_3_2_1_48_1","volume-title":"International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=9Vrb9D0WI4","author":"Sanh Victor","year":"2022","unstructured":"Victor Sanh, Albert Webson, Colin Raffel, Stephen Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, et al., 2022. Multitask prompted training enables zero-shot task generalization. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=9Vrb9D0WI4"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1056"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.2307\/1412159"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2302.13971"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-4413"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1259"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.655"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.47"}],"event":{"name":"SIGIR '25: The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Padua Italy","acronym":"SIGIR '25"},"container-title":["Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3726302.3730299","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T10:01:29Z","timestamp":1755856889000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3726302.3730299"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,13]]},"references-count":55,"alternative-id":["10.1145\/3726302.3730299","10.1145\/3726302"],"URL":"https:\/\/doi.org\/10.1145\/3726302.3730299","relation":{},"subject":[],"published":{"date-parts":[[2025,7,13]]},"assertion":[{"value":"2025-07-13","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}