{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:00:39Z","timestamp":1775815239261,"version":"3.50.1"},"reference-count":205,"publisher":"Association for Computing Machinery (ACM)","issue":"12","license":[{"start":{"date-parts":[[2024,10,3]],"date-time":"2024-10-03T00:00:00Z","timestamp":1727913600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Shenzhen Science and Technology Program","award":["JCYJ20220818103001002"],"award-info":[{"award-number":["JCYJ20220818103001002"]}]},{"name":"Shenzhen Doctoral Startup Funding","award":["RCBS20221008093330065"],"award-info":[{"award-number":["RCBS20221008093330065"]}]},{"name":"Tianyuan Fund for Mathematics of National Natural Science Foundation of China","award":["12326608"],"award-info":[{"award-number":["12326608"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2024,12,31]]},"abstract":"<jats:p>\n            This survey article proposes a clearer view of\n            <jats:bold>Natural Language Reasoning (NLR)<\/jats:bold>\n            in the field of\n            <jats:bold>Natural Language Processing (NLP)<\/jats:bold>\n            , both conceptually and practically. Conceptually, we provide a distinct definition for NLR in NLP, based on both philosophy and NLP scenarios; discuss what types of tasks require reasoning; and introduce a taxonomy of reasoning. Practically, we conduct a comprehensive literature review on NLR in NLP, mainly covering classical logical reasoning,\n            <jats:bold>Natural Language Inference (NLI)<\/jats:bold>\n            , multi-hop question answering, and commonsense reasoning. The article also identifies and views backward reasoning, a powerful paradigm for multi-step reasoning, and introduces defeasible reasoning as one of the most important future directions in NLR research. We focus on single-modality unstructured natural language text, excluding neuro-symbolic research and mathematical reasoning.\n            <jats:xref ref-type=\"fn\">\n              <jats:sup>1<\/jats:sup>\n            <\/jats:xref>\n          <\/jats:p>","DOI":"10.1145\/3664194","type":"journal-article","created":{"date-parts":[[2024,5,9]],"date-time":"2024-05-09T08:33:00Z","timestamp":1715243580000},"page":"1-39","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":36,"title":["Natural Language Reasoning, A Survey"],"prefix":"10.1145","volume":"56","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-0400-4599","authenticated-orcid":false,"given":"Fei","family":"Yu","sequence":"first","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0425-3673","authenticated-orcid":false,"given":"Hongbo","family":"Zhang","sequence":"additional","affiliation":[{"name":"Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2851-4260","authenticated-orcid":false,"given":"Prayag","family":"Tiwari","sequence":"additional","affiliation":[{"name":"School of Information Technology, Halmstad University, Halmstad, Sweden"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1501-9914","authenticated-orcid":false,"given":"Benyou","family":"Wang","sequence":"additional","affiliation":[{"name":"Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen, China"}]}],"member":"320","published-online":{"date-parts":[[2024,10,3]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"3050","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201921), (Volume 1: Long Papers)","author":"Aggarwal Shourya","year":"2021","unstructured":"Shourya Aggarwal, Divyanshu Mandowara, Vishwajeet Agrawal, Dinesh Khandelwal, Parag Singla, and Dinesh Garg. 2021. Explanations for CommonsenseQA: New dataset and models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201921), (Volume 1: Long Papers), Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). Association for Computational Linguistics, 3050\u20133065. DOI:10.18653\/v1\/2021.acl-long.238"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2211.15661"},{"key":"e_1_3_2_4_2","volume-title":"Dictionary of Philosophy","author":"Angeles Peter Adam","year":"1981","unstructured":"Peter Adam Angeles. 1981. Dictionary of Philosophy. Barnes & Noble Books."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2302.04023"},{"key":"e_1_3_2_6_2","unstructured":"Qiming Bao Alex Yuxuan Peng Tim Hartill Neset Tan Zhenyun Deng Michael Witbrock and Jiamou Liu. 2022. Multi-step deductive reasoning over natural language: An empirical study on out-of-distribution generalisation. In The 2nd International Joint Conference on Learning and Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy\u201922)."},{"key":"e_1_3_2_7_2","first-page":"63","volume-title":"IWCS","author":"Betz Gregor","year":"2021","unstructured":"Gregor Betz, Christian Voigt, and Kyle Richardson. 2021. Critical thinking for language models. In IWCS. Association for Computational Linguistics, 63\u201375."},{"key":"e_1_3_2_8_2","volume-title":"ICLR","author":"Bhagavatula Chandra","year":"2020","unstructured":"Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Wen-tau Yih, and Yejin Choi. 2020. Abductive commonsense reasoning. In ICLR. OpenReview.net."},{"key":"e_1_3_2_9_2","first-page":"7432","volume-title":"The 34th AAAI Conference on Artificial Intelligence (AAAI\u201920), The 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), The 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920)","author":"Bisk Yonatan","year":"2020","unstructured":"Yonatan Bisk, Rowan Zellers, Ronan Le Bras, Jianfeng Gao, and Yejin Choi. 2020. PIQA: Reasoning about physical commonsense in natural language. In The 34th AAAI Conference on Artificial Intelligence (AAAI\u201920), The 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), The 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920). AAAI Press, 7432\u20137439. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/6239"},{"key":"e_1_3_2_10_2","volume-title":"The Oxford Dictionary of Philosophy","author":"Blackburn Simon","year":"2008","unstructured":"Simon Blackburn. 2008. The Oxford Dictionary of Philosophy. Oxford University Press."},{"key":"e_1_3_2_11_2","first-page":"1122","volume-title":"EMNLP","author":"Boratko Michael","year":"2020","unstructured":"Michael Boratko, Xiang Li, Tim O\u2019Gorman, Rajarshi Das, Dan Le, and Andrew McCallum. 2020. ProtoQA: A question answering dataset for prototypical common-sense reasoning. In EMNLP. Association for Computational Linguistics, 1122\u20131136."},{"key":"e_1_3_2_12_2","doi-asserted-by":"crossref","first-page":"6266","DOI":"10.18653\/v1\/2021.emnlp-main.506","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921)","author":"Bostrom Kaj","year":"2021","unstructured":"Kaj Bostrom, Xinyu Zhao, Swarat Chaudhuri, and Greg Durrett. 2021. Flexible generation of natural language deductions. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921), Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (Eds.). Association for Computational Linguistics, 6266\u20136278. DOI:10.18653\/v1\/2021.emnlp-main.506"},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","first-page":"632","DOI":"10.18653\/v1\/D15-1075","volume-title":"Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201915)","author":"Bowman Samuel R.","year":"2015","unstructured":"Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201915), Llu\u00eds M\u00e0rquez, Chris Callison-Burch, Jian Su, Daniele Pighin, and Yuval Marton (Eds.). Association for Computational Linguistics, 632\u2013642. DOI:10.18653\/v1\/d15-1075"},{"key":"e_1_3_2_14_2","article-title":"Inference","author":"Britannica The Editors of Encyclopaedia","year":"2017","unstructured":"The Editors of Encyclopaedia Britannica. 2017. Inference. Encyclopedia Britannica. https:\/\/www.britannica.com\/topic\/inference-reason","journal-title":"Encyclopedia Britannica"},{"key":"e_1_3_2_15_2","article-title":"Reason","author":"Britannica The Editors of Encyclopaedia","year":"2020","unstructured":"The Editors of Encyclopaedia Britannica. 2020. Reason. Encyclopedia Britannica. https:\/\/www.britannica.com\/topic\/reason","journal-title":"Encyclopedia Britannica"},{"key":"e_1_3_2_16_2","volume-title":"Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 (NeurIPS\u201920)","author":"Brown Tom B.","year":"2020","unstructured":"Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 (NeurIPS\u201920), Hugo Larochelle, Marc\u2019Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.). https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2303.12712"},{"key":"e_1_3_2_18_2","volume-title":"3rd Annual Conference on Weblogs and Social Media (ICWSM\u201909)","author":"Burton Kevin","year":"2009","unstructured":"Kevin Burton, Akshay Java, and Ian Soboroff. 2009. The ICWSM 2009 spinn3r dataset. In 3rd Annual Conference on Weblogs and Social Media (ICWSM\u201909)."},{"key":"e_1_3_2_19_2","first-page":"9560","volume-title":"Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 (NeurIPS\u201918)","author":"Camburu Oana-Maria","year":"2018","unstructured":"Oana-Maria Camburu, Tim Rockt\u00e4schel, Thomas Lukasiewicz, and Phil Blunsom. 2018. e-SNLI: Natural language inference with natural language explanations. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 (NeurIPS\u201918), Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicol\u00f2 Cesa-Bianchi, and Roman Garnett (Eds.). 9560\u20139572. https:\/\/proceedings.neurips.cc\/paper\/2018\/hash\/4c7a167bb329bd92580a99ce422d6fa6-Abstract.html"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.297"},{"key":"e_1_3_2_21_2","first-page":"4026","volume-title":"NAACL-HLT","author":"Chen Jifan","year":"2019","unstructured":"Jifan Chen and Greg Durrett. 2019. Understanding dataset design choices for multi-hop reasoning. In NAACL-HLT. Association for Computational Linguistics, 4026\u20134032."},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","first-page":"8772","DOI":"10.18653\/v1\/2020.acl-main.774","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL\u201920)","author":"Chen Tongfei","year":"2020","unstructured":"Tongfei Chen, Zhengping Jiang, Adam Poliak, Keisuke Sakaguchi, and Benjamin Van Durme. 2020. Uncertain natural language inference. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL\u201920), Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (Eds.). Association for Computational Linguistics, 8772\u20138779. DOI:10.18653\/v1\/2020.acl-main.774"},{"key":"e_1_3_2_23_2","volume-title":"8th International Conference on Learning Representations (ICLR\u201920)","author":"Chen Wenhu","year":"2020","unstructured":"Wenhu Chen, Hongmin Wang, Jianshu Chen, Yunkai Zhang, Hong Wang, Shiyang Li, Xiyou Zhou, and William Yang Wang. 2020. TabFact: A large-scale dataset for table-based fact verification. In 8th International Conference on Learning Representations (ICLR\u201920). OpenReview.net. https:\/\/openreview.net\/forum?id=rkeJRhNYDH"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.11416"},{"key":"e_1_3_2_25_2","article-title":"Think you have solved question answering? Try ARC, the AI2 reasoning challenge","volume":"1803","author":"Clark Peter","year":"2018","unstructured":"Peter Clark, Isaac Cowhey, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Carissa Schoenick, and Oyvind Tafjord. 2018. Think you have solved question answering? Try ARC, the AI2 reasoning challenge. CoRR abs\/1803.05457 (2018). arXiv:1803.05457http:\/\/arxiv.org\/abs\/1803.05457","journal-title":"CoRR"},{"key":"e_1_3_2_26_2","first-page":"3882","volume-title":"IJCAI","author":"Clark Peter","year":"2020","unstructured":"Peter Clark, Oyvind Tafjord, and Kyle Richardson. 2020. Transformers as soft reasoners over language. In IJCAI. ijcai.org, 3882\u20133890."},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","first-page":"2475","DOI":"10.18653\/v1\/D18-1269","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Conneau Alexis","year":"2018","unstructured":"Alexis Conneau, Ruty Rinott, Guillaume Lample, Adina Williams, Samuel R. Bowman, Holger Schwenk, and Veselin Stoyanov. 2018. XNLI: Evaluating cross-lingual sentence representations. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun\u2019ichi Tsujii (Eds.). Association for Computational Linguistics, 2475\u20132485. DOI:10.18653\/v1\/d18-1269"},{"key":"e_1_3_2_28_2","article-title":"Faithful reasoning using large language models","volume":"2208","author":"Creswell Antonia","year":"2022","unstructured":"Antonia Creswell and Murray Shanahan. 2022. Faithful reasoning using large language models. CoRR abs\/2208.14271 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_29_2","article-title":"Selection-inference: Exploiting large language models for interpretable logical reasoning","volume":"2205","author":"Creswell Antonia","year":"2022","unstructured":"Antonia Creswell, Murray Shanahan, and Irina Higgins. 2022. Selection-inference: Exploiting large language models for interpretable logical reasoning. CoRR abs\/2205.09712 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.2200\/S00509ED1V01Y201305HLT023"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.10559"},{"key":"e_1_3_2_32_2","first-page":"7358","volume-title":"EMNLP","author":"Dalvi Bhavana","year":"2021","unstructured":"Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, and Peter Clark. 2021. Explaining answers with entailment trees. In EMNLP. Association for Computational Linguistics, 7358\u20137370."},{"key":"e_1_3_2_33_2","article-title":"Language models show human-like content effects on reasoning","volume":"2207","author":"Dasgupta Ishita","year":"2022","unstructured":"Ishita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, and Felix Hill. 2022. Language models show human-like content effects on reasoning. CoRR abs\/2207.07051 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","first-page":"6112","DOI":"10.18653\/v1\/2021.emnlp-main.494","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921)","author":"Deng Xiang","year":"2021","unstructured":"Xiang Deng, Yu Su, Alyssa Lees, You Wu, Cong Yu, and Huan Sun. 2021. ReasonBERT: Pre-trained to reason with distant supervision. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921), Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (Eds.). Association for Computational Linguistics, 6112\u20136127. DOI:10.18653\/v1\/2021.emnlp-main.494"},{"key":"e_1_3_2_35_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201919), Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201919), Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.). Association for Computational Linguistics, 4171\u20134186. DOI:10.18653\/v1\/n19-1423"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","first-page":"932","DOI":"10.18653\/v1\/2022.acl-long.66","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922)","author":"Dong Qingxiu","year":"2022","unstructured":"Qingxiu Dong, Ziwei Qin, Heming Xia, Tian Feng, Shoujie Tong, Haoran Meng, Lin Xu, Zhongyu Wei, Weidong Zhan, Baobao Chang, Sujian Li, Tianyu Liu, and Zhifang Sui. 2022. Premise-based multimodal reasoning: Conditional inference on joint textual and visual clues. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922), Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for Computational Linguistics, 932\u2013946. DOI:10.18653\/v1\/2022.acl-long.66"},{"key":"e_1_3_2_37_2","first-page":"5181","volume-title":"ACL\/IJCNLP","author":"Du Li","year":"2021","unstructured":"Li Du, Xiao Ding, Ting Liu, and Bing Qin. 2021. Learning event graph knowledge for abductive reasoning. In ACL\/IJCNLP. Association for Computational Linguistics, 5181\u20135190."},{"key":"e_1_3_2_38_2","first-page":"432","volume-title":"ACL","author":"Du Li","year":"2022","unstructured":"Li Du, Xiao Ding, Kai Xiong, Ting Liu, and Bing Qin. 2022. e-CARE: A new dataset for exploring explainable causal reasoning. In ACL. Association for Computational Linguistics, 432\u2013446."},{"key":"e_1_3_2_39_2","first-page":"698","volume-title":"EMNLP","author":"Emelin Denis","year":"2021","unstructured":"Denis Emelin, Ronan Le Bras, Jena D. Hwang, Maxwell Forbes, and Yejin Choi. 2021. Moral stories: Situated reasoning about norms, intents, actions, and their consequences. In EMNLP. Association for Computational Linguistics, 698\u2013718."},{"key":"e_1_3_2_40_2","first-page":"6896","volume-title":"ACL","author":"Fei Zichu","year":"2022","unstructured":"Zichu Fei, Qi Zhang, Tao Gui, Di Liang, Sirui Wang, Wei Wu, and Xuanjing Huang. 2022. CQG: A simple and effective controlled generation framework for multi-hop question generation. In ACL. Association for Computational Linguistics, 6896\u20136906."},{"key":"e_1_3_2_41_2","first-page":"1295","volume-title":"EMNLP","author":"Feng Yanlin","year":"2020","unstructured":"Yanlin Feng, Xinyue Chen, Bill Yuchen Lin, Peifeng Wang, Jun Yan, and Xiang Ren. 2020. Scalable multi-hop relational reasoning for knowledge-aware question answering. In EMNLP. Association for Computational Linguistics, 1295\u20131309."},{"issue":"2","key":"e_1_3_2_42_2","article-title":"Informal logic and the theory of reasoning","volume":"6","author":"Finocchiaro Maurice A.","year":"1984","unstructured":"Maurice A. Finocchiaro. 1984. Informal logic and the theory of reasoning. Informal Logic 6, 2 (1984).","journal-title":"Informal Logic"},{"key":"e_1_3_2_43_2","first-page":"653","volume-title":"EMNLP","author":"Forbes Maxwell","year":"2020","unstructured":"Maxwell Forbes, Jena D. Hwang, Vered Shwartz, Maarten Sap, and Yejin Choi. 2020. Social Chemistry 101: Learning to reason about social and moral norms. In EMNLP. Association for Computational Linguistics, 653\u2013670."},{"key":"e_1_3_2_44_2","article-title":"Peirce\u2019s logic","author":"Bellucci Ahti-Veikko Pietarinen Francesco","year":"2022","unstructured":"Ahti-Veikko Pietarinen Francesco Bellucci. 2022. Peirce\u2019s logic. In The Internet Encyclopedia of Philosophy, ISSN 2161-0002 (2022). https:\/\/iep.utm.edu\/peir-log\/","journal-title":"The Internet Encyclopedia of Philosophy, ISSN 2161-0002"},{"key":"e_1_3_2_45_2","first-page":"3108","volume-title":"ACL","author":"Gabriel Saadia","year":"2022","unstructured":"Saadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi, and Yejin Choi. 2022. Misinfo reaction frames: Reasoning about readers\u2019 reactions to news headlines. In ACL. Association for Computational Linguistics, 3108\u20133127."},{"key":"e_1_3_2_46_2","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1162\/tacl_a_00370","article-title":"Did Aristotle use a laptop? A question answering benchmark with implicit reasoning strategies","volume":"9","author":"Geva Mor","year":"2021","unstructured":"Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, and Jonathan Berant. 2021. Did Aristotle use a laptop? A question answering benchmark with implicit reasoning strategies. Trans. Assoc. Comput. Linguistics 9 (2021), 346\u2013361.","journal-title":"Trans. Assoc. Comput. Linguistics"},{"key":"e_1_3_2_47_2","volume-title":"Epistemology and Cognition","author":"Goldman Alvin I.","year":"1986","unstructured":"Alvin I. Goldman. 1986. Epistemology and Cognition. Harvard University Press."},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.2305.11738"},{"issue":"2","key":"e_1_3_2_49_2","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/BF00128143","article-title":"Critical thinking as argument analysis","volume":"3","author":"Govier Trudy","year":"1989","unstructured":"Trudy Govier. 1989. Critical thinking as argument analysis. Argumentation 3, 2 (1989), 115\u2013126.","journal-title":"Argumentation"},{"key":"e_1_3_2_50_2","first-page":"1930","volume-title":"NAACL-HLT","author":"Habernal Ivan","year":"2018","unstructured":"Ivan Habernal, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. 2018. The argument reasoning comprehension task: Identification and reconstruction of implicit warrants. In NAACL-HLT. Association for Computational Linguistics, 1930\u20131940."},{"key":"e_1_3_2_51_2","article-title":"FOLIO: Natural language reasoning with first-order logic","volume":"2209","author":"Han Simeng","year":"2022","unstructured":"Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, and Dragomir Radev. 2022. FOLIO: Natural language reasoning with first-order logic. CoRR abs\/2209.00840 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_52_2","volume-title":"9th International Conference on Learning Representations (ICLR\u201921)","author":"Hendrycks Dan","year":"2021","unstructured":"Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt. 2021. Measuring massive multitask language understanding. In 9th International Conference on Learning Representations (ICLR\u201921). OpenReview.net. https:\/\/openreview.net\/forum?id=d7KBjmI3GmQ"},{"key":"e_1_3_2_53_2","article-title":"Logic","author":"Hintikka Jaakko J.","year":"2022","unstructured":"Jaakko J. Hintikka. 2022. Logic. In Encyclopedia Britannica. https:\/\/www.britannica.com\/topic\/logic","journal-title":"Encyclopedia Britannica"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.12152"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.10071"},{"key":"e_1_3_2_56_2","first-page":"6609","volume-title":"COLING","author":"Ho Xanh","year":"2020","unstructured":"Xanh Ho, Anh-Khoa Duong Nguyen, Saku Sugawara, and Akiko Aizawa. 2020. Constructing a multi-hop QA dataset for comprehensive evaluation of reasoning steps. In COLING. International Committee on Computational Linguistics, 6609\u20136625."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-naacl.145"},{"key":"e_1_3_2_58_2","doi-asserted-by":"crossref","first-page":"9106","DOI":"10.18653\/v1\/2020.emnlp-main.732","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920)","author":"Hossain Md Mosharaf","year":"2020","unstructured":"Md Mosharaf Hossain, Venelin Kovatchev, Pranoy Dutta, Tiffany Kao, Elizabeth Wei, and Eduardo Blanco. 2020. An analysis of natural language inference benchmarks through the lens of negation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920), Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (Eds.). Association for Computational Linguistics, 9106\u20139118. DOI:10.18653\/v1\/2020.emnlp-main.732"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.18653\/V1\/2020.FINDINGS-EMNLP.314"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.10403"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.2310.01798"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.11610"},{"key":"e_1_3_2_63_2","first-page":"2391","volume-title":"EMNLP\/IJCNLP","author":"Huang Lifu","year":"2019","unstructured":"Lifu Huang, Ronan Le Bras, Chandra Bhagavatula, and Yejin Choi. 2019. Cosmos QA: Machine reading comprehension with contextual commonsense reasoning. In EMNLP\/IJCNLP. Association for Computational Linguistics, 2391\u20132401."},{"key":"e_1_3_2_64_2","first-page":"5810","volume-title":"NAACL-HLT","author":"Huang Yongjie","year":"2021","unstructured":"Yongjie Huang and Meng Yang. 2021. Breadth first reasoning graph for multi-hop question answering. In NAACL-HLT. Association for Computational Linguistics, 5810\u20135821."},{"key":"e_1_3_2_65_2","article-title":"MetaLogic: Logical reasoning explanations with fine-grained structure","volume":"2210","author":"Huang Yinya","year":"2022","unstructured":"Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, and Dong Yu. 2022. MetaLogic: Logical reasoning explanations with fine-grained structure. CoRR abs\/2210.12487 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_66_2","volume-title":"A Concise Introduction to Logic","author":"Hurley Patrick J.","year":"2014","unstructured":"Patrick J. Hurley. 2014. A Concise Introduction to Logic. Cengage Learning."},{"key":"e_1_3_2_67_2","first-page":"6384","volume-title":"35th AAAI Conference on Artificial Intelligence (AAAI\u201921), 33rd Conference on Innovative Applications of Artificial Intelligence (IAAI\u201921), The 11th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201921)","author":"Hwang Jena D.","year":"2021","unstructured":"Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jeff Da, Keisuke Sakaguchi, Antoine Bosselut, and Yejin Choi. 2021. (Comet-) Atomic 2020: On symbolic and neural commonsense knowledge graphs. In 35th AAAI Conference on Artificial Intelligence (AAAI\u201921), 33rd Conference on Innovative Applications of Artificial Intelligence (IAAI\u201921), The 11th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201921). AAAI Press, 6384\u20136392. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/16792"},{"key":"e_1_3_2_68_2","first-page":"6740","volume-title":"ACL","author":"Inoue Naoya","year":"2020","unstructured":"Naoya Inoue, Pontus Stenetorp, and Kentaro Inui. 2020. R4C: A benchmark for evaluating RC systems to get the right answer for the right reason. In ACL. Association for Computational Linguistics, 6740\u20136750."},{"key":"e_1_3_2_69_2","first-page":"6064","volume-title":"EMNLP","author":"Inoue Naoya","year":"2021","unstructured":"Naoya Inoue, Harsh Trivedi, Steven Sinha, Niranjan Balasubramanian, and Kentaro Inui. 2021. Summarize-then-answer: Generating concise explanations for multi-hop reading comprehension. In EMNLP. Association for Computational Linguistics, 6064\u20136080."},{"key":"e_1_3_2_70_2","first-page":"137","volume-title":"EMNLP","author":"Jhamtani Harsh","year":"2020","unstructured":"Harsh Jhamtani and Peter Clark. 2020. Learning to explain: Datasets and models for identifying valid reasoning chains in multihop question-answering. In EMNLP. Association for Computational Linguistics, 137\u2013150."},{"key":"e_1_3_2_71_2","first-page":"2726","volume-title":"ACL","author":"Jiang Yichen","year":"2019","unstructured":"Yichen Jiang and Mohit Bansal. 2019. Avoiding reasoning shortcuts: Adversarial evaluation, training, and model development for multi-hop QA. In ACL. Association for Computational Linguistics, 2726\u20132736."},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-acl.276"},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2205.11822"},{"key":"e_1_3_2_74_2","volume-title":"Thinking, Fast and Slow","author":"Kahneman Daniel","year":"2011","unstructured":"Daniel Kahneman. 2011. Thinking, Fast and Slow. Macmillan."},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.13894"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1162\/TACL_A_00419"},{"key":"e_1_3_2_77_2","first-page":"8082","volume-title":"AAAI","author":"Khot Tushar","year":"2020","unstructured":"Tushar Khot, Peter Clark, Michal Guerquin, Peter Jansen, and Ashish Sabharwal. 2020. QASC: A dataset for question answering via sentence composition. In AAAI. AAAI Press, 8082\u20138090."},{"key":"e_1_3_2_78_2","first-page":"1264","volume-title":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201921)","author":"Khot Tushar","year":"2021","unstructured":"Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, and Ashish Sabharwal. 2021. Text modular networks: Learning to decompose tasks in the language of existing models. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201921), Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-T\u00fcr, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (Eds.). Association for Computational Linguistics, 1264\u20131279. DOI:10.18653\/v1\/2021.naacl-main.99"},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.18653\/V1\/2022.FINDINGS-ACL.142"},{"key":"e_1_3_2_80_2","first-page":"5189","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI\u201918), the 30th innovative Applications of Artificial Intelligence (IAAI\u201918), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201918)","author":"Khot Tushar","year":"2018","unstructured":"Tushar Khot, Ashish Sabharwal, and Peter Clark. 2018. SciTaiL: A textual entailment dataset from science question answering. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI\u201918), the 30th innovative Applications of Artificial Intelligence (IAAI\u201918), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201918), Sheila A. McIlraith and Kilian Q. Weinberger (Eds.). AAAI Press, 5189\u20135197. https:\/\/www.aaai.org\/ocs\/index.php\/AAAI\/AAAI18\/paper\/view\/17368"},{"key":"e_1_3_2_81_2","first-page":"4831","volume-title":"ACL","author":"Klein Tassilo","year":"2019","unstructured":"Tassilo Klein and Moin Nabi. 2019. Attention is (not) all you need for commonsense reasoning. In ACL. Association for Computational Linguistics, 4831\u20134836."},{"key":"e_1_3_2_82_2","article-title":"Large language models are zero-shot reasoners","volume":"2205","author":"Kojima Takeshi","year":"2022","unstructured":"Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, and Yusuke Iwasawa. 2022. Large language models are zero-shot reasoners. CoRR abs\/2205.11916 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_83_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.53"},{"key":"e_1_3_2_84_2","first-page":"5651","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201921) (Volume 1: Long Papers)","author":"Le Hung","year":"2021","unstructured":"Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard, and Satwik Kottur. 2021. DVD: A diagnostic dataset for multi-step reasoning in video grounded dialogue. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201921) (Volume 1: Long Papers), Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). Association for Computational Linguistics, 5651\u20135665. DOI:10.18653\/v1\/2021.acl-long.439"},{"key":"e_1_3_2_85_2","first-page":"6110","volume-title":"ACL\/IJCNLP","author":"Lee Kyungjae","year":"2021","unstructured":"Kyungjae Lee, Seung-won Hwang, Sang-eun Han, and Dohyeon Lee. 2021. Robustifying multi-hop QA through pseudo-evidentiality training. In ACL\/IJCNLP. Association for Computational Linguistics, 6110\u20136119."},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219745"},{"key":"e_1_3_2_87_2","first-page":"1225","volume-title":"NAACL-HLT","author":"Liang Zhengzhong","year":"2021","unstructured":"Zhengzhong Liang, Steven Bethard, and Mihai Surdeanu. 2021. Explainable multi-hop verbal reasoning through internal monologue. In NAACL-HLT. Association for Computational Linguistics, 1225\u20131250."},{"key":"e_1_3_2_88_2","first-page":"4611","volume-title":"NAACL-HLT","author":"Lin Bill Yuchen","year":"2021","unstructured":"Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren, and William W. Cohen. 2021. Differentiable open-ended commonsense reasoning. In NAACL-HLT. Association for Computational Linguistics, 4611\u20134625."},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.165"},{"key":"e_1_3_2_90_2","first-page":"58","volume-title":"Proceedings of the 2nd Workshop on Machine Reading for Question Answering (MRQA@EMNLP\u201919)","author":"Lin Kevin","year":"2019","unstructured":"Kevin Lin, Oyvind Tafjord, Peter Clark, and Matt Gardner. 2019. Reasoning over paragraph effects in situations. In Proceedings of the 2nd Workshop on Machine Reading for Question Answering (MRQA@EMNLP\u201919), Adam Fisch, Alon Talmor, Robin Jia, Minjoon Seo, Eunsol Choi, and Danqi Chen (Eds.). Association for Computational Linguistics, 58\u201362. DOI:10.18653\/v1\/D19-5808"},{"key":"e_1_3_2_91_2","first-page":"13388","volume-title":"35th AAAI Conference on Artificial Intelligence (AAAI\u201921), 33rd Conference on Innovative Applications of Artificial Intelligence (IAAI\u201921), The 11th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201921)","author":"Liu Hanmeng","year":"2021","unstructured":"Hanmeng Liu, Leyang Cui, Jian Liu, and Yue Zhang. 2021. Natural language inference in context\u2014Investigating contextual reasoning over long texts. In 35th AAAI Conference on Artificial Intelligence (AAAI\u201921), 33rd Conference on Innovative Applications of Artificial Intelligence (IAAI\u201921), The 11th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201921). AAAI Press, 13388\u201313396. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/17580"},{"issue":"4","key":"e_1_3_2_92_2","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1023\/B:BTTJ.0000047600.45421.6d","article-title":"ConceptNet\u2013a practical commonsense reasoning tool-kit","volume":"22","author":"Liu Hugo","year":"2004","unstructured":"Hugo Liu and Push Singh. 2004. ConceptNet\u2013a practical commonsense reasoning tool-kit. BT Technol. J. 22, 4 (2004), 211\u2013226.","journal-title":"BT Technol. J."},{"key":"e_1_3_2_93_2","first-page":"3622","volume-title":"Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI\u201920)","author":"Liu Jian","year":"2020","unstructured":"Jian Liu, Leyang Cui, Hanmeng Liu, Dandan Huang, Yile Wang, and Yue Zhang. 2020. LogiQA: A challenge dataset for machine reading comprehension with logical reasoning. In Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI\u201920), Christian Bessiere (Ed.). ijcai.org, 3622\u20133628. DOI:10.24963\/ijcai.2020\/501"},{"key":"e_1_3_2_94_2","first-page":"3154","volume-title":"ACL","author":"Liu Jiacheng","year":"2022","unstructured":"Jiacheng Liu, Alisa Liu, Ximing Lu, Sean Welleck, Peter West, Ronan Le Bras, Yejin Choi, and Hannaneh Hajishirzi. 2022. Generated knowledge prompting for commonsense reasoning. In ACL. Association for Computational Linguistics, 3154\u20133169."},{"key":"e_1_3_2_95_2","volume-title":"An Essay Concerning Human Understanding","author":"Locke John","year":"1847","unstructured":"John Locke. 1847. An Essay Concerning Human Understanding. Kay & Troutman."},{"key":"e_1_3_2_96_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.2310.00836"},{"key":"e_1_3_2_97_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.456"},{"key":"e_1_3_2_98_2","first-page":"6291","volume-title":"EMNLP","author":"Madaan Aman","year":"2021","unstructured":"Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Peter Clark, Yiming Yang, and Eduard H. Hovy. 2021. Think about it! Improving defeasible reasoning by first modeling the question scenario. In EMNLP. Association for Computational Linguistics, 6291\u20136310."},{"key":"e_1_3_2_99_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.08410"},{"key":"e_1_3_2_100_2","first-page":"3428","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers","author":"McCoy Tom","year":"2019","unstructured":"Tom McCoy, Ellie Pavlick, and Tal Linzen. 2019. Right for the wrong reasons: Diagnosing syntactic heuristics in natural language inference. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers, Anna Korhonen, David R. Traum, and Llu\u00eds M\u00e0rquez (Eds.). Association for Computational Linguistics, 3428\u20133448. DOI:10.18653\/v1\/p19-1334"},{"key":"e_1_3_2_101_2","doi-asserted-by":"crossref","first-page":"2381","DOI":"10.18653\/v1\/D18-1260","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Mihaylov Todor","year":"2018","unstructured":"Todor Mihaylov, Peter Clark, Tushar Khot, and Ashish Sabharwal. 2018. Can a suit of armor conduct electricity? A new dataset for open book question answering. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun\u2019ichi Tsujii (Eds.). Association for Computational Linguistics, 2381\u20132391. DOI:10.18653\/v1\/d18-1260"},{"key":"e_1_3_2_102_2","first-page":"4249","volume-title":"ACL","author":"Min Sewon","year":"2019","unstructured":"Sewon Min, Eric Wallace, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi, and Luke Zettlemoyer. 2019. Compositional questions do not necessitate multi-hop reasoning. In ACL. Association for Computational Linguistics, 4249\u20134257."},{"key":"e_1_3_2_103_2","first-page":"6097","volume-title":"ACL","author":"Min Sewon","year":"2019","unstructured":"Sewon Min, Victor Zhong, Luke Zettlemoyer, and Hannaneh Hajishirzi. 2019. Multi-hop reading comprehension through question decomposition and rescoring. In ACL. Association for Computational Linguistics, 6097\u20136109."},{"key":"e_1_3_2_104_2","first-page":"839","volume-title":"The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT\u201916)","author":"Mostafazadeh Nasrin","year":"2016","unstructured":"Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, and James F. Allen. 2016. A corpus and cloze evaluation for deeper understanding of commonsense stories. In The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT\u201916), Kevin Knight, Ani Nenkova, and Owen Rambow (Eds.). Association for Computational Linguistics, 839\u2013849. DOI:10.18653\/v1\/n16-1098"},{"key":"e_1_3_2_105_2","first-page":"4885","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL\u201920)","author":"Nie Yixin","year":"2020","unstructured":"Yixin Nie, Adina Williams, Emily Dinan, Mohit Bansal, Jason Weston, and Douwe Kiela. 2020. Adversarial NLI: A new benchmark for natural language understanding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL\u201920), Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (Eds.). Association for Computational Linguistics, 4885\u20134901. DOI:10.18653\/V1\/2020.ACL-MAIN.441"},{"key":"e_1_3_2_106_2","volume-title":"NeurIPS Datasets and Benchmarks","author":"Onoe Yasumasa","year":"2021","unstructured":"Yasumasa Onoe, Michael J. Q. Zhang, Eunsol Choi, and Greg Durrett. 2021. CREAK: A dataset for commonsense reasoning over entity knowledge. In NeurIPS Datasets and Benchmarks."},{"key":"e_1_3_2_107_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2203.15099"},{"key":"e_1_3_2_108_2","first-page":"5866","volume-title":"NAACL-HLT","author":"Pan Liangming","year":"2021","unstructured":"Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan, and William Yang Wang. 2021. Unsupervised multi-hop question answering by question generation. In NAACL-HLT. Association for Computational Linguistics, 5866\u20135880."},{"key":"e_1_3_2_109_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2205.12538"},{"key":"e_1_3_2_110_2","volume-title":"Reasoning and the Logic of Things: The Cambridge Conferences Lectures of 1898","author":"Peirce Charles Sanders","year":"1992","unstructured":"Charles Sanders Peirce. 1992. Reasoning and the Logic of Things: The Cambridge Conferences Lectures of 1898. Harvard University Press."},{"key":"e_1_3_2_111_2","doi-asserted-by":"crossref","first-page":"8864","DOI":"10.18653\/v1\/2020.emnlp-main.713","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920)","author":"Perez Ethan","year":"2020","unstructured":"Ethan Perez, Patrick S. H. Lewis, Wen-tau Yih, Kyunghyun Cho, and Douwe Kiela. 2020. Unsupervised question decomposition for question answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920), Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (Eds.). Association for Computational Linguistics, 8864\u20138880. DOI:10.18653\/v1\/2020.emnlp-main.713"},{"key":"e_1_3_2_112_2","article-title":"Reasoning like program executors","volume":"2201","author":"Pi Xinyu","year":"2022","unstructured":"Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Yan Gao, Qiang Fu, Jian-Guang Lou, and Weizhu Chen. 2022. Reasoning like program executors. CoRR abs\/2201.11473 (2022). arXiv:2201.11473https:\/\/arxiv.org\/abs\/2201.11473","journal-title":"CoRR"},{"key":"e_1_3_2_113_2","doi-asserted-by":"crossref","first-page":"180","DOI":"10.18653\/v1\/S18-2023","volume-title":"Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (SEM@NAACL-HLT\u201918)","author":"Poliak Adam","year":"2018","unstructured":"Adam Poliak, Jason Naradowsky, Aparajita Haldar, Rachel Rudinger, and Benjamin Van Durme. 2018. Hypothesis only baselines in natural language inference. In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (SEM@NAACL-HLT\u201918), Malvina Nissim, Jonathan Berant, and Alessandro Lenci (Eds.). Association for Computational Linguistics, 180\u2013191. DOI:10.18653\/v1\/s18-2023"},{"key":"e_1_3_2_114_2","article-title":"Measuring and narrowing the compositionality gap in language models","volume":"2210","author":"Press Ofir","year":"2022","unstructured":"Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, and Mike Lewis. 2022. Measuring and narrowing the compositionality gap in language models. CoRR abs\/2210.03350 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_115_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2304.03843"},{"key":"e_1_3_2_116_2","first-page":"3599","volume-title":"EMNLP","author":"Qi Peng","year":"2021","unstructured":"Peng Qi, Haejun Lee, Tg Sido, and Christopher D. Manning. 2021. Answering open-domain questions of varying reasoning steps from text. In EMNLP. Association for Computational Linguistics, 3599\u20133614."},{"key":"e_1_3_2_117_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.09597"},{"key":"e_1_3_2_118_2","first-page":"5042","volume-title":"EMNLP\/IJCNLP","author":"Qin Lianhui","year":"2019","unstructured":"Lianhui Qin, Antoine Bosselut, Ari Holtzman, Chandra Bhagavatula, Elizabeth Clark, and Yejin Choi. 2019. Counterfactual story reasoning and generation. In EMNLP\/IJCNLP. Association for Computational Linguistics, 5042\u20135052."},{"key":"e_1_3_2_119_2","first-page":"794","volume-title":"EMNLP","author":"Qin Lianhui","year":"2020","unstructured":"Lianhui Qin, Vered Shwartz, Peter West, Chandra Bhagavatula, Jena D. Hwang, Ronan Le Bras, Antoine Bosselut, and Yejin Choi. 2020. Back to the future: Unsupervised backprop-based decoding for counterfactual and abductive commonsense reasoning. In EMNLP. Association for Computational Linguistics, 794\u2013805."},{"key":"e_1_3_2_120_2","first-page":"6140","volume-title":"ACL","author":"Qiu Lin","year":"2019","unstructured":"Lin Qiu, Yunxuan Xiao, Yanru Qu, Hao Zhou, Lei Li, Weinan Zhang, and Yong Yu. 2019. Dynamically fused graph network for multi-hop reasoning. In ACL. Association for Computational Linguistics, 6140\u20136150."},{"key":"e_1_3_2_121_2","first-page":"2968","volume-title":"NAACL-HLT","author":"Qu Hanhao","year":"2022","unstructured":"Hanhao Qu, Yu Cao, Jun Gao, Liang Ding, and Ruifeng Xu. 2022. Interpretable proof generation via iterative backward reasoning. In NAACL-HLT. Association for Computational Linguistics, 2968\u20132981."},{"key":"e_1_3_2_122_2","unstructured":"Alec Radford Karthik Narasimhan Tim Salimans and Ilya Sutskever. 2018. Improving language understanding by generative pre-training. OpenAI."},{"key":"e_1_3_2_123_2","first-page":"140:1\u2013140:67","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21 (2020), 140:1\u2013140:67. http:\/\/jmlr.org\/papers\/v21\/20-074.html","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_2_124_2","first-page":"4932","volume-title":"ACL","author":"Rajani Nazneen Fatema","year":"2019","unstructured":"Nazneen Fatema Rajani, Bryan McCann, Caiming Xiong, and Richard Socher. 2019. Explain yourself! Leveraging language models for commonsense reasoning. In ACL. Association for Computational Linguistics, 4932\u20134942."},{"key":"e_1_3_2_125_2","doi-asserted-by":"crossref","first-page":"463","DOI":"10.18653\/v1\/P18-1043","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL\u201918), Volume 1: Long Papers","author":"Rashkin Hannah","year":"2018","unstructured":"Hannah Rashkin, Maarten Sap, Emily Allaway, Noah A. Smith, and Yejin Choi. 2018. Event2Mind: Commonsense inference on events, intents, and reactions. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL\u201918), Volume 1: Long Papers, Iryna Gurevych and Yusuke Miyao (Eds.). Association for Computational Linguistics, 463\u2013473. DOI:10.18653\/v1\/P18-1043"},{"key":"e_1_3_2_126_2","doi-asserted-by":"crossref","first-page":"8729","DOI":"10.18653\/v1\/2022.emnlp-main.598","volume-title":"Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201922)","author":"Ravichander Abhilasha","year":"2022","unstructured":"Abhilasha Ravichander, Matt Gardner, and Ana Marasovic. 2022. CONDAQA: A contrastive reading comprehension dataset for reasoning about negation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201922), Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (Eds.). Association for Computational Linguistics, 8729\u20138755. https:\/\/aclanthology.org\/2022.emnlp-main.598"},{"key":"e_1_3_2_127_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-naacl.35"},{"key":"e_1_3_2_128_2","first-page":"11209","volume-title":"36th AAAI Conference on Artificial Intelligence (AAAI\u201922), 34th Conference on Innovative Applications of Artificial Intelligence (IAAI\u201922), The 12th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201922","author":"Richardson Kyle","year":"2022","unstructured":"Kyle Richardson and Ashish Sabharwal. 2022. Pushing the limits of rule reasoning in transformers through natural language satisfiability. In 36th AAAI Conference on Artificial Intelligence (AAAI\u201922), 34th Conference on Innovative Applications of Artificial Intelligence (IAAI\u201922), The 12th Symposium on Educational Advances in Artificial Intelligence (EAAI\u201922. AAAI Press, 11209\u201311219. DOI:10.1609\/AAAI.V36I10.21371"},{"key":"e_1_3_2_129_2","volume-title":"Logical Formalizations of Commonsense Reasoning, Papers from the 2011 AAAI Spring Symposium, Technical Report SS-11-06","author":"Roemmele Melissa","year":"2011","unstructured":"Melissa Roemmele, Cosmin Adrian Bejan, and Andrew S. Gordon. 2011. Choice of plausible alternatives: An evaluation of commonsense causal reasoning. In Logical Formalizations of Commonsense Reasoning, Papers from the 2011 AAAI Spring Symposium, Technical Report SS-11-06. AAAI. http:\/\/www.aaai.org\/ocs\/index.php\/SSS\/SSS11\/paper\/view\/2418"},{"key":"e_1_3_2_130_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.418"},{"key":"e_1_3_2_131_2","volume-title":"The Dictionary of Philosophy","author":"Runes Dagobert D.","year":"2001","unstructured":"Dagobert D. Runes. 2001. The Dictionary of Philosophy. Citadel Press."},{"key":"e_1_3_2_132_2","doi-asserted-by":"crossref","first-page":"7399","DOI":"10.18653\/v1\/2022.acl-long.511","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922)","author":"Sadat Mobashir","year":"2022","unstructured":"Mobashir Sadat and Cornelia Caragea. 2022. SciNLI: A corpus for natural language inference on scientific text. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922), Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for Computational Linguistics, 7399\u20137409. DOI:10.18653\/v1\/2022.acl-long.511"},{"key":"e_1_3_2_133_2","doi-asserted-by":"crossref","first-page":"2087","DOI":"10.18653\/v1\/D18-1233","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Saeidi Marzieh","year":"2018","unstructured":"Marzieh Saeidi, Max Bartolo, Patrick S. H. Lewis, Sameer Singh, Tim Rockt\u00e4schel, Mike Sheldon, Guillaume Bouchard, and Sebastian Riedel. 2018. Interpretation of natural language rules in conversational machine reading. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun\u2019ichi Tsujii (Eds.). Association for Computational Linguistics, 2087\u20132097. DOI:10.18653\/v1\/d18-1233"},{"key":"e_1_3_2_134_2","first-page":"122","volume-title":"EMNLP","author":"Saha Swarnadeep","year":"2020","unstructured":"Swarnadeep Saha, Sayan Ghosh, Shashank Srivastava, and Mohit Bansal. 2020. PRover: Proof generation for interpretable reasoning over rules. In EMNLP. Association for Computational Linguistics, 122\u2013136."},{"key":"e_1_3_2_135_2","doi-asserted-by":"crossref","first-page":"8240","DOI":"10.18653\/v1\/2020.emnlp-main.661","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920)","author":"Saha Swarnadeep","year":"2020","unstructured":"Swarnadeep Saha, Yixin Nie, and Mohit Bansal. 2020. ConjNLI: Natural language inference over conjunctive sentences. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201920), Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (Eds.). Association for Computational Linguistics, 8240\u20138252. DOI:10.18653\/v1\/2020.emnlp-main.661"},{"key":"e_1_3_2_136_2","first-page":"3662","volume-title":"NAACL-HLT","author":"Saha Swarnadeep","year":"2021","unstructured":"Swarnadeep Saha, Prateek Yadav, and Mohit Bansal. 2021. multiPRover: Generating multiple proofs for improved interpretability in rule reasoning. In NAACL-HLT. Association for Computational Linguistics, 3662\u20133677."},{"key":"e_1_3_2_137_2","first-page":"8732","volume-title":"The 34th AAAI Conference on Artificial Intelligence (AAAI\u201920), The 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), The 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920)","author":"Sakaguchi Keisuke","year":"2020","unstructured":"Keisuke Sakaguchi, Ronan Le Bras, Chandra Bhagavatula, and Yejin Choi. 2020. WinoGrande: An adversarial winograd schema challenge at scale. In The 34th AAAI Conference on Artificial Intelligence (AAAI\u201920), The 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), The 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920). AAAI Press, 8732\u20138740. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/6399"},{"key":"e_1_3_2_138_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2205.12598"},{"key":"e_1_3_2_139_2","first-page":"1075","volume-title":"ACL","author":"Sanyal Soumya","year":"2022","unstructured":"Soumya Sanyal, Harman Singh, and Xiang Ren. 2022. FaiRR: Faithful and robust deductive reasoning over natural language. In ACL. Association for Computational Linguistics, 1075\u20131093."},{"key":"e_1_3_2_140_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.09282"},{"key":"e_1_3_2_141_2","first-page":"3027","volume-title":"AAAI","author":"Sap Maarten","year":"2019","unstructured":"Maarten Sap, Ronan Le Bras, Emily Allaway, Chandra Bhagavatula, Nicholas Lourie, Hannah Rashkin, Brendan Roof, Noah A. Smith, and Yejin Choi. 2019. ATOMIC: An atlas of machine commonsense for If-Then reasoning. In AAAI. AAAI Press, 3027\u20133035."},{"key":"e_1_3_2_142_2","first-page":"5477","volume-title":"ACL","author":"Sap Maarten","year":"2020","unstructured":"Maarten Sap, Saadia Gabriel, Lianhui Qin, Dan Jurafsky, Noah A. Smith, and Yejin Choi. 2020. Social bias frames: Reasoning about social and power implications of language. In ACL. Association for Computational Linguistics, 5477\u20135490."},{"key":"e_1_3_2_143_2","first-page":"4462","volume-title":"EMNLP\/IJCNLP","author":"Sap Maarten","year":"2019","unstructured":"Maarten Sap, Hannah Rashkin, Derek Chen, Ronan Le Bras, and Yejin Choi. 2019. Social IQa: Commonsense reasoning about social interactions. In EMNLP\/IJCNLP. Association for Computational Linguistics, 4462\u20134472."},{"key":"e_1_3_2_144_2","article-title":"Language models are greedy reasoners: A systematic formal analysis of chain-of-thought","volume":"2210","author":"Saparov Abulhair","year":"2022","unstructured":"Abulhair Saparov and He He. 2022. Language models are greedy reasoners: A systematic formal analysis of chain-of-thought. CoRR abs\/2210.01240 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_145_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.00193"},{"key":"e_1_3_2_146_2","first-page":"4505","volume-title":"EMNLP\/IJCNLP","author":"Sinha Koustuv","year":"2019","unstructured":"Koustuv Sinha, Shagun Sodhani, Jin Dong, Joelle Pineau, and William L. Hamilton. 2019. CLUTRR: A diagnostic benchmark for inductive reasoning from text. In EMNLP\/IJCNLP. Association for Computational Linguistics, 4505\u20134514."},{"key":"e_1_3_2_147_2","first-page":"4444","volume-title":"Proceedings of the 31st AAAI Conference on Artificial Intelligence","author":"Speer Robyn","year":"2017","unstructured":"Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. ConceptNet 5.5: An open multilingual graph of general knowledge. In Proceedings of the 31st AAAI Conference on Artificial Intelligence, Satinder Singh and Shaul Markovitch (Eds.). AAAI Press, 4444\u20134451. http:\/\/aaai.org\/ocs\/index.php\/AAAI\/AAAI17\/paper\/view\/14972"},{"key":"e_1_3_2_148_2","article-title":"Beyond the imitation game: Quantifying and extrapolating the capabilities of language models","volume":"2206","author":"Srivastava Aarohi","year":"2022","unstructured":"Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adri\u00e0 Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Santilli, Andreas Stuhlm\u00fcller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, et\u00a0al. 2022. Beyond the imitation game: Quantifying and extrapolating the capabilities of language models. CoRR abs\/2206.04615 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_149_2","first-page":"6418","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers","author":"Suhr Alane","year":"2019","unstructured":"Alane Suhr, Stephanie Zhou, Ally Zhang, Iris Zhang, Huajun Bai, and Yoav Artzi. 2019. A corpus for reasoning about natural language grounded in photographs. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers, Anna Korhonen, David R. Traum, and Llu\u00eds M\u00e0rquez (Eds.). Association for Computational Linguistics, 6418\u20136428. DOI:10.18653\/v1\/p19-1644"},{"key":"e_1_3_2_150_2","article-title":"Challenging BIG-bench tasks and whether chain-of-thought can solve them","volume":"2210","author":"Suzgun Mirac","year":"2022","unstructured":"Mirac Suzgun, Nathan Scales, Nathanael Sch\u00e4rli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, and Jason Wei. 2022. Challenging BIG-bench tasks and whether chain-of-thought can solve them. CoRR abs\/2210.09261 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_151_2","series-title":"Findings of ACL","first-page":"3621","volume-title":"ACL\/IJCNLP (Findings)","author":"Tafjord Oyvind","year":"2021","unstructured":"Oyvind Tafjord, Bhavana Dalvi, and Peter Clark. 2021. ProofWriter: Generating implications, proofs, and abductive statements over natural language. In ACL\/IJCNLP (Findings)(Findings of ACL, Vol. ACL\/IJCNLP 2021). Association for Computational Linguistics, 3621\u20133634."},{"key":"e_1_3_2_152_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.12217"},{"key":"e_1_3_2_153_2","first-page":"641","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201918), Volume 1 (Long Papers)","author":"Talmor Alon","year":"2018","unstructured":"Alon Talmor and Jonathan Berant. 2018. The web as a knowledge-base for answering complex questions. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201918), Volume 1 (Long Papers), Marilyn A. Walker, Heng Ji, and Amanda Stent (Eds.). Association for Computational Linguistics, 641\u2013651. DOI:10.18653\/V1\/N18-1059"},{"key":"e_1_3_2_154_2","first-page":"4149","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (NAACL-HLT\u201919), Volume 1 (Long and Short Papers)","author":"Talmor Alon","year":"2019","unstructured":"Alon Talmor, Jonathan Herzig, Nicholas Lourie, and Jonathan Berant. 2019. CommonsenseQA: A question answering challenge targeting commonsense knowledge. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (NAACL-HLT\u201919), Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.). Association for Computational Linguistics, 4149\u20134158. DOI:10.18653\/v1\/n19-1421"},{"key":"e_1_3_2_155_2","volume-title":"NeurIPS","author":"Talmor Alon","year":"2020","unstructured":"Alon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, and Jonathan Berant. 2020. Leap-of-thought: Teaching pre-trained models to systematically reason over implicit knowledge. In NeurIPS."},{"key":"e_1_3_2_156_2","volume-title":"Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks\u201921)","author":"Talmor Alon","year":"2021","unstructured":"Alon Talmor, Ori Yoran, Ronan Le Bras, Chandra Bhagavatula, Yoav Goldberg, Yejin Choi, and Jonathan Berant. 2021. CommonsenseQA 2.0: Exposing the limits of AI through gamification. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks\u201921), Joaquin Vanschoren and Sai-Kit Yeung (Eds.). https:\/\/datasets-benchmarks-proceedings.neurips.cc\/paper\/2021\/hash\/3ef815416f775098fe977004015c6193-Abstract-round1.html"},{"key":"e_1_3_2_157_2","first-page":"3878","volume-title":"ACL","author":"Tamborrino Alexandre","year":"2020","unstructured":"Alexandre Tamborrino, Nicola Pellican\u00f2, Baptiste Pannier, Pascal Voitot, and Louise Naudin. 2020. Pre-training is (almost) all you need: An application to commonsense reasoning. In ACL. Association for Computational Linguistics, 3878\u20133887."},{"key":"e_1_3_2_158_2","doi-asserted-by":"crossref","first-page":"57","DOI":"10.18653\/v1\/D18-1006","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Tandon Niket","year":"2018","unstructured":"Niket Tandon, Bhavana Dalvi, Joel Grus, Wen-tau Yih, Antoine Bosselut, and Peter Clark. 2018. Reasoning about actions and state changes by injecting commonsense knowledge. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun\u2019ichi Tsujii (Eds.). Association for Computational Linguistics, 57\u201366. DOI:10.18653\/v1\/d18-1006"},{"key":"e_1_3_2_159_2","first-page":"6075","volume-title":"EMNLP\/IJCNLP","author":"Tandon Niket","year":"2019","unstructured":"Niket Tandon, Bhavana Dalvi, Keisuke Sakaguchi, Peter Clark, and Antoine Bosselut. 2019. WIQA: A dataset for \u201cWhat if...\u201d reasoning over procedural text. In EMNLP\/IJCNLP. Association for Computational Linguistics, 6075\u20136084."},{"key":"e_1_3_2_160_2","first-page":"8846","volume-title":"EMNLP","author":"Trivedi Harsh","year":"2020","unstructured":"Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, and Ashish Sabharwal. 2020. Is Multihop QA in DiRe condition? Measuring and reducing disconnected reasoning. In EMNLP. Association for Computational Linguistics, 8846\u20138863."},{"key":"e_1_3_2_161_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00475"},{"key":"e_1_3_2_162_2","volume-title":"Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC\u201918)","author":"Tsuchiya Masatoshi","year":"2018","unstructured":"Masatoshi Tsuchiya. 2018. Performance impact caused by hidden bias of training data for recognizing textual entailment. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC\u201918), Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, K\u00f4iti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, H\u00e9l\u00e8ne Mazo, Asunci\u00f3n Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga (Eds.). European Language Resources Association (ELRA). http:\/\/www.lrec-conf.org\/proceedings\/lrec2018\/summaries\/786.html"},{"key":"e_1_3_2_163_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.2311.08516"},{"key":"e_1_3_2_164_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, and Roman Garnett (Eds.). 5998\u20136008. https:\/\/proceedings.neurips.cc\/paper\/2017\/hash\/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html"},{"key":"e_1_3_2_165_2","first-page":"960","volume-title":"ACL","author":"Vilares David","year":"2019","unstructured":"David Vilares and Carlos G\u00f3mez-Rodr\u00edguez. 2019. HEAD-QA: A healthcare dataset for complex reasoning. In ACL. Association for Computational Linguistics, 960\u2013966."},{"issue":"8","key":"e_1_3_2_166_2","doi-asserted-by":"crossref","first-page":"399","DOI":"10.2307\/2026735","article-title":"What is reasoning? What is an argument?","volume":"87","author":"Walton Douglas N.","year":"1990","unstructured":"Douglas N. Walton. 1990. What is reasoning? What is an argument? J. Philos. 87, 8 (1990), 399\u2013419.","journal-title":"J. Philos."},{"key":"e_1_3_2_167_2","first-page":"3261","volume-title":"Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 (NeurIPS\u201919)","author":"Wang Alex","year":"2019","unstructured":"Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman. 2019. SuperGLUE: A stickier benchmark for general-purpose language understanding systems. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 (NeurIPS\u201919), Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily B. Fox, and Roman Garnett (Eds.). 3261\u20133275. https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/4496bf24afe7fab6f046bf4923da8de6-Abstract.html"},{"key":"e_1_3_2_168_2","volume-title":"7th International Conference on Learning Representations (ICLR\u201919)","author":"Wang Alex","year":"2019","unstructured":"Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman. 2019. GLUE: A multi-task benchmark and analysis platform for natural language understanding. In 7th International Conference on Learning Representations (ICLR\u201919). OpenReview.net. https:\/\/openreview.net\/forum?id=rJ4km2R5t7"},{"key":"e_1_3_2_169_2","article-title":"Self-consistency improves chain of thought reasoning in language models","volume":"2203","author":"Wang Xuezhi","year":"2022","unstructured":"Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V. Le, Ed H. Chi, and Denny Zhou. 2022. Self-consistency improves chain of thought reasoning in language models. CoRR abs\/2203.11171 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_170_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2206.07682"},{"key":"e_1_3_2_171_2","article-title":"Chain of thought prompting elicits reasoning in large language models","volume":"2201","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Ed H. Chi, Quoc Le, and Denny Zhou. 2022. Chain of thought prompting elicits reasoning in large language models. CoRR abs\/2201.11903 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_172_2","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1162\/tacl_a_00021","article-title":"Constructing datasets for multi-hop reading comprehension across documents","volume":"6","author":"Welbl Johannes","year":"2018","unstructured":"Johannes Welbl, Pontus Stenetorp, and Sebastian Riedel. 2018. Constructing datasets for multi-hop reading comprehension across documents. Trans. Assoc. Comput. Linguistics 6 (2018), 287\u2013302.","journal-title":"Trans. Assoc. Comput. Linguistics"},{"key":"e_1_3_2_173_2","volume-title":"ICLR (Poster)","author":"Weston Jason","year":"2016","unstructured":"Jason Weston, Antoine Bordes, Sumit Chopra, and Tom\u00e1s Mikolov. 2016. Towards AI-complete question answering: A set of prerequisite toy tasks. In ICLR (Poster)."},{"key":"e_1_3_2_174_2","first-page":"1112","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201918), Volume 1 (Long Papers)","author":"Williams Adina","year":"2018","unstructured":"Adina Williams, Nikita Nangia, and Samuel R. Bowman. 2018. A broad-coverage challenge corpus for sentence understanding through inference. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT\u201918), Volume 1 (Long Papers), Marilyn A. Walker, Heng Ji, and Amanda Stent (Eds.). Association for Computational Linguistics, 1112\u20131122. DOI:10.18653\/v1\/n18-1101"},{"key":"e_1_3_2_175_2","doi-asserted-by":"publisher","DOI":"10.1162\/TACL_A_00309"},{"key":"e_1_3_2_176_2","first-page":"2660","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922)","author":"Wu Yuxiang","year":"2022","unstructured":"Yuxiang Wu, Matt Gardner, Pontus Stenetorp, and Pradeep Dasigi. 2022. Generating data to mitigate spurious correlations in natural language inference datasets. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL\u201922), Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for Computational Linguistics, 2660\u20132676. DOI:10.18653\/v1\/2022.acl-long.190"},{"key":"e_1_3_2_177_2","volume-title":"The 10th International Conference on Learning Representations (ICLR\u201922)","author":"Xie Sang Michael","year":"2022","unstructured":"Sang Michael Xie, Aditi Raghunathan, Percy Liang, and Tengyu Ma. 2022. An explanation of in-context learning as implicit Bayesian inference. In The 10th International Conference on Learning Representations (ICLR\u201922). OpenReview.net. https:\/\/openreview.net\/forum?id=RdJVFCHjUMI"},{"key":"e_1_3_2_178_2","first-page":"4762","volume-title":"Proceedings of the 28th International Conference on Computational Linguistics (COLING\u201920)","author":"Xu Liang","year":"2020","unstructured":"Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, and Zhenzhong Lan. 2020. CLUE: A Chinese language understanding evaluation benchmark. In Proceedings of the 28th International Conference on Computational Linguistics (COLING\u201920), Donia Scott, N\u00faria Bel, and Chengqing Zong (Eds.). International Committee on Computational Linguistics, 4762\u20134772. DOI:10.18653\/V1\/2020.COLING-MAIN.419"},{"key":"e_1_3_2_179_2","doi-asserted-by":"crossref","first-page":"31","DOI":"10.18653\/v1\/W19-4804","volume-title":"Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP@ACL\u201919)","author":"Yanaka Hitomi","year":"2019","unstructured":"Hitomi Yanaka, Koji Mineshima, Daisuke Bekki, Kentaro Inui, Satoshi Sekine, Lasha Abzianidze, and Johan Bos. 2019. Can neural networks understand monotonicity reasoning? In Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP@ACL\u201919), Tal Linzen, Grzegorz Chrupala, Yonatan Belinkov, and Dieuwke Hupkes (Eds.). Association for Computational Linguistics, 31\u201340. DOI:10.18653\/v1\/W19-4804"},{"key":"e_1_3_2_180_2","first-page":"250","volume-title":"Proceedings of the 8th Joint Conference on Lexical and Computational Semantics (SEM@NAACL-HLT\u201919)","author":"Yanaka Hitomi","year":"2019","unstructured":"Hitomi Yanaka, Koji Mineshima, Daisuke Bekki, Kentaro Inui, Satoshi Sekine, Lasha Abzianidze, and Johan Bos. 2019. HELP: A dataset for identifying shortcomings of neural models in monotonicity reasoning. In Proceedings of the 8th Joint Conference on Lexical and Computational Semantics (SEM@NAACL-HLT\u201919), Rada Mihalcea, Ekaterina Shutova, Lun-Wei Ku, Kilian Evang, and Soujanya Poria (Eds.). Association for Computational Linguistics, 250\u2013255. DOI:10.18653\/v1\/s19-1027"},{"key":"e_1_3_2_181_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2205.12443"},{"key":"e_1_3_2_182_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.10923"},{"key":"e_1_3_2_183_2","first-page":"2369","volume-title":"EMNLP","author":"Yang Zhilin","year":"2018","unstructured":"Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William W. Cohen, Ruslan Salakhutdinov, and Christopher D. Manning. 2018. HotpotQA: A dataset for diverse, explainable multi-hop question answering. In EMNLP. Association for Computational Linguistics, 2369\u20132380."},{"key":"e_1_3_2_184_2","first-page":"535","volume-title":"NAACL-HLT","author":"Yasunaga Michihiro","year":"2021","unstructured":"Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang, and Jure Leskovec. 2021. QA-GNN: Reasoning with language models and knowledge graphs for question answering. In NAACL-HLT. Association for Computational Linguistics, 535\u2013546."},{"key":"e_1_3_2_185_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2211.13892"},{"key":"e_1_3_2_186_2","doi-asserted-by":"crossref","first-page":"2115","DOI":"10.18653\/v1\/2021.emnlp-main.162","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921)","author":"Yin Da","year":"2021","unstructured":"Da Yin, Liunian Harold Li, Ziniu Hu, Nanyun Peng, and Kai-Wei Chang. 2021. Broaden the vision: Geo-diverse visual commonsense reasoning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921), Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (Eds.). Association for Computational Linguistics, 2115\u20132129. DOI:10.18653\/v1\/2021.emnlp-main.162"},{"key":"e_1_3_2_187_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.435"},{"key":"e_1_3_2_188_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-acl.19"},{"key":"e_1_3_2_189_2","first-page":"6729","volume-title":"ACL","author":"Yu Jianxing","year":"2020","unstructured":"Jianxing Yu, Wei Liu, Shuang Qiu, Qinliang Su, Kai Wang, Xiaojun Quan, and Jian Yin. 2020. Low-resource generation of multi-hop reasoning questions. In ACL. Association for Computational Linguistics, 6729\u20136739."},{"key":"e_1_3_2_190_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.08286"},{"key":"e_1_3_2_191_2","volume-title":"ICLR","author":"Yu Weihao","year":"2020","unstructured":"Weihao Yu, Zihang Jiang, Yanfei Dong, and Jiashi Feng. 2020. ReClor: A reading comprehension dataset requiring logical reasoning. In ICLR. OpenReview.net."},{"key":"e_1_3_2_192_2","article-title":"STaR: Bootstrapping reasoning with reasoning","volume":"2203","author":"Zelikman Eric","year":"2022","unstructured":"Eric Zelikman, Yuhuai Wu, Jesse Mu, and Noah D. Goodman. 2022. STaR: Bootstrapping reasoning with reasoning. CoRR abs\/2203.14465 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_193_2","doi-asserted-by":"crossref","first-page":"93","DOI":"10.18653\/v1\/D18-1009","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Zellers Rowan","year":"2018","unstructured":"Rowan Zellers, Yonatan Bisk, Roy Schwartz, and Yejin Choi. 2018. SWAG: A large-scale adversarial dataset for grounded commonsense inference. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun\u2019ichi Tsujii (Eds.). Association for Computational Linguistics, 93\u2013104. DOI:10.18653\/v1\/d18-1009"},{"key":"e_1_3_2_194_2","first-page":"4791","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers","author":"Zellers Rowan","year":"2019","unstructured":"Rowan Zellers, Ari Holtzman, Yonatan Bisk, Ali Farhadi, and Yejin Choi. 2019. HellaSwag: Can a machine really finish your sentence?. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers, Anna Korhonen, David R. Traum, and Llu\u00eds M\u00e0rquez (Eds.). Association for Computational Linguistics, 4791\u20134800. DOI:10.18653\/v1\/p19-1472"},{"key":"e_1_3_2_195_2","article-title":"On the paradox of learning to reason from data","volume":"2205","author":"Zhang Honghua","year":"2022","unstructured":"Honghua Zhang, Liunian Harold Li, Tao Meng, Kai-Wei Chang, and Guy Van den Broeck. 2022. On the paradox of learning to reason from data. CoRR abs\/2205.11502 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_196_2","first-page":"4630","volume-title":"EMNLP","author":"Zhang Li","year":"2020","unstructured":"Li Zhang, Qing Lyu, and Chris Callison-Burch. 2020. Reasoning about goals, steps, and temporal ordering with WikiHow. In EMNLP. Association for Computational Linguistics, 4630\u20134639."},{"key":"e_1_3_2_197_2","volume-title":"ICLR","author":"Zhang Xikun","year":"2022","unstructured":"Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, and Jure Leskovec. 2022. GreaseLM: Graph REASoning enhanced language models. In ICLR. OpenReview.net."},{"key":"e_1_3_2_198_2","article-title":"Automatic chain of thought prompting in large language models","volume":"2210","author":"Zhang Zhuosheng","year":"2022","unstructured":"Zhuosheng Zhang, Aston Zhang, Mu Li, and Alex Smola. 2022. Automatic chain of thought prompting in large language models. CoRR abs\/2210.03493 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_199_2","volume-title":"ICLR","author":"Zhao Chen","year":"2020","unstructured":"Chen Zhao, Chenyan Xiong, Corby Rosset, Xia Song, Paul N. Bennett, and Saurabh Tiwary. 2020. Transformer-XH: Multi-evidence reasoning with eXtra hop attention. In ICLR. OpenReview.net."},{"key":"e_1_3_2_200_2","first-page":"8881","volume-title":"EMNLP","author":"Zheng Chen","year":"2020","unstructured":"Chen Zheng and Parisa Kordjamshidi. 2020. SRLGRN: Semantic role labeling graph reasoning network. In EMNLP. Association for Computational Linguistics, 8881\u20138891."},{"key":"e_1_3_2_201_2","first-page":"2310","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers","author":"Zhong Victor","year":"2019","unstructured":"Victor Zhong and Luke Zettlemoyer. 2019. E3: Entailment-driven extracting and editing for conversational machine reading. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919), Volume 1: Long Papers, Anna Korhonen, David R. Traum, and Llu\u00eds M\u00e0rquez (Eds.). Association for Computational Linguistics, 2310\u20132320. DOI:10.18653\/v1\/p19-1223"},{"key":"e_1_3_2_202_2","article-title":"Disentangling reasoning capabilities from language models with compositional reasoning transformers","volume":"2210","author":"Zhong Wanjun","year":"2022","unstructured":"Wanjun Zhong, Tingting Ma, Jiahai Wang, Jian Yin, Tiejun Zhao, Chin-Yew Lin, and Nan Duan. 2022. Disentangling reasoning capabilities from language models with compositional reasoning transformers. CoRR abs\/2210.11265 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_203_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-naacl.177"},{"key":"e_1_3_2_204_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.16865"},{"key":"e_1_3_2_205_2","article-title":"Least-to-most prompting enables complex reasoning in large language models","volume":"2205","author":"Zhou Denny","year":"2022","unstructured":"Denny Zhou, Nathanael Sch\u00e4rli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, and Ed Chi. 2022. Least-to-most prompting enables complex reasoning in large language models. CoRR abs\/2205.10625 (2022).","journal-title":"CoRR"},{"key":"e_1_3_2_206_2","doi-asserted-by":"crossref","first-page":"7560","DOI":"10.18653\/v1\/2021.emnlp-main.598","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921)","author":"Zhou Pei","year":"2021","unstructured":"Pei Zhou, Rahul Khanna, Seyeon Lee, Bill Yuchen Lin, Daniel Ho, Jay Pujara, and Xiang Ren. 2021. RICA: Evaluating robust inference capabilities based on commonsense axioms. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP\u201921), Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (Eds.). Association for Computational Linguistics, 7560\u20137579. DOI:10.18653\/v1\/2021.emnlp-main.598"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3664194","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3664194","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T18:44:02Z","timestamp":1750272242000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3664194"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,3]]},"references-count":205,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,12,31]]}},"alternative-id":["10.1145\/3664194"],"URL":"https:\/\/doi.org\/10.1145\/3664194","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,3]]},"assertion":[{"value":"2023-05-06","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-04-26","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}