{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T04:22:09Z","timestamp":1781324529405,"version":"3.54.1"},"reference-count":175,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T00:00:00Z","timestamp":1734998400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2025,4,30]]},"abstract":"<jats:p>\n            Humans possess an extraordinary ability to create and utilize tools. With the advent of foundation models, artificial intelligence systems have the potential to be equally adept in tool use as humans. This paradigm, which is dubbed as\n            <jats:italic>tool learning with foundation models<\/jats:italic>\n            , combines the strengths of tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. This article presents a systematic investigation and comprehensive review of tool learning. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research and formulate a general framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate generalization in tool learning. Finally, we discuss several open problems that require further investigation, such as ensuring trustworthy tool use, enabling tool creation with foundation models, and addressing personalization challenges. Overall, we hope this article could inspire future research in integrating tools with foundation models.\n          <\/jats:p>","DOI":"10.1145\/3704435","type":"journal-article","created":{"date-parts":[[2024,11,21]],"date-time":"2024-11-21T11:15:42Z","timestamp":1732187742000},"page":"1-40","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":104,"title":["Tool Learning with Foundation Models"],"prefix":"10.1145","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3608-5061","authenticated-orcid":false,"given":"Yujia","family":"Qin","sequence":"first","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-8037-2055","authenticated-orcid":false,"given":"Shengding","family":"Hu","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0151-6178","authenticated-orcid":false,"given":"Yankai","family":"Lin","sequence":"additional","affiliation":[{"name":"Renmin University of China, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-5882-8870","authenticated-orcid":false,"given":"Weize","family":"Chen","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8758-9484","authenticated-orcid":false,"given":"Ning","family":"Ding","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6385-8547","authenticated-orcid":false,"given":"Ganqu","family":"Cui","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6901-5292","authenticated-orcid":false,"given":"Zheni","family":"Zeng","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2285-7836","authenticated-orcid":false,"given":"Xuanhe","family":"Zhou","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1040-2604","authenticated-orcid":false,"given":"Yufei","family":"Huang","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6039-0942","authenticated-orcid":false,"given":"Chaojun","family":"Xiao","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6235-5841","authenticated-orcid":false,"given":"Chi","family":"Han","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-1869-0363","authenticated-orcid":false,"given":"Yi Ren","family":"Fung","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9509-9573","authenticated-orcid":false,"given":"Yusheng","family":"Su","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3260-4119","authenticated-orcid":false,"given":"Huadong","family":"Wang","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9913-820X","authenticated-orcid":false,"given":"Cheng","family":"Qian","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-7885-1490","authenticated-orcid":false,"given":"Runchu","family":"Tian","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-9107-7401","authenticated-orcid":false,"given":"Kunlun","family":"Zhu","sequence":"additional","affiliation":[{"name":"OpenBMB Inc., Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-9877-3363","authenticated-orcid":false,"given":"Shihao","family":"Liang","sequence":"additional","affiliation":[{"name":"OpenBMB Inc., Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-1597-4201","authenticated-orcid":false,"given":"Xingyu","family":"Shen","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0661-507X","authenticated-orcid":false,"given":"Bokai","family":"Xu","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-5585-9114","authenticated-orcid":false,"given":"Zhen","family":"Zhang","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-1390-4311","authenticated-orcid":false,"given":"Yining","family":"Ye","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9444-7652","authenticated-orcid":false,"given":"Bowen","family":"Li","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8385-4296","authenticated-orcid":false,"given":"Ziwei","family":"Tang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7424-9605","authenticated-orcid":false,"given":"Jing","family":"Yi","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-1441-7613","authenticated-orcid":false,"given":"Yuzhang","family":"Zhu","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-3084-5882","authenticated-orcid":false,"given":"Zhenning","family":"Dai","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4317-0504","authenticated-orcid":false,"given":"Lan","family":"Yan","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2370-306X","authenticated-orcid":false,"given":"Xin","family":"Cong","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-0197-9284","authenticated-orcid":false,"given":"Yaxi","family":"Lu","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8016-1952","authenticated-orcid":false,"given":"Weilin","family":"Zhao","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-9208-9840","authenticated-orcid":false,"given":"Yuxiang","family":"Huang","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-2974-7321","authenticated-orcid":false,"given":"Junxi","family":"Yan","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4726-7621","authenticated-orcid":false,"given":"Xu","family":"Han","sequence":"additional","affiliation":[{"name":"Computer Science, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-6005-3206","authenticated-orcid":false,"given":"Xian","family":"Sun","sequence":"additional","affiliation":[{"name":"Zhihu Inc., Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-8934-0759","authenticated-orcid":false,"given":"Dahai","family":"Li","sequence":"additional","affiliation":[{"name":"Zhihu Inc., Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3522-1869","authenticated-orcid":false,"given":"Jason","family":"Phang","sequence":"additional","affiliation":[{"name":"New York University, New York, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7821-0030","authenticated-orcid":false,"given":"Cheng","family":"Yang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1630-0588","authenticated-orcid":false,"given":"Tongshuang","family":"Wu","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0464-7966","authenticated-orcid":false,"given":"Heng","family":"Ji","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1398-0621","authenticated-orcid":false,"given":"Guoliang","family":"Li","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7709-2543","authenticated-orcid":false,"given":"Zhiyuan","family":"Liu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6011-6115","authenticated-orcid":false,"given":"Maosong","family":"Sun","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2024,12,24]]},"reference":[{"key":"e_1_3_4_2_2","unstructured":"Michael Ahn Anthony Brohan Noah Brown Yevgen Chebotar Omar Cortes Byron David Chelsea Finn Keerthana Gopalakrishnan Karol Hausman Alex Herzog et\u00a0al. 2022. Do as I can not as I say: Grounding language in robotic affordances. arXiv:2204.01691. Retrieved from https:\/\/arxiv.org\/abs\/2204.01691"},{"key":"e_1_3_4_3_2","unstructured":"Ilge Akkaya Marcin Andrychowicz Maciek Chociej Mateusz Litwin Bob McGrew Arthur Petron Alex Paino Matthias Plappert Glenn Powell Raphael Ribas et\u00a0al. 2019. Solving Rubik\u2019s cube with a robot hand. arXiv:1910.07113. Retrieved from https:\/\/arxiv.org\/abs\/1910.07113"},{"key":"e_1_3_4_4_2","doi-asserted-by":"crossref","unstructured":"Kelsey R. Allen Kevin A. Smith and Joshua B. Tenenbaum. 2020. Rapid trial-and-error learning with simulation supports flexible tool use and physical reasoning. Proceedings of the National Academy of Sciences 117 47 (2020) 29302\u201329310.","DOI":"10.1073\/pnas.1912341117"},{"issue":"1","key":"e_1_3_4_5_2","first-page":"S135\u2013S147","article-title":"Coevolution of composite-tool technology, constructive memory, and language: Implications for the evolution of modern human behavior","volume":"51","author":"Ambrose Stanley H.","year":"2010","unstructured":"Stanley H. Ambrose. 2010. Coevolution of composite-tool technology, constructive memory, and language: Implications for the evolution of modern human behavior. Current Anthropology 51, S1 (2010), S135\u2013S147.","journal-title":"Current Anthropology"},{"key":"e_1_3_4_6_2","unstructured":"Dario Amodei Chris Olah Jacob Steinhardt Paul Christiano John Schulman and Dan Man\u00e9. 2016. Concrete Problems in AI Safety. ArXiv preprint abs\/1606.06565 (2016). https:\/\/arxiv.org\/abs\/1606.06565"},{"key":"e_1_3_4_7_2","first-page":"103","volume-title":"Machine Intelligence 15","author":"Bain Michael","year":"1995","unstructured":"Michael Bain and Claude Sammut. 1995. A framework for behavioural cloning. Machine Intelligence 15. 103\u2013129."},{"key":"e_1_3_4_8_2","unstructured":"Bowen Baker Ilge Akkaya Peter Zhokhov Joost Huizinga Jie Tang Adrien Ecoffet Brandon Houghton Raul Sampedro and Jeff Clune. 2022. Video pretraining (VPT): Learning to act by watching unlabeled online videos. In Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022 NeurIPS 2022 New Orleans LA USA November 28 - December 9 2022 Sanmi Koyejo S. Mohamed A. Agarwal Danielle Belgrave K. Cho and A. Oh (Eds.). Vol. 35. Curran Associates Inc. 24639\u201324654. http:\/\/papers.nips.cc\/paper_files\/paper\/2022\/hash\/9c7008aff45b5d8f0973b23e1a22ada0-Abstract-Conference.html"},{"key":"e_1_3_4_9_2","unstructured":"Bowen Baker Ingmar Kanitscheider Todor M. Markov Yi Wu Glenn Powell Bob McGrew and Igor Mordatch. 2020. Emergent tool use from multi agent autocurricula. In 8th International Conference on Learning Representations ICLR 2020 Addis Ababa Ethiopia April 26-30 2020. OpenReview.net. https:\/\/openreview.net\/forum?id=SkxpxJBKwS"},{"key":"e_1_3_4_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553380"},{"key":"e_1_3_4_11_2","unstructured":"Christopher Berner Greg Brockman Brooke Chan Vicki Cheung Przemys\u0142aw D\u0119biak Christy Dennison David Farhi Quirin Fischer Shariq Hashme Chris Hesse et\u00a0al. 2019. Dota 2 with large scale deep reinforcement learning. arXiv:1912.06680. Retrieved from https:\/\/arxiv.org\/abs\/1912.06680"},{"issue":"1","key":"e_1_3_4_12_2","doi-asserted-by":"crossref","first-page":"1515","DOI":"10.1038\/s41598-018-38392-8","article-title":"Learning curves and teaching when acquiring nut-cracking in humans and chimpanzees","volume":"9","author":"Boesch Christophe","year":"2019","unstructured":"Christophe Boesch, Da\u0161a Bombjakov\u00e1, Amelia Meier, and Roger Mundry. 2019. Learning curves and teaching when acquiring nut-cracking in humans and chimpanzees. Scientific Reports 9, 1 (2019), 1515.","journal-title":"Scientific Reports"},{"key":"e_1_3_4_13_2","unstructured":"Daniil A. Boiko Robert MacKnight and Gabe Gomes. 2023. Emergent autonomous scientific research capabilities of large language models. arXiv:2304.05332. Retrieved from https:\/\/arxiv.org\/abs\/2304.05332"},{"key":"e_1_3_4_14_2","unstructured":"Rishi Bommasani Drew A. Hudson Ehsan Adeli Russ Altman Simran Arora Sydney von Arx Michael S. Bernstein Jeannette Bohg Antoine Bosselut Emma Brunskill et\u00a0al. 2021. On the opportunities and risks of foundation models. arXiv:2108.07258. Retrieved from https:\/\/arxiv.org\/abs\/2108.07258"},{"key":"e_1_3_4_15_2","doi-asserted-by":"crossref","unstructured":"Ali Borji. 2023. A categorical archive of ChatGPT failures. arXiv:2302.03494. Retrieved from https:\/\/arxiv.org\/abs\/2302.03494","DOI":"10.21203\/rs.3.rs-2895792\/v1"},{"key":"e_1_3_4_16_2","unstructured":"Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et\u00a0al. 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020 December 6-12 2020 virtual Hugo Larochelle Marc\u2019Aurelio Ranzato Raia Hadsell Maria-Florina Balcan and Hsuan-Tien Lin (Eds.). Vol. 33. Curran Associates Inc. 1877\u20131901."},{"key":"e_1_3_4_17_2","unstructured":"S\u00e9bastien Bubeck Varun Chandrasekaran Ronen Eldan Johannes Gehrke Eric Horvitz Ece Kamar Peter Lee Yin Tat Lee Yuanzhi Li Scott Lundberg et\u00a0al. 2023. Sparks of artificial general intelligence: Early experiments with GPT-4. arXiv:2303.12712. Retrieved from https:\/\/arxiv.org\/abs\/2303.12712"},{"key":"e_1_3_4_18_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-2442-2"},{"key":"e_1_3_4_19_2","unstructured":"Nicholas Carlini Jamie Hayes Milad Nasr Matthew Jagielski Vikash Sehwag Florian Tram\u00e8r Borja Balle Daphne Ippolito and Eric Wallace. 2023. Extracting training data from diffusion models. (Aug. 2023) 5253\u20135270. https:\/\/www.usenix.org\/conference\/usenixsecurity23\/presentation\/carlini"},{"key":"e_1_3_4_20_2","unstructured":"Nicholas Carlini Daphne Ippolito Matthew Jagielski Katherine Lee Florian Tram\u00e8r and Chiyuan Zhang. 2023. Quantifying memorization across neural language models. In The Eleventh International Conference on Learning Representations ICLR 2023 Kigali Rwanda May 1-5 2023. OpenReview.net. https:\/\/openreview.net\/pdf?id=TatRHT_1cK"},{"key":"e_1_3_4_21_2","first-page":"2633","volume-title":"Proceedings of the 30th USENIX Security Symposium (USENIX Security 21)","author":"Carlini Nicholas","year":"2021","unstructured":"Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, et\u00a0al. 2021. Extracting training data from large language models. In Proceedings of the 30th USENIX Security Symposium (USENIX Security 21). 2633\u20132650."},{"key":"e_1_3_4_22_2","unstructured":"Baian Chen Chang Shu Ehsan Shareghi Nigel Collier Karthik Narasimhan and Shunyu Yao. 2023. Fireact: Toward language agent fine-tuning. ArXiv preprint abs\/2310.05915 (2023). https:\/\/arxiv.org\/abs\/2310.05915"},{"key":"e_1_3_4_23_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Ponde de Oliveira Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman et\u00a0al. 2021. Evaluating large language models trained on code. arXiv:2107.03374. Retrieved from https:\/\/arxiv.org\/abs\/2107.03374"},{"key":"e_1_3_4_24_2","unstructured":"Wenhu Chen Xueguang Ma Xinyi Wang and William W. Cohen. 2022. Program of thoughts prompting: Disentangling computation from reasoning for numerical reasoning tasks. arXiv:2211.12588. Retrieved from https:\/\/arxiv.org\/abs\/2211.12588"},{"key":"e_1_3_4_25_2","doi-asserted-by":"publisher","unstructured":"Yangyi Chen Lifan Yuan Ganqu Cui Zhiyuan Liu and Heng Ji. 2023. A close look into the calibration of pre-trained language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers Jordan Boyd-Graber and Naoaki Okazaki (Eds.). Association for Computational Linguistics Toronto Canada 1343\u20131367. 10.18653\/v1\/2023.acl-long.75","DOI":"10.18653\/v1\/2023.acl-long.75"},{"key":"e_1_3_4_26_2","volume-title":"Advances in Neural Information Processing Systems","author":"Christiano Paul F.","year":"2017","unstructured":"Paul F. Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, and Dario Amodei. 2017. Deep reinforcement learning from human preferences. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Vol. 30. Curran Associates, Inc."},{"key":"e_1_3_4_27_2","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1007\/978-1-4419-5546-3_13","article-title":"Metacognition and the use of tools","author":"Clarebout Geraldine","year":"2013","unstructured":"Geraldine Clarebout, Jan Elen, Norma A. Juarez Collazo, Griet Lust, and Lai Jiang. 2013. Metacognition and the use of tools. In International Handbook of Metacognition and Learning Technologies. 187\u2013195.","journal-title":"International Handbook of Metacognition and Learning Technologies"},{"key":"e_1_3_4_28_2","unstructured":"Karl Cobbe Vineet Kosaraju Mohammad Bavarian Mark Chen Heewoo Jun Lukasz Kaiser Matthias Plappert Jerry Tworek Jacob Hilton Reiichiro Nakano et\u00a0al. 2021. Training verifiers to solve math word problems. arXiv:2110.14168. Retrieved from https:\/\/arxiv.org\/abs\/2110.14168"},{"key":"e_1_3_4_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00942"},{"key":"e_1_3_4_30_2","unstructured":"Antonia Creswell Murray Shanahan and Irina Higgins. 2023. Selection-inference: Exploiting large language models for interpretable logical reasoning. In The Eleventh International Conference on Learning Representations ICLR 2023 Kigali Rwanda May 1-5 2023. OpenReview.net. https:\/\/openreview.net\/pdf?id=3Pf3Wg6o-A4"},{"key":"e_1_3_4_31_2","article-title":"A unified evaluation of textual backdoor learning: Frameworks and benchmarks","author":"Cui Ganqu","year":"2022","unstructured":"Ganqu Cui, Lifan Yuan, Bingxiang He, Yangyi Chen, Zhiyuan Liu, and Maosong Sun. 2022. A unified evaluation of textual backdoor learning: Frameworks and benchmarks. Proceedings of the International Conference on Neural Information Processing Systems .","journal-title":"Proceedings of the International Conference on Neural Information Processing Systems"},{"key":"e_1_3_4_32_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_4_33_2","unstructured":"Danny Driess Fei Xia Mehdi S. M. Sajjadi Corey Lynch Aakanksha Chowdhery Brian Ichter Ayzaan Wahid Jonathan Tompson Quan Vuong Tianhe Yu et\u00a0al. 2023. PaLM-E: An embodied multimodal language model. In International Conference on Machine Learning ICML 2023 23-29 July 2023 Honolulu Hawaii USA (Proceedings of Machine Learning Research Vol. 202) Andreas Krause Emma Brunskill Kyunghyun Cho Barbara Engelhardt Sivan Sabato and Jonathan Scarlett (Eds.). PMLR 8469\u20138488. https:\/\/proceedings.mlr.press\/v202\/driess23a.html"},{"key":"e_1_3_4_34_2","doi-asserted-by":"crossref","unstructured":"Andreas K. Engel Alexander Maye Martin Kurthen and Peter K\u00f6nig. 2013. Where\u2019s the action? The pragmatic turn in cognitive science. Trends in Cognitive Sciences 17 5 (2013) 202\u2013209.","DOI":"10.1016\/j.tics.2013.03.006"},{"key":"e_1_3_4_35_2","first-page":"267","article-title":"What does it take for an infant to learn how to use a tool by observation?","volume":"7","author":"Fagard Jacqueline","year":"2016","unstructured":"Jacqueline Fagard, Lauriane Rat-Fischer, Rana Esseily, Eszter Somogyi, and J. K. O\u2019Regan. 2016. What does it take for an infant to learn how to use a tool by observation? Frontiers in Psychology 7 (2016), 267.","journal-title":"Frontiers in Psychology"},{"issue":"3","key":"e_1_3_4_36_2","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1016\/S0010-9452(08)70462-3","article-title":"What puts the how in where? Tool use and the divided visual streams hypothesis","volume":"43","author":"Frey Scott H.","year":"2007","unstructured":"Scott H. Frey. 2007. What puts the how in where? Tool use and the divided visual streams hypothesis. Cortex 43, 3 (2007), 368\u2013375.","journal-title":"Cortex"},{"key":"e_1_3_4_37_2","unstructured":"Daniel Fried Armen Aghajanyan Jessy Lin Sida Wang Eric Wallace Freda Shi Ruiqi Zhong Scott Yih Luke Zettlemoyer and Mike Lewis. 2023. InCoder: A generative model for code infilling and synthesis. In The Eleventh International Conference on Learning Representations ICLR 2023 Kigali Rwanda May 1-5 2023. OpenReview.net. https:\/\/openreview.net\/pdf?id=hQwb-lbM6EL"},{"key":"e_1_3_4_38_2","unstructured":"Luyu Gao Aman Madaan Shuyan Zhou Uri Alon Pengfei Liu Yiming Yang Jamie Callan and Graham Neubig. 2023. PAL: Program-aided language models. In International Conference on Machine Learning ICML 2023 23-29 July 2023 Honolulu Hawaii USA (Proceedings of Machine Learning Research Vol. 202) Andreas Krause Emma Brunskill Kyunghyun Cho Barbara Engelhardt Sivan Sabato and Jonathan Scarlett (Eds.). PMLR 10764\u201310799. https:\/\/proceedings.mlr.press\/v202\/gao23f.html"},{"key":"e_1_3_4_39_2","volume-title":"Tools, Language and Cognition in Human Evolution","author":"Gibson Kathleen R.","year":"1993","unstructured":"Kathleen R. Gibson, Kathleen Rita Gibson, and Tim Ingold. 1993. Tools, Language and Cognition in Human Evolution. Cambridge University Press."},{"key":"e_1_3_4_40_2","unstructured":"Kelvin Guu Kenton Lee Zora Tung Panupong Pasupat and Ming-Wei Chang. 2020. Retrieval augmented language model pre-training. In Proceedings of the 37th International Conference on Machine Learning ICML 2020 13-18 July 2020 Virtual Event (Proceedings of Machine Learning Research Vol. 119). PMLR 3929\u20133938. http:\/\/proceedings.mlr.press\/v119\/guu20a.html"},{"key":"e_1_3_4_41_2","doi-asserted-by":"publisher","unstructured":"Xu Han Zhengyan Zhang Ning Ding Yuxian Gu Xiao Liu Yuqi Huo Jiezhong Qiu Yuan Yao andAo Zhang Liang Zhang et\u00a0al. 2021. Pre-Trained models: Past present and future. AI Open 2 5044 (2021) 225\u2013250. 10.1016\/j.aiopen.2021.08.002","DOI":"10.1016\/j.aiopen.2021.08.002"},{"key":"e_1_3_4_42_2","unstructured":"Nicklas Hansen Rishabh Jangir Yu Sun Guillem Aleny\u00e0 Pieter Abbeel Alexei A. Efros Lerrel Pinto and Xiaolong Wang. 2021. Self-supervised policy adaptation during deployment. In 9th International Conference on Learning Representations ICLR 2021 Virtual Event Austria May 3-7 2021. OpenReview.net. https:\/\/openreview.net\/forum?id=o_V-MjyyGV"},{"key":"e_1_3_4_43_2","unstructured":"Dan Hendrycks Nicholas Carlini John Schulman and Jacob Steinhardt. 2021. Unsolved problems in ML safety. arXiv:2109.13916. Retrieved from https:\/\/arxiv.org\/abs\/2109.13916"},{"key":"e_1_3_4_44_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.conb.2009.05.003"},{"key":"e_1_3_4_45_2","doi-asserted-by":"publisher","DOI":"10.4159\/9780674985155"},{"key":"e_1_3_4_46_2","unstructured":"Wenlong Huang Pieter Abbeel Deepak Pathak and Igor Mordatch. 2022. Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In International Conference on Machine Learning ICML 2022 17-23 July 2022 Baltimore Maryland USA (Proceedings of Machine Learning Research Vol. 162) Kamalika Chaudhuri Stefanie Jegelka Le Song Csaba Szepesv\u00e1ri Gang Niu and Sivan Sabato (Eds.). PMLR 9118\u20139147. https:\/\/proceedings.mlr.press\/v162\/huang22a.html"},{"key":"e_1_3_4_47_2","unstructured":"Wenlong Huang Fei Xia Ted Xiao Harris Chan Jacky Liang Pete Florence Andy Zeng Jonathan Tompson Igor Mordatch Yevgen Chebotar et\u00a0al. 2022. Inner monologue: Embodied reasoning through planning with language models. arXiv:2207.05608. Retrieved from https:\/\/arxiv.org\/abs\/2207.05608"},{"issue":"6562","key":"e_1_3_4_48_2","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1038\/379249a0","article-title":"Manufacture and use of hook-tools by New Caledonian crows","volume":"379","author":"Hunt Gavin R.","year":"1996","unstructured":"Gavin R. Hunt. 1996. Manufacture and use of hook-tools by New Caledonian crows. Nature 379, 6562 (1996), 249\u2013251.","journal-title":"Nature"},{"key":"e_1_3_4_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3054912"},{"key":"e_1_3_4_50_2","unstructured":"Srinivasan Iyer Xi Victoria Lin Ramakanth Pasunuru Todor Mihaylov D\u00e1niel Simig Ping Yu Kurt Shuster Tianlu Wang Qing Liu Punit Singh Koura et\u00a0al. 2022. OPT-IML: Scaling language model instruction meta learning through the lens of generalization. ArXiv preprint abs\/2212.12017 (2022). https:\/\/arxiv.org\/abs\/2212.12017"},{"key":"e_1_3_4_51_2","unstructured":"Gautier Izacard Patrick Lewis Maria Lomeli Lucas Hosseini Fabio Petroni Timo Schick Jane Dwivedi-Yu Armand Joulin Sebastian Riedel and Edouard Grave. 2022. Few-shot learning with retrieval augmented language models. arXiv:2208.03299. Retrieved from https:\/\/arxiv.org\/abs\/2208.03299"},{"key":"e_1_3_4_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242739"},{"key":"e_1_3_4_53_2","first-page":"8018","volume-title":"Proceedings of the 34th AAAI Conference on Artificial Intelligence, AAAI 2020, The 32nd Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The 10th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020","author":"Jin Di","year":"2020","unstructured":"Di Jin, Zhijing Jin, Joey Tianyi Zhou, and Peter Szolovits. 2020. Is BERT really robust? A strong baseline for natural language attack on text classification and entailment. In Proceedings of the 34th AAAI Conference on Artificial Intelligence, AAAI 2020, The 32nd Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The 10th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020. AAAI Press, 8018\u20138025."},{"key":"e_1_3_4_54_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-021-03819-2"},{"key":"e_1_3_4_55_2","volume-title":"Groundwork for the Metaphysics of Morals","author":"Kant Immanuel","year":"2002","unstructured":"Immanuel Kant and Jerome B. Schneewind. 2002. Groundwork for the Metaphysics of Morals. Yale University Press."},{"key":"e_1_3_4_56_2","volume-title":"Proceedings of the 8th International Conference on Learning Representations, ICLR 2020","author":"Khandelwal Urvashi","year":"2020","unstructured":"Urvashi Khandelwal, Omer Levy, Dan Jurafsky, Luke Zettlemoyer, and Mike Lewis. 2020. Generalization through memorization: Nearest neighbor language models. In Proceedings of the 8th International Conference on Learning Representations, ICLR 2020. OpenReview.net."},{"key":"e_1_3_4_57_2","unstructured":"Hannah Rose Kirk Bertie Vidgen Paul R\u00f6ttger and Scott A. Hale. 2023. Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback. arXiv:2303.05453. Retrieved from https:\/\/arxiv.org\/abs\/2303.05453"},{"issue":"1","key":"e_1_3_4_58_2","article-title":"Origins of human intelligence: The chain of tool-making and brain evolution","volume":"22","author":"Ko Kwang Hyun","year":"2016","unstructured":"Kwang Hyun Ko. 2016. Origins of human intelligence: The chain of tool-making and brain evolution. Anthropological Notebooks 22, 1 (2016), 5\u201322.","journal-title":"Anthropological Notebooks"},{"key":"e_1_3_4_59_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-02131-2"},{"key":"e_1_3_4_60_2","doi-asserted-by":"publisher","unstructured":"Keita Kurita Paul Michel and Graham Neubig. 2020. Weight poisoning attacks on pretrained models. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Dan Jurafsky Joyce Chai Natalie Schluter and Joel Tetreault (Eds.). Association for Computational Linguistics Online 2793\u20132806. 10.18653\/v1\/2020.acl-main.249","DOI":"10.18653\/v1\/2020.acl-main.249"},{"key":"e_1_3_4_61_2","doi-asserted-by":"crossref","first-page":"269","DOI":"10.4324\/9780203052594-16","volume-title":"Computers as Cognitive Tools","author":"Lajoie Susanne P.","year":"2013","unstructured":"Susanne P. Lajoie and Sharon J. Derry. 2013. Computer environments as cognitive tools for enhancing learning. In Computers as Cognitive Tools, Susanne P. Lajoie and Sharon J. Derry (Eds.). Routledge, 269\u2013296."},{"key":"e_1_3_4_62_2","first-page":"26337","article-title":"HyperTree proof search for neural theorem proving","author":"Lample Guillaume","year":"2022","unstructured":"Guillaume Lample, Timothee Lacroix, Marie-Anne Lachaux, Aurelien Rodriguez, Amaury Hayat, Thibaut Lavril, Gabriel Ebner, and Xavier Martinet. 2022. HyperTree proof search for neural theorem proving. In Proceedings of the 36th International Conference on Neural Information Processing System . 26337\u201326349.","journal-title":"Proceedings of the 36th International Conference on Neural Information Processing System"},{"key":"e_1_3_4_63_2","unstructured":"Angeliki Lazaridou Elena Gribovskaya Wojciech Stokowiec and Nikolai Grigorev. 2022. Internet-augmented language models through few-shot prompting for open-domain question answering. .arXiv:2203.05115. Retrieved from https:\/\/arxiv.org\/abs\/2203.05115"},{"key":"e_1_3_4_64_2","doi-asserted-by":"publisher","DOI":"10.1177\/0278364917710318"},{"key":"e_1_3_4_65_2","unstructured":"Patrick S. H. Lewis Ethan Perez Aleksandra Piktus Fabio Petroni Vladimir Karpukhin Naman Goyal Heinrich K\u00fcttler Mike Lewis Wen-tau Yih Tim Rockt\u00e4schel et\u00a0al. 2020. Retrieval-augmented generation for knowledge-intensive NLP tasks. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020 December 6-12 2020 virtual Hugo Larochelle Marc\u2019Aurelio Ranzato Raia Hadsell Maria-Florina Balcan and Hsuan-Tien Lin (Eds.). Curran Associates Inc. https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/6b493230205f780e1bc26945df7481e5-Abstract.html"},{"key":"e_1_3_4_66_2","unstructured":"Shuang Li Xavier Puig Chris Paxton Yilun Du Clinton Wang Linxi Fan Tao Chen De-An Huang Ekin Aky\u00fcrek Anima Anandkumar et\u00a0al. 2022. Pre-trained language models for interactive decision-making. In Advances in Neural Information Processing Systems S. Koyejo S. Mohamed A. Agarwal D. Belgrave K. Cho and A. Oh (Eds.). Vol. 35. Curran Associates Inc. 31199\u201331212. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2022\/file\/ca3b1f24fc0238edf5ed1ad226b9d655-Paper-Conference.pdf"},{"key":"e_1_3_4_67_2","doi-asserted-by":"publisher","unstructured":"Jacky Liang Wenlong Huang Fei Xia Peng Xu Karol Hausman Brian Ichter Pete Florence and Andy Zeng. 2023. Code as policies: Language model programs for embodied control. In 2023 IEEE International Conference on Robotics and Automation (ICRA). 9493\u20139500. 10.1109\/ICRA48891.2023.10160591","DOI":"10.1109\/ICRA48891.2023.10160591"},{"key":"e_1_3_4_68_2","unstructured":"Percy Liang Rishi Bommasani Tony Lee Dimitris Tsipras Dilara Soylu Michihiro Yasunaga Yian Zhang Deepak Narayanan Yuhuai Wu Ananya Kumar et\u00a0al. 2022. Holistic evaluation of language models. arXiv:2211.09110. Retrieved from https:\/\/arxiv.org\/abs\/2211.09110"},{"key":"e_1_3_4_69_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.229"},{"key":"e_1_3_4_70_2","doi-asserted-by":"crossref","unstructured":"Pantelis Linardatos Vasilis Papastefanopoulos and Sotiris Kotsiantis. 2020. Explainable AI: A review of machine learning interpretability methods. Entropy 23 1 (2020) 18.","DOI":"10.3390\/e23010018"},{"key":"e_1_3_4_71_2","unstructured":"Evan Zheran Liu Kelvin Guu Panupong Pasupat Tianlin Shi and Percy Liang. 2018. Reinforcement learning on Web interfaces using workflow-guided exploration. In 6th International Conference on Learning Representations ICLR 2018 Vancouver BC Canada April 30 - May 3 2018 Conference Track Proceedings. OpenReview.net. https:\/\/openreview.net\/forum?id=ryTp3f-0-"},{"key":"e_1_3_4_72_2","unstructured":"Jason Xinyu Liu Ziyi Yang Ifrah Idrees Sam Liang Benjamin Schornstein Stefanie Tellex and Ankit Shah. 2022. Lang2LTL: Translating Natural Language Commands to Temporal Robot Task Specification. (2022). https:\/\/openreview.net\/forum?id=VxfjGZzrdn"},{"key":"e_1_3_4_73_2","unstructured":"Ruibo Liu Jason Wei Shixiang Shane Gu Te-Yen Wu Soroush Vosoughi Claire Cui Denny Zhou and Andrew M. Dai. 2023. Mind\u2019s Eye: Grounded language model reasoning through simulation. In The Eleventh International Conference on Learning Representations ICLR 2023 Kigali Rwanda May 1-5 2023. OpenReview.net. https:\/\/openreview.net\/pdf?id=4rXMRuoJlai"},{"key":"e_1_3_4_74_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8462901"},{"key":"e_1_3_4_75_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.565"},{"issue":"2","key":"e_1_3_4_76_2","first-page":"195","article-title":"Learning to drive by imitation: An overview of deep behavior cloning methods","volume":"6","author":"Ly Abdoulaye O.","year":"2020","unstructured":"Abdoulaye O. Ly and Moulay Akhloufi. 2020. Learning to drive by imitation: An overview of deep behavior cloning methods. IEEE Transactions on Intelligent Vehicles 6, 2 (2020), 195\u2013209.","journal-title":"IEEE Transactions on Intelligent Vehicles"},{"key":"e_1_3_4_77_2","unstructured":"Aman Madaan Niket Tandon Prakhar Gupta Skyler Hallinan Luyu Gao Sarah Wiegreffe Uri Alon Nouha Dziri Shrimai Prabhumoye Yiming Yang et\u00a0al. 2023. Self-refine: Iterative refinement with self-feedback. In Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023 NeurIPS 2023 New Orleans LA USA December 10-16 2023 Alice Oh Tristan Naumann Amir Globerson Kate Saenko Moritz Hardt and Sergey Levine (Eds.). http:\/\/papers.nips.cc\/paper_files\/paper\/2023\/hash\/91edff07232fb1b55a505a9e9f6c0ff3-Abstract-Conference.html"},{"key":"e_1_3_4_78_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1542"},{"key":"e_1_3_4_79_2","volume-title":"AI in Context: The Labor of Integrating New Technologies","author":"Mateescu Alexandra","year":"2019","unstructured":"Alexandra Mateescu and Madeleine Elish. 2019. AI in Context: The Labor of Integrating New Technologies. Technical Report. Data & Society Research Institute."},{"key":"e_1_3_4_80_2","article-title":"Matlab","author":"Matlab Starting","year":"2012","unstructured":"Starting Matlab. 2012. Matlab. The MathWorks, Natick, MA .","journal-title":"The MathWorks, Natick, MA"},{"key":"e_1_3_4_81_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1298"},{"key":"e_1_3_4_82_2","unstructured":"Jacob Menick Maja Trebacz Vladimir Mikulik John Aslanides Francis Song Martin Chadwick Mia Glaese.Susannah Young Lucy Campbell-G;illingham et\u00a0al. 2022. Teaching language models to support answers with verifiedquotes. arXiv:2203.11147. https:\/\/arxiv.org\/abs\/2203.11147"},{"key":"e_1_3_4_83_2","unstructured":"Gr\u00e9goire Mialon Roberto Dess\u00ec Maria Lomeli Christoforos Nalmpantis Ram Pasunuru Roberta Raileanu Baptiste Rozi\u00e8re Timo Schick Jane Dwivedi-Yu Asli Celikyilmaz et\u00a0al. 2023. Augmented language models: A survey. arXiv:2302.07842. Retrieved from https:\/\/arxiv.org\/abs\/2302.07842"},{"key":"e_1_3_4_84_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-2050"},{"key":"e_1_3_4_85_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1238"},{"key":"e_1_3_4_86_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.244"},{"key":"e_1_3_4_87_2","volume-title":"The Prehistory of the Mind: The Cognitive Origins of Art and Science","author":"Mithen Steven","year":"1996","unstructured":"Steven Mithen. 1996. The Prehistory of the Mind: The Cognitive Origins of Art and Science. Thames & Hudson Ltd."},{"key":"e_1_3_4_88_2","unstructured":"Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv:1312.5602. Retrieved from https:\/\/arxiv.org\/abs\/1312.5602"},{"key":"e_1_3_4_89_2","doi-asserted-by":"publisher","DOI":"10.1075\/li.30.1.03nad"},{"key":"e_1_3_4_90_2","unstructured":"Reiichiro Nakano Jacob Hilton Suchir Balaji Jeff Wu Long Ouyang Christina Kim Christopher Hesse Shantanu Jain Vineet Kosaraju William Saunders et\u00a0al. 2021. WebGPT: Browser-assisted question-answering with human feedback. arXiv:2112.09332. https:\/\/arxiv.org\/abs\/2112.09332"},{"key":"e_1_3_4_91_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-3223-4_3#citeas"},{"key":"e_1_3_4_92_2","unstructured":"Maxwell Nye Anders Johan Andreassen Guy Gur-Ari Henryk Michalewski Jacob Austin David Bieber et\u00a0al. 2021. Show your work: Scratchpads for intermediate computation with language models. arXiv:2112.00114. https:\/\/arxiv.org\/abs\/2112.00114"},{"key":"e_1_3_4_93_2","unstructured":"OpenAI. 2022. OpenAI: Introducing ChatGPT. (2022). Retrieved Nov. 30 2022 from https:\/\/openai.com\/blog\/chatgpt"},{"key":"e_1_3_4_94_2","unstructured":"OpenAI. 2023. GPT-4Technical Report. https:\/\/arxiv.org\/abs\/2303.08774"},{"key":"e_1_3_4_95_2","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2014.00310\/full"},{"key":"e_1_3_4_96_2","doi-asserted-by":"publisher","DOI":"10.1037\/rev0000027"},{"issue":"2","key":"e_1_3_4_97_2","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1037\/amp0000162","article-title":"Looking for intoolligence: A unified framework for the cognitive study of human tool use and technology.","volume":"73","author":"Osiurak Fran\u00e7ois","year":"2018","unstructured":"Fran\u00e7ois Osiurak and Dietmar Heinke. 2018. Looking for intoolligence: A unified framework for the cognitive study of human tool use and technology. American Psychologist 73, 2 (2018), 169.","journal-title":"American Psychologist"},{"issue":"1","key":"e_1_3_4_98_2","doi-asserted-by":"crossref","first-page":"10429","DOI":"10.1038\/s41598-018-28759-2","article-title":"Tool use and generalized motor programs: We all are natural born poly-dexters","volume":"8","author":"Osiurak Fran\u00e7ois","year":"2018","unstructured":"Fran\u00e7ois Osiurak, Mathieu Lesourd, Ludovic Delporte, and Yves Rossetti. 2018. Tool use and generalized motor programs: We all are natural born poly-dexters. Scientific Reports 8, 1 (2018), 10429.","journal-title":"Scientific Reports"},{"key":"e_1_3_4_99_2","doi-asserted-by":"publisher","DOI":"10.1017\/S0140525X19003236"},{"key":"e_1_3_4_100_2","unstructured":"Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray et\u00a0al. 2022. Training language models to follow instructions with human feedback. arXiv:2203.02155. Retrieved from https:\/\/arxiv.org\/abs\/2203.02155"},{"key":"e_1_3_4_101_2","unstructured":"Aaron Parisi Yao Zhao and Noah Fiedel. 2022. TALM: Tool augmented language models. arXiv:2205.12255. Retrieved from https:\/\/arxiv.org\/abs\/2205.12255"},{"key":"e_1_3_4_102_2","unstructured":"Joon Sung Park Joseph C. O\u2019Brien Carrie J. Cai Meredith Ringel Morris Percy Liang and Michael S. Bernstein. 2023. Generative agents: Interactive simulacra of human behavior. arXiv:2304.03442. Retrieved from https:\/\/arxiv.org\/abs\/2304.03442"},{"key":"e_1_3_4_103_2","unstructured":"F\u00e1bio Perez and Ian Ribeiro. 2022. Ignore previous prompt: Attack techniques for language models. arXiv:2211.09527. Retrieved from https:\/\/arxiv.org\/abs\/2211.09527"},{"key":"e_1_3_4_104_2","unstructured":"Aleksandra Piktus Fabio Petroni Vladimir Karpukhin Dmytro Okhonko Samuel Broscheit et\u00a0al. 2021. The web is your oyster\u2013knowledge-intensive NLP against a very large web corpus. arXiv:2112.09924. Retrieved from https:\/\/arxiv.org\/abs\/2112.09924"},{"key":"e_1_3_4_105_2","article-title":"ALVINN: An autonomous land vehicle in a neural network","author":"Pomerleau Dean A.","year":"1988","unstructured":"Dean A. Pomerleau. 1988. ALVINN: An autonomous land vehicle in a neural network. In Proceedings of the 1st International Conference on Neural Information Processing Systems .","journal-title":"Proceedings of the 1st International Conference on Neural Information Processing Systems"},{"key":"e_1_3_4_106_2","doi-asserted-by":"crossref","unstructured":"Ofir Press Muru Zhang Sewon Min Ludwig Schmidt Noah A. Smith and Mike Lewis. 2022. Measuring and narrowing the compositionality gap in language models. arXiv:2210.03350. Retrieved from https:\/\/arxiv.org\/abs\/2210.03350","DOI":"10.18653\/v1\/2023.findings-emnlp.378"},{"key":"e_1_3_4_107_2","unstructured":"Cheng Qian Chenyan Xiong Zhenghao Liu and Zhiyuan Liu. 2023. Toolink: Linking toolkit creation and using through chain-of-solving on open-source model. arXiv:2310.05155. Retrieved from https:\/\/arxiv.org\/abs\/2310.05155"},{"key":"e_1_3_4_108_2","unstructured":"Cheng Qian Xinran Zhao and Sherry Tongshuang Wu. 2023. \u201cMerge Conflicts!\u201d exploring the impacts of external distractors to parametric knowledge graphs. arXiv:2309.08594. Retrieved from https:\/\/arxiv.org\/abs\/2309.08594"},{"key":"e_1_3_4_109_2","unstructured":"Yujia Qin Shihao Liang Yining Ye Kunlun Zhu Lan Yan Yaxi Lu Yankai Lin Xin Cong Xiangru Tang Bill Qian et\u00a0al. 2023. ToolLLM: Facilitating large language models to master 16000+ real-world APIs. arXiv:2307.16789. Retrieved from https:\/\/arxiv.org\/abs\/2307.16789"},{"key":"e_1_3_4_110_2","article-title":"Direct preference optimization: Your language model is secretly a reward model","author":"Rafailov Rafael","year":"2024","unstructured":"Rafael Rafailov, Archit Sharma, Eric Mitchell, Christopher D. Manning, Stefano Ermon, and Chelsea Finn. 2024. Direct preference optimization: Your language model is secretly a reward model. InProceedings of the 37th International Conference on Neural Information Processing Systems .","journal-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_4_111_2","first-page":"140:1\u2013140:67","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research 21 (2020), 140:1\u2013140:67. https:\/\/colinraffel.com\/publications\/arxiv2019exploring.pdf","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_4_112_2","unstructured":"Aditya Ramesh Prafulla Dhariwal Alex Nichol Casey Chu and Mark Chen. 2022. Hierarchical text-conditional image generation with clip latents. arXiv:2204.06125. Retrieved from https:\/\/arxiv.org\/abs\/2204.06125"},{"key":"e_1_3_4_113_2","volume-title":"Proceedings of the 8th International Conference on Learning Representations, ICLR 2020","author":"Reddy Siddharth","year":"2020","unstructured":"Siddharth Reddy, Anca D. Dragan, and Sergey Levine. 2020. SQIL: Imitation learning via reinforcement learning with sparse rewards. In Proceedings of the 8th International Conference on Learning Representations, ICLR 2020. OpenReview.net."},{"key":"e_1_3_4_114_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.eacl-main.24"},{"key":"e_1_3_4_115_2","unstructured":"Kevin Roose. 2023. A conversation with Bing\u2019s chatbot left me deeply unsettled. Retrieved Feb. 16 2023 from https:\/\/www.nytimes.com\/2023\/02\/16\/technology\/bing-chatbot-microsoft-chatgpt.html"},{"key":"e_1_3_4_116_2","volume-title":"Proceedings of the 10th International Conference on Learning Representations, ICLR 2022","author":"Sanh Victor","year":"2022","unstructured":"Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, M. Saiful Bari, et\u00a0al. 2022. Multitask prompted training enables zero-shot task generalization. In Proceedings of the 10th International Conference on Learning Representations, ICLR 2022. OpenReview.net."},{"key":"e_1_3_4_117_2","volume-title":"Proceedings of the 9th International Conference on Learning Representations, ICLR 2021","author":"Sasaki Fumihiro","year":"2021","unstructured":"Fumihiro Sasaki and Ryota Yamashina. 2021. Behavioral cloning from noisy demonstrations. In Proceedings of the 9th International Conference on Learning Representations, ICLR 2021. OpenReview.net."},{"key":"e_1_3_4_118_2","unstructured":"Timo Schick Jane Dwivedi-Yu Roberto Dess\u00ec Roberta Raileanu Maria Lomeli Luke Zettlemoyer Nicola Cancedda and Thomas Scialom. 2023. Toolformer: Language models can teach themselves to use tools. arXiv:2302.04761. Retrieved from https:\/\/arxiv.org\/abs\/2302.04761"},{"key":"e_1_3_4_119_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-03051-4"},{"key":"e_1_3_4_120_2","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv:1707.06347. Retrieved from https:\/\/arxiv.org\/abs\/1707.06347"},{"issue":"23","key":"e_1_3_4_121_2","first-page":"R1032\u2013R1039","article-title":"Animal tool-use","volume":"20","author":"Seed Amanda","year":"2010","unstructured":"Amanda Seed and Richard Byrne. 2010. Animal tool-use. Current Biology 20, 23 (2010), R1032\u2013R1039.","journal-title":"Current Biology"},{"key":"e_1_3_4_122_2","first-page":"3135","volume-title":"Proceedings of the 34th International Conference on Machine Learning ICML 2017","volume":"70","author":"Shi Tianlin","year":"2017","unstructured":"Tianlin Shi, Andrej Karpathy, Linxi Fan, Jonathan Hernandez, and Percy Liang. 2017. World of bits: An open-domain platform for web-based agents. In Proceedings of the 34th International Conference on Machine Learning ICML 2017, Doina Precup and Yee Whye Teh (Eds.). Proceedings of Machine Learning Research, Vol. 70. PMLR, 3135\u20133144."},{"key":"e_1_3_4_123_2","doi-asserted-by":"publisher","DOI":"10.1353\/book.98237"},{"key":"e_1_3_4_124_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-emnlp.320"},{"key":"e_1_3_4_125_2","unstructured":"Kurt Shuster Jing Xu Mojtaba Komeili Da Ju Eric Michael Smith Stephen Roller Megan Ung Moya Chen Kushal Arora Joshua Lane et\u00a0al. 2022. Blenderbot 3: A deployed conversational agent that continually learns to responsibly engage. arXiv:2208.03188. Retrieved from https:\/\/arxiv.org\/abs\/2208.03188"},{"key":"e_1_3_4_126_2","doi-asserted-by":"publisher","DOI":"10.1126\/science.aar6404"},{"key":"e_1_3_4_127_2","doi-asserted-by":"crossref","unstructured":"Ishika Singh Valts Blukis Arsalan Mousavian Ankit Goyal Danfei Xu Jonathan Tremblay Dieter Fox Jesse Thomason and Animesh Garg. 2022. Progprompt: Generating situated robot task plans using large language models. arXiv:2209.11302. Retrieved from https:\/\/arxiv.org\/abs\/2209.11302","DOI":"10.1109\/ICRA48891.2023.10161317"},{"key":"e_1_3_4_128_2","unstructured":"Chan Hee Song Jiaman Wu Clayton Washington Brian M. Sadler Wei-Lun Chao and Yu Su. 2022. LLM-Planner: Few-shot grounded planning for embodied agents with large language models. arXiv:2212.04088. Retrieved from https:\/\/arxiv.org\/abs\/2212.04088"},{"key":"e_1_3_4_129_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.14"},{"key":"e_1_3_4_130_2","unstructured":"Yifan Song Da Yin Xiang Yue Jie Huang Sujian Li and Bill Yuchen Lin. 2024. Trial and error: Exploration-based trajectory optimization for LLM agents. arXiv:2403.02502. Retrieved from https:\/\/arxiv.org\/abs\/2403.02502"},{"issue":"4","key":"e_1_3_4_131_2","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1037\/1089-2680.3.4.292","article-title":"The theory of successful intelligence","volume":"3","author":"Sternberg Robert J.","year":"1999","unstructured":"Robert J. Sternberg. 1999. The theory of successful intelligence. Review of General Psychology 3, 4 (1999), 292\u2013316.","journal-title":"Review of General Psychology"},{"key":"e_1_3_4_132_2","volume-title":"Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020","author":"Stiennon Nisan","year":"2020","unstructured":"Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel M. Ziegler, et\u00a0al. 2020. Learning to summarize with human feedback. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, Hugo Larochelle, Marc\u2019Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.). https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/1f89885d556929e98d3ef9b86448f951-Abstract.html"},{"key":"e_1_3_4_133_2","doi-asserted-by":"publisher","DOI":"10.5555\/2671144"},{"key":"e_1_3_4_134_2","volume-title":"Proceedings of the 2nd International Conference on Learning Representations ICLR 2014","author":"Szegedy Christian","year":"2014","unstructured":"Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian J. Goodfellow, and Rob Fergus. 2014. Intriguing properties of neural networks. In Proceedings of the 2nd International Conference on Learning Representations ICLR 2014, Yoshua Bengio and Yann LeCun (Eds.). https:\/\/arxiv.org\/abs\/1312.6199"},{"key":"e_1_3_4_135_2","volume-title":"Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020","author":"Taori Rohan","year":"2020","unstructured":"Rohan Taori, Achal Dave, Vaishaal Shankar, et\u00a0al. 2020. Measuring robustness to natural distribution shifts in image classification. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020, Hugo Larochelle, Marc\u2019Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.). https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/d8330f857a17c53d217014ee776bfd50-Abstract.html"},{"issue":"1630","key":"e_1_3_4_136_2","doi-asserted-by":"crossref","first-page":"20120418","DOI":"10.1098\/rstb.2012.0418","article-title":"Did tool-use evolve with enhanced physical cognitive abilities?","volume":"368","author":"Teschke Irmgard","year":"2013","unstructured":"Irmgard Teschke, Claudia A. F. Wascher, Madeleine F. Scriba, Auguste M. P. von Bayern, V. Huml, B. Siemers, and Sabine Tebbich. 2013. Did tool-use evolve with enhanced physical cognitive abilities? Philosophical Transactions of the Royal Society B: Biological Sciences 368, 1630 (2013), 20120418.","journal-title":"Philosophical Transactions of the Royal Society B: Biological Sciences"},{"key":"e_1_3_4_137_2","unstructured":"Romal Thoppilan Daniel De Freitas Jamie Hall Noam Shazeer Apoorv Kulshreshtha Heng-Tze Cheng Alicia Jin Taylor Bos Leslie Baker Yu Du et\u00a0al. 2022. Lamda: Language models for dialog applications. arXiv:2201.08239. Retrieved from https:\/\/arxiv.org\/abs\/2201.08239"},{"key":"e_1_3_4_138_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6386109"},{"key":"e_1_3_4_139_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/687"},{"key":"e_1_3_4_140_2","doi-asserted-by":"crossref","unstructured":"Harsh Trivedi Tushar Khot Mareike Hartmann Ruskin Manku Vinty Dong Edward Li Shashank Gupta Ashish Sabharwal and Niranjan Balasubramanian. 2024. AppWorld: A controllable world of apps and people for benchmarking interactive coding agents. arXiv:2407.18901. Retrieved from https:\/\/arxiv.org\/abs\/2407.18901","DOI":"10.18653\/v1\/2024.acl-long.850"},{"key":"e_1_3_4_141_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, et\u00a0al. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, and Roman Garnett (Eds.). 5998\u20136008. https:\/\/arxiv.org\/abs\/1706.03762"},{"key":"e_1_3_4_142_2","volume-title":"ChatGPT for Robotics: Design Principles and Model Abilities","author":"Vemprala Sai","year":"2023","unstructured":"Sai Vemprala, Rogerio Bonatti, Arthur Bucker, and Ashish Kapoor. 2023. ChatGPT for Robotics: Design Principles and Model Abilities. Technical Report MSR-TR-2023-8. Microsoft."},{"key":"e_1_3_4_143_2","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/7209.001.0001","volume-title":"What is Cognitive Science?","author":"Eckardt Barbara Von","year":"1995","unstructured":"Barbara Von Eckardt. 1995. What is Cognitive Science?MIT Press."},{"key":"e_1_3_4_144_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1221"},{"key":"e_1_3_4_145_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.13"},{"key":"e_1_3_4_146_2","first-page":"3261","volume-title":"Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 NeurIPS 2019","author":"Wang Alex","year":"2019","unstructured":"Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman. 2019. SuperGLUE: A stickier benchmark for general-purpose language understanding systems. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 NeurIPS 2019, Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily B. Fox, and Roman Garnett (Eds.). 3261\u20133275."},{"key":"e_1_3_4_147_2","volume-title":"Proceedings of the 7th International Conference on Learning Representations, ICLR 2019","author":"Wang Alex","year":"2019","unstructured":"Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, et\u00a0al. 2019. GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Proceedings of the 7th International Conference on Learning Representations, ICLR 2019. OpenReview.net. https:\/\/arxiv.org\/abs\/1804.07461"},{"key":"e_1_3_4_148_2","unstructured":"Jindong Wang Xixu Hu Wenxin Hou Hao Chen Runkai Zheng Yidong Wang Linyi Yang Haojun Huang Wei Ye Xiubo Geng et\u00a0al. 2023. On the robustness of ChatGPT: An adversarial and out-of-distribution perspective. arXiv:2302.12095. Retrieved from https:\/\/arxiv.org\/abs\/2302.12095"},{"key":"e_1_3_4_149_2","unstructured":"Xuezhi Wang Jason Wei Dale Schuurmans Quoc Le Ed Chi and Denny Zhou. 2022. Self-consistency improves chain of thought reasoning in language models. arXiv:2203.11171. Retrieved from https:\/\/arxiv.org\/abs\/2203.11171"},{"key":"e_1_3_4_150_2","unstructured":"Zihao Wang Shaofei Cai Anji Liu Xiaojian Ma and Yitao Liang. 2023. Describe explain plan and select: Interactive planning with large language models enables open-world multi-task agents. arXiv:2302.01560. Retrieved from https:\/\/arxiv.org\/abs\/2302.01560"},{"issue":"3","key":"e_1_3_4_151_2","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1038\/scientificamerican0960-62","article-title":"Tools and human evolution","volume":"203","author":"Washburn Sherwood L.","year":"1960","unstructured":"Sherwood L. Washburn. 1960. Tools and human evolution. Scientific American 203, 3 (1960), 62\u201375.","journal-title":"Scientific American"},{"key":"e_1_3_4_152_2","volume-title":"Proceedings of the 10th International Conference on Learning Representations, ICLR 2022","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, et\u00a0al. 2022. Finetuned language models are zero-shot learners. In Proceedings of the 10th International Conference on Learning Representations, ICLR 2022. OpenReview.net. https:\/\/arxiv.org\/abs\/2109.01652"},{"key":"e_1_3_4_153_2","unstructured":"Jason Wei Yi Tay Rishi Bommasani Colin Raffel Barret Zoph Sebastian Borgeaud Dani Yogatama Maarten Bosma Denny Zhou Donald Metzler et\u00a0al. 2022. Emergent abilities of large language models. arXiv:2206.07682. Retrieved from https:\/\/arxiv.org\/abs\/2206.07682"},{"key":"e_1_3_4_154_2","unstructured":"Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Ed Chi Quoc Le and Denny Zhou. 2022. Chain of thought prompting elicits reasoning in large language models. arXiv:2201.11903. Retrieved from https:\/\/arxiv.org\/abs\/2201.11903"},{"key":"e_1_3_4_155_2","article-title":"Ethical and Societal Implications of Algorithms, Data, and Artificial Intelligence: A Roadmap for Research","author":"Whittlestone Jess","year":"2019","unstructured":"Jess Whittlestone, Rune Nyrup, Anna Alexandrova, Kanta Dihal, and Stephen Cave. 2019. Ethical and Societal Implications of Algorithms, Data, and Artificial Intelligence: A Roadmap for Research. Nuffield Foundation, London .","journal-title":"Nuffield Foundation, London"},{"key":"e_1_3_4_156_2","unstructured":"Chenfei Wu Shengming Yin Weizhen Qi Xiaodong Wang Zecheng Tang and Nan Duan. 2023. Visual ChatGPT: Talking drawing and editing with visual foundation models. arXiv:2303.04671. Retrieved from https:\/\/arxiv.org\/abs\/2303.04671"},{"key":"e_1_3_4_157_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.157"},{"key":"e_1_3_4_158_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1104"},{"key":"e_1_3_4_159_2","volume-title":"Proceedings of the ICLR 2024 Workshop on Large Language Model (LLM) Agents","author":"Xu Lin","year":"2023","unstructured":"Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, et\u00a0al. 2023. Magic: Investigation of large language model powered multi-agent in cognition, adaptability, rationality and collaboration. In Proceedings of the ICLR 2024 Workshop on Large Language Model (LLM) Agents. https:\/\/aclanthology.org\/2024.emnlp-main.416"},{"key":"e_1_3_4_160_2","first-page":"1342","volume-title":"Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing","author":"Yan Rui","year":"2011","unstructured":"Rui Yan, Jian-Yun Nie, and Xiaoming Li. 2011. Summarize what you are interested in: An optimization framework for interactive personalized summarization. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Edinburgh, Scotland, UK., 1342\u20131351."},{"key":"e_1_3_4_161_2","first-page":"3","volume-title":"Proceedings of the 24th International Conference on Text, Speech, and Dialogue, TSD 2021","author":"Yang Diyi","year":"2021","unstructured":"Diyi Yang and Lucie Flek. 2021. Towards user-centric text-to-text generation: A survey. In Proceedings of the 24th International Conference on Text, Speech, and Dialogue, TSD 2021. Springer, 3\u201322."},{"key":"e_1_3_4_162_2","unstructured":"John Yang Carlos E. Jimenez Alexander Wettig Kilian Lieret Shunyu Yao Karthik Narasimhan and Ofir Press. 2024. SWE-agent: Agent-computer interfaces enable automated software engineering. arXiv:2405.15793. Retrieved from https:\/\/arxiv.org\/abs\/2405.15793"},{"key":"e_1_3_4_163_2","unstructured":"Sherry Yang Ofir Nachum Yilun Du Jason Wei Pieter Abbeel and Dale Schuurmans. 2023. Foundation models for decision making: Problems methods and opportunities. arXiv:2303.04129. Retrieved from https:\/\/arxiv.org\/abs\/2303.04129"},{"key":"e_1_3_4_164_2","unstructured":"Shunyu Yao Howard Chen John Yang and Karthik Narasimhan. 2022. Webshop: Towards scalable real-world web interaction with grounded language agents. arXiv:2207.01206. Retrieved from https:\/\/arxiv.org\/abs\/2207.01206"},{"key":"e_1_3_4_165_2","unstructured":"Shunyu Yao Jeffrey Zhao Dian Yu Nan Du Izhak Shafran Karthik Narasimhan and Yuan Cao. 2022. React: Synergizing reasoning and acting in language models. arXiv:2210.03629. Retrieved from https:\/\/arxiv.org\/abs\/2210.03629"},{"key":"e_1_3_4_166_2","unstructured":"Yining Ye Xin Cong Shizuo Tian Jiannan Cao Hao Wang Yujia Qin Yaxi Lu Heyang Yu Huadong Wang Yankai Lin et\u00a0al. 2023. Proagent: From robotic process automation to agentic process automation. arXiv:2311.10751. Retrieved from https:\/\/arxiv.org\/abs\/2311.10751"},{"key":"e_1_3_4_167_2","unstructured":"Da Yin Faeze Brahman Abhilasha Ravichander Khyathi Chandu Kai-Wei Chang Yejin Choi and Bill Yuchen Lin. 2023. Lumos: Learning agents with unified data modular design and open-source LLMs. arXiv:2311.05657. Retrieved from https:\/\/arxiv.org\/abs\/2311.05657"},{"key":"e_1_3_4_168_2","first-page":"9051","volume-title":"Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 NeurIPS 2019","author":"Zellers Rowan","year":"2019","unstructured":"Rowan Zellers, Ari Holtzman, Hannah Rashkin, Yonatan Bisk, et\u00a0al. 2019. Defending against neural fake news. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 NeurIPS 2019, Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily B. Fox, and Roman Garnett (Eds.). 9051\u20139062. https:\/\/arxiv.org\/abs\/1905.12616"},{"key":"e_1_3_4_169_2","unstructured":"Aohan Zeng Mingdao Liu Rui Lu Bowen Wang Xiao Liu Yuxiao Dong and Jie Tang. 2023. AgentTuning: Enabling generalized agent abilities for LLMs. arXiv:2310.12823. Retrieved from https:\/\/arxiv.org\/abs\/2310.12823"},{"key":"e_1_3_4_170_2","unstructured":"Andy Zeng Adrian Wong Stefan Welker Krzysztof Choromanski Federico Tombari Aveek Purohit Michael Ryoo Vikas Sindhwani Johnny Lee Vincent Vanhoucke et\u00a0al. 2022. Socratic models: Composing zero-shot multimodal reasoning with language. arXiv:2204.00598. Retrieved from https:\/\/arxiv.org\/abs\/2204.00598"},{"key":"e_1_3_4_171_2","doi-asserted-by":"crossref","unstructured":"Kechi Zhang Jia Li Ge Li Xianjie Shi and Zhi Jin. 2024. Codeagent: Enhancing code generation with tool-integrated agent systems for real-world repo-level coding challenges. arXiv:2401.07339. Retrieved from https:\/\/arxiv.org\/abs\/2401.07339","DOI":"10.18653\/v1\/2024.acl-long.737"},{"key":"e_1_3_4_172_2","unstructured":"Kechi Zhang Huangzhao Zhang Ge Li Jia Li Zhuo Li and Zhi Jin. 2023. ToolCoder: Teach code generation models to use API search tools. arXiv:2305.04032. Retrieved from https:\/\/arxiv.org\/abs\/2305.04032"},{"key":"e_1_3_4_173_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1205"},{"key":"e_1_3_4_174_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.naacl-main.426"},{"key":"e_1_3_4_175_2","unstructured":"Shuyan Zhou Uri Alon Frank F. Xu Zhiruo Wang Zhengbao Jiang and Graham Neubig. 2023. DocPrompting: Generating code by retrieving the docs. arXiv:2207.05987. Retrieved from https:\/\/arxiv.org\/abs\/2207.05987"},{"key":"e_1_3_4_176_2","unstructured":"Daniel M. Ziegler Nisan Stiennon Jeffrey Wu Tom B. Brown Alec Radford Dario Amodei Paul Christiano and Geoffrey Irving. 2019. Fine-tuning language models from human preferences. arXiv:1909.08593. Retrieved from https:\/\/arxiv.org\/abs\/1909.08593"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3704435","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3704435","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:18:05Z","timestamp":1750295885000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3704435"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,24]]},"references-count":175,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,4,30]]}},"alternative-id":["10.1145\/3704435"],"URL":"https:\/\/doi.org\/10.1145\/3704435","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,24]]},"assertion":[{"value":"2023-07-19","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-11-03","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-12-24","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}