{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T18:30:38Z","timestamp":1772303438478,"version":"3.50.1"},"publisher-location":"California","reference-count":0,"publisher":"International Joint Conferences on Artificial Intelligence Organization","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,9]]},"abstract":"<jats:p>Reinforcement learning (RL) has shown impressive results in sequential decision-making tasks. Large Language Models (LLMs) and Vision-Language Models (VLMs) have recently emerged, exhibiting impressive capabilities in multimodal understanding and reasoning. These advances have led to a surge of research integrating LLMs and VLMs into RL. This survey reviews representative works in which LLMs and VLMs are used to overcome key challenges in RL, such as lack of prior knowledge, long-horizon planning, and reward design. We present a taxonomy that categorizes these LLM\/VLM-assisted RL approaches into three roles: agent, planner, and reward. We conclude by exploring open problems, including grounding, bias mitigation, improved representations, and action advice. By consolidating existing research and identifying future directions, this survey establishes a framework for integrating LLMs and VLMs into RL, advancing approaches that unify natural language and visual understanding with sequential decision-making.<\/jats:p>","DOI":"10.24963\/ijcai.2025\/1181","type":"proceedings-article","created":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T08:10:40Z","timestamp":1758269440000},"page":"10641-10649","source":"Crossref","is-referenced-by-count":1,"title":["The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning"],"prefix":"10.24963","author":[{"given":"Sheila","family":"Schoepp","sequence":"first","affiliation":[{"name":"University of Alberta"}]},{"given":"Masoud","family":"Jafaripour","sequence":"additional","affiliation":[{"name":"University of Alberta"}]},{"given":"Yingyue","family":"Cao","sequence":"additional","affiliation":[{"name":"University of Alberta"}]},{"given":"Tianpei","family":"Yang","sequence":"additional","affiliation":[{"name":"Nanjing University"}]},{"given":"Fatemeh","family":"Abdollahi","sequence":"additional","affiliation":[{"name":"University of Alberta"}]},{"given":"Shadan","family":"Golestan","sequence":"additional","affiliation":[{"name":"Alberta Machine Intelligence Institute"}]},{"given":"Zahin","family":"Sufiyan","sequence":"additional","affiliation":[{"name":"University of Alberta"}]},{"given":"Osmar R.","family":"Zaiane","sequence":"additional","affiliation":[{"name":"University of Alberta"},{"name":"Alberta Machine Intelligence Institute (Amii)"}]},{"given":"Matthew E.","family":"Taylor","sequence":"additional","affiliation":[{"name":"University of Alberta"},{"name":"Alberta Machine Intelligence Institute (Amii)"}]}],"member":"10584","event":{"name":"Thirty-Fourth International Joint Conference on Artificial Intelligence {IJCAI-25}","theme":"Artificial Intelligence","location":"Montreal, Canada","acronym":"IJCAI-2025","number":"34","sponsor":["International Joint Conferences on Artificial Intelligence Organization (IJCAI)"],"start":{"date-parts":[[2025,8,16]]},"end":{"date-parts":[[2025,8,22]]}},"container-title":["Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence"],"original-title":[],"deposited":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T11:36:22Z","timestamp":1758627382000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.ijcai.org\/proceedings\/2025\/1181"}},"subtitle":[],"proceedings-subject":"Artificial Intelligence Research Articles","short-title":[],"issued":{"date-parts":[[2025,9]]},"references-count":0,"URL":"https:\/\/doi.org\/10.24963\/ijcai.2025\/1181","relation":{},"subject":[],"published":{"date-parts":[[2025,9]]}}}