{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T10:29:43Z","timestamp":1773224983646,"version":"3.50.1"},"reference-count":195,"publisher":"Association for Computing Machinery (ACM)","issue":"2","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Hum.-Robot Interact."],"published-print":{"date-parts":[[2026,3,31]]},"abstract":"<jats:p>\n                    Robot learning from humans has been proposed and researched for several decades as a means to enable robots to learn new skills or adapt existing ones to new situations. Recent advances in AI, including learning approaches like reinforcement learning and architectures like transformers and foundation models, combined with access to massive datasets, have created attractive opportunities to apply those data-hungry techniques to this problem. We argue that the focus on massive amounts of pre-collected data, and the resulting learning paradigm, where humans demonstrate and robots learn in isolation, is overshadowing a specialized area of work we term Human-Interactive Robot Learning (HIRL). This paradigm, wherein robots and humans interact\n                    <jats:italic toggle=\"yes\">during the learning process<\/jats:italic>\n                    , is at the intersection of multiple fields (AI, robotics, human\u2013computer interaction, design and others) and holds unique promise. Using HIRL, robots can achieve greater sample efficiency (as humans can provide task knowledge through interaction), align with human preferences (as humans can guide the robot behavior toward their expectations), and explore more meaningfully and safely (as humans can utilize domain knowledge to guide learning and prevent catastrophic failures). This can result in robotic systems that can more quickly and easily adapt to new tasks in human environments. The objective of this article is to provide a broad and consistent overview of HIRL research and to guide researchers toward understanding the scope of HIRL, and current open or underexplored challenges related to four themes\u2014namely, human, robot learning, interaction, and broader context. The article includes concrete use cases to illustrate the interaction between these challenges and inspire further research according to broad recommendations and a call for action for the growing HIRL community.\n                  <\/jats:p>","DOI":"10.1145\/3779297","type":"journal-article","created":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T16:17:08Z","timestamp":1765297028000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Human-Interactive Robot Learning: Definition, Challenges, and Recommendations"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4381-4234","authenticated-orcid":false,"given":"Kim","family":"Baraka","sequence":"first","affiliation":[{"name":"Vrije Universiteit Amsterdam, Amsterdam, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0693-0711","authenticated-orcid":false,"given":"Ifrah","family":"Idrees","sequence":"additional","affiliation":[{"name":"Brown University, Providence, Rhode Island, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5838-0021","authenticated-orcid":false,"given":"Taylor","family":"Kessler Faulkner","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, Washington, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9516-3130","authenticated-orcid":false,"given":"Erdem","family":"Biyik","sequence":"additional","affiliation":[{"name":"University of Southern California, Los Angeles, California, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7738-4418","authenticated-orcid":false,"given":"Serena","family":"Booth","sequence":"additional","affiliation":[{"name":"Brown University, Providence, Rhode Island, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2920-4539","authenticated-orcid":false,"given":"Mohamed","family":"Chetouani","sequence":"additional","affiliation":[{"name":"Institut des Syst\u00e8mes Intelligents et de Robotique, Sorbonne University, CNRS, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8757-727X","authenticated-orcid":false,"given":"Daniel H.","family":"Grollman","sequence":"additional","affiliation":[{"name":"Plus One Robotics Inc., Boulder, Colorado, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9637-6699","authenticated-orcid":false,"given":"Akanksha","family":"Saran","sequence":"additional","affiliation":[{"name":"Sony AI, San Francisco, California, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7160-4352","authenticated-orcid":false,"given":"Emmanuel","family":"Senft","sequence":"additional","affiliation":[{"name":"Idiap Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6826-370X","authenticated-orcid":false,"given":"Silvia","family":"Tulli","sequence":"additional","affiliation":[{"name":"Sorbonne University, CNRS, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9378-7249","authenticated-orcid":false,"given":"Anna-Lisa","family":"Vollmer","sequence":"additional","affiliation":[{"name":"Bielefeld University, Bielefeld, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6641-6450","authenticated-orcid":false,"given":"Antonio","family":"Andriella","sequence":"additional","affiliation":[{"name":"Artificial Intelligence Research Institute (IIIA-CSIC), Barcelona, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4159-6608","authenticated-orcid":false,"given":"Helen","family":"Beierling","sequence":"additional","affiliation":[{"name":"Bielefeld University, Bielefeld, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1358-1912","authenticated-orcid":false,"given":"Tiffany","family":"Horter","sequence":"additional","affiliation":[{"name":"University of Oxford, Oxford, United Kingdom of Great Britain and Northern Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7257-5434","authenticated-orcid":false,"given":"Jens","family":"Kober","sequence":"additional","affiliation":[{"name":"TU Delft, Delft, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-4487-8357","authenticated-orcid":false,"given":"Isaac","family":"Sheidlower","sequence":"additional","affiliation":[{"name":"Tufts University, Medford, Massachusetts, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8946-0211","authenticated-orcid":false,"given":"Matthew E.","family":"Taylor","sequence":"additional","affiliation":[{"name":"University of Alberta &amp; Alberta Machine Intelligence Institute (Amii), Edmonton, Alberta, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3729-157X","authenticated-orcid":false,"given":"Sanne","family":"van Waveren","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology, Atlanta, Georgia, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5151-2186","authenticated-orcid":false,"given":"Xuesu","family":"Xiao","sequence":"additional","affiliation":[{"name":"George Mason University, Fairfax, Virginia, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,2,23]]},"reference":[{"key":"e_1_3_3_2_2","unstructured":"Competition of International Conference of Social Robotics. 2024. Retrieved December 9 2024 from https:\/\/www.icsr2024-competition.org\/competition"},{"key":"e_1_3_3_3_2","unstructured":"Educational module Human-Interactive Robot Learning (HIRL). 2024. Retrieved December 9 2024 from https:\/\/www.humane-ai.eu\/project\/tmp-038\/"},{"key":"e_1_3_3_4_2","unstructured":"Human-Robot Interaction A Research Portal for the HRI Community. 2024. Retrieved December 9 2024 from https:\/\/humanrobotinteraction.org\/"},{"key":"e_1_3_3_5_2","unstructured":"Performance Evaluation & Benchmarking of Robotic and Automation Systems. 2024. Retrieved December 9 2024 from https:\/\/www.ieee-ras.org\/performance-evaluation"},{"key":"e_1_3_3_6_2","unstructured":"RoboCup@Home League. 2024. Retrieved December 9 2024 from https:\/\/athome.robocup.org\/"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3434073.3444647"},{"key":"e_1_3_3_8_2","unstructured":"Michael Ahn Anthony Brohan Noah Brown Yevgen Chebotar Omar Cortes Byron David Chelsea Finn Chuyuan Fu Keerthana Gopalakrishnan Karol Hausman et al. 2022. Do as I can and not as I say: Grounding language in robotic affordances. arXiv:2204.01691. Retrieved from https:\/\/arxiv.org\/abs\/2204.01691"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2020.2996587"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2008.10.024"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/3357236.3395525"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICAD65464.2025.11114031"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS60139.2025.11245930"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.3389\/frobt.2020.00062"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-42307-0\\_2"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1126\/scirobotics.aat5954"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1177\/02783649211041652"},{"key":"e_1_3_3_18_2","volume-title":"Proceedings of 3rd Conference on Robot Learning","author":"Biyik Erdem","year":"2019","unstructured":"Erdem Biyik, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, and Dorsa Sadigh. 2019. Asking easy questions: A user-friendly approach to active reward learning. In Proceedings of 3rd Conference on Robot Learning."},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610977.3634987"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610977.3634987"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3319502.3374811"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i5.25733"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI53351.2022.9889398"},{"key":"e_1_3_3_24_2","unstructured":"Connor Brooks and Daniel Szafir. 2019. Building second-order mental models for human-robot interaction. arXiv:1909.06508. Retrieved from https:\/\/arxiv.org\/abs\/1909.06508"},{"key":"e_1_3_3_25_2","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Brown Daniel S.","year":"2019","unstructured":"Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, and Scott Niekum. 2019. Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations. In Proceedings of the International Conference on Machine Learning. Retrieved from https:\/\/api.semanticscholar.org\/CorpusID:119111734"},{"key":"e_1_3_3_26_2","article-title":"Open problems and fundamental limitations of reinforcement learning from human feedback","author":"Casper Stephen","year":"2023","unstructured":"Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, J\u00e9r\u00e9my Scheurer, Javier Rando, Rachel Freedman, Tomek Korbak, David Lindner, Pedro Freire, et al. 2023. Open problems and fundamental limitations of reinforcement learning from human feedback. Transactions on Machine Learning Research (2023). Retrieved from https:\/\/openreview.net\/forum?id=bx24KpJ4EbSurvey","journal-title":"Transactions on Machine Learning Research"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.1561\/2300000072"},{"key":"e_1_3_3_28_2","first-page":"2083","volume-title":"Proceedings of the Conference on Robot Learning","author":"Chen Letian","year":"2023","unstructured":"Letian Chen, Sravan Jayanthi, Rohan R. Paleja, Daniel Martin, Viacheslav Zakharov, and Matthew Gombolay. 2023. Fast lifelong adaptive inverse reinforcement learning from demonstrations. In Proceedings of the Conference on Robot Learning. PMLR, 2083\u20132094."},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-01570-0"},{"key":"e_1_3_3_30_2","first-page":"140","volume-title":"Interactive Robot Learning: An Overview","author":"Chetouani Mohamed","year":"2023","unstructured":"Mohamed Chetouani. 2023. Interactive Robot Learning: An Overview. Springer International Publishing, 140\u2013172."},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610978.3640740"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI53351.2022.9889421"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.5555\/3032527.3032529"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i10.29049"},{"key":"e_1_3_3_35_2","first-page":"429","article-title":"Humans in the loop","volume":"76","author":"Crootof Rebecca","year":"2023","unstructured":"Rebecca Crootof, Margot E. Kaminski, and W. Nicholson Price II. 2023. Humans in the loop. Vanderbilt Law Review 76 (2023), 429.","journal-title":"Vanderbilt Law Review"},{"key":"e_1_3_3_36_2","doi-asserted-by":"crossref","unstructured":"Yuchen Cui Siddharth Karamcheti Raj Palleti Nidhya Shivakumar Percy Liang and Dorsa Sadigh. 2023. No to the right: Online language corrections for robotic manipulation via shared autonomy. In Proceedings of the 2023 ACM\/IEEE International Conference on Human-Robot Interaction 93\u2013101.","DOI":"10.1145\/3568162.3578623"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2021\/599"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460854"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460854"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i18.17998"},{"key":"e_1_3_3_41_2","first-page":"507","volume-title":"Proceedings of the International Conference on Information Technology & Systems","author":"Rodrigues da Costa Larissa","year":"2023","unstructured":"Larissa Rodrigues da Costa, Jaelson Castro, Cinthya Lins, Judith Kelner, Maria Lencastre, and \u00d3scar Pastor. 2023. On the use of social robots for rehabilitation: The case of NAO physio. In Proceedings of the International Conference on Information Technology & Systems. Springer, 507\u2013517."},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/3008665.3008674"},{"key":"e_1_3_3_43_2","volume-title":"Proceedings of 18th International Symposium on Experimental Robotics (ISER)","author":"Dennler Nathaniel","year":"2023","unstructured":"Nathaniel Dennler, David Delgado, Daniel Zeng, Stefanos Nikolaidis, and Maja Matari\u0107. 2023. The RoSiD tool: Empowering users to design multimodal signals for human-robot collaboration. In Proceedings of 18th International Symposium on Experimental Robotics (ISER)."},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sbspro.2015.06.134"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.injury.2020.04.050"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2023.XIX.011"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1080\/01691864.2019.1694068"},{"key":"e_1_3_3_48_2","unstructured":"European Parliament and Council of the European Union. 2024. Regulation (EU) 2024\/1689 on artificial intelligence act. Official Journal of the European Union. Retrieved from https:\/\/www.artificial-intelligence-act.com\/"},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197219"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-34103-8_20"},{"key":"e_1_3_3_51_2","volume-title":"Proceedings of 6th Annual Conference on Robot Learning","author":"Fitzgerald Tesca","year":"2022","unstructured":"Tesca Fitzgerald, Pallavi Koppol, Patrick Callaghan, Russell Quinlan Jun Hei Wong, Reid Simmons, Oliver Kroemer, and Henny Admoni. 2022. INQUIRE: INteractive querying for user-aware informative REasoning. In Proceedings of 6th Annual Conference on Robot Learning."},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3476412"},{"key":"e_1_3_3_53_2","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyt.2021.596055"},{"key":"e_1_3_3_54_2","volume-title":"Children Teach Children: Learning by Teaching","author":"Gartner, Mary Conway Kohler, and Frank Riessman Alan","year":"1971","unstructured":"Alan Gartner, Mary Conway Kohler, and Frank Riessman. 1971. Children Teach Children: Learning by Teaching. Harper & Row."},{"key":"e_1_3_3_55_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-023-35231-3"},{"key":"e_1_3_3_56_2","doi-asserted-by":"crossref","unstructured":"Ken Goldberg. 2025. Good old-fashioned engineering can close the 100 000-year \u201cdata gap\u201d in robotics. Science Robotics 10 105 (2025) eaea7390.","DOI":"10.1126\/scirobotics.aea7390"},{"key":"e_1_3_3_57_2","article-title":"Policy shaping: Integrating human feedback with reinforcement learning","volume":"26","author":"Griffith Shane","year":"2013","unstructured":"Shane Griffith, Kaushik Subramanian, Jonathan Scholz, Charles L. Isbell, and Andrea L. Thomaz. 2013. Policy shaping: Integrating human feedback with reinforcement learning. Advances in Neural Information Processing Systems 26 (2013).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-012-0161-z"},{"key":"e_1_3_3_59_2","unstructured":"James P. Gunderson and Louise F. Gunderson. 2004. Intelligence (is not equal to) autonomy (is not equal to) capability. In Proceedings of the 2004 Performance Metrics for Intelligent Systems Workshop (PerMIS '04) Gaithersburg MD."},{"key":"e_1_3_3_60_2","unstructured":"Soheil Habibian Antonio Alvarez Valdivia Laura H. Blumenschein and Dylan P. Losey. 2023. A review of communicating robot learning during human-robot interaction. arXiv:2312.00948. Retrieved from https:\/\/arxiv.org\/abs\/2312.00948"},{"key":"e_1_3_3_61_2","volume-title":"Proceedings of 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI \u201910)","author":"Haeberlen Andreas","year":"2010","unstructured":"Andreas Haeberlen, Paarijaat Aditya, Rodrigo Rodrigues, and Peter Druschel. 2010. Accountable virtual machines. In Proceedings of 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI \u201910)."},{"key":"e_1_3_3_62_2","first-page":"234","volume-title":"An Introduction to Vygotsky","author":"Hedegaard Mariane","year":"2012","unstructured":"Mariane Hedegaard. 2012. The zone of proximal development as basis for instruction. In An Introduction to Vygotsky. Luis C. Moll (Ed.), Routledge, 234\u2013258."},{"key":"e_1_3_3_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACIIW63320.2024.00020"},{"key":"e_1_3_3_64_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2022.104104"},{"key":"e_1_3_3_65_2","unstructured":"Matthew Hong Anthony Liang Kevin Kim Harshitha Rajaprakash Jesse Thomason Erdem B\u0131y\u0131k and Jesse Zhang. 2025. HAND me the data: Fast robot adaptation via hand path retrieval. arXiv:2505.20455. Retrieved from https:\/\/arxiv.org\/abs\/2505.20455"},{"key":"e_1_3_3_66_2","doi-asserted-by":"publisher","DOI":"10.1145\/2701973.2702091"},{"key":"e_1_3_3_67_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN57019.2023.10309481"},{"key":"e_1_3_3_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/3687272.3688298"},{"key":"e_1_3_3_69_2","doi-asserted-by":"publisher","DOI":"10.23919\/SoftCOM52868.2021.9559079"},{"key":"e_1_3_3_70_2","doi-asserted-by":"crossref","unstructured":"Jindan Huang Reuben M. Aronson and Elaine Schaertl Short. 2024. Modeling variation in human feedback with user inputs: An exploratory methodology. In Proceedings of the 2024 ACM\/IEEE International Conference on Human-Robot Interaction 303\u2013312.","DOI":"10.1145\/3610977.3634925"},{"key":"e_1_3_3_71_2","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642181"},{"key":"e_1_3_3_72_2","volume-title":"Robots for Health and Elderly Care (RoboHEC) Workshop\u2014IROS","author":"Idrees Ifrah","unstructured":"Ifrah Idrees and Stefanie Tellex. [n.\u2009d.]. Towards conversational interfaces and visual memory representation for social robots helping the elderly. Robots for Health and Elderly Care (RoboHEC) Workshop\u2014IROS."},{"key":"e_1_3_3_73_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS55552.2023.10342380"},{"key":"e_1_3_3_74_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2020.2974707"},{"key":"e_1_3_3_75_2","doi-asserted-by":"publisher","DOI":"10.4324\/9781315587622"},{"key":"e_1_3_3_76_2","first-page":"4415","article-title":"Reward-rational (implicit) choice: A unifying formalism for reward learning","volume":"33","author":"Jeon Hong Jun","year":"2020","unstructured":"Hong Jun Jeon, Smitha Milli, and Anca Dragan. 2020. Reward-rational (implicit) choice: A unifying formalism for reward learning. Advances in Neural Information Processing Systems 33 (2020), 4415\u20134426.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_77_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA55743.2025.11127989"},{"key":"e_1_3_3_78_2","unstructured":"Timo Kaufmann Paul Weng Viktor Bengs and Eyke H\u00fcllermeier. 2024. A survey of reinforcement learning from human feedback. arXiv:2312.14925. Retrieved from https:\/\/arxiv.org\/abs\/2312.14925"},{"key":"e_1_3_3_79_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8793698"},{"key":"e_1_3_3_80_2","first-page":"728","volume-title":"Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems","author":"Faulkner Taylor Kessler","year":"2019","unstructured":"Taylor Kessler Faulkner, Reymundo A. Gutierrez, Elaine Schaertl Short, Guy Hoffman, and Andrea L. Thomaz. 2019. Active attention-modified policy shaping: Socially interactive agents track. In Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 728\u2013736."},{"key":"e_1_3_3_81_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197219"},{"key":"e_1_3_3_82_2","volume-title":"ACM International Conference on Multimodal Interaction","author":"Knierim Matilda","year":"2024","unstructured":"Matilda Knierim, Sahil Jain, Murat Han Aydo\u011fan, Kenneth Mitra, Kush Desai, Akanksha Saran, and Kim Baraka. 2024. Prosody as a teaching signal for agent learning: Exploratory studies and algorithmic implications. ACM International Conference on Multimodal Interaction"},{"key":"e_1_3_3_83_2","unstructured":"W. Bradley Knox Stephane Hatgis-Kessell Serena Booth Scott Niekum Peter Stone and Alessandro Allievi. 2022. Models of human preference for learning reward functions. arXiv:2206.02231. Retrieved from https:\/\/arxiv.org\/abs\/2206.02231"},{"key":"e_1_3_3_84_2","doi-asserted-by":"publisher","DOI":"10.1145\/1597735.1597738"},{"key":"e_1_3_3_85_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA55743.2025.11128018"},{"key":"e_1_3_3_86_2","doi-asserted-by":"publisher","unstructured":"Samantha Krening and Karen M. Feigh. 2018. Interaction algorithm effect on human experience with reinforcement learning. Journal of Human-Robot Interaction 7 2 Article 16 (Oct. 2018) 22. DOI: 10.1145\/3277904","DOI":"10.1145\/3277904"},{"key":"e_1_3_3_87_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCDS.2016.2628365"},{"key":"e_1_3_3_88_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-51532-8_10"},{"key":"e_1_3_3_89_2","doi-asserted-by":"publisher","DOI":"10.1145\/3319502.3374832"},{"key":"e_1_3_3_90_2","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2017.3121552"},{"key":"e_1_3_3_91_2","doi-asserted-by":"publisher","DOI":"10.1561\/2300000052"},{"key":"e_1_3_3_92_2","unstructured":"Kimin Lee Laura Smith Anca Dragan and Pieter Abbeel. 2021. B-Pref: Benchmarking preference-based reinforcement learning. arXiv:2111.03026. Retrieved from https:\/\/arxiv.org\/abs\/2111.03026"},{"key":"e_1_3_3_93_2","first-page":"4545","volume-title":"Proceedings of 9th Conference on Robot Learning (Proceedings of Machine Learning Research","volume":"305","author":"Lepert Marion","year":"2025","unstructured":"Marion Lepert, Jiaying Fang, and Jeannette Bohg. 2025. Phantom: Training robots without robots using only human videos. In Proceedings of 9th Conference on Robot Learning (Proceedings of Machine Learning Research), Vol. 305. PMLR, 4545\u20134565."},{"key":"e_1_3_3_94_2","doi-asserted-by":"publisher","DOI":"10.1109\/THMS.2019.2912447"},{"key":"e_1_3_3_95_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS58592.2024.10801388"},{"key":"e_1_3_3_96_2","doi-asserted-by":"crossref","unstructured":"Jacky Liang Fei Xia Wenhao Yu Andy Zeng Montse Gonzalez Arenas Maria Attarian Maria Bauz\u00e1 Matthew Bennice Alex Bewley Adil Dostmohamed et al. 2024. Learning to Learn Faster from Human Feedback with Language Model Predictive Control. arXiv:2402.11450. Retrieved from https:\/\/arxiv.org\/abs\/2402.11450","DOI":"10.15607\/RSS.2024.XX.125"},{"key":"e_1_3_3_97_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3006254"},{"key":"e_1_3_3_98_2","unstructured":"Hao Liu Lisa Lee Kimin Lee and P. Abbeel. 2022. Instruction-following agents with multimodal transformer. arXiv:2210.13431. Retrieved from https:\/\/arxiv.org\/abs\/2210.13431"},{"key":"e_1_3_3_99_2","first-page":"1084","volume-title":"Proceedings of the Conference on Robot Learning","author":"Liu Jason Xinyu","year":"2023","unstructured":"Jason Xinyu Liu, Ziyi Yang, Ifrah Idrees, Sam Liang, Benjamin Schornstein, Stefanie Tellex, and Ankit Shah. 2023. Grounding complex natural language commands for temporal tasks in unseen environments. In Proceedings of the Conference on Robot Learning. PMLR, 1084\u20131110."},{"key":"e_1_3_3_100_2","doi-asserted-by":"publisher","DOI":"10.1037\/xge0001182"},{"key":"e_1_3_3_101_2","doi-asserted-by":"publisher","DOI":"10.1177\/02783649211050958"},{"key":"e_1_3_3_102_2","volume-title":"Proceedings of 12th International Conference on Learning Representations","author":"Luo Jianlan","year":"2024","unstructured":"Jianlan Luo, Perry Dong, Yuexiang Zhai, Yi Ma, and Sergey Levine. 2024. RLIF: Interactive imitation learning as reinforcement learning. In Proceedings of 12th International Conference on Learning Representations."},{"key":"e_1_3_3_103_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9196664"},{"key":"e_1_3_3_104_2","doi-asserted-by":"publisher","DOI":"10.5555\/3305890.3305917"},{"key":"e_1_3_3_105_2","unstructured":"Jessica Maghakian Paul Mineiro Kishan Panaganti Mark Rucker Akanksha Saran and Cheng Tan. 2022. Personalized reward learning with interaction-grounded learning (IGL). arXiv:2211.15823. Retrieved from https:\/\/arxiv.org\/abs\/2211.15823"},{"key":"e_1_3_3_106_2","doi-asserted-by":"crossref","unstructured":"Bertram F. Malle and Matthias Scheutz. 2020. Moral competence for social robots. In Machine Ethics and Robot Ethics. Routledge 225\u2013230.","DOI":"10.4324\/9781003074991-19"},{"key":"e_1_3_3_107_2","volume-title":"Proceedings of the Conference on Robot Learning","author":"Mandlekar Ajay","year":"2021","unstructured":"Ajay Mandlekar, Danfei Xu, J. Wong, Soroush Nasiriany, Chen Wang, Rohun Kulkarni, Li Fei-Fei, Silvio Savarese, Yuke Zhu, and Roberto Mart\u2019in-Mart\u2019in. 2021. What matters in learning from offline human demonstrations for robot manipulation. In Proceedings of the Conference on Robot Learning. Retrieved from https:\/\/api.semanticscholar.org\/CorpusID:236956615"},{"key":"e_1_3_3_108_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICAR.2015.7251439"},{"key":"e_1_3_3_109_2","doi-asserted-by":"crossref","unstructured":"Gaurav Menghani. 2023. Efficient deep learning: A survey on making deep learning models smaller faster and better. ACM Computing Surveys 55 12 (2023) 1\u201337.","DOI":"10.1145\/3578938"},{"issue":"1","key":"e_1_3_3_110_2","article-title":"What is robotics made of? The interdisciplinary politics of robotics research","volume":"8","author":"Michalec Ola","year":"2021","unstructured":"Ola Michalec, Cian O\u2019Donovan, and Mehdi Sobhani. 2021. What is robotics made of? The interdisciplinary politics of robotics research. Humanities and Social Sciences Communications 8, 1 (2021).","journal-title":"Humanities and Social Sciences Communications"},{"key":"e_1_3_3_111_2","unstructured":"Tim Miller. 2017. Explanation in artificial intelligence: Insights from the social sciences. arXiv:1706.07269. Retrieved from http:\/\/arxiv.org\/abs\/1706.07269"},{"key":"e_1_3_3_112_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI53351.2022.9889551"},{"key":"e_1_3_3_113_2","doi-asserted-by":"publisher","DOI":"10.1145\/3568294.3579962"},{"key":"e_1_3_3_114_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-01589-2"},{"key":"e_1_3_3_115_2","doi-asserted-by":"publisher","DOI":"10.5555\/3463952.3463959"},{"key":"e_1_3_3_116_2","unstructured":"Dipendra Misra Akanksha Saran Tengyang Xie Alex Lamb and John Langford. 2024. Towards principled representation learning from videos for reinforcement learning. arXiv:2403.13765. Retrieved from https:\/\/arxiv.org\/abs\/2403.13765"},{"key":"e_1_3_3_117_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2023.3270034"},{"key":"e_1_3_3_118_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-024-60905-x"},{"key":"e_1_3_3_119_2","doi-asserted-by":"publisher","DOI":"10.1145\/153571.255960"},{"key":"e_1_3_3_120_2","first-page":"342","volume-title":"Proceedings of the Conference on Robot Learning","author":"Myers Vivek","year":"2022","unstructured":"Vivek Myers, Erdem Biyik, Nima Anari, and Dorsa Sadigh. 2022. Learning multimodal rewards from rankings. In Proceedings of the Conference on Robot Learning. PMLR, 342\u2013352."},{"key":"e_1_3_3_121_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48891.2023.10160439"},{"key":"e_1_3_3_122_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2008.4543753"},{"key":"e_1_3_3_123_2","article-title":"Reinforcement learning with human advice: A survey","volume":"8","author":"Najar Anis","year":"2020","unstructured":"Anis Najar and Mohamed Chetouani. 2020. Reinforcement learning with human advice: A survey. Frontiers in Robotics and AI 8 (2020). Retrieved from https:\/\/api.semanticscholar.org\/CorpusID:218862857","journal-title":"Frontiers in Robotics and AI"},{"key":"e_1_3_3_124_2","doi-asserted-by":"publisher","DOI":"10.1145\/3568162.3576988"},{"key":"e_1_3_3_125_2","doi-asserted-by":"publisher","DOI":"10.1145\/3568162.3576988"},{"key":"e_1_3_3_126_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN46459.2019.8956465"},{"key":"e_1_3_3_127_2","doi-asserted-by":"publisher","DOI":"10.1145\/3568162.3576965"},{"key":"e_1_3_3_128_2","doi-asserted-by":"publisher","DOI":"10.1145\/2909824.3020252"},{"key":"e_1_3_3_129_2","doi-asserted-by":"publisher","DOI":"10.2522\/ptj.20150240"},{"key":"e_1_3_3_130_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33016128"},{"key":"e_1_3_3_131_2","doi-asserted-by":"publisher","DOI":"10.3233\/FAIA230094"},{"key":"e_1_3_3_132_2","unstructured":"Carolina Parada. 2025. Gemini robotics on-device brings AI to local robotic devices. Retrieved from https:\/\/deepmind.google\/discover\/blog\/gemini-robotics-on-device-brings-ai-to-local-robotic-devices\/"},{"key":"e_1_3_3_133_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS40897.2019.8967974"},{"key":"e_1_3_3_134_2","unstructured":"Andi Peng Ilia Sucholutsky Belinda Z. Li Theodore R. Sumers Thomas L. Griffiths Jacob Andreas and Julie A. Shah. 2024. Learning with language-guided state abstractions. arXiv:2402.18759. Retrieved from https:\/\/arxiv.org\/abs\/2402.18759"},{"issue":"09","key":"e_1_3_3_135_2","article-title":"On the loop of action modification and the recipient\u2019s gaze in adult-child interaction","volume":"24","author":"Pitsch Karola","year":"2009","unstructured":"Karola Pitsch, Anna-Lisa Vollmer, Jannik Fritsch, Britta Wrede, Katharina Rohlfing, and Gerhard Sagerer. 2009. On the loop of action modification and the recipient\u2019s gaze in adult-child interaction. Gesture and Speech in Interaction 24, 09 (2009).","journal-title":"Gesture and Speech in Interaction"},{"key":"e_1_3_3_136_2","article-title":"Alvinn: An autonomous land vehicle in a neural network","volume":"1","author":"Pomerleau Dean A.","year":"1988","unstructured":"Dean A. Pomerleau. 1988. Alvinn: An autonomous land vehicle in a neural network. Advances in Neural Information Processing Systems 1 (1988).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_137_2","doi-asserted-by":"publisher","DOI":"10.1145\/1121241.1121280"},{"key":"e_1_3_3_138_2","unstructured":"Rob Price. 2016. Microsoft is deleting its AI chatbot\u2019s incredibly racist tweets. Business Insider. Retrieved from https:\/\/web.archive.org\/web\/20190130071430\/https:\/\/www.businessinsider.com\/microsoft-deletes-racist-genocidal-tweets-from-ai-chatbot-tay-2016-3"},{"key":"e_1_3_3_139_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2023.104466"},{"key":"e_1_3_3_140_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610978.3638160"},{"key":"e_1_3_3_141_2","doi-asserted-by":"publisher","DOI":"10.5435\/JAAOS-D-14-00432"},{"issue":"3","key":"e_1_3_3_142_2","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1109\/TCDS.2020.3044366","article-title":"Explanation as a social practice: Toward a conceptual framework for the social design of AI systems","volume":"13","author":"Rohlfing Katharina J.","year":"2020","unstructured":"Katharina J. Rohlfing, Philipp Cimiano, Ingrid Scharlau, Tobias Matzner, Heike M. Buhl, Hendrik Buschmeier, Elena Esposito, Angela Grimminger, Barbara Hammer, Reinhold H\u00e4b-Umbach, et al. 2020. Explanation as a social practice: Toward a conceptual framework for the social design of AI systems. IEEE Transactions on Cognitive and Developmental Systems 13, 3 (2020), 717\u2013728.","journal-title":"IEEE Transactions on Cognitive and Developmental Systems"},{"key":"e_1_3_3_143_2","doi-asserted-by":"publisher","unstructured":"Astrid Marieke Rosenthal-von der P\u00fctten David Sirkin Anna Abrams and Laura Platte. 2020. The forgotten in HRI: Incidental encounters with robots in public spaces 656\u2013657. DOI: 10.1145\/3371382.3374852","DOI":"10.1145\/3371382.3374852"},{"key":"e_1_3_3_144_2","doi-asserted-by":"publisher","unstructured":"S. A. E. International. 2021. Taxonomy and definitions for terms related to driving automation systems for on-road motor vehicles. Technical Report J3016_202104. SAE International. DOI: 10.4271\/J3016_202104","DOI":"10.4271\/J3016_202104"},{"key":"e_1_3_3_145_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS47612.2022.9981053"},{"key":"e_1_3_3_146_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2018.8593580"},{"key":"e_1_3_3_147_2","first-page":"1247","volume-title":"Proceedings of the Conference on Robot Learning","author":"Saran Akanksha","year":"2020","unstructured":"Akanksha Saran, Elaine Schaertl Short, Andrea Thomaz, and Scott Niekum. 2020. Understanding teacher gaze patterns for robot learning. In Proceedings of the Conference on Robot Learning. PMLR, 1247\u20131258."},{"key":"e_1_3_3_148_2","doi-asserted-by":"publisher","DOI":"10.65109\/EVCM1060"},{"key":"e_1_3_3_149_2","unstructured":"William Saunders Girish Sastry Andreas Stuhlmueller and Owain Evans. 2017. Trial without error: Towards safe reinforcement learning via human intervention. arXiv:1707.05173. Retrieved from https:\/\/arxiv.org\/abs\/1707.05173"},{"key":"e_1_3_3_150_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610978.3640655"},{"key":"e_1_3_3_151_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI53351.2022.9889616"},{"key":"e_1_3_3_152_2","first-page":"1315","article-title":"Negligence and AI\u2019s human users","volume":"100","author":"Selbst Andrew D.","year":"2020","unstructured":"Andrew D. Selbst. 2020. Negligence and AI\u2019s human users. BUL Rev 100 (2020), 1315.","journal-title":"BUL Rev"},{"key":"e_1_3_3_153_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3100603"},{"key":"e_1_3_3_154_2","doi-asserted-by":"publisher","DOI":"10.1561\/2300000081"},{"key":"e_1_3_3_155_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2022.XVIII.065"},{"key":"e_1_3_3_156_2","article-title":"Balancing multiple sources of reward in reinforcement learning","volume":"13","author":"Shelton Christian","year":"2000","unstructured":"Christian Shelton. 2000. Balancing multiple sources of reward in reinforcement learning. In Advances in Neural Information Processing Systems. T. Leen, T. Dietterich, and V. Tresp (Eds.), Vol. 13, MIT Press. Retrieved from https:\/\/proceedings.neurips.cc\/paper\\_files\/paper\/2000\/file\/e0ab531ec312161511493b002f9be2ee-Paper.pdf","journal-title":"Advances in Neural Information Processing Systems."},{"key":"e_1_3_3_157_2","unstructured":"Hua Shen Tiffany Knearem Reshmi Ghosh Kenan Alkiek Kundan Krishna Yachuan Liu Ziqiao Ma Savvas Petridis Yi-Hao Peng Li Qiwei et al. 2024. Towards bidirectional human-AI alignment: A systematic review for clarifications framework and future directions. arXiv240609264. Retrieved from https:\/\/arxiv.org\/abs\/2406.09264"},{"key":"e_1_3_3_158_2","doi-asserted-by":"publisher","DOI":"10.1177\/0018720816644364"},{"key":"e_1_3_3_159_2","first-page":"101","volume-title":"Proceedings of the 2024 International Symposium on Technological Advances in Human-Robot Interaction","author":"Shpiro Einav","year":"2024","unstructured":"Einav Shpiro and Reuth Mirsky. 2024. Recognition and identification of intentional blocking in social navigation. In Proceedings of the 2024 International Symposium on Technological Advances in Human-Robot Interaction, 101\u2013110."},{"key":"e_1_3_3_160_2","unstructured":"Adrian Stoica. 1995. Motion LearningbyRobot Apprentices :AFuzzy Neural Approach. Ph.D. Dissertation. Victoria University of Technology."},{"key":"e_1_3_3_161_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cognition.2022.105326"},{"key":"e_1_3_3_162_2","volume-title":"Reinforcement Learning: An Introduction","author":"Sutton Richard S.","year":"2018","unstructured":"Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. MIT Press."},{"key":"e_1_3_3_163_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-54173-6_11"},{"key":"e_1_3_3_164_2","doi-asserted-by":"publisher","DOI":"10.1007\/s43154-020-00019-0"},{"key":"e_1_3_3_165_2","first-page":"360","article-title":"Guide dog robot","author":"Tachi Susumu","year":"1984","unstructured":"Susumu Tachi and Kiyoshi Komoriya. 1984. Guide dog robot. Autonomous Mobile Robots: Control, Planning, and Architecture (1984), 360\u2013367.","journal-title":"Autonomous Mobile Robots: Control, Planning, and Architecture"},{"key":"e_1_3_3_166_2","doi-asserted-by":"publisher","DOI":"10.1080\/01691864.2021.1928552"},{"key":"e_1_3_3_167_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-021-06375-y"},{"key":"e_1_3_3_168_2","doi-asserted-by":"publisher","DOI":"10.5555\/2031678.2031705"},{"key":"e_1_3_3_169_2","doi-asserted-by":"publisher","DOI":"10.3233\/FAIA230071"},{"key":"e_1_3_3_170_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2013.6630576"},{"key":"e_1_3_3_171_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48891.2023.10160504"},{"key":"e_1_3_3_172_2","doi-asserted-by":"publisher","DOI":"10.3389\/frobt.2018.00077"},{"key":"e_1_3_3_173_2","unstructured":"Anna-Lisa Vollmer Daniel Leidner Michael Beetz and Britta Wrede. 2023. From interactive to co-constructive task learning. arXiv:2305.15535. Retrieved from https:\/\/arxiv.org\/abs\/2305.15535"},{"key":"e_1_3_3_174_2","doi-asserted-by":"publisher","DOI":"10.1109\/DEVLRN.2009.5175516"},{"key":"e_1_3_3_175_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0091349"},{"key":"e_1_3_3_176_2","doi-asserted-by":"publisher","DOI":"10.1007\/s13164-017-0353-4"},{"key":"e_1_3_3_177_2","first-page":"10","article-title":"Pragmatic frames for teaching and learning in human\u2013robot interaction: Review and challenges","volume":"10","author":"Vollmer Anna-Lisa","year":"2016","unstructured":"Anna-Lisa Vollmer, Britta Wrede, Katharina J. Rohlfing, and Pierre-Yves Oudeyer. 2016. Pragmatic frames for teaching and learning in human\u2013robot interaction: Review and challenges. Frontiers in Neurorobotics 10 (2016), 10.","journal-title":"Frontiers in Neurorobotics"},{"key":"e_1_3_3_178_2","first-page":"137","volume-title":"Proceedings of the Australasian Joint Conference on Artificial Intelligence","author":"Vu Thuy-Trang","year":"2024","unstructured":"Thuy-Trang Vu, Shahram Khadivi, Mahsa Ghorbanali, Dinh Phung, and Gholamreza Haffari. 2024. Active continual learning: On balancing knowledge retention and learnability. In Proceedings of the Australasian Joint Conference on Artificial Intelligence. Springer, 137\u2013150."},{"key":"e_1_3_3_179_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASE.2021.3074873"},{"issue":"9","key":"e_1_3_3_180_2","first-page":"4555","article-title":"A survey on curriculum learning","volume":"44","author":"Wang Xin","year":"2021","unstructured":"Xin Wang, Yudong Chen, and Wenwu Zhu. 2021. A survey on curriculum learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 9 (2021), 4555\u20134576.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_3_181_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989121"},{"key":"e_1_3_3_182_2","volume-title":"Proceedings of the 5th Conference on Robot Learning","author":"Wilde Nils","year":"2021","unstructured":"Nils Wilde, Erdem Biyik, Dorsa Sadigh, and Stephen L. Smith. 2021. Learning reward functions from scale feedback. In Proceedings of the 5th Conference on Robot Learning."},{"key":"e_1_3_3_183_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2020.XVI.059"},{"key":"e_1_3_3_184_2","first-page":"11414","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Xie Tengyang","year":"2021","unstructured":"Tengyang Xie, John Langford, Paul Mineiro, and Ida Momennejad. 2021. Interaction-grounded learning. In Proceedings of the International Conference on Machine Learning. PMLR, 11414\u201311423."},{"key":"e_1_3_3_185_2","first-page":"12529","article-title":"Interaction-grounded learning with action-inclusive feedback","volume":"35","author":"Xie Tengyang","year":"2022","unstructured":"Tengyang Xie, Akanksha Saran, Dylan J. Foster, Lekan Molu, Ida Momennejad, Nan Jiang, Paul Mineiro, and John Langford. 2022. Interaction-grounded learning with action-inclusive feedback. Advances in Neural Information Processing Systems 35 (2022), 12529\u201312541.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"9","key":"e_1_3_3_186_2","first-page":"1813","article-title":"Safe reinforcement learning: A survey","volume":"49","author":"Rong Wang","year":"2023","unstructured":"Wang Rong, Xue-Song Wang Rong, and Cheng Yu-Hu. 2023. Safe reinforcement learning: A survey. Acta Automatica Sinica 49, 9 (2023), 1813\u20131835.","journal-title":"Acta Automatica Sinica"},{"key":"e_1_3_3_187_2","doi-asserted-by":"crossref","unstructured":"Yutao Yang Jie Zhou Junsong Li Qianjun Pan Bihao Zhan Qin Chen Xipeng Qiu and Liang He. 2025. Reinforced interactive continual learning via real-time noisy human feedback. arXiv:2505.09925. Retrieved from https:\/\/arxiv.org\/abs\/2505.09925","DOI":"10.2139\/ssrn.5966945"},{"key":"e_1_3_3_188_2","volume-title":"Proceedings of the Conference on Robot Learning","author":"Yang Zhaojing","year":"2024","unstructured":"Zhaojing Yang, Miru Jun, Jeremy Tien, Stuart J. Russell, Anca Dragan, and Erdem B\u0131y\u0131k. 2024. Trajectory improvement and reward learning from comparative language feedback. In Proceedings of the Conference on Robot Learning."},{"key":"e_1_3_3_189_2","doi-asserted-by":"publisher","DOI":"10.1108\/IJHG-02-2019-0012"},{"key":"e_1_3_3_190_2","doi-asserted-by":"publisher","DOI":"10.1145\/3434074.3447207"},{"key":"e_1_3_3_191_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2020.2983149"},{"issue":"1","key":"e_1_3_3_192_2","article-title":"Systems of collaboration: Challenges and solutions for interdisciplinary research in AI and social robotics","volume":"2","author":"Zeller Frauke","year":"2022","unstructured":"Frauke Zeller and Lauren Dwyer. 2022. Systems of collaboration: Challenges and solutions for interdisciplinary research in AI and social robotics. Discover Artificial Intelligence 2, 1 (2022), 12.","journal-title":"Discover Artificial Intelligence"},{"key":"e_1_3_3_193_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2020\/689"},{"key":"e_1_3_3_194_2","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation","author":"Zhang Tianhao","year":"2017","unstructured":"Tianhao Zhang, Zoe McCarthy, Owen Jow, Dennis Lee, Ken Goldberg, and P. Abbeel. 2017. Deep imitation learning for complex manipulation tasks from virtual reality teleoperation. In Proceedings of the IEEE International Conference on Robotics and Automation. Retrieved from https:\/\/api.semanticscholar.org\/CorpusID:3720790"},{"key":"e_1_3_3_195_2","doi-asserted-by":"crossref","unstructured":"Gaoyue Zhou Victoria Dean Mohan Kumar Srirama Aravind Rajeswaran Jyothish Pari Kyle Hatch Aryan Jain Tianhe Yu Pieter Abbeel Lerrel Pinto Chelsea Finn and Abhinav Gupta. 2023. Train offline test online: A real robot learning benchmark. arXiv:2306.00942. Retrieved from https:\/\/arxiv.org\/abs\/2306.00942","DOI":"10.1109\/ICRA48891.2023.10160594"},{"key":"e_1_3_3_196_2","first-page":"850","volume-title":"Proceedings of the Conference on Robot Learning","author":"Zhou Yilun","year":"2022","unstructured":"Yilun Zhou, Serena Booth, Nadia Figueroa, and Julie Shah. 2022. RoCUS: Robot controller understanding via sampling. In Proceedings of the Conference on Robot Learning. PMLR, 850\u2013860."}],"container-title":["ACM Transactions on Human-Robot Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3779297","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T13:51:49Z","timestamp":1773150709000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3779297"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,23]]},"references-count":195,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,3,31]]}},"alternative-id":["10.1145\/3779297"],"URL":"https:\/\/doi.org\/10.1145\/3779297","relation":{},"ISSN":["2573-9522"],"issn-type":[{"value":"2573-9522","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,23]]},"assertion":[{"value":"2024-12-20","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-10-31","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-02-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}