{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:22:34Z","timestamp":1760059354497,"version":"build-2065373602"},"reference-count":68,"publisher":"Association for Computing Machinery (ACM)","issue":"OOPSLA2","license":[{"start":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T00:00:00Z","timestamp":1759968000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["CCF-2124155"],"award-info":[{"award-number":["CCF-2124155"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Program. Lang."],"published-print":{"date-parts":[[2025,10,9]]},"abstract":"<jats:p>Over the past decade, deep reinforcement learning (RL) techniques have significantly advanced robotic systems. However, due to the complex architectures of neural network models, ensuring their trustworthiness is a considerable challenge. Programmatic reinforcement learning has surfaced as a promising approach. Nonetheless, synthesizing robot-control programs remains challenging. Existing methods rely on domain-specific languages (DSLs) populated with user-defined state abstraction predicates and a library of low-level controllers as abstract actions to boot synthesis, which is impractical in unknown environments that lack such predefined components. To address this limitation, we introduce RoboScribe, a novel abstraction refinement-guided program synthesis framework that automatically derives robot state and action abstractions from raw, unsegmented task demonstrations in high-dimensional, continuous spaces. It iteratively enriches and refines an initially coarse abstraction until it generates a task-solving program over the abstracted robot environment. RoboScribe is effective in synthesizing iterative programs by inferring recurring subroutines directly from the robot\u2019s raw, continuous state and action spaces, without needing predefined abstractions. Experimental results show that RoboScribe programs inductively generalize to long-horizon robot tasks involving arbitrary numbers of objects, outperforming baseline methods in terms of both interpretability and efficiency.<\/jats:p>","DOI":"10.1145\/3763070","type":"journal-article","created":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T08:49:50Z","timestamp":1759999790000},"page":"555-583","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Abstraction Refinement-Guided Program Synthesis for Robot Learning from Demonstrations"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7994-915X","authenticated-orcid":false,"given":"Guofeng","family":"Cui","sequence":"first","affiliation":[{"name":"Rutgers University, New Brunswick, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-4317-9758","authenticated-orcid":false,"given":"Yuning","family":"Wang","sequence":"additional","affiliation":[{"name":"Rutgers University, New Brunswick, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-6182-6556","authenticated-orcid":false,"given":"Wensen","family":"Mao","sequence":"additional","affiliation":[{"name":"Rutgers University, New Brunswick, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2696-2668","authenticated-orcid":false,"given":"Yuanlin","family":"Duan","sequence":"additional","affiliation":[{"name":"Rutgers University, New Brunswick, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9606-150X","authenticated-orcid":false,"given":"He","family":"Zhu","sequence":"additional","affiliation":[{"name":"Rutgers University, New Brunswick, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,10,9]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1613\/JAIR.1.13073"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence.","author":"Andre David","unstructured":"David Andre and Stuart J. Russell. 2002. State Abstraction for Programmable Reinforcement Learning Agents. In Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence."},{"key":"e_1_2_1_3_1","volume-title":"Hindsight Experience Replay. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Andrychowicz Marcin","year":"2017","unstructured":"Marcin Andrychowicz, Dwight Crow, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, and Wojciech Zaremba. 2017. Hindsight Experience Replay. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA. 5048\u20135058."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/3298483.3298491"},{"key":"e_1_2_1_5_1","volume-title":"Proc. ACM Program. Lang..","author":"Barke Shraddha","year":"2020","unstructured":"Shraddha Barke, Hila Peleg, and Nadia Polikarpova. 2020. Just-in-time learning for bottom-up enumerative synthesis. Proc. ACM Program. Lang.."},{"key":"e_1_2_1_6_1","volume-title":"Advances in Neural Information Processing Systems, NeurIPS","author":"Bastani Osbert","year":"2018","unstructured":"Osbert Bastani, Yewen Pu, and Armando Solar-Lezama. 2018. Verifiable Reinforcement Learning via Policy Extraction. In Advances in Neural Information Processing Systems, NeurIPS 2018."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.24963\/IJCAI.2019\/875"},{"key":"e_1_2_1_8_1","volume-title":"Syntia: Synthesizing the Semantics of Obfuscated Code. In 26th USENIX Security Symposium, USENIX Security","author":"Blazytko Tim","year":"2017","unstructured":"Tim Blazytko, Moritz Contag, Cornelius Aschermann, and Thorsten Holz. 2017. Syntia: Synthesizing the Semantics of Obfuscated Code. In 26th USENIX Security Symposium, USENIX Security 2017."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3571234"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3571207"},{"key":"e_1_2_1_11_1","volume-title":"CAV","author":"Chen Yanju","year":"2020","unstructured":"Yanju Chen, Chenglong Wang, Osbert Bastani, Isil Dillig, and Yu Feng. 2020. Program Synthesis Using Deduction-Guided Reinforcement Learning. In Computer Aided Verification - 32nd International Conference, CAV 2020."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE56229.2023.00129"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","unstructured":"Guofeng Cui Yuning Wang Wensen Mao Yuanlin Duan and He Zhu. 2025. Abstraction Refinement-Guided Program Synthesis for Robot Learning from Demonstrations (Artifact). Zenodo. https:\/\/doi.org\/10.5281\/zenodo.16929200 10.5281\/zenodo.16929200","DOI":"10.5281\/zenodo.16929200"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3656447"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1609\/AAAI.V36I5.20475"},{"key":"e_1_2_1_16_1","volume-title":"Conditional Abstraction Trees for Sample-Efficient Reinforcement Learning. In The 39th Conference on Uncertainty in Artificial Intelligence. https:\/\/openreview.net\/forum?id=tQP094M0j8G","author":"Dadvar Mehdi","year":"2023","unstructured":"Mehdi Dadvar, Rashmeet Kaur Nayyar, and Siddharth Srivastava. 2023. Conditional Abstraction Trees for Sample-Efficient Reinforcement Learning. In The 39th Conference on Uncertainty in Artificial Intelligence. https:\/\/openreview.net\/forum?id=tQP094M0j8G"},{"key":"e_1_2_1_17_1","volume-title":"Conference on Robot Learning","volume":"223","author":"de Haro Joaquim Ortiz","year":"2021","unstructured":"Joaquim Ortiz de Haro, Jung-Su Ha, Danny Driess, and Marc Toussaint. 2021. Structured deep generative models for sampling on constraint manifolds in sequential manipulation. In Conference on Robot Learning, 8-11 November 2021, London, UK, Aleksandra Faust, David Hsu, and Gerhard Neumann (Eds.) (Proceedings of Machine Learning Research, Vol. 164). PMLR, 213\u2013223. https:\/\/proceedings.mlr.press\/v164\/ortiz-haro22a.html"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-43421-1_36"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Thomas G. Dietterich. 2000. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition. J. Artif. Intell. Res..","DOI":"10.1007\/3-540-44914-0_2"},{"key":"e_1_2_1_20_1","volume-title":"Goal-conditioned Imitation Learning. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019","author":"Ding Yiming","year":"2019","unstructured":"Yiming Ding, Carlos Florensa, Pieter Abbeel, and Mariano Phielipp. 2019. Goal-conditioned Imitation Learning. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily B. Fox, and Roman Garnett (Eds.). 15298\u201315309. https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/c8d3a760ebab631565f8509d84b3b3f1-Abstract.html"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3453483.3454080"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2017.","author":"Feng Yu","year":"2017","unstructured":"Yu Feng, Ruben Martins, Jacob Van Geffen, Isil Dillig, and Swarat Chaudhuri. 2017. Component-based synthesis of table consolidation and transformation tasks from examples. In Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2017."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1146\/ANNUREV-CONTROL-091420-084139"},{"key":"e_1_2_1_24_1","volume-title":"International conference on machine learning. 1861\u20131870","author":"Haarnoja Tuomas","year":"2018","unstructured":"Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning. 1861\u20131870."},{"key":"e_1_2_1_25_1","volume-title":"Generative Adversarial Imitation Learning. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016","author":"Ho Jonathan","year":"2016","unstructured":"Jonathan Ho and Stefano Ermon. 2016. Generative Adversarial Imitation Learning. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, Daniel D. Lee, Masashi Sugiyama, Ulrike von Luxburg, Isabelle Guyon, and Roman Garnett (Eds.). 4565\u20134573. https:\/\/proceedings.neurips.cc\/paper\/2016\/hash\/cc7e2b878868cbae992d1fb743995d8f-Abstract.html"},{"key":"e_1_2_1_26_1","volume-title":"4th Conference on Robot Learning, CoRL 2020","volume":"1480","author":"Holtz Jarrett","year":"2020","unstructured":"Jarrett Holtz, Arjun Guha, and Joydeep Biswas. 2020. Robot Action Selection Learning via Layered Dimension Informed Program Synthesis. In 4th Conference on Robot Learning, CoRL 2020, 16-18 November 2020, Virtual Event \/ Cambridge, MA, USA, Jens Kober, Fabio Ramos, and Claire J. Tomlin (Eds.) (Proceedings of Machine Learning Research, Vol. 155). PMLR, 1471\u20131480. https:\/\/proceedings.mlr.press\/v155\/holtz21a.html"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS47612.2022.9981962"},{"key":"e_1_2_1_28_1","volume-title":"8th International Conference on Learning Representations, ICLR","author":"Inala Jeevana Priya","year":"2020","unstructured":"Jeevana Priya Inala, Osbert Bastani, Zenna Tavares, and Armando Solar-Lezama. 2020. Synthesizing Programmatic Policies that Inductively Generalize. In 8th International Conference on Learning Representations, ICLR 2020."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1613\/JAIR.5575"},{"key":"e_1_2_1_30_1","volume-title":"7th International Conference on Learning Representations, ICLR","author":"Lee Youngwoon","year":"2019","unstructured":"Youngwoon Lee, Shao-Hua Sun, Sriram Somasundaram, Edward S. Hu, and Joseph J. Lim. 2019. Composing Complex Skills by Learning Transition Policies. In 7th International Conference on Learning Representations, ICLR 2019."},{"key":"e_1_2_1_31_1","volume-title":"8th International Conference on Learning Representations, ICLR","author":"Lee Youngwoon","year":"2020","unstructured":"Youngwoon Lee, Jingyun Yang, and Joseph J. Lim. 2020. Learning to Coordinate Manipulation Skills via Skill Behavior Diversification. In 8th International Conference on Learning Representations, ICLR 2020."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197468"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 2023 International Conference on Robotics and Automation (ICRA).","author":"Liang Junchi","year":"2023","unstructured":"Junchi Liang and Abdeslam Boularias. 2023. Learning Category-Level Manipulation Tasks from Point Clouds with Dynamic Graph CNNs. In Proceedings of the 2023 International Conference on Robotics and Automation (ICRA)."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48891.2023.10160591"},{"key":"e_1_2_1_35_1","volume-title":"8th International Conference on Learning Representations, ICLR","author":"Lin Zhixuan","year":"2020","unstructured":"Zhixuan Lin, Yi-Fu Wu, Skand Vishwanath Peri, Weihao Sun, Gautam Singh, Fei Deng, Jindong Jiang, and Sungjin Ahn. 2020. SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition. In 8th International Conference on Learning Representations, ICLR 2020."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS45743.2020.9341535"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8202206"},{"key":"e_1_2_1_38_1","volume-title":"British Machine Vision Conference 2018, BMVC 2018.","author":"Mitash Chaitanya","unstructured":"Chaitanya Mitash, Abdeslam Boularias, and Kostas E. Bekris. 2018. Robust 6D Object Pose Estimation with Stochastic Congruent Sets. In British Machine Vision Conference 2018, BMVC 2018."},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021","author":"Mu Tongzhou","year":"2021","unstructured":"Tongzhou Mu, Zhan Ling, Fanbo Xiang, Derek Yang, Xuanlin Li, Stone Tao, Zhiao Huang, Zhiwei Jia, and Hao Su. 2021. ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, December 2021, virtual, Joaquin Vanschoren and Sai-Kit Yeung (Eds.). https:\/\/datasets-benchmarks-proceedings.neurips.cc\/paper\/2021\/hash\/eda80a3d5b344bc40f3bc04f65b7a357-Abstract-round2.html"},{"key":"e_1_2_1_40_1","volume-title":"Data-Efficient Hierarchical Reinforcement Learning. In Annual Conference on Neural Information Processing Systems, NeurIPS","author":"Nachum Ofir","year":"2018","unstructured":"Ofir Nachum, Shixiang Gu, Honglak Lee, and Sergey Levine. 2018. Data-Efficient Hierarchical Reinforcement Learning. In Annual Conference on Neural Information Processing Systems, NeurIPS 2018."},{"key":"e_1_2_1_41_1","volume-title":"Overcoming Exploration in Reinforcement Learning with Demonstrations. CoRR, abs\/1709.10089","author":"Nair Ashvin","year":"2017","unstructured":"Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, and Pieter Abbeel. 2017. Overcoming Exploration in Reinforcement Learning with Demonstrations. CoRR, abs\/1709.10089 (2017), arXiv:1709.10089. arxiv:1709.10089"},{"key":"e_1_2_1_42_1","volume-title":"Planning with Goal-Conditioned Policies. In Annual Conference on Neural Information Processing Systems, NeurIPS","author":"Nasiriany Soroush","year":"2019","unstructured":"Soroush Nasiriany, Vitchyr Pong, Steven Lin, and Sergey Levine. 2019. Planning with Goal-Conditioned Policies. In Annual Conference on Neural Information Processing Systems, NeurIPS 2019."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1613\/JAIR.2113"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3632860"},{"key":"e_1_2_1_45_1","volume-title":"MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies. In Annual Conference on Neural Information Processing Systems, NeurIPS","author":"Peng Xue Bin","year":"2019","unstructured":"Xue Bin Peng, Michael Chang, Grace Zhang, Pieter Abbeel, and Sergey Levine. 2019. MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies. In Annual Conference on Neural Information Processing Systems, NeurIPS 2019."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3568162.3576991"},{"key":"e_1_2_1_47_1","volume-title":"10th International Conference on Learning Representations, ICLR","author":"Qiu Wenjie","year":"2022","unstructured":"Wenjie Qiu and He Zhu. 2022. Programmatic Reinforcement Learning without Oracles. In 10th International Conference on Learning Representations, ICLR 2022."},{"key":"e_1_2_1_48_1","volume-title":"Composing Task-Agnostic Policies with Deep Reinforcement Learning. In 8th International Conference on Learning Representations, ICLR 2020","author":"Qureshi Ahmed Hussain","year":"2020","unstructured":"Ahmed Hussain Qureshi, Jacob J. Johnson, Yuzhe Qin, Taylor Henderson, Byron Boots, and Michael C. Yip. 2020. Composing Task-Agnostic Policies with Deep Reinforcement Learning. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020."},{"key":"e_1_2_1_49_1","first-page":"1","article-title":"Stable-Baselines3: Reliable Reinforcement Learning Implementations","volume":"22","author":"Raffin Antonin","year":"2021","unstructured":"Antonin Raffin, Ashley Hill, Adam Gleave, Anssi Kanervisto, Maximilian Ernestus, and Noah Dormann. 2021. Stable-Baselines3: Reliable Reinforcement Learning Implementations. Journal of Machine Learning Research, 22, 268 (2021), 1\u20138. http:\/\/jmlr.org\/papers\/v22\/20-1364.html","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_2_1_50_1","volume-title":"Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR","author":"Redmon Joseph","year":"2016","unstructured":"Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. 779\u2013788."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAC.2024.3394313"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8793966"},{"key":"e_1_2_1_53_1","volume-title":"Few-Shot Bayesian Imitation Learning with Logical Program Policies. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI","author":"Silver Tom","year":"2020","unstructured":"Tom Silver, Kelsey R. Allen, Alex K. Lew, Leslie Pack Kaelbling, and Josh Tenenbaum. 2020. Few-Shot Bayesian Imitation Learning with Logical Program Policies. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020."},{"key":"e_1_2_1_54_1","volume-title":"Learning Neuro-Symbolic Skills for Bilevel Planning. In Conference on Robot Learning, CoRL 2022","volume":"714","author":"Silver Tom","year":"2022","unstructured":"Tom Silver, Ashay Athalye, Joshua B. Tenenbaum, Tom\u00e1s Lozano-P\u00e9rez, and Leslie Pack Kaelbling. 2022. Learning Neuro-Symbolic Skills for Bilevel Planning. In Conference on Robot Learning, CoRL 2022, 14-18 December 2022, Auckland, New Zealand, Karen Liu, Dana Kulic, and Jeffrey Ichnowski (Eds.) (Proceedings of Machine Learning Research, Vol. 205). PMLR, 701\u2013714. https:\/\/proceedings.mlr.press\/v205\/silver23a.html"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1609\/AAAI.V37I10.26429"},{"key":"e_1_2_1_56_1","volume-title":"Learning Symbolic Operators for Task and Motion Planning. In IEEE\/RSJ International Conference on Intelligent Robots and Systems, IROS","author":"Silver Tom","year":"2021","unstructured":"Tom Silver, Rohan Chitnis, Joshua B. Tenenbaum, Leslie Pack Kaelbling, and Tom\u00e1s Lozano-P\u00e9rez. 2021. Learning Symbolic Operators for Task and Motion Planning. In IEEE\/RSJ International Conference on Intelligent Robots and Systems, IROS 2021."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2894875"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS40897.2019.8967548"},{"key":"e_1_2_1_59_1","doi-asserted-by":"crossref","unstructured":"Siddharth Srivastava Neil Immerman and Shlomo Zilberstein. 2011. A new representation and associated algorithms for generalized planning. Artif. Intell..","DOI":"10.1016\/j.artint.2010.10.006"},{"key":"e_1_2_1_60_1","volume-title":"Annual Conference on Neural Information Processing Systems 2021","author":"Trivedi Dweep","year":"2021","unstructured":"Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, and Joseph J Lim. 2021. Learning to Synthesize Programs as Interpretable and Generalizable Policies. In Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021."},{"key":"e_1_2_1_61_1","volume-title":"CoRR, abs\/1706.03762","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. CoRR, abs\/1706.03762 (2017), arXiv:1706.03762. arxiv:1706.03762"},{"key":"e_1_2_1_62_1","volume-title":"Imitation-Projected Programmatic Reinforcement Learning. In Annual Conference on Neural Information Processing Systems 2019","author":"Verma Abhinav","year":"2019","unstructured":"Abhinav Verma, Hoang Minh Le, Yisong Yue, and Swarat Chaudhuri. 2019. Imitation-Projected Programmatic Reinforcement Learning. In Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019."},{"key":"e_1_2_1_63_1","volume-title":"Proceedings of the 35th International Conference on Machine Learning, ICML 2018.","author":"Verma Abhinav","year":"2018","unstructured":"Abhinav Verma, Vijayaraghavan Murali, Rishabh Singh, Pushmeet Kohli, and Swarat Chaudhuri. 2018. Programmatically Interpretable Reinforcement Learning. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018."},{"key":"e_1_2_1_64_1","volume-title":"Strategic Attentive Writer for Learning Macro-Actions. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016","author":"Vezhnevets Alexander","year":"2016","unstructured":"Alexander Vezhnevets, Volodymyr Mnih, Simon Osindero, Alex Graves, Oriol Vinyals, John P. Agapiou, and Koray Kavukcuoglu. 2016. Strategic Attentive Writer for Learning Macro-Actions. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2024.3385691"},{"key":"e_1_2_1_66_1","volume-title":"Program Synthesis Guided Reinforcement Learning for Partially Observed Environments. In Annual Conference on Neural Information Processing Systems 2021","author":"Yang Yichen","year":"2021","unstructured":"Yichen Yang, Jeevana Priya Inala, Osbert Bastani, Yewen Pu, Armando Solar-Lezama, and Martin Rinard. 2021. Program Synthesis Guided Reinforcement Learning for Partially Observed Environments. In Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021."},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the 1st Reinforcement Learning Conference, RLC 2024","author":"Zhou Allan","year":"2024","unstructured":"Allan Zhou, Vikash Kumar, Chelsea Finn, and Aravind Rajeswaran. 2024. Policy Architectures for Compositional Generalization in Control. In Proceedings of the 1st Reinforcement Learning Conference, RLC 2024, Amherst, MA, USA, August 9-12, 2024, Philip S. Thomas, Feryal M. P. Behbahani, Glen Berseth, Scott M. Jordan, Scott Niekum, Andrew Patterson, Eugene Vinitsky, Adam White, Martha White, and Amy Zhang (Eds.). University of Massachusetts Amherst, MA, USA, 2264\u20132283."},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/451"}],"container-title":["Proceedings of the ACM on Programming Languages"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3763070","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3763070","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:48:34Z","timestamp":1760032114000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3763070"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,9]]},"references-count":68,"journal-issue":{"issue":"OOPSLA2","published-print":{"date-parts":[[2025,10,9]]}},"alternative-id":["10.1145\/3763070"],"URL":"https:\/\/doi.org\/10.1145\/3763070","relation":{},"ISSN":["2475-1421"],"issn-type":[{"value":"2475-1421","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,9]]},"assertion":[{"value":"2025-03-26","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-12","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-10-09","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}