{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T09:03:37Z","timestamp":1770282217667,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,11,16]]},"DOI":"10.1145\/3769002.3769975","type":"proceedings-article","created":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T19:16:19Z","timestamp":1770232579000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["FormalGym: Deep Reinforcement Learning Agent Based Formal Compiler Optimization Framework"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9923-4063","authenticated-orcid":false,"given":"Abhilash","family":"Majumder","sequence":"first","affiliation":[{"name":"Software GPU, NVIDIA Corporation, Bengaluru, Karnataka, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,2,4]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-71489-9"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/2928270"},{"key":"e_1_3_2_1_3_1","volume-title":"arXiv:1606.01540","author":"Brockman Greg","year":"2016","unstructured":"Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba. 2016. OpenAI Gym. arXiv:1606.01540 (2016)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Stefano Cereda Gianluca Palermo Paolo Cremonesi and Stefano Doni. 2020. A Collaborative Filtering Approach for the Automatic Tuning of Compiler Optimisations. In LCTES.","DOI":"10.1145\/3372799.3394361"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Chris Cummins Bram Wasti Jiadong Guo Brandon Cui Jason Ansel Sahir Gomez Somya Jain Jia Liu Olivier Teytaud Benoit Steiner Yuandong Tian and Hugh Leather. 2022. CompilerGym: Robust Performant Compiler Optimization Environments for AI Research. In CGO.","DOI":"10.1109\/CGO53902.2022.9741258"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Grigori Fursin John Cavazos Michael O'Boyle and Olivier Temam. 2007. MiDataSets: Creating the conditions for a more realistic evaluation of iterative optimization. In HiPEAC.","DOI":"10.1007\/978-3-540-69338-3_17"},{"key":"e_1_3_2_1_7_1","unstructured":"Tuomas Haarnoja Aurick Zhou Pieter Abbeel and Sergey Levine. 2018. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. In ICML."},{"key":"e_1_3_2_1_8_1","volume-title":"Krste Asanovic, and Ion Stoica.","author":"Haj-Ali Ameer","year":"2020","unstructured":"Ameer Haj-Ali, Nesreen K Ahmed, Ted Willke, Yakun Sophia Shao, Krste Asanovic, and Ion Stoica. 2020. NeuroVectorizer: End-to-end Vectorization with Deep Reinforcement Learning. In CGO."},{"key":"e_1_3_2_1_9_1","unstructured":"Ameer Haj-Ali Qijing Huang William Moses John Xiang John Wawrzynek Krste Asanovic and Ion Stoica. 2020. AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning. In MLSys."},{"key":"e_1_3_2_1_10_1","unstructured":"Sergey Levine John Schulman Philipp Moritz Michael I. Jordan and Pieter Abbeel. 2015. Trust Region Policy Optimization. In ICML."},{"key":"e_1_3_2_1_11_1","unstructured":"Vijay R Konda and John N Tsitsiklis. 2000. Actor-Critic Algorithms. In NeurIPS."},{"key":"e_1_3_2_1_12_1","volume-title":"LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation. In CGO.","author":"Lattner Chris","year":"2004","unstructured":"Chris Lattner and Vikram Adve. 2004. LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation. In CGO."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Hugh Leather and Chris Cummins. 2020. Machine Learning in Compilers: Past Present and Future. In FDL.","DOI":"10.1109\/FDL50818.2020.9232934"},{"key":"e_1_3_2_1_14_1","unstructured":"Timothy P. Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In ICLR."},{"key":"e_1_3_2_1_15_1","volume-title":"An SMT Encoding of LLVM's Memory Model for Bounded Translation Validation. Computer Aided Verification 33","author":"Lopes Nuno P.","year":"2021","unstructured":"Nuno P. Lopes, Juneyoung Lee, Chung-Kil Hur, and Dongjoo Kim. 2021. An SMT Encoding of LLVM's Memory Model for Bounded Translation Validation. Computer Aided Verification 33 (2021)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Nuno P. Lopes Juneyoung Lee Chung-Kil Hur Zhengyang Liu and John Regehr. 2021. Alive2: Bounded Translation Validation for LLVM. In PLDI.","DOI":"10.1145\/3453483.3454030"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Rahim Mammadli Ali Jannesari and Felix Wolf. 2020. Static Neural Compiler Optimization via Deep Reinforcement Learning. In LLVM-HPC.","DOI":"10.1109\/LLVMHPCHiPar51896.2020.00006"},{"key":"e_1_3_2_1_18_1","volume-title":"Luiz GA Martins, and Jo\u00e3o MP Cardoso","author":"Nobre Ricardo","year":"2016","unstructured":"Ricardo Nobre, Luiz GA Martins, and Jo\u00e3o MP Cardoso. 2016. A graph-based iterative compiler pass selection and phase ordering approach. In LCTES."},{"key":"e_1_3_2_1_19_1","unstructured":"Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury Gregory Chanan Trevor Killeen Zeming Lin Natalia Gimelshein Luca Antiga et al. 2019. PyTorch: An Imperative Style High-performance Deep Learning Library. arXiv:1912.01703 (2019)."},{"key":"e_1_3_2_1_20_1","volume-title":"Proximal Policy Optimization Algorithms. arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_21_1","volume-title":"On Information And Sufficiency. Project Euclid","author":"Kullback S.","year":"1951","unstructured":"S. Kullback and RA Lieblar. 1951. On Information And Sufficiency. Project Euclid (1951)."},{"key":"e_1_3_2_1_22_1","volume-title":"MLGO: a Machine Learning Guided Compiler Optimizations Framework. arXiv:2101.04808","author":"Trofin Mircea","year":"2021","unstructured":"Mircea Trofin, Yundi Qian, Eugene Brevdo, Zinan Lin, Krzysztof Choromanski, and David Li. 2021. MLGO: a Machine Learning Guided Compiler Optimizations Framework. arXiv:2101.04808 (2021)."},{"key":"e_1_3_2_1_23_1","unstructured":"Yue Wang and Shaofeng Zou. 2022. Policy Gradient Method For Robust Reinforcement Learning. In ICML."},{"key":"e_1_3_2_1_24_1","volume-title":"Technical Note: Q-Learning. In DBLP.","author":"Watkins Christopher","year":"1989","unstructured":"Christopher Watkins. 1989. Technical Note: Q-Learning. In DBLP."},{"key":"e_1_3_2_1_25_1","volume-title":"ACM SIGPLAN","author":"John Regehr Xuejun Yang Eric Eide","unstructured":"Eric Eide John Regehr Xuejun Yang, Yang Chen. 2011. Finding and Understanding Bugs in C Compilers. In ACM SIGPLAN Volume 46, Issue 6."}],"event":{"name":"RACS '25: International Conference on Research in Adaptive and Convergent Systems","location":"Ho Chi Minh Vietnam","acronym":"RACS '25","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing"]},"container-title":["Proceedings of the International Conference on Research in Adaptive and Convergent Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3769002.3769975","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T19:16:34Z","timestamp":1770232594000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3769002.3769975"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,16]]},"references-count":25,"alternative-id":["10.1145\/3769002.3769975","10.1145\/3769002"],"URL":"https:\/\/doi.org\/10.1145\/3769002.3769975","relation":{},"subject":[],"published":{"date-parts":[[2025,11,16]]},"assertion":[{"value":"2026-02-04","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}