{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T01:47:55Z","timestamp":1773193675329,"version":"3.50.1"},"reference-count":42,"publisher":"Association for Computing Machinery (ACM)","issue":"OOPSLA1","license":[{"start":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T00:00:00Z","timestamp":1714348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"NSF","award":["CNS-2147909,CNS-2211882,CNS-2239351"],"award-info":[{"award-number":["CNS-2147909,CNS-2211882,CNS-2239351"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Program. Lang."],"published-print":{"date-parts":[[2024,4,29]]},"abstract":"<jats:p>Optimizing quantum circuits is challenging due to the very large search space of functionally equivalent circuits and the necessity of applying transformations that temporarily decrease performance to achieve a final performance improvement. This paper presents Quarl, a learning-based quantum circuit optimizer. Applying reinforcement learning (RL) to quantum circuit optimization raises two main challenges: the large and varying action space and the non-uniform state representation. Quarl addresses these issues with a novel neural architecture and RL-training procedure. Our neural architecture decomposes the action space into two parts and leverages graph neural networks in its state representation, both of which are guided by the intuition that optimization decisions can be mostly guided by local reasoning while allowing global circuit-wide reasoning. Our evaluation shows that Quarl significantly outperforms existing circuit optimizers on almost all benchmark circuits. Surprisingly, Quarl can learn to perform rotation merging\u2014a complex, non-local circuit optimization implemented as a separate pass in existing optimizers.<\/jats:p>","DOI":"10.1145\/3649831","type":"journal-article","created":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T17:53:50Z","timestamp":1714413230000},"page":"555-582","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Quarl: A Learning-Based Quantum Circuit Optimizer"],"prefix":"10.1145","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6746-6225","authenticated-orcid":false,"given":"Zikun","family":"Li","sequence":"first","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-0202-2122","authenticated-orcid":false,"given":"Jinjun","family":"Peng","sequence":"additional","affiliation":[{"name":"Columbia University, New York City, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-5781-9164","authenticated-orcid":false,"given":"Yixuan","family":"Mei","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-7837-5925","authenticated-orcid":false,"given":"Sina","family":"Lin","sequence":"additional","affiliation":[{"name":"Microsoft, Seattle, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9057-5817","authenticated-orcid":false,"given":"Yi","family":"Wu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-4209-1635","authenticated-orcid":false,"given":"Oded","family":"Padon","sequence":"additional","affiliation":[{"name":"VMware Research, Palo Alto, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1270-5185","authenticated-orcid":false,"given":"Zhihao","family":"Jia","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, USA"}]}],"member":"320","published-online":{"date-parts":[[2024,4,29]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"2023. The Perlmutter Supercomputer. https:\/\/docs.nersc.gov\/systems\/perlmutter\/"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","unstructured":"Abien Fred Agarap. 2018. Deep Learning using Rectified Linear Units (ReLU). https:\/\/doi.org\/10.48550\/ARXIV.1803.08375","DOI":"10.48550\/ARXIV.1803.08375"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","unstructured":"Gadi Aleksandrowicz Thomas Alexander Panagiotis Barkoutsos Luciano Bello Yael Ben-Haim David Bucher Francisco Jose Cabrera-Hern\u00e1ndez Jorge Carballo-Franquis Adrian Chen Chun-Fu Chen Jerry M. Chow Antonio D. C\u00f3rcoles-Gonzales Abigail J. Cross Andrew Cross Juan Cruz-Benito Chris Culver Salvador De La Puente Gonz\u00e1lez Enrique De La Torre Delton Ding Eugene Dumitrescu Ivan Duran Pieter Eendebak Mark Everitt Ismael Faro Sertage Albert Frisch Andreas Fuhrer Jay Gambetta Borja Godoy Gago Juan Gomez-Mosquera Donny Greenberg Ikko Hamamura Vojtech Havlicek Joe Hellmers \u0141ukasz Herok Hiroshi Horii Shaohan Hu Takashi Imamichi Toshinari Itoko Ali Javadi-Abhari Naoki Kanazawa Anton Karazeev Kevin Krsulich Peng Liu Yang Luh Yunho Maeng Manoel Marques Francisco Jose Mart\u00edn-Fern\u00e1ndez Douglas T. McClure David McKay Srujan Meesala Antonio Mezzacapo Nikolaj Moll Diego Moreda Rodr\u00edguez Giacomo Nannicini Paul Nation Pauline Ollitrault Lee James O\u2019Riordan Hanhee Paik Jes\u00fas P\u00e9rez Anna Phan Marco Pistoia Viktor Prutyanov Max Reuter Julia Rice Abd\u00f3n Rodr\u00edguez Davila Raymond Harry Putra Rudy Mingi Ryu Ninad Sathaye Chris Schnabel Eddie Schoute Kanav Setia Yunong Shi Adenilton Silva Yukio Siraichi Seyon Sivarajah John A. Smolin Mathias Soeken Hitomi Takahashi Ivano Tavernelli Charles Taylor Pete Taylour Kenso Trabing Matthew Treinish Wes Turner Desiree Vogt-Lee Christophe Vuillot Jonathan A. Wildstrom Jessica Wilson Erick Winston Christopher Wood Stephen Wood Stefan W\u00f6rner Ismail Yunus Akhalwaya and Christa Zoufal. 2019. Qiskit: An Open-source Framework for Quantum Computing. https:\/\/doi.org\/10.5281\/zenodo.2562111 10.5281\/zenodo.2562111","DOI":"10.5281\/zenodo.2562111"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2014.2341953"},{"key":"e_1_2_1_5_1","volume-title":"Amazon EC2 M6i Instances. https:\/\/aws.amazon.com\/ec2\/instance-types\/m6i\/ Accessed: 19th","author":"AWS.","year":"2023","unstructured":"AWS. 2023. Amazon EC2 M6i Instances. https:\/\/aws.amazon.com\/ec2\/instance-types\/m6i\/ Accessed: 19th Oct 2023"},{"key":"e_1_2_1_6_1","volume-title":"Quantum machine learning. Nature, 549, 7671","author":"Biamonte Jacob","year":"2017","unstructured":"Jacob Biamonte, Peter Wittek, Nicola Pancotti, Patrick Rebentrost, Nathan Wiebe, and Seth Lloyd. 2017. Quantum machine learning. Nature, 549, 7671 (2017), 195\u2013202."},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"1810","DOI":"10.1109\/TCAD.2020.3032630","article-title":"Advanced equivalence checking for quantum circuits","volume":"40","author":"Burgholzer Lukas","year":"2020","unstructured":"Lukas Burgholzer and Robert Wille. 2020. Advanced equivalence checking for quantum circuits. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 40, 9 (2020), 1810\u20131824.","journal-title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems"},{"key":"e_1_2_1_8_1","volume-title":"Quantum chemistry in the age of quantum computing. Chemical reviews, 119, 19","author":"Cao Yudong","year":"2019","unstructured":"Yudong Cao, Jonathan Romero, Jonathan P Olson, Matthias Degroote, Peter D Johnson, M\u00e1ria Kieferov\u00e1, Ian D Kivlichan, Tim Menke, Borja Peropadre, and Nicolas PD Sawaya. 2019. Quantum chemistry in the age of quantum computing. Chemical reviews, 119, 19 (2019), 10856\u201310915."},{"key":"e_1_2_1_9_1","unstructured":"Edward Farhi Jeffrey Goldstone and Sam Gutmann. 2014. A quantum approximate optimization algorithm. arXiv preprint arXiv:1411.4028."},{"key":"e_1_2_1_10_1","volume-title":"Florian Marquardt, and Li Li.","author":"F\u00f6sel Thomas","year":"2021","unstructured":"Thomas F\u00f6sel, Murphy Yuezhen Niu, Florian Marquardt, and Li Li. 2021. Quantum circuit optimization with deep reinforcement learning. arXiv preprint arXiv:2103.07585."},{"key":"e_1_2_1_11_1","volume-title":"Artificial neural networks (the multilayer perceptron)\u2014a review of applications in the atmospheric sciences. Atmospheric environment, 32, 14-15","author":"Gardner Matt W","year":"1998","unstructured":"Matt W Gardner and SR Dorling. 1998. Artificial neural networks (the multilayer perceptron)\u2014a review of applications in the atmospheric sciences. Atmospheric environment, 32, 14-15 (1998), 2627\u20132636."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209108.3209128"},{"key":"e_1_2_1_13_1","volume-title":"Inductive representation learning on large graphs. Advances in neural information processing systems, 30","author":"Hamilton Will","year":"2017","unstructured":"Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. Advances in neural information processing systems, 30 (2017)."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2018.05.012"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3434318"},{"key":"e_1_2_1_16_1","volume-title":"International Conference on Parallel Architectures and Languages Europe. 1\u201313","author":"Hinton Geoffrey E","year":"1987","unstructured":"Geoffrey E Hinton. 1987. Learning translation invariant recognition in a massively parallel networks. In International Conference on Parallel Architectures and Languages Europe. 1\u201313."},{"key":"e_1_2_1_17_1","unstructured":"IBM. 2023. The IBM Washington quantum device.. https:\/\/reversiblebenchmarks.github.io\/"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209108.3209131"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.4204\/eptcs.318.14"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3453483.3454071"},{"key":"e_1_2_1_21_1","volume-title":"On-demand GPU cloud pricing. https:\/\/lambdalabs.com\/service\/gpu-cloud Accessed: 19th","year":"2023","unstructured":"Lambda. 2023. On-demand GPU cloud pricing. https:\/\/lambdalabs.com\/service\/gpu-cloud Accessed: 19th Oct 2023"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","unstructured":"Gushu Li Yufei Ding and Yuan Xie. 2018. Tackling the Qubit Mapping Problem for NISQ-Era Quantum Devices. https:\/\/doi.org\/10.48550\/ARXIV.1809.02573","DOI":"10.48550\/ARXIV.1809.02573"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.10463907"},{"key":"e_1_2_1_24_1","volume-title":"International conference on machine learning. 1928\u20131937","author":"Mnih Volodymyr","year":"2016","unstructured":"Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International conference on machine learning. 1928\u20131937."},{"key":"e_1_2_1_25_1","volume-title":"Realization of a scalable Shor algorithm. Science, 351, 6277","author":"Monz Thomas","year":"2016","unstructured":"Thomas Monz, Daniel Nigg, Esteban A Martinez, Matthias F Brandl, Philipp Schindler, Richard Rines, Shannon X Wang, Isaac L Chuang, and Rainer Blatt. 2016. Realization of a scalable Shor algorithm. Science, 351, 6277 (2016), 1068\u20131070."},{"key":"e_1_2_1_26_1","volume-title":"Automated optimization of large quantum circuits with continuous parameters. npj Quantum Information, 4, 1","author":"Nam Yunseong","year":"2018","unstructured":"Yunseong Nam, Neil J Ross, Yuan Su, Andrew M Childs, and Dmitri Maslov. 2018. Automated optimization of large quantum circuits with continuous parameters. npj Quantum Information, 4, 1 (2018), 1\u201312."},{"key":"e_1_2_1_27_1","first-page":"18182","article-title":"Reinforcement learning for optimization of variational quantum circuit architectures","volume":"34","author":"Ostaszewski Mateusz","year":"2021","unstructured":"Mateusz Ostaszewski, Lea M Trenkwalder, Wojciech Masarczyk, Eleanor Scerri, and Vedran Dunjko. 2021. Reinforcement learning for optimization of variational quantum circuit architectures. Advances in Neural Information Processing Systems, 34 (2021), 18182\u201318194.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_28_1","volume-title":"PyTorch: An Imperative Style","author":"Paszke Adam","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d' Alch\u00e9-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024\u20138035. http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf"},{"key":"e_1_2_1_29_1","volume-title":"A variational eigenvalue solver on a photonic quantum processor. Nature communications, 5, 1","author":"Peruzzo Alberto","year":"2014","unstructured":"Alberto Peruzzo, Jarrod McClean, Peter Shadbolt, Man-Hong Yung, Xiao-Qi Zhou, Peter J Love, Al\u00e1n Aspuru-Guzik, and Jeremy L O\u2019brien. 2014. A variational eigenvalue solver on a photonic quantum processor. Nature communications, 5, 1 (2014), 4213."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2111.11387"},{"key":"e_1_2_1_31_1","unstructured":"Matteo G Pozzi Steven J Herbert Akash Sengupta and Robert D Mullins. 2020. Using reinforcement learning to perform qubit routing in quantum compilers. arXiv preprint arXiv:2007.15957."},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Nils Quetschlich Lukas Burgholzer and Robert Wille. 2022. MQT Bench: Benchmarking Software and Design Automation Tools for Quantum Computing. arxiv:2204.13719. MQT Bench is available at https:\/\/www.cda.cit.tum.de\/mqtbench\/","DOI":"10.22331\/q-2023-07-20-1062"},{"key":"e_1_2_1_33_1","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence. 36","author":"Sinha Animesh","year":"2022","unstructured":"Animesh Sinha, Utkarsh Azad, and Harjinder Singh. 2022. Qubit routing using graph neural network aided Monte Carlo tree search. In Proceedings of the AAAI Conference on Artificial Intelligence. 36, 9935\u20139943."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1088\/2058-9565"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","unstructured":"Mark Skilbeck Eric Peterson appleby Erik Davis Peter Karalekas Juan M. Bello-Rivas Daniel Kochmanski Zach Beane Robert Smith Andrew Shi Cole Scott Adam Paszke Eric Hulburd Matthew Young Aaron S. Jackson BHAVISHYA M. Sohaib Alam Wilfredo Vel\u00e1zquez-Rodr\u00edguez c. b. osborn fengdlm and jmackeyrigetti. 2020. rigetti\/quilc: v1.21.0. https:\/\/doi.org\/10.5281\/zenodo.3967926 10.5281\/zenodo.3967926","DOI":"10.5281\/zenodo.3967926"},{"key":"e_1_2_1_37_1","volume-title":"Policy gradient methods for reinforcement learning with function approximation. Advances in neural information processing systems, 12","author":"Sutton Richard S","year":"1999","unstructured":"Richard S Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 1999. Policy gradient methods for reinforcement learning with function approximation. Advances in neural information processing systems, 12 (1999)."},{"key":"e_1_2_1_38_1","unstructured":"Minjie Wang Da Zheng Zihao Ye Quan Gan Mufei Li Xiang Song Jinjing Zhou Chao Ma Lingfan Yu and Yu Gai. 2019. Deep graph library: A graph-centric highly-performant package for graph neural networks. arXiv preprint arXiv:1909.01315."},{"key":"e_1_2_1_39_1","unstructured":"Minjie Wang Da Zheng Zihao Ye Quan Gan Mufei Li Xiang Song Jinjing Zhou Chao Ma Lingfan Yu and Yu Gai. 2019. Deep graph library: A graph-centric highly-performant package for graph neural networks. arXiv preprint arXiv:1909.01315."},{"key":"e_1_2_1_40_1","unstructured":"Amanda Xu Abtin Molavi Lauren Pick Swamit Tannu and Aws Albarghouthi. 2022. Synthesizing Quantum-Circuit Optimizers. arXiv preprint arXiv:2211.09691."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3519939.3523433"},{"key":"e_1_2_1_42_1","volume-title":"Quartz: Superoptimization of quantum circuits (extended version). arxiv:2204.09033.","author":"Xu Mingkuan","year":"2022","unstructured":"Mingkuan Xu, Zikun Li, Oded Padon, Sina Lin, Jessica Pointing, Auguste Hirth, Henry Ma, Jens Palsberg, Alex Aiken, Umut A. Acar, and Zhihao Jia. 2022. Quartz: Superoptimization of quantum circuits (extended version). arxiv:2204.09033."}],"container-title":["Proceedings of the ACM on Programming Languages"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3649831","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3649831","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:54:06Z","timestamp":1750287246000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3649831"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,29]]},"references-count":42,"journal-issue":{"issue":"OOPSLA1","published-print":{"date-parts":[[2024,4,29]]}},"alternative-id":["10.1145\/3649831"],"URL":"https:\/\/doi.org\/10.1145\/3649831","relation":{},"ISSN":["2475-1421"],"issn-type":[{"value":"2475-1421","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,29]]},"assertion":[{"value":"2024-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}