{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T20:25:33Z","timestamp":1776111933789,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":51,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,13]],"date-time":"2022-06-13T00:00:00Z","timestamp":1655078400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,13]]},"DOI":"10.1145\/3520312.3534867","type":"proceedings-article","created":{"date-parts":[[2022,6,10]],"date-time":"2022-06-10T15:14:19Z","timestamp":1654874059000},"page":"50-59","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["ExeBench: an ML-scale dataset of executable C functions"],"prefix":"10.1145","author":[{"given":"Jordi","family":"Armengol-Estap\u00e9","sequence":"first","affiliation":[{"name":"University of Edinburgh, UK"}]},{"given":"Jackson","family":"Woodruff","sequence":"additional","affiliation":[{"name":"University of Edinburgh, UK"}]},{"given":"Alexander","family":"Brauckmann","sequence":"additional","affiliation":[{"name":"University of Edinburgh, UK"}]},{"given":"Jos\u00e9 Wesley de Souza","family":"Magalh\u00e3es","sequence":"additional","affiliation":[{"name":"University of Edinburgh, UK"}]},{"given":"Michael F. P.","family":"O'Boyle","sequence":"additional","affiliation":[{"name":"University of Edinburgh, UK"}]}],"member":"320","published-online":{"date-parts":[[2022,6,13]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_2_2_1","volume-title":"Language models are few-shot learners. Advances in neural information processing systems, 33: 1877\u20131901","author":"Brown Tom","year":"2020","unstructured":"Tom Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared D Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , Language models are few-shot learners. Advances in neural information processing systems, 33: 1877\u20131901 , 2020 . Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. Advances in neural information processing systems, 33: 1877\u20131901, 2020."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3212695"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2019.00101"},{"key":"e_1_3_2_2_5_1","volume-title":"Convolutional neural networks over tree structures for programming language processing","author":"Mou Lili","year":"2016","unstructured":"Lili Mou , Ge Li , Lu Zhang , Tao Wang , and Zhi Jin . Convolutional neural networks over tree structures for programming language processing . 2016 . Lili Mou, Ge Li, Lu Zhang, Tao Wang, and Zhi Jin. Convolutional neural networks over tree structures for programming language processing. 2016."},{"key":"e_1_3_2_2_6_1","volume-title":"Unpublished","author":"Puri Ruchir","year":"2021","unstructured":"Ruchir Puri , David S Kung , Wei Zhang , Giacomo Domeniconi , Vladmir Zolotov , Julian Dolby , Jie Chen , Mihir Choudbury , Lindsey Decker , Veronika Thost , Saurabh Buratti , Luca n ad Pujar , and Ulrich Finkler . Project codenet : A large-scale AI for code dataset for learning a diversity of coding tasks . Unpublished , 2021 . Ruchir Puri, David S Kung, Wei Zhang, Giacomo Domeniconi, Vladmir Zolotov, Julian Dolby, Jie Chen, Mihir Choudbury, Lindsey Decker, Veronika Thost, Saurabh Buratti, Luca nad Pujar, and Ulrich Finkler. Project codenet: A large-scale AI for code dataset for learning a diversity of coding tasks. Unpublished, 2021."},{"key":"e_1_3_2_2_7_1","volume-title":"AAAI","author":"Zhu Ming","year":"2022","unstructured":"Ming Zhu , Karthik Suresh , and Chandan K Reddy . Multilingual code snippets training for program translation . AAAI , 2022 . Ming Zhu, Karthik Suresh, and Chandan K Reddy. Multilingual code snippets training for program translation. AAAI, 2022."},{"key":"e_1_3_2_2_8_1","volume-title":"Advances in Programming Languages and Neurosymbolic Systems Workshop","author":"Armengol-Estap\u00e9 Jordi","year":"2021","unstructured":"Jordi Armengol-Estap\u00e9 and Michael O\u2019Boyle . Learning c to x86 translation : An experiment in neural compilation . In Advances in Programming Languages and Neurosymbolic Systems Workshop , 2021 . URL https:\/\/openreview.net\/forum?id=444ug_EYXet. Jordi Armengol-Estap\u00e9 and Michael O\u2019Boyle. Learning c to x86 translation: An experiment in neural compilation. In Advances in Programming Languages and Neurosymbolic Systems Workshop, 2021. URL https:\/\/openreview.net\/forum?id=444ug_EYXet."},{"key":"e_1_3_2_2_9_1","volume-title":"Towards neural decompilation. CoRR, abs\/1905.08325","author":"Katz Omer","year":"2019","unstructured":"Omer Katz , Yuval Olshaker , Yoav Goldberg , and Eran Yahav . Towards neural decompilation. CoRR, abs\/1905.08325 , 2019 . URL http:\/\/arxiv.org\/abs\/1905.08325. Omer Katz, Yuval Olshaker, Yoav Goldberg, and Eran Yahav. Towards neural decompilation. CoRR, abs\/1905.08325, 2019. URL http:\/\/arxiv.org\/abs\/1905.08325."},{"key":"e_1_3_2_2_10_1","volume-title":"Learning to infer program sketches","author":"Nye Maxwell","year":"2019","unstructured":"Maxwell Nye , Luke Hewitt , Joshua Tenenbaum , and Armando Solar-Lezama . Learning to infer program sketches . 2019 . Maxwell Nye, Luke Hewitt, Joshua Tenenbaum, and Armando Solar-Lezama. Learning to infer program sketches. 2019."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3425898.3426952"},{"key":"e_1_3_2_2_12_1","volume-title":"Synthetic datasets for neural program synthesis","author":"Shin Richard","year":"2019","unstructured":"Richard Shin , Neel Kant , Kavi Gupta , C Christopher Bender , Brandon Trabucco , Rishabh Singh , and Dawn Song . Synthetic datasets for neural program synthesis . 2019 . Richard Shin, Neel Kant, Kavi Gupta, CChristopher Bender, Brandon Trabucco, Rishabh Singh, and Dawn Song. Synthetic datasets for neural program synthesis. 2019."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3425898.3426952"},{"key":"e_1_3_2_2_14_1","first-page":"102","volume-title":"2020 35th IEEE\/ACM International Conference on Automated Software Engineering (ASE)","author":"Collie Bruce","unstructured":"Bruce Collie , Philip Ginsbach , Jackson Woodruff , Ajitha Rajan , and Michael FP O\u2019Boyle . M3 : Semantic api migrations . In 2020 35th IEEE\/ACM International Conference on Automated Software Engineering (ASE) , pages 90\u2013 102 . IEEE, 2020. Bruce Collie, Philip Ginsbach, Jackson Woodruff, Ajitha Rajan, and Michael FP O\u2019Boyle. M3: Semantic api migrations. In 2020 35th IEEE\/ACM International Conference on Automated Software Engineering (ASE), pages 90\u2013102. IEEE, 2020."},{"key":"e_1_3_2_2_15_1","volume-title":"Deepcoder: Learning to write programs","author":"Balog Matej","year":"2017","unstructured":"Matej Balog , Alexander L. Gaunt , Marc Brockschmidt , Sebastian Nowozin , and Daniel Tarlow . Deepcoder: Learning to write programs . 2017 . Matej Balog, Alexander L. Gaunt, Marc Brockschmidt, Sebastian Nowozin, and Daniel Tarlow. Deepcoder: Learning to write programs. 2017."},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3315508.3329976"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO51591.2021.9370322"},{"key":"e_1_3_2_2_18_1","volume-title":"PMLR","author":"Koh Pang Wei","year":"2021","unstructured":"Pang Wei Koh , Shiori Sagawa , Henrik Marklund , Sang Michael Xie , Marvin Zhang , Akshay Balsubramani , Weihua Hu , Michihiro Yasunaga , Richard Lanas Phillips , Irena Gao , Tony Lee , Etienne David , Ian Stavness , Wei Guo , Berton A Earnshaw , Imran S Haque , Sara Beery , Jure Leskovec , Anshul Kundaje , Emma Pierson , Sergey Levine , Chelsea Finn , and Percy Liang . Wilds : A benchmark of in-the-wild distribution shifts . PMLR , 2021 . Pang Wei Koh, Shiori Sagawa, Henrik Marklund, Sang Michael Xie, Marvin Zhang, Akshay Balsubramani, Weihua Hu, Michihiro Yasunaga, Richard Lanas Phillips, Irena Gao, Tony Lee, Etienne David, Ian Stavness, Wei Guo, Berton A Earnshaw, Imran S Haque, Sara Beery, Jure Leskovec, Anshul Kundaje, Emma Pierson, Sergey Levine, Chelsea Finn, and Percy Liang. Wilds: A benchmark of in-the-wild distribution shifts. PMLR, 2021."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597073.2597126"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359591.3359735"},{"key":"e_1_3_2_2_21_1","volume-title":"PMLR","author":"A","year":"2021","unstructured":"A large-scale benchmark for few-shot program induction and synthesis . PMLR , 2021 . A large-scale benchmark for few-shot program induction and synthesis. PMLR, 2021."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3196398.3196464"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2013.6624047"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3196398.3196450"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/APSEC.2010.46"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/msr.2013.6624029"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ESEM.2017.11"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3360589"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2017.7863731"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-acl.2"},{"key":"e_1_3_2_2_31_1","volume-title":"CoRR","author":"Korbak Tamasz","year":"2021","unstructured":"Tamasz Korbak , Hady Elsahar , Marc Dymetman , and German Kruszewski . Energy-based models for code generation under compilability constraints . CoRR , 2021 . Tamasz Korbak, Hady Elsahar, Marc Dymetman, and German Kruszewski. Energy-based models for code generation under compilability constraints. CoRR, 2021."},{"key":"e_1_3_2_2_32_1","volume-title":"NeurIPS","author":"Chen Xinyun","year":"2021","unstructured":"Xinyun Chen , Dawn Song , and Yuandong Tian . Latent execution for neural program synthesis . NeurIPS , 2021 . Xinyun Chen, Dawn Song, and Yuandong Tian. Latent execution for neural program synthesis. NeurIPS, 2021."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSME.2018.00031"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3368089.3409706"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2931037.2931058"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2901739.2901767"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240765.3240848"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133915"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1007512.1007526"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10009-020-00568-x"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3328778.3372685"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2876034.2876050"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3519939.3523439"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE43902.2021.00109"},{"key":"e_1_3_2_2_45_1","volume-title":"Umap: Uniform manifold approximation and projection for dimension reduction","author":"McInnes Leland","year":"2018","unstructured":"Leland McInnes , John Healy , and James Melville . Umap: Uniform manifold approximation and projection for dimension reduction , 2018 . URL https:\/\/arxiv.org\/abs\/1802.03426. Leland McInnes, John Healy, and James Melville. Umap: Uniform manifold approximation and projection for dimension reduction, 2018. URL https:\/\/arxiv.org\/abs\/1802.03426."},{"key":"e_1_3_2_2_46_1","volume-title":"Do we train on test data? purging CIFAR of near-duplicates. CoRR, abs\/1902.00423","author":"Barz Bj\u00f6rn","year":"2019","unstructured":"Bj\u00f6rn Barz and Joachim Denzler . Do we train on test data? purging CIFAR of near-duplicates. CoRR, abs\/1902.00423 , 2019 . URL http:\/\/arxiv.org\/abs\/1902.00423. Bj\u00f6rn Barz and Joachim Denzler. Do we train on test data? purging CIFAR of near-duplicates. CoRR, abs\/1902.00423, 2019. URL http:\/\/arxiv.org\/abs\/1902.00423."},{"key":"e_1_3_2_2_47_1","volume-title":"Deduplicating training data makes language models better. CoRR, abs\/2107.06499","author":"Lee Katherine","year":"2021","unstructured":"Katherine Lee , Daphne Ippolito , Andrew Nystrom , Chiyuan Zhang , Douglas Eck , Chris Callison-Burch , and Nicholas Carlini . Deduplicating training data makes language models better. CoRR, abs\/2107.06499 , 2021 . URL https:\/\/arxiv.org\/abs\/2107.06499. Katherine Lee, Daphne Ippolito, Andrew Nystrom, Chiyuan Zhang, Douglas Eck, Chris Callison-Burch, and Nicholas Carlini. Deduplicating training data makes language models better. CoRR, abs\/2107.06499, 2021. URL https:\/\/arxiv.org\/abs\/2107.06499."},{"key":"e_1_3_2_2_48_1","volume-title":"The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs\/2101.00027","author":"Gao Leo","year":"2021","unstructured":"Leo Gao , Stella Biderman , Sid Black , Laurence Golding , Travis Hoppe , Charles Foster , Jason Phang , Horace He , Anish Thite , Noa Nabeshima , Shawn Presser , and Connor Leahy . The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs\/2101.00027 , 2021 . URL https:\/\/arxiv.org\/abs\/2101.00027. Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, and Connor Leahy. The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs\/2101.00027, 2021. URL https:\/\/arxiv.org\/abs\/2101.00027."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-demo.21"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.5639822"},{"key":"e_1_3_2_2_51_1","first-page":"2253","volume-title":"International Conference on Machine Learning","author":"Cummins Chris","unstructured":"Chris Cummins , Zacharias V Fisches , Tal Ben-Nun , Torsten Hoefler , Michael FP O\u2019Boyle , and Hugh Leather . Programl : A graph-based program representation for data flow analysis and compiler optimizations . In International Conference on Machine Learning , pages 2244\u2013 2253 . PMLR, 2021. Chris Cummins, Zacharias V Fisches, Tal Ben-Nun, Torsten Hoefler, Michael FP O\u2019Boyle, and Hugh Leather. Programl: A graph-based program representation for data flow analysis and compiler optimizations. In International Conference on Machine Learning, pages 2244\u20132253. PMLR, 2021."}],"event":{"name":"MAPS '22: 6th ACM SIGPLAN International Symposium on Machine Programming","location":"San Diego CA USA","acronym":"MAPS '22","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"]},"container-title":["Proceedings of the 6th ACM SIGPLAN International Symposium on Machine Programming"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3520312.3534867","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3520312.3534867","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:31Z","timestamp":1750183771000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3520312.3534867"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,13]]},"references-count":51,"alternative-id":["10.1145\/3520312.3534867","10.1145\/3520312"],"URL":"https:\/\/doi.org\/10.1145\/3520312.3534867","relation":{},"subject":[],"published":{"date-parts":[[2022,6,13]]},"assertion":[{"value":"2022-06-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}