{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,18]],"date-time":"2026-05-18T11:29:01Z","timestamp":1779103741448,"version":"3.51.4"},"reference-count":268,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2018,9,18]],"date-time":"2018-09-18T00:00:00Z","timestamp":1537228800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"EU Commission H2020-FET-HPC program","award":["ANTAREX-671623"],"award-info":[{"award-number":["ANTAREX-671623"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2019,9,30]]},"abstract":"<jats:p>Since the mid-1990s, researchers have been trying to use machine-learning-based approaches to solve a number of different compiler optimization problems. These techniques primarily enhance the quality of the obtained results and, more importantly, make it feasible to tackle two main compiler optimization problems: optimization selection (choosing which optimizations to apply) and phase-ordering (choosing the order of applying optimizations). The compiler optimization space continues to grow due to the advancement of applications, increasing number of compiler optimizations, and new target architectures. Generic optimization passes in compilers cannot fully leverage newly introduced optimizations and, therefore, cannot keep up with the pace of increasing options. This survey summarizes and classifies the recent advances in using machine learning for the compiler optimization field, particularly on the two major problems of (1) selecting the best optimizations, and (2) the phase-ordering of optimizations. The survey highlights the approaches taken so far, the obtained results, the fine-grain classification among different approaches, and finally, the influential papers of the field.<\/jats:p>","DOI":"10.1145\/3197978","type":"journal-article","created":{"date-parts":[[2018,9,18]],"date-time":"2018-09-18T12:11:32Z","timestamp":1537272692000},"page":"1-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":171,"title":["A Survey on Compiler Autotuning using Machine Learning"],"prefix":"10.1145","volume":"51","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8606-6497","authenticated-orcid":false,"given":"Amir H.","family":"Ashouri","sequence":"first","affiliation":[{"name":"University of Toronto, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"William","family":"Killian","sequence":"additional","affiliation":[{"name":"Millersville University of Pennsylvania, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John","family":"Cavazos","sequence":"additional","affiliation":[{"name":"University of Delaware, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gianluca","family":"Palermo","sequence":"additional","affiliation":[{"name":"Politecnico di Milano, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cristina","family":"Silvano","sequence":"additional","affiliation":[{"name":"Politecnico di Milano, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,9,18]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/646662.699219"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/277652.277740"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2006.37"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/6448"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/390013.808479"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/998300.997196"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/645678.663957"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/2635955"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/647472.727289"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1542476.1542481"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2628071.2628092"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.3182\/20050703-6-CZ-1902.01073"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1120725.1120762"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSoC.2017.31"},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the IEEE Embedded Systems for Real-Time Multimedia (ESTIMedia). 90--97","author":"Ashouri A. H.","unstructured":"A. H. Ashouri, G. Mariani, G. Palermo, and C. Silvano. 2014. A Bayesian network approach for compiler auto-tuning for embedded processors. In Proceedings of the IEEE Embedded Systems for Real-Time Multimedia (ESTIMedia). 90--97."},{"key":"e_1_2_1_16_1","unstructured":"Amir Hossein Ashouri. 2012. Design space exploration methodology for compiler parameters in VLIW processors. Master\u2019s thesis. M. Sc. Dissertation. Politecnico Di Milano Italy. Retrieved from http:\/\/hdl.handle.net\/10589\/72083."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","unstructured":"Amir Hossein Ashouri Andrea Bignoli Gianluca Palermo and Cristina Silvano. 2016. Predictive modeling methodology for compiler phase-ordering. In Proceedings of the 7th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and the 5th Workshop on Design Tools and Architectures For Multicore Embedded Computing Platforms (PARMA-DITAM\u201916). ACM New York NY 7--12. 10.1145\/2872421.2872424","DOI":"10.1145\/2872421.2872424"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3124452"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197978"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2928270"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/3203505"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Amir H. Ashouri Gianluca Palermo John Cavazos and Cristina Silvano. 2018. Background. Springer International Publishing Cham 1--22.","DOI":"10.1007\/978-3-319-71489-9_1"},{"key":"e_1_2_1_24_1","volume-title":"Design Space Exploration of Compiler Passes: A Co-Exploration Approach for the Embedded Domain","author":"Ashouri Amir H.","unstructured":"Amir H. Ashouri, Gianluca Palermo, John Cavazos, and Cristina Silvano. 2018. Design Space Exploration of Compiler Passes: A Co-Exploration Approach for the Embedded Domain. Springer International Publishing, Cham, 23--39."},{"key":"e_1_2_1_25_1","volume-title":"The Phase-Ordering Problem: A Complete Sequence Prediction Approach","author":"Ashouri Amir H.","unstructured":"Amir H. Ashouri, Gianluca Palermo, John Cavazos, and Cristina Silvano. 2018. The Phase-Ordering Problem: A Complete Sequence Prediction Approach. Springer International Publishing, Cham, 85--113."},{"key":"e_1_2_1_26_1","volume-title":"The Phase-Ordering Problem: An Intermediate Speedup Prediction Approach","author":"Ashouri Amir H.","unstructured":"Amir H. Ashouri, Gianluca Palermo, John Cavazos, and Cristina Silvano. 2018. The Phase-Ordering Problem: An Intermediate Speedup Prediction Approach. Springer International Publishing, Cham, 71--83."},{"key":"e_1_2_1_27_1","volume-title":"Selecting the Best Compiler Optimizations: A Bayesian Network Approach","author":"Ashouri Amir H.","unstructured":"Amir H. Ashouri, Gianluca Palermo, John Cavazos, and Cristina Silvano. 2018. Selecting the Best Compiler Optimizations: A Bayesian Network Approach. Springer International Publishing, Cham, 41--70."},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the Workshop on Resource Awareness and Application Autotuning in Adaptive and Heterogeneous Computing (RES4ANT\u201916)","author":"Ashouri Amir Hossein","unstructured":"Amir Hossein Ashouri, Gianluca Palermo, and Cristina Silvano. An evaluation of autotuning techniques for the compiler optimization problems. In Proceedings of the Workshop on Resource Awareness and Application Autotuning in Adaptive and Heterogeneous Computing (RES4ANT\u201916), colocated with the Design Automation and Test in Europe Conference and Expo (DATE\u201916). 23--27. http:\/\/ceur-ws.org\/Vol-1643\/#paper-05"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/VLSI-SoC.2013.6673262"},{"key":"e_1_2_1_30_1","first-page":"1","article-title":"Energy-aware compilation and hardware design for VLIW embedded systems","volume":"3","author":"Ayala Jose L.","year":"2007","unstructured":"Jose L. Ayala, Marisa L\u00f3pez-Vallejo, David Atienza, Praveen Raghavan, Francky Catthoor, and Diederik Verkest. 2007. Energy-aware compilation and hardware design for VLIW embedded systems. Int. J. Embed. Syst. 3, 1--2 (2007), 73--82.","journal-title":"Int. J. Embed. Syst."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/857076.857077"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the 2002 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE\u201902)","volume":"2","author":"Babuka R.","unstructured":"R. Babuka, P. J. Van der Veen, and U. Kaymak. 2002. Improved covariance estimation for Gustafson-Kessel clustering. In Proceedings of the 2002 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE\u201902), Vol. 2. IEEE, 1081--1085."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/197405.197406"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.1975.6312870"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342013493644"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2017.04.002"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-11970-5_16"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1471068415000174"},{"key":"e_1_2_1_39_1","volume-title":"Automatically tuning the GCC compiler to optimize the performance of applications running on the ARM cortex-M3. arXiv preprint arXiv:1703.08228","author":"Blackmore Craig","year":"2017","unstructured":"Craig Blackmore, Oliver Ray, and Kerstin Eder. 2017. Automatically tuning the GCC compiler to optimize the performance of applications running on the ARM cortex-M3. arXiv preprint arXiv:1703.08228 (2017)."},{"key":"e_1_2_1_40_1","volume-title":"Automatically tuning the GCC compiler to optimize the performance of applications running on the ARM cortex-M3. CoRR abs\/1703.08228","author":"Blackmore Craig","year":"2017","unstructured":"Craig Blackmore, Oliver Ray, and Kerstin Eder. 2017. Automatically tuning the GCC compiler to optimize the performance of applications running on the ARM cortex-M3. CoRR abs\/1703.08228 (2017). arxiv:1703.08228, retrieved from http:\/\/arxiv.org\/abs\/1703.08228."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2967938.2967963"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the Workshop on Profile and Feedback-Directed Compilation.","author":"Bodin Fran\u00e7ois","year":"1998","unstructured":"Fran\u00e7ois Bodin, Toru Kisuki, Peter Knijnenburg, Mike O\u2019Boyle, and Erven Rohou. 1998. Iterative compilation in a non-linear optimisation space. In Proceedings of the Workshop on Profile and Feedback-Directed Compilation."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.5555\/1788374.1788386"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","unstructured":"U. Bondhugula and A. Hartono. 2008. A practical automatic polyhedral parallelizer and locality optimizer. (2008). Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;1375595. 10.1145\/1375581.1375595","DOI":"10.1145\/1375581.1375595"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation (PLDI\u201908)","author":"Bondhugula Uday","unstructured":"Uday Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan. 2008. PLuTo: A practical and fully automatic polyhedral program optimization system. In Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation (PLDI\u201908). Citeseer."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.132"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCC.2008.172"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2007.895416"},{"key":"e_1_2_1_49_1","volume-title":"Jos\u00e9 Gabriel de Figueiredo Coutinho, and Pedro C. Diniz","author":"Paiva Cardoso Jo\u00e3o Manuel","year":"2017","unstructured":"Jo\u00e3o Manuel Paiva Cardoso, Jos\u00e9 Gabriel de Figueiredo Coutinho, and Pedro C. Diniz. 2017. Embedded Computing for High Performance: Efficient Mapping of Computations Using Customization, Code Transformations and Compilation. Morgan Kaufmann."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1176760.1176765"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2007.32"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/996893.996864"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/11688839_12"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2005.14"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/1167515.1167492"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.5555\/2245737.2245881"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2009.2015974"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2005.10"},{"key":"e_1_2_1_59_1","unstructured":"Chun Chen Jacqueline Chame and Mary Hall. 2008. CHiLL: A Framework for Composing High-level Loop Transformations. Technical report. Citeseer."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/2355585.2355594"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/1806596.1806647"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2005.2"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/2742854.2742883"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/201059.201061"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/508352.508353"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454122"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/1070891.1065921"},{"key":"e_1_2_1_68_1","unstructured":"K. Cooper Timothy J. Harvey Devika Subramanian and Linda Torczon. 2002. Compilation Order Matters. Technical report."},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","unstructured":"K. D. Cooper P. J. Schielke and D. Subramanian. 1999. Optimizing for reduced code space using genetic algorithms. ACM SIGPLAN Notices. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;314414. 10.1145\/315253.314414","DOI":"10.1145\/315253.314414"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","unstructured":"K. D. Cooper D. Subramanian and L. Torczon. 2002. Adaptive optimizing compilers for the 21st Century. J. Supercomput. Retrieved from http:\/\/link.springer.com\/article\/10.1023\/A:1015729001611. 10.1023\/A:1015729001611","DOI":"10.1023\/A:1015729001611"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/3078659.3078664"},{"key":"e_1_2_1_72_1","unstructured":"Chris Cummins Pavlos Petoumenos Michel Steuwer and Hugh Leather. 2015. Autotuning OpenCL workgroup size for stencil patterns. arXiv preprint arXiv:1511.02490."},{"key":"e_1_2_1_73_1","volume-title":"Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques (PACT\u201917)","author":"Cummins C.","unstructured":"C. Cummins, P. Petoumenos, Z. Wang, and H. Leather. 2017. End-to-end deep learning of optimization heuristics. In Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques (PACT\u201917). 219--232."},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2017.7863731"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/4235.996017"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.5555\/648054.743935"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/2813885.2737969"},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242531.1242553"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/1669112.1669124"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","unstructured":"Chris Eagle. 2011. The IDA Pro Book: The Unofficial Guide to the World\u2019s Most Popular Disassembler. No Starch Press.","DOI":"10.5555\/2049962"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/2000064.2000108"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2015.85"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2739048"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1145\/342001.339682"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1051\/ro\/1988220302431"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1145\/24039.24041"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSSC.2009.932433"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.1981.1675827"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.5555\/2843494"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.5555\/2843494"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","unstructured":"B. Franke M. O\u2019Boyle J. Thomson and G. Fursin. 2005. Probabilistic source-level optimisation of embedded programs. ACM SIGPLAN Notices (2005). Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;1065922. 10.1145\/1070891.1065922","DOI":"10.1145\/1070891.1065922"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1145\/301631.301672"},{"key":"e_1_2_1_93_1","volume-title":"Ruttenberg","author":"Freudenberger Stefan M.","year":"1992","unstructured":"Stefan M. Freudenberger and John C. Ruttenberg. 1992. Phase ordering of register allocation and instruction scheduling. In Code Generation\u00e2\u0102\u0164 Concepts, Tools, Techniques. Springer, 146--170."},{"key":"e_1_2_1_94_1","volume-title":"The Elements of Statistical Learning","author":"Friedman Jerome","unstructured":"Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2001. The Elements of Statistical Learning. Vol. 1. Springer, Berlin."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007465528199"},{"key":"e_1_2_1_96_1","unstructured":"G. G. Fursin. 2004. Iterative compilation and performance prediction for numerical applications. Retrieved from https:\/\/www.era.lib.ed.ac.uk\/handle\/1842\/565."},{"key":"e_1_2_1_97_1","unstructured":"Grigori Fursin. 2010. Collective benchmark (cbench) a collection of open-source programs with multiple datasets assembled by the community to enable realistic benchmarking and research on program and architecture optimization. Retrieved from http:\/\/ctuning.org\/wiki\/index.php\/CTools:CBench."},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.5555\/1762146.1762170"},{"key":"e_1_2_1_99_1","volume-title":"Workshop Proceedings.","author":"Fursin G.","unstructured":"G. Fursin and A. Cohen. 2007. Building a practical iterative interactive compiler. Workshop Proceedings. Retrieved from https:\/\/www.researchgate.net\/profile\/Chuck."},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1007\/11587514_4"},{"key":"e_1_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-010-0161-2"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.5555\/2971808.2972009"},{"key":"e_1_2_1_103_1","volume-title":"A collective knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques. arXiv preprint arXiv:1801.08024","author":"Fursin Grigori","year":"2018","unstructured":"Grigori Fursin, Anton Lokhmotov, Dmitry Savenko, and Eben Upton. 2018. A collective knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques. arXiv preprint arXiv:1801.08024 (2018)."},{"key":"e_1_2_1_104_1","volume-title":"Collective mind, part II: Towards performance-and cost-aware software engineering as a natural science. arXiv preprint arXiv:1506.06256","author":"Fursin Grigori","year":"2015","unstructured":"Grigori Fursin, Abdul Memon, Christophe Guillon, and Anton Lokhmotov. 2015. Collective mind, part II: Towards performance-and cost-aware software engineering as a natural science. arXiv preprint arXiv:1506.06256 (2015)."},{"key":"e_1_2_1_105_1","volume-title":"Proceedings of the GCC Summit.","author":"Fursin G.","unstructured":"G. Fursin, C. Miranda, and O. Temam. 2008. MILEPOST GCC: Machine learning based research compiler. Proceedings of the GCC Summit. Retrieved from https:\/\/hal.inria.fr\/inria-00294704\/."},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1007\/11596110_24"},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-92990-1_5"},{"key":"e_1_2_1_108_1","doi-asserted-by":"publisher","DOI":"10.1145\/1880043.1880047"},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.23919\/DATE.2018.8342183"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1145\/2908961.2931696"},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1145\/3207719.3207727"},{"key":"e_1_2_1_112_1","volume-title":"Proceedings of the Workshop on Compilers for Parallel Computing (CPC\u201918)","author":"Gong Zhangxiaowen","unstructured":"Zhangxiaowen Gong, Zhi Chen, Justin Josef Szaday, David C. Wong, Zehra Sura, Neftali Watkinson, Saeed Maleki, David Padua, Alexandru Nicolau, Alexander V. Veidenbaum et al. 2018. An empirical study of the effect of source-level transformations on compiler stability. In Proceedings of the Workshop on Compilers for Parallel Computing (CPC\u201918)."},{"key":"e_1_2_1_113_1","volume-title":"Handbook of Multivariate Experimental Psychology","author":"Gorsuch Richard L.","unstructured":"Richard L. Gorsuch. 1988. Exploratory factor analysis. In Handbook of Multivariate Experimental Psychology. Springer, 231--258."},{"key":"e_1_2_1_114_1","doi-asserted-by":"publisher","DOI":"10.1109\/InPar.2012.6339595"},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","unstructured":"M. Hall D. Padua and K. Pingali. 2009. Compiler research: The next 50 years. Commun. ACM (2009). Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;1461946. 10.1145\/1461928.1461946","DOI":"10.1145\/1461928.1461946"},{"key":"e_1_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.1145\/1062261.1062293"},{"key":"e_1_2_1_117_1","volume-title":"The Elements of Statistical Learning","author":"Hastie Trevor","unstructured":"Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. Unsupervised learning. In The Elements of Statistical Learning. Springer, 485--585."},{"key":"e_1_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.940014"},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_2_1_120_1","doi-asserted-by":"publisher","DOI":"10.1145\/2807591.2807644"},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2007.56"},{"key":"e_1_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.1145\/1356058.1356080"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772954.1772965"},{"key":"e_1_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.12.5.317"},{"key":"e_1_2_1_125_1","volume-title":"OMAP4430 SoC Dev. Board 2","author":"Instruments Texas","year":"2012","unstructured":"Texas Instruments. 2012. Pandaboard. OMAP4430 SoC Dev. Board 2 (2012)."},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2015.46"},{"key":"e_1_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.5555\/2555729.2555736"},{"key":"e_1_2_1_128_1","unstructured":"Sverre Jarp. 2002. A Methodology for Using the Itanium 2 Performance Counters for Bottleneck Analysis. Technical report HP Labs."},{"key":"e_1_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.1145\/2228360.2228569"},{"key":"e_1_2_1_130_1","volume-title":"Wichern","author":"Johnson Richard Arnold","year":"2002","unstructured":"Richard Arnold Johnson and Dean W. Wichern. 2002. Applied Multivariate Statistical Analysis. Vol. 5. Prentice Hall, Upper Saddle River, NJ."},{"key":"e_1_2_1_131_1","unstructured":"Bill Joy Guy Steele James Gosling and Gilad Bracha. 2000. Java (TM) Language Specification. Addisson-Wesley."},{"key":"e_1_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.7494\/csci.2016.17.3.407"},{"key":"e_1_2_1_133_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2012.69"},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.5555\/2691166.2691170"},{"key":"e_1_2_1_135_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00607-016-0535-4"},{"key":"e_1_2_1_136_1","unstructured":"William Killian Renato Miceli Eunjung Park Marco Alvarez and John Cavazos. 2014. Performance improvement in kernels by guiding compiler auto-vectorization heuristics. PRACE-RI.EU. Retrieved from http:\/\/www.prace-ri.eu\/IMG\/pdf\/WP183.pdf."},{"key":"e_1_2_1_137_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0925-2312(03)00372-2"},{"key":"e_1_2_1_139_1","volume-title":"Proceedings of the Conference on Compilers for Parallel Computers (CPC\u201910)","author":"Kisuki Toru","unstructured":"Toru Kisuki, P. Knijnenburg, M. O\u2019Boyle, and H. Wijshoff. 2000. Iterative compilation in program optimization. In Proceedings of the Conference on Compilers for Parallel Computers (CPC\u201910). Citeseer, 35--44."},{"key":"e_1_2_1_140_1","doi-asserted-by":"publisher","DOI":"10.5555\/646347.690219"},{"key":"e_1_2_1_141_1","doi-asserted-by":"publisher","DOI":"10.5555\/517554.825767"},{"key":"e_1_2_1_142_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1020989410030"},{"key":"e_1_2_1_143_1","doi-asserted-by":"publisher","DOI":"10.5555\/850986.855645"},{"key":"e_1_2_1_144_1","doi-asserted-by":"publisher","DOI":"10.5555\/2999134.2999257"},{"key":"e_1_2_1_145_1","volume-title":"Ku and Giovanni De Micheli","author":"David","year":"1992","unstructured":"David C. Ku and Giovanni De Micheli. 1992. Design space exploration. In High Level Synthesis of ASICs under Timing and Synchronization Constraints. Springer, 83--111."},{"key":"e_1_2_1_146_1","doi-asserted-by":"publisher","DOI":"10.1145\/996893.996863"},{"key":"e_1_2_1_147_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755951.1755903"},{"key":"e_1_2_1_148_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2007.9"},{"key":"e_1_2_1_149_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2006.15"},{"key":"e_1_2_1_150_1","doi-asserted-by":"publisher","DOI":"10.1145\/1509864.1509865"},{"key":"e_1_2_1_151_1","doi-asserted-by":"publisher","unstructured":"S. Kulkarni and J. Cavazos. 2012. Mitigating the compiler optimization phase-ordering problem using machine learning. ACM SIGPLAN Notices (2012). Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;2384628. 10.1145\/2398857.2384628","DOI":"10.1145\/2398857.2384628"},{"key":"e_1_2_1_152_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2013.6495004"},{"key":"e_1_2_1_153_1","first-page":"30831","article-title":"Compiler phase ordering and optimizing MPI runtime parameters using heuristic algorithms on SMPs","volume":"9","author":"Kumar T. Satish","year":"2014","unstructured":"T. Satish Kumar, S. Sakthivel, S. Sushil Kumar, and N. Arun. 2014. Compiler phase ordering and optimizing MPI runtime parameters using heuristic algorithms on SMPs. Int. J. Appl. Eng. Res. 9, 24 (2014), 30831--30851.","journal-title":"Int. J. Appl. Eng. Res."},{"key":"e_1_2_1_154_1","doi-asserted-by":"publisher","DOI":"10.5555\/977395.977673"},{"key":"e_1_2_1_155_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2009.21"},{"key":"e_1_2_1_156_1","volume-title":"Steven O. Hobbs, Joseph M. Newcomer, Andrew H. Reiner, Bruce R. Schatz, and William A. Wulf.","author":"Leverett Bruce W.","year":"1979","unstructured":"Bruce W. Leverett, Roderic Geoffrey Galton Cattell, Steven O. Hobbs, Joseph M. Newcomer, Andrew H. Reiner, Bruce R. Schatz, and William A. Wulf. 1979. An Overview of the Production Quality Compiler-Compiler Project. Carnegie Mellon University, Department of Computer Science."},{"key":"e_1_2_1_157_1","doi-asserted-by":"publisher","DOI":"10.1109\/IMIS.2014.26"},{"key":"e_1_2_1_158_1","doi-asserted-by":"publisher","DOI":"10.1145\/1669112.1669172"},{"key":"e_1_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01970-8_89"},{"key":"e_1_2_1_160_1","volume-title":"ALIC: A low overhead compiler optimization prediction model. Wireless Personal Commun","author":"Liu Hui","year":"2018","unstructured":"Hui Liu, Rongcai Zhao, Qi Wang, and Yingying Li. 2018. ALIC: A low overhead compiler optimization prediction model. Wireless Personal Commun. Springer."},{"key":"e_1_2_1_161_1","unstructured":"Vincent Loechner. 1999. PolyLib: A library for manipulating parameterized polyhedra. Retrieved from online at http:\/\/icps.u-strasbg.fr\/PolyLib\/."},{"key":"e_1_2_1_162_1","volume-title":"Proceedings of the 3rd Workshop on Statistical and Machine Learning Approaches to Architectures and Compilation (SMART\u201909)","author":"Lokuciejewski P.","unstructured":"P. Lokuciejewski and F. Gedikli. 2009. Automatic WCET reduction by machine learning based heuristics for function inlining. Proceedings of the 3rd Workshop on Statistical and Machine Learning Approaches to Architectures and Compilation (SMART\u201909). 1--15. Retrieved from https:\/\/www.researchgate.net\/profile\/Peter."},{"key":"e_1_2_1_163_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISORC.2010.15"},{"key":"e_1_2_1_164_1","doi-asserted-by":"publisher","DOI":"10.1145\/321992.322000"},{"key":"e_1_2_1_165_1","doi-asserted-by":"publisher","DOI":"10.1145\/1065010.1065034"},{"key":"e_1_2_1_166_1","unstructured":"L. Luo Y. Chen C. Wu S. Long and G. Fursin. 2014. Finding representative sets of optimizations for adaptive multiversioning applications. arXiv preprint arXiv:1407.4075 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1407.4075."},{"key":"e_1_2_1_167_1","doi-asserted-by":"publisher","DOI":"10.1145\/144965.144998"},{"key":"e_1_2_1_168_1","doi-asserted-by":"publisher","DOI":"10.1145\/1837274.1837307"},{"key":"e_1_2_1_169_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2009.24"},{"key":"e_1_2_1_170_1","doi-asserted-by":"publisher","DOI":"10.1145\/2883614"},{"key":"e_1_2_1_171_1","doi-asserted-by":"publisher","DOI":"10.1145\/2666357.2597821"},{"key":"e_1_2_1_172_1","doi-asserted-by":"publisher","DOI":"10.5555\/897538"},{"key":"e_1_2_1_173_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1017976211990"},{"key":"e_1_2_1_174_1","volume-title":"Proceedings of the PARCO Mini-Symposium on Application Autotuning for HPC (Architectures).","author":"Memon Abdul Wahid","year":"2013","unstructured":"Abdul Wahid Memon and Grigori Fursin. 2013. Crowdtuning: Systematizing auto-tuning using predictive modeling and crowdsourcing. In Proceedings of the PARCO Mini-Symposium on Application Autotuning for HPC (Architectures)."},{"key":"e_1_2_1_175_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-36803-5_24"},{"key":"e_1_2_1_176_1","unstructured":"MinIR 2011. MINimal IR space. Retrieved from http:\/\/www.assembla.com\/wiki\/show\/minir-dev."},{"key":"e_1_2_1_177_1","doi-asserted-by":"publisher","DOI":"10.5555\/2371238"},{"key":"e_1_2_1_178_1","doi-asserted-by":"publisher","DOI":"10.5555\/646053.677574"},{"key":"e_1_2_1_179_1","unstructured":"Thierry Moreau Anton Lokhmotov and Grigori Fursin. 2018. Introducing ReQuEST: An open platform for reproducible and quality-efficient systems-ML tournaments. CoRR abs\/1801.06378. arxiv:1801.06378. Retrieved from http:\/\/arxiv.org\/abs\/1801.06378."},{"key":"e_1_2_1_180_1","doi-asserted-by":"publisher","DOI":"10.5555\/302528.302858"},{"key":"e_1_2_1_181_1","unstructured":"Paschalis Mpeis Pavlos Petoumenos and Hugh Leather. 2015. Iterative compilation on mobile devices. CoRR abs\/1511.02603. Retrieved from http:\/\/arxiv.org\/abs\/1511.02603."},{"key":"e_1_2_1_182_1","volume-title":"Proceedings of the Department of Defense HPCMP Users Group Conference. 7--10","author":"Mucci Philip J.","year":"1999","unstructured":"Philip J. Mucci, Shirley Browne, Christine Deane, and George Ho. 1999. PAPI: A portable interface to hardware performance counters. In Proceedings of the Department of Defense HPCMP Users Group Conference. 7--10."},{"key":"e_1_2_1_183_1","doi-asserted-by":"publisher","DOI":"10.1145\/1878921.1878951"},{"key":"e_1_2_1_184_1","volume-title":"Proceedings of the 19th Workshop on Compilers for Parallel Computing (CPC\u201916)","author":"Nobre Ricardo","unstructured":"Ricardo Nobre, Reis Luis, and M. P. Cardoso Joao. 2016. Compiler phase ordering as an orthogonal approach for reducing energy consumption. In Proceedings of the 19th Workshop on Compilers for Parallel Computing (CPC\u201916)."},{"key":"e_1_2_1_185_1","doi-asserted-by":"publisher","DOI":"10.1145\/2764967.2764978"},{"key":"e_1_2_1_186_1","doi-asserted-by":"publisher","DOI":"10.1145\/2907950.2907959"},{"key":"e_1_2_1_187_1","volume-title":"Proceedings of the Parallel Processing Workshops (Euro-Par\u201917)","author":"Nobre Ricardo","unstructured":"Ricardo Nobre, Lu\u00eds Reis, and Jo\u00e3o M. P. Cardoso. 2018. Impact of compiler phase ordering when targeting GPUs. In Proceedings of the Parallel Processing Workshops (Euro-Par\u201917), Dora B. Heras and Luc Boug\u00e9 (Eds.). Springer International Publishing, Cham, 427--438."},{"key":"e_1_2_1_188_1","doi-asserted-by":"publisher","DOI":"10.5555\/646157.679859"},{"key":"e_1_2_1_189_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2017.7863744"},{"key":"e_1_2_1_191_1","doi-asserted-by":"publisher","DOI":"10.1145\/7902.7904"},{"key":"e_1_2_1_192_1","doi-asserted-by":"publisher","DOI":"10.1145\/764808.764833"},{"key":"e_1_2_1_193_1","doi-asserted-by":"publisher","DOI":"10.5555\/1233748.1233750"},{"key":"e_1_2_1_194_1","doi-asserted-by":"publisher","DOI":"10.5555\/1233748.1233750"},{"key":"e_1_2_1_195_1","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/bxt129"},{"key":"e_1_2_1_196_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2004.47"},{"key":"e_1_2_1_197_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2006.38"},{"key":"e_1_2_1_198_1","doi-asserted-by":"publisher","DOI":"10.1145\/2259016.2259042"},{"key":"e_1_2_1_199_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-013-0241-1"},{"key":"e_1_2_1_200_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2014.26"},{"key":"e_1_2_1_201_1","doi-asserted-by":"publisher","DOI":"10.1145\/2038698.2038711"},{"key":"e_1_2_1_203_1","doi-asserted-by":"publisher","DOI":"10.5555\/2568134"},{"key":"e_1_2_1_204_1","volume-title":"Bayesian Networks: A Model of Self-activated Memory for Evidential Reasoning. UCLA Technical report no. CSD-850017)","author":"Pearl Judea","year":"1985","unstructured":"Judea Pearl. 1985. Bayesian Networks: A Model of Self-activated Memory for Evidential Reasoning. UCLA Technical report no. CSD-850017); Proceedings of the 7th Conference of the Cognitive Science Society, vol. 3, 329--334."},{"key":"e_1_2_1_205_1","volume-title":"Automatic configuration of GCC using irace","author":"C\u00e1ceres Leslie P\u00e9rez","unstructured":"Leslie P\u00e9rez C\u00e1ceres, Federico Pagnozzi, Alberto Franzin, and Thomas St\u00fctzle. 2018. Automatic configuration of GCC using irace. In Artificial Evolution, Evelyne Lutton, Pierrick Legrand, Pierre Parrend, Nicolas Monmarch\u00e9, and Marc Schoenauer (Eds.). Springer International Publishing, Cham, 202--216."},{"key":"e_1_2_1_206_1","doi-asserted-by":"publisher","DOI":"10.5555\/1032659.1034235"},{"key":"e_1_2_1_207_1","volume-title":"Proceedings of the International Conference on Computer Languages. 281--290","author":"Pollock L. L.","unstructured":"L. L. Pollock and M. L. Soffa. 1990. Incremental global optimization for faster recompilations. In Proceedings of the International Conference on Computer Languages. 281--290."},{"key":"e_1_2_1_208_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2007.21"},{"key":"e_1_2_1_209_1","doi-asserted-by":"publisher","DOI":"10.1145\/1379022.1375594"},{"key":"e_1_2_1_210_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2010.14"},{"key":"e_1_2_1_211_1","volume-title":"Polybench: The polyhedral benchmark suite.","author":"Pouchet Louis-No\u00ebl","year":"2012","unstructured":"Louis-No\u00ebl Pouchet. 2012. Polybench: The polyhedral benchmark suite. Retrieved from http:\/\/www.cs.ucla.edu\/p\u00f6ouchet\/software\/polybench\/."},{"key":"e_1_2_1_212_1","doi-asserted-by":"publisher","DOI":"10.1145\/2400682.2400715"},{"key":"e_1_2_1_214_1","volume-title":"Recommender Systems Handbook","author":"Ricci Francesco","unstructured":"Francesco Ricci, Lior Rokach, and Bracha Shapira. 2011. Introduction to recommender systems handbook. In Recommender Systems Handbook. Springer, 1--35."},{"key":"e_1_2_1_215_1","volume-title":"Design of Experiments Using the Taguchi Approach: 16 Steps to Product and Process Improvement","author":"Roy Ranjit K.","unstructured":"Ranjit K. Roy. 2001. Design of Experiments Using the Taguchi Approach: 16 Steps to Product and Process Improvement. Wiley-Interscience."},{"key":"e_1_2_1_216_1","volume-title":"Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW\u201917)","author":"Rusira T.","unstructured":"T. Rusira, M. Hall, and P. Basu. 2017. Automating compiler-directed autotuning for phased performance behavior. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW\u201917). 1362--1371."},{"key":"e_1_2_1_217_1","doi-asserted-by":"publisher","DOI":"10.5555\/2190025.2190072"},{"key":"e_1_2_1_218_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.413.0233"},{"key":"e_1_2_1_219_1","doi-asserted-by":"publisher","DOI":"10.1145\/335231.335246"},{"key":"e_1_2_1_220_1","doi-asserted-by":"publisher","DOI":"10.1109\/6.591665"},{"key":"e_1_2_1_221_1","doi-asserted-by":"publisher","unstructured":"E. Schkufza R. Sharma and A. Aiken. 2014. Stochastic optimization of floating-point programs with tunable precision. ACM SIGPLAN Notices. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;2594302. 10.1145\/2666356.2594302","DOI":"10.1145\/2666356.2594302"},{"key":"e_1_2_1_222_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2014.09.003"},{"key":"e_1_2_1_223_1","doi-asserted-by":"publisher","DOI":"10.1145\/800192.805690"},{"key":"e_1_2_1_224_1","unstructured":"Bernhard Sch\u00f6lkopf. 2001. The kernel trick for distances. In Advances in Neural Information Processing Systems. Curran Associates inc. 301--307."},{"key":"e_1_2_1_225_1","doi-asserted-by":"publisher","DOI":"10.1109\/CSE.2015.58"},{"key":"e_1_2_1_226_1","doi-asserted-by":"publisher","DOI":"10.5555\/2971808.2971971"},{"key":"e_1_2_1_227_1","doi-asserted-by":"publisher","DOI":"10.1145\/2903150.2903470"},{"key":"e_1_2_1_228_1","volume-title":"Proceedings of the International Conference on Embedded Computer Systems: Architecture, Modeling, and Simulation (SAMOS\u201917)","author":"Silvano Cristina","unstructured":"Cristina Silvano, Andrea Bartolini, Andrea Beccari, Candida Manelfi, Carlo Cavazzoni, Davide Gadioli, Erven Rohou, Gianluca Palermo, Giovanni Agosta, Jan Martinovi\u010d et al. 2017. The ANTAREX tool flow for monitoring and autotuning energy efficient HPC systems. In Proceedings of the International Conference on Embedded Computer Systems: Architecture, Modeling, and Simulation (SAMOS\u201917)."},{"key":"e_1_2_1_229_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISVLSI.2010.67"},{"key":"e_1_2_1_230_1","doi-asserted-by":"publisher","DOI":"10.1145\/3203217.3205338"},{"key":"e_1_2_1_231_1","volume-title":"Density estimation for statistics and data analysis","author":"Silverman Bernard W.","unstructured":"Bernard W. Silverman. 1986. Density estimation for statistics and data analysis. Vol. 26. CRC press."},{"key":"e_1_2_1_232_1","unstructured":"Richard Stallman. 2001. Using and porting the GNU compiler collection. In MIT Artificial Intelligence Laboratory. Citeseer."},{"key":"e_1_2_1_233_1","volume-title":"Stallman et al","author":"Richard","year":"2003","unstructured":"Richard M. Stallman et al. 2003. Using GCC: the GNU compiler collection reference manual. GNU Press."},{"key":"e_1_2_1_234_1","volume-title":"Proceedings of the Genetic and Evolutionary Computation Conference (GECCO\u201902)","author":"Stanley Kenneth O.","year":"2002","unstructured":"Kenneth O. Stanley. 2002. Efficient reinforcement learning through evolving neural network topologies. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO\u201902). Citeseer."},{"key":"e_1_2_1_235_1","unstructured":"M. W. Stephenson. 2006. Automating the construction of compiler heuristics using machine learning. Retrieved from http:\/\/groups.csail.mit.edu\/commit\/papers\/2006\/stephenson_phdthesis.pdf."},{"key":"e_1_2_1_236_1","doi-asserted-by":"publisher","unstructured":"M. Stephenson and S. Amarasinghe. 2003. Meta optimization: Improving compiler heuristics with machine learning. 38 5 (2003) 77--90. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;781141. 10.1145\/780822.781141","DOI":"10.1145\/780822.781141"},{"key":"e_1_2_1_237_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2005.29"},{"key":"e_1_2_1_238_1","doi-asserted-by":"publisher","DOI":"10.5555\/1762668.1762691"},{"key":"e_1_2_1_239_1","volume-title":"Multiple Criteria Optimization: Theory, Computation, and Applications","author":"Steuer Ralph E.","unstructured":"Ralph E. Steuer. 1986. Multiple Criteria Optimization: Theory, Computation, and Applications. Wiley."},{"key":"e_1_2_1_240_1","doi-asserted-by":"publisher","DOI":"10.1145\/2086696.2086729"},{"key":"e_1_2_1_241_1","doi-asserted-by":"publisher","DOI":"10.1147\/sj.391.0175"},{"key":"e_1_2_1_242_1","doi-asserted-by":"publisher","DOI":"10.5555\/762761.762771"},{"key":"e_1_2_1_243_1","doi-asserted-by":"publisher","DOI":"10.1145\/2287016.2287023"},{"key":"e_1_2_1_244_1","doi-asserted-by":"publisher","DOI":"10.1145\/2400682.2400705"},{"key":"e_1_2_1_245_1","doi-asserted-by":"publisher","DOI":"10.1145\/2400682.2400705"},{"key":"e_1_2_1_246_1","doi-asserted-by":"publisher","DOI":"10.5555\/2998981.2999131"},{"key":"e_1_2_1_247_1","doi-asserted-by":"publisher","DOI":"10.1002\/j.1556-6678.2002.tb00167.x"},{"key":"e_1_2_1_248_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13374-9_28"},{"key":"e_1_2_1_249_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2009.5161054"},{"key":"e_1_2_1_250_1","doi-asserted-by":"publisher","DOI":"10.1145\/1543135.1542496"},{"key":"e_1_2_1_251_1","doi-asserted-by":"publisher","DOI":"10.5555\/776261.776284"},{"key":"e_1_2_1_252_1","doi-asserted-by":"publisher","DOI":"10.5555\/2721682"},{"key":"e_1_2_1_253_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2007.25"},{"key":"e_1_2_1_254_1","doi-asserted-by":"publisher","DOI":"10.5555\/1014194.800942"},{"key":"e_1_2_1_255_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342004041293"},{"key":"e_1_2_1_256_1","doi-asserted-by":"crossref","unstructured":"Richard W. Vuduc. 2011. Autotuning. Springer Boston MA 102--105.","DOI":"10.1007\/978-0-387-09766-4_68"},{"key":"e_1_2_1_257_1","doi-asserted-by":"publisher","DOI":"10.1145\/106975.106991"},{"key":"e_1_2_1_258_1","volume-title":"Proceedings of the Workshop on Polyhedral Compilation Techniques.","author":"Wang Wei","year":"2014","unstructured":"Wei Wang, John Cavazos, and Allan Porterfield. 2014. Energy auto-tuning using the polyhedral approach. In Proceedings of the Workshop on Polyhedral Compilation Techniques."},{"key":"e_1_2_1_259_1","doi-asserted-by":"publisher","DOI":"10.1145\/1594835.1504189"},{"key":"e_1_2_1_261_1","doi-asserted-by":"publisher","DOI":"10.1145\/113446.113456"},{"key":"e_1_2_1_262_1","doi-asserted-by":"publisher","DOI":"10.1145\/99163.99179"},{"key":"e_1_2_1_263_1","doi-asserted-by":"publisher","DOI":"10.1145\/267959.267960"},{"key":"e_1_2_1_264_1","unstructured":"Doran K. Wilde. 1993. A Library for Doing Polyhedral Operations. Technical report no. 785. IRISA."},{"key":"e_1_2_1_265_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"e_1_2_1_266_1","doi-asserted-by":"publisher","DOI":"10.1145\/193209.193217"},{"issue":"6","key":"e_1_2_1_267_1","first-page":"078","article-title":"Method and apparatus for improving compiler performance during subsequent compilations of a source program","author":"Wolczko M. I.","year":"2000","unstructured":"M. I. Wolczko and D. M. Ungar. 2000. Method and apparatus for improving compiler performance during subsequent compilations of a source program. U.S. Patent No. 6,078,744. Retrieved from https:\/\/www.google.com\/patents\/US6078744.","journal-title":"U.S. Patent"},{"key":"e_1_2_1_268_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPT.2008.4762420"},{"key":"e_1_2_1_269_1","doi-asserted-by":"publisher","DOI":"10.5555\/578752"},{"key":"e_1_2_1_270_1","volume-title":"Alphaz: A system for analysis, transformation, and code generation in the polyhedral equational model.","author":"Yuki T.","year":"2012","unstructured":"T. Yuki, V. Basupalli, G. Gupta, G. Iooss, and D. Kim. 2012. Alphaz: A system for analysis, transformation, and code generation in the polyhedral equational model. Retrieved from http:\/\/www.cs.colostate.edu\/TechReports\/Reports\/2012\/tr12-101.pdf."},{"key":"e_1_2_1_271_1","volume-title":"Proceedings of the International Workshop on Languages and Compilers for Parallel Computing. 17--31","author":"Yuki T.","year":"2012","unstructured":"T. Yuki, G. Gupta, D. G. Kim, T. Pathan, and S. Rajopadhye. 2012. AlphaZ: A system for design space exploration in the polyhedral model, In Proceedings of the International Workshop on Languages and Compilers for Parallel Computing. 17--31. Retrieved from http:\/\/people.rennes.inria.fr\/Tomofumi.Yuki\/papers\/yuki-lcpc2012.pdf."},{"key":"e_1_2_1_272_1","volume-title":"Proceedings of the 23rd International Conference on Architecture of Computing Systems (ARCS\u201910)","author":"Zaccaria Vittorio","year":"2010","unstructured":"Vittorio Zaccaria, Gianluca Palermo, Fabrizio Castro, Cristina Silvano, and Giovanni Mariani. 2010. Multicube explorer: An open source framework for design space exploration of chip multi-processors. In Proceedings of the 23rd International Conference on Architecture of Computing Systems (ARCS\u201910). VDE, 1--7."},{"key":"e_1_2_1_273_1","doi-asserted-by":"publisher","DOI":"10.1145\/780732.780734"},{"key":"e_1_2_1_274_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2005.2"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197978","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3197978","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:23Z","timestamp":1750268963000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197978"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,9,18]]},"references-count":268,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,9,30]]}},"alternative-id":["10.1145\/3197978"],"URL":"https:\/\/doi.org\/10.1145\/3197978","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,9,18]]},"assertion":[{"value":"2016-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-09-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}