{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T11:33:38Z","timestamp":1751369618830,"version":"3.41.0"},"reference-count":36,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,7,22]],"date-time":"2023-07-22T00:00:00Z","timestamp":1689984000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004359","name":"Swedish Research Council","doi-asserted-by":"crossref","award":["2019-04929"],"award-info":[{"award-number":["2019-04929"]}],"id":[{"id":"10.13039\/501100004359","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Swedish Foundation for Strategic Research","award":["CHI19-0048"],"award-info":[{"award-number":["CHI19-0048"]}]},{"name":"European Union has also partially","award":["EU-101051997"],"award-info":[{"award-number":["EU-101051997"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2023,9,30]]},"abstract":"<jats:p>\n            Reducing energy consumption while providing performance and quality guarantees is crucial for computing systems ranging from battery-powered embedded systems to data centers. This article considers approximate iterative applications executing on heterogeneous multi-core platforms under user-specified performance and quality targets. We note that allowing a slight yet bounded relaxation in solution quality can considerably reduce the required iteration count and thereby can save significant amounts of energy. To this end, this article proposes\n            <jats:italic>Approx-RM<\/jats:italic>\n            , a resource management scheme that reduces energy expenditure while guaranteeing a specified performance\n            <jats:italic>as well as<\/jats:italic>\n            accuracy target.\n            <jats:italic>Approx-RM<\/jats:italic>\n            predicts the number of iterations required to meet the relaxed accuracy target at runtime. The time saved generates execution-time slack, which allows\n            <jats:italic>Approx-RM<\/jats:italic>\n            to allocate fewer resources on a heterogeneous multi-core platform in terms of DVFS, core type, and core count to save energy while meeting the performance target.\n            <jats:italic>Approx-RM<\/jats:italic>\n            contributes with lightweight methods for predicting the iteration count needed to meet the accuracy target and the resources needed to meet the performance target.\n            <jats:italic>Approx-RM<\/jats:italic>\n            uses the aforementioned predictions to allocate\n            <jats:italic>just enough<\/jats:italic>\n            resources to comply with quality of service constraints to save energy. Our evaluation shows energy savings of 31.6%, on average, compared to\n            <jats:italic>Race-to-idle<\/jats:italic>\n            when the accuracy is only relaxed by 1%.\n            <jats:italic>Approx-RM<\/jats:italic>\n            incurs timing and energy overheads of less than 0.1%.\n          <\/jats:p>","DOI":"10.1145\/3605214","type":"journal-article","created":{"date-parts":[[2023,6,22]],"date-time":"2023-06-22T20:10:46Z","timestamp":1687464646000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Approx-RM: Reducing Energy on Heterogeneous Multicore Processors under Accuracy and Timing Constraints"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0477-4540","authenticated-orcid":false,"given":"Muhammad Waqar","family":"Azhar","sequence":"first","affiliation":[{"name":"Chalmers University of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9783-8357","authenticated-orcid":false,"given":"Madhavan","family":"Manivannan","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7441-8245","authenticated-orcid":false,"given":"Per","family":"Stenstr\u00f6m","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,7,22]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/SIES.2015.7185039"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/2556953"},{"key":"e_1_3_1_4_2","unstructured":"M. Waqar Azhar. 2021. Workloads for Approx-RM. https:\/\/github.com\/waqarazhar\/Approx-RM-Workloads."},{"key":"e_1_3_1_5_2","volume-title":"Proceedings of the 48th International Conference on Parallel Processing (ICPP\u201919)","author":"Azhar M. Waqar","year":"2019","unstructured":"M. Waqar Azhar, Miquel Peric\u00e0s, and Per Stenstr\u00f6m. 2019. SaC: Exploiting execution-time slack to save energy in heterogeneous multicore systems. In Proceedings of the 48th International Conference on Parallel Processing (ICPP\u201919). ACM, Article 26, 12 pages. 10.1145\/3337821.3337865"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3494537"},{"issue":"4","key":"e_1_3_1_7_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3148053","article-title":"SLOOP: QoS-supervised loop execution to reduce energy on heterogeneous architectures","volume":"14","author":"Azhar M. Waqar","year":"2017","unstructured":"M. Waqar Azhar, Per Stenstr\u00f6m, and Vassilis Papaefstathiou. 2017. SLOOP: QoS-supervised loop execution to reduce energy on heterogeneous architectures. ACM Transactions on Architecture and Code Optimization (TACO) 14, 4 (2017), 1\u201325.","journal-title":"ACM Transactions on Architecture and Code Optimization (TACO)"},{"key":"e_1_3_1_8_2","unstructured":"Barcelona Super Computing Center. 2021. BSC application repository. Retrieved from https:\/\/pm.bsc.es\/projects\/bar"},{"key":"e_1_3_1_9_2","unstructured":"Hongsuk Chung. 2013. Heterogeneous Multi-Processing Solution of Exynos 5 Octa with ARM big.LITTLETM Technology. https:\/\/www.semanticscholar.org\/paper\/Heterogeneous-MultiProcessing-Solution-of-Exynos-5-Chung\/0"},{"key":"e_1_3_1_10_2","doi-asserted-by":"crossref","unstructured":"Sai Santosh Dayapule Fan Yao and Guru Venkataramani. 2019. PowerStar: Improving power efficiency in heterogenous processors for bursty workloads with approximate computing. In 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) . 175\u2013182. DOI:10.1109\/CloudCom.2019.00035","DOI":"10.1109\/CloudCom.2019.00035"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/3004054"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/2556583"},{"key":"e_1_3_1_13_2","first-page":"1","volume-title":"2016 International Conference on Hardware\/Software Codesign and System Synthesis (CODES+ISSS\u201916)","author":"Donyanavard B.","year":"2016","unstructured":"B. Donyanavard, T. M\u00fcck, S. Sarma, and N. Dutt. 2016. SPARTA: Runtime task allocation for energy efficient heterogeneous manycores. In 2016 International Conference on Hardware\/Software Codesign and System Synthesis (CODES+ISSS\u201916). 1\u201310."},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/1952998.1952999"},{"key":"e_1_3_1_15_2","first-page":"421","volume-title":"2016 USENIX Annual Technical Conference (USENIX ATC\u201916)","author":"Farrell Anne","year":"2016","unstructured":"Anne Farrell and Henry Hoffmann. 2016. MEANTIME: Achieving both minimal energy and timeliness with approximate computing. In 2016 USENIX Annual Technical Conference (USENIX ATC\u201916). USENIX Association, Denver, CO, 421\u2013435. https:\/\/www.usenix.org\/conference\/atc16\/technical-sessions\/presentation\/farrell."},{"key":"e_1_3_1_16_2","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1145\/2815400.2815403","volume-title":"Proceedings of the 25th Symposium on Operating Systems Principles (SOSP\u201915)","author":"Hoffmann Henry","year":"2015","unstructured":"Henry Hoffmann. 2015. JouleGuard: Energy guarantees for approximate applications. In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP\u201915). Association for Computing Machinery, New York, NY, 198\u2013214. 10.1145\/2815400.2815403"},{"key":"e_1_3_1_17_2","first-page":"199","volume-title":"Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XVI)","author":"Hoffmann Henry","year":"2011","unstructured":"Henry Hoffmann, Stelios Sidiroglou, Michael Carbin, Sasa Misailovic, Anant Agarwal, and Martin Rinard. 2011. Dynamic knobs for responsive power-aware computing. In Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XVI). Association for Computing Machinery, New York, NY, 199\u2013212. 10.1145\/1950365.1950390"},{"key":"e_1_3_1_18_2","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1109\/MICRO.2001.991123","volume-title":"Proceedings 34th ACM\/IEEE International Symposium on Microarchitecture (MICRO-34)","author":"Hughes C. J.","year":"2001","unstructured":"C. J. Hughes, J. Srinivasan, and S. V. Adve. 2001. Saving energy with architectural and frequency adaptations for multimedia applications. In Proceedings 34th ACM\/IEEE International Symposium on Microarchitecture (MICRO-34). 250\u2013261. 10.1109\/MICRO.2001.991123"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/2724942.2724950"},{"issue":"1","key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-01721-6","article-title":"Computer architecture techniques for power-efficiency","volume":"3","author":"Kaxiras Stefanos","year":"2008","unstructured":"Stefanos Kaxiras and Margaret Martonosi. 2008. Computer architecture techniques for power-efficiency. Synthesis Lectures on Computer Architecture 3, 1 (2008), 1\u2013207.","journal-title":"Synthesis Lectures on Computer Architecture"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/CPSNA.2015.23"},{"key":"e_1_3_1_22_2","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1109\/HPCA.2019.00035","volume-title":"2019 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201919)","author":"Kulkarni Neeraj","year":"2019","unstructured":"Neeraj Kulkarni, Feng Qi, and Christina Delimitrou. 2019. Pliant: Leveraging approximation to improve datacenter resource efficiency. In 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201919). IEEE, 159\u2013171."},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2006.1598114"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/VLSI-DAT.2016.7482541"},{"key":"e_1_3_1_25_2","article-title":"Hardware thread scheduling algorithms for single-ISA asymmetric CMPs","author":"Markovic Nikola","year":"2015","unstructured":"Nikola Markovic. 2015. Hardware thread scheduling algorithms for single-ISA asymmetric CMPs. TDX (Tesis Doctorals en Xarxa). https:\/\/upcommons.upc.edu\/handle\/2117\/96039.","journal-title":"TDX (Tesis Doctorals en Xarxa)"},{"key":"e_1_3_1_26_2","first-page":"23","volume-title":"Proceedings of the 1996 Annual Conference on USENIX Annual Technical Conference (ATEC\u201996)","author":"McVoy Larry","year":"1996","unstructured":"Larry McVoy and Carl Staelin. 1996. Lmbench: Portable tools for performance analysis. In Proceedings of the 1996 Annual Conference on USENIX Annual Technical Conference (ATEC\u201996). USENIX Association, 23."},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2020.05.006"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2019.00040"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2012.2235126"},{"key":"e_1_3_1_30_2","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa F.","year":"2011","unstructured":"F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (2011), 2825\u20132830.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056037"},{"key":"e_1_3_1_32_2","first-page":"262","volume-title":"Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP\u201921)","author":"Sharif Hashim","year":"2021","unstructured":"Hashim Sharif, Yifan Zhao, Maria Kotsifakou, Akash Kothari, Ben Schreiber, Elizabeth Wang, Yasmin Sarita, Nathan Zhao, Keyur Joshi, Vikram S. Adve, Sasa Misailovic, and Sarita Adve. 2021. ApproxTuner: A compiler and runtime system for adaptive approximations. In Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP\u201921). Association for Computing Machinery, New York, NY, 262\u2013277. 10.1145\/3437801.3446108"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/2714575"},{"issue":"5","key":"e_1_3_1_34_2","doi-asserted-by":"crossref","first-page":"1078","DOI":"10.1007\/s10766-016-0409-6","article-title":"Exploiting significance of computations for energy-constrained approximate computing","volume":"44","author":"Vassiliadis Vassilis","year":"2016","unstructured":"Vassilis Vassiliadis, Charalampos Chalios, Konstantinos Parasyris, Christos D. Antonopoulos, Spyros Lalis, Nikolaos Bellas, Hans Vandierendonck, and Dimitrios S. Nikolopoulos. 2016. Exploiting significance of computations for energy-constrained approximate computing. International Journal of Parallel Programming 44, 5 (2016), 1078\u20131098.","journal-title":"International Journal of Parallel Programming"},{"key":"e_1_3_1_35_2","first-page":"119","volume-title":"20th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201923)","author":"You Jie","year":"2023","unstructured":"Jie You, Jae-Won Chung, and Mosharaf Chowdhury. 2023. Zeus: Understanding and optimizing GPU energy consumption of DNN training. In 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201923). 119\u2013139."},{"key":"e_1_3_1_36_2","first-page":"701","volume-title":"2015 Design, Automation & Test in Europe Conference & Exhibition (DATE\u201915)","author":"Zhang Qian","year":"2015","unstructured":"Qian Zhang, Ting Wang, Ye Tian, Feng Yuan, and Qiang Xu. 2015. ApproxANN: An approximate computing framework for artificial neural network. In 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE\u201915). IEEE, 701\u2013706."},{"key":"e_1_3_1_37_2","first-page":"1","volume-title":"Proceedings of the 51st Annual Design Automation Conference (DAC\u201914)","author":"Zhang Qian","year":"2014","unstructured":"Qian Zhang, Feng Yuan, Rong Ye, and Qiang Xu. 2014. ApproxIt: An approximate computing framework for iterative methods. In Proceedings of the 51st Annual Design Automation Conference (DAC\u201914). Association for Computing Machinery, New York, NY, 1\u20136. 10.1145\/2593069.2593092"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3605214","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3605214","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:55Z","timestamp":1750291435000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3605214"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,22]]},"references-count":36,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,9,30]]}},"alternative-id":["10.1145\/3605214"],"URL":"https:\/\/doi.org\/10.1145\/3605214","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2023,7,22]]},"assertion":[{"value":"2022-10-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-06-15","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}