{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T05:00:25Z","timestamp":1755838825845,"version":"3.41.0"},"reference-count":42,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2019,6,17]],"date-time":"2019-06-17T00:00:00Z","timestamp":1560729600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Energy-Efficiency Tools for High-Performance Multi- and Many-core Applications"},{"name":"AWE William Penney Fellow"},{"name":"University of Warwick and Allinea Software Ltd."},{"name":"UK Technology Strategy Board project","award":["131197"],"award-info":[{"award-number":["131197"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2019,9,30]]},"abstract":"<jats:p>Advances in processor design have delivered performance improvements for decades. As physical limits are reached, refinements to the same basic technologies are beginning to yield diminishing returns. Unsustainable increases in energy consumption are forcing hardware manufacturers to prioritise energy efficiency in their designs. Research suggests that software modifications may be needed to exploit the resulting improvements in current and future hardware. New tools are required to capitalise on this new class of optimisation.<\/jats:p>\n          <jats:p>In this article, we present the Power Optimised Software Envelope (POSE) model, which allows developers to assess the potential benefits of power optimisation for their applications. The POSE model is metric agnostic and in this article, we provide derivations using the established Energy-Delay Product metric and the novel Energy-Delay Sum and Energy-Delay Distance metrics that we believe are more appropriate for energy-aware optimisation efforts. We demonstrate POSE on three platforms by studying the optimisation characteristics of applications from the Mantevo benchmark suite. Our results show that the Pathfinder application has very little scope for power optimisation while TeaLeaf has the most, with all other applications in the benchmark suite falling between the two.<\/jats:p>\n          <jats:p>Finally, we extend our POSE model with a formulation known as System Summary POSE\u2014a meta-heuristic that allows developers to assess the scope a system has for energy-aware software optimisation independent of the code being run.<\/jats:p>","DOI":"10.1145\/3321551","type":"journal-article","created":{"date-parts":[[2019,6,18]],"date-time":"2019-06-18T12:14:26Z","timestamp":1560860066000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["The Power-optimised Software Envelope"],"prefix":"10.1145","volume":"16","author":[{"given":"Stephen I.","family":"Roberts","sequence":"first","affiliation":[{"name":"Arm Ltd., UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7133-8533","authenticated-orcid":false,"given":"Steven A.","family":"Wright","sequence":"additional","affiliation":[{"name":"University of York, UK"}]},{"given":"Suhaib A.","family":"Fahmy","sequence":"additional","affiliation":[{"name":"University of Warwick, UK"}]},{"given":"Stephen A.","family":"Jarvis","sequence":"additional","affiliation":[{"name":"University of Warwick, UK"}]}],"member":"320","published-online":{"date-parts":[[2019,6,17]]},"reference":[{"unstructured":"2012. Advanced Micro Devices. BIOS and Kernel Developer\u2019s Guide (BKDG) for AMD Family 15h Models 00h-0Fh Processors. Retrieved from https:\/\/www.amd.com\/system\/files\/TechDocs\/42301_15h_Mod_00h-0Fh_BKDG.pdf.  2012. Advanced Micro Devices. BIOS and Kernel Developer\u2019s Guide (BKDG) for AMD Family 15h Models 00h-0Fh Processors. Retrieved from https:\/\/www.amd.com\/system\/files\/TechDocs\/42301_15h_Mod_00h-0Fh_BKDG.pdf.","key":"e_1_2_1_1_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_2_1","DOI":"10.5555\/271559.271565"},{"doi-asserted-by":"publisher","key":"e_1_2_1_3_1","DOI":"10.1145\/1465482.1465560"},{"volume-title":"Proceedings of the IEEE SoutheastCon. 479--484","author":"Bedard D.","unstructured":"D. Bedard , M. Y. Lim , R. Fowler , and A. Porterfield . 2010. Powermon: Fine-grained and integrated power monitoring for commodity computer systems . In Proceedings of the IEEE SoutheastCon. 479--484 . D. Bedard, M. Y. Lim, R. Fowler, and A. Porterfield. 2010. Powermon: Fine-grained and integrated power monitoring for commodity computer systems. In Proceedings of the IEEE SoutheastCon. 479--484.","key":"e_1_2_1_4_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_5_1","DOI":"10.1007\/s00450-010-0119-z"},{"doi-asserted-by":"publisher","key":"e_1_2_1_6_1","DOI":"10.1109\/ISPA.2008.127"},{"volume-title":"Proceedings of the International Symposium on Computer Architecture (ISCA\u201900)","author":"Brooks D.","unstructured":"D. Brooks , V. Tiwari , and M. Martonosi . 2000. Wattch: A framework for architectural-level power analysis and optimizations . In Proceedings of the International Symposium on Computer Architecture (ISCA\u201900) . 83--94. D. Brooks, V. Tiwari, and M. Martonosi. 2000. Wattch: A framework for architectural-level power analysis and optimizations. In Proceedings of the International Symposium on Computer Architecture (ISCA\u201900). 83--94.","key":"e_1_2_1_7_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_8_1","DOI":"10.1145\/2588768.2576783"},{"volume-title":"Proceedings of the IEEE International Performance Computing and Communications Conference (IPCCC\u201900)","author":"Cao J.","unstructured":"J. Cao , D. Kerbyson , E. Papaefstathiou , and G. Nudd . 2000. Performance modelling of parallel and distributed computing using PACE . In Proceedings of the IEEE International Performance Computing and Communications Conference (IPCCC\u201900) . 485--492. J. Cao, D. Kerbyson, E. Papaefstathiou, and G. Nudd. 2000. Performance modelling of parallel and distributed computing using PACE. In Proceedings of the IEEE International Performance Computing and Communications Conference (IPCCC\u201900). 485--492.","key":"e_1_2_1_9_1"},{"volume-title":"Proceedings of the IEEE International Symposium on Parallel 8 Distributed Processing (IPDPS\u201913)","author":"Choi J. W.","unstructured":"J. W. Choi , D. Bedard , R. Fowler , and R. Vuduc . 2013. A roofline model of energy . In Proceedings of the IEEE International Symposium on Parallel 8 Distributed Processing (IPDPS\u201913) . 661--672. J. W. Choi, D. Bedard, R. Fowler, and R. Vuduc. 2013. A roofline model of energy. In Proceedings of the IEEE International Symposium on Parallel 8 Distributed Processing (IPDPS\u201913). 661--672.","key":"e_1_2_1_10_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_11_1","DOI":"10.1145\/155332.155333"},{"volume-title":"Proceedings of the International Symposium on Low-Power Electronics and Design (ISLPED\u201910)","author":"David H.","unstructured":"H. David , E. Gorbatov , U. R. Hanebutte , R. Khanna , and C. Le . 2010. RAPL: Memory power estimation and capping . In Proceedings of the International Symposium on Low-Power Electronics and Design (ISLPED\u201910) . 189--194. H. David, E. Gorbatov, U. R. Hanebutte, R. Khanna, and C. Le. 2010. RAPL: Memory power estimation and capping. In Proceedings of the International Symposium on Low-Power Electronics and Design (ISLPED\u201910). 189--194.","key":"e_1_2_1_12_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_13_1","DOI":"10.1007\/978-3-319-58667-0_21"},{"doi-asserted-by":"publisher","key":"e_1_2_1_14_1","DOI":"10.1109\/CloudCom.2015.60"},{"doi-asserted-by":"publisher","key":"e_1_2_1_15_1","DOI":"10.1109\/TPDS.2007.1026"},{"doi-asserted-by":"publisher","key":"e_1_2_1_16_1","DOI":"10.1109\/TPDS.2009.76"},{"doi-asserted-by":"publisher","key":"e_1_2_1_17_1","DOI":"10.1109\/4.535411"},{"volume-title":"Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201914)","author":"Hackenberg D.","unstructured":"D. Hackenberg , T. Ilsche , J. Schuchart , R. Sch\u00f6ne , W. E. Nagel , M. Simon , and Y. Georgiou . 2014. HDEEM: High definition energy efficiency monitoring . In Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201914) . 1--10. D. Hackenberg, T. Ilsche, J. Schuchart, R. Sch\u00f6ne, W. E. Nagel, M. Simon, and Y. Georgiou. 2014. HDEEM: High definition energy efficiency monitoring. In Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201914). 1--10.","key":"e_1_2_1_18_1"},{"volume-title":"Proceedings of the International Green Computing Conference (IGCC\u201913)","author":"Hackenberg D.","unstructured":"D. Hackenberg , R. Oldenburg , D. Molka , and R. Sch\u00f6ne . 2013. Introducing FIRESTARTER: A processor stress test utility . In Proceedings of the International Green Computing Conference (IGCC\u201913) . 1--9. D. Hackenberg, R. Oldenburg, D. Molka, and R. Sch\u00f6ne. 2013. Introducing FIRESTARTER: A processor stress test utility. In Proceedings of the International Green Computing Conference (IGCC\u201913). 1--9.","key":"e_1_2_1_19_1"},{"volume-title":"Proceedings of the IEEE Conference on Thermal and Thermomechanical Phenomena in Electronic Systems. 395--403","author":"Hackenberg D.","unstructured":"D. Hackenberg and M. K. Patterson . 2016. Evaluation of a new data center air-cooling architecture: The down-flow plenum . In Proceedings of the IEEE Conference on Thermal and Thermomechanical Phenomena in Electronic Systems. 395--403 . D. Hackenberg and M. K. Patterson. 2016. Evaluation of a new data center air-cooling architecture: The down-flow plenum. In Proceedings of the IEEE Conference on Thermal and Thermomechanical Phenomena in Electronic Systems. 395--403.","key":"e_1_2_1_20_1"},{"volume-title":"Proceedings of the International Conference on Simulation Tools and Techniques (Simutools\u201909)","author":"Hammond S. D.","unstructured":"S. D. Hammond , G. R. Mudalige , J. A. Smith , S. A. Jarvis , J. A. Herdman , and A. Vadgama . 2009. WARPP: A toolkit for simulating high-performance parallel scientific codes . In Proceedings of the International Conference on Simulation Tools and Techniques (Simutools\u201909) . 19:1--19:10. S. D. Hammond, G. R. Mudalige, J. A. Smith, S. A. Jarvis, J. A. Herdman, and A. Vadgama. 2009. WARPP: A toolkit for simulating high-performance parallel scientific codes. In Proceedings of the International Conference on Simulation Tools and Techniques (Simutools\u201909). 19:1--19:10.","key":"e_1_2_1_21_1"},{"volume-title":"Proceedings of the International Symposium on Software Metrics. 58--69","author":"Harman M.","unstructured":"M. Harman and J. Clark . 2004. Metrics are fitness functions too . In Proceedings of the International Symposium on Software Metrics. 58--69 . M. Harman and J. Clark. 2004. Metrics are fitness functions too. In Proceedings of the International Symposium on Software Metrics. 58--69.","key":"e_1_2_1_22_1"},{"volume-title":"SAND2009-5574","author":"Heroux M. A.","unstructured":"M. A. Heroux , D. W. Doerfler , P. S. Crozier , J. M. Willenbring , H. C. Edwards , A. Williams , M. Rajan , E. R. Keiter , H. K. Thornquist , and R. W. Numrich . 2009. Improving performance via mini-applications . SAND2009-5574 . Tech Report. Retrieved from http:\/\/www.cs.sandia.gov\/&sim;maherou\/docs\/MantevoOverview.pdf. M. A. Heroux, D. W. Doerfler, P. S. Crozier, J. M. Willenbring, H. C. Edwards, A. Williams, M. Rajan, E. R. Keiter, H. K. Thornquist, and R. W. Numrich. 2009. Improving performance via mini-applications. SAND2009-5574. Tech Report. Retrieved from http:\/\/www.cs.sandia.gov\/&sim;maherou\/docs\/MantevoOverview.pdf.","key":"e_1_2_1_23_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_24_1","DOI":"10.1145\/263272.263310"},{"doi-asserted-by":"crossref","unstructured":"R. Karp and V. Ramachandran. 1990. Parallel algorithms for shared-memory machines. In Handbook of Theoretical Computer Science. MIT Press Cambridge MA 869--941.   R. Karp and V. Ramachandran. 1990. Parallel algorithms for shared-memory machines. In Handbook of Theoretical Computer Science. MIT Press Cambridge MA 869--941.","key":"e_1_2_1_25_1","DOI":"10.1016\/B978-0-444-88071-0.50022-9"},{"doi-asserted-by":"crossref","unstructured":"J. H. Laros K. Pedretti S. M. Kelly Wei Shu K. Ferreira J. Vandyke and C. Vaughan. 2013. Energy delay product. In Energy-Efficient High Performance Computing: Measurement and Tuning. Springer 51--55.  J. H. Laros K. Pedretti S. M. Kelly Wei Shu K. Ferreira J. Vandyke and C. Vaughan. 2013. Energy delay product. In Energy-Efficient High Performance Computing: Measurement and Tuning. Springer 51--55.","key":"e_1_2_1_26_1","DOI":"10.1007\/978-1-4471-4492-2_8"},{"volume-title":"Proceedings of the International Green Computing Conference (IGCC\u201913)","author":"Laros J. H.","unstructured":"J. H. Laros , P. Pokorny , and D. DeBonis . 2013. PowerInsight\u2014A commodity power measurement capability . In Proceedings of the International Green Computing Conference (IGCC\u201913) . 1--6. J. H. Laros, P. Pokorny, and D. DeBonis. 2013. PowerInsight\u2014A commodity power measurement capability. In Proceedings of the International Green Computing Conference (IGCC\u201913). 1--6.","key":"e_1_2_1_27_1"},{"key":"e_1_2_1_28_1","first-page":"1","article-title":"Modeling performance and energy for applications offloaded to Intel Xeon Phi","volume":"7","author":"Lawson G.","year":"2015","unstructured":"G. Lawson , V. Sundriyal , M. Sosonkina , and Y. Shen . 2015 . Modeling performance and energy for applications offloaded to Intel Xeon Phi . In Proceedings of the International Workshop on Hardware-Software Co-Design for High Performance Computing. 7 : 1 -- 7 :8. G. Lawson, V. Sundriyal, M. Sosonkina, and Y. Shen. 2015. Modeling performance and energy for applications offloaded to Intel Xeon Phi. In Proceedings of the International Workshop on Hardware-Software Co-Design for High Performance Computing. 7:1--7:8.","journal-title":"Proceedings of the International Workshop on Hardware-Software Co-Design for High Performance Computing."},{"volume-title":"Proceedings of the IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201909)","author":"Li S.","unstructured":"S. Li , J. Ahn , R. D. Strong , J. B. Brockman , D. M. Tullsen , and N. P. Jouppi . 2009. McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures . In Proceedings of the IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201909) . 469--480. S. Li, J. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi. 2009. McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures. In Proceedings of the IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201909). 469--480.","key":"e_1_2_1_29_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_30_1","DOI":"10.1109\/SBAC-PAD.2012.38"},{"doi-asserted-by":"crossref","unstructured":"A. J. Martin M. Nystr\u00f6m and P. I. P\u00e9nzes. 2002. ET<sup>2<\/sup>: A metric for time and energy efficiency of computation. In Power Aware Computing. Springer 293--315.   A. J. Martin M. Nystr\u00f6m and P. I. P\u00e9nzes. 2002. ET <sup>2<\/sup>: A metric for time and energy efficiency of computation. In Power Aware Computing. Springer 293--315.","key":"e_1_2_1_31_1","DOI":"10.1007\/978-1-4757-6217-4_15"},{"doi-asserted-by":"publisher","key":"e_1_2_1_32_1","DOI":"10.1145\/1281700.1281716"},{"volume-title":"Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201915)","author":"Pedretti K.","unstructured":"K. Pedretti , S. L. Olivier , K. B. Ferreira , G. Shipman , and W. Shu . 2015. Early experiences with node-level power capping on the Cray XC40 platform . In Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201915) . 1--10. K. Pedretti, S. L. Olivier, K. B. Ferreira, G. Shipman, and W. Shu. 2015. Early experiences with node-level power capping on the Cray XC40 platform. In Proceedings of the Energy Efficient Supercomputing Workshop (E2SC\u201915). 1--10.","key":"e_1_2_1_33_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_34_1","DOI":"10.1007\/978-3-319-58667-0_22"},{"volume-title":"Proceedings of the International Green and Sustainable Computing Conference (IGSC\u201915)","author":"Roberts S. I.","unstructured":"S. I. Roberts , S. A. Wright , D. Lecomber , C. January , J. Byrd , X. Or\u00f3 , and S. A. Jarvis . 2015. POSE: A mathematical and visual modelling tool to guide energy aware code optimisation . In Proceedings of the International Green and Sustainable Computing Conference (IGSC\u201915) . S. I. Roberts, S. A. Wright, D. Lecomber, C. January, J. Byrd, X. Or\u00f3, and S. A. Jarvis. 2015. POSE: A mathematical and visual modelling tool to guide energy aware code optimisation. In Proceedings of the International Green and Sustainable Computing Conference (IGSC\u201915).","key":"e_1_2_1_35_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_36_1","DOI":"10.1007\/s10723-012-9219-2"},{"doi-asserted-by":"publisher","key":"e_1_2_1_37_1","DOI":"10.1145\/1964218.1964225"},{"doi-asserted-by":"publisher","key":"e_1_2_1_38_1","DOI":"10.1109\/L-CA.2012.32"},{"volume-title":"Proceedings of the International Symposium on Microarchitecture (MICRO\u201902)","author":"Srinivasan V.","unstructured":"V. Srinivasan , D. Brooks , M. Gschwind , P. Bose , V. Zyuban , P. N. Strenski , and P. G. Emma . 2002. Optimizing pipelines for power and performance . In Proceedings of the International Symposium on Microarchitecture (MICRO\u201902) . 333--344. V. Srinivasan, D. Brooks, M. Gschwind, P. Bose, V. Zyuban, P. N. Strenski, and P. G. Emma. 2002. Optimizing pipelines for power and performance. In Proceedings of the International Symposium on Microarchitecture (MICRO\u201902). 333--344.","key":"e_1_2_1_39_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_40_1","DOI":"10.1145\/1498765.1498785"},{"doi-asserted-by":"publisher","key":"e_1_2_1_41_1","DOI":"10.1109\/MC.2016.311"},{"doi-asserted-by":"publisher","key":"e_1_2_1_42_1","DOI":"10.1109\/MC.2011.96"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3321551","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3321551","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:54:38Z","timestamp":1750204478000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3321551"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,6,17]]},"references-count":42,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2019,9,30]]}},"alternative-id":["10.1145\/3321551"],"URL":"https:\/\/doi.org\/10.1145\/3321551","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2019,6,17]]},"assertion":[{"value":"2018-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-06-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}