{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,3,3]],"date-time":"2024-03-03T01:19:22Z","timestamp":1709428762338},"reference-count":21,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2011,8,31]],"date-time":"2011-08-31T00:00:00Z","timestamp":1314748800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Comput Sci Res Dev"],"published-print":{"date-parts":[[2012,11]]},"DOI":"10.1007\/s00450-011-0191-z","type":"journal-article","created":{"date-parts":[[2011,8,30]],"date-time":"2011-08-30T13:26:27Z","timestamp":1314710787000},"page":"277-287","source":"Crossref","is-referenced-by-count":18,"title":["Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency"],"prefix":"10.1007","volume":"27","author":[{"given":"Hatem","family":"Ltaief","sequence":"first","affiliation":[]},{"given":"Piotr","family":"Luszczek","sequence":"additional","affiliation":[]},{"given":"Jack","family":"Dongarra","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,8,31]]},"reference":[{"key":"191_CR1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1654059.1654080","volume-title":"SC \u201909: proceedings of the conference on high performance computing networking, storage and analysis","author":"E Agullo","year":"2009","unstructured":"Agullo E, Hadri B, Ltaief H, Dongarrra J (2009) Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. In: SC \u201909: proceedings of the conference on high performance computing networking, storage and analysis. ACM, New York, pp 1\u201312. http:\/\/doi.acm.org\/10.1145\/1654059.1654080"},{"key":"191_CR2","doi-asserted-by":"crossref","DOI":"10.1137\/1.9780898719604","volume-title":"LAPACK user\u2019s guide","author":"E Anderson","year":"1999","unstructured":"Anderson E, Bai Z, Bischof C, Blackford SL, Demmel JW, Dongarra JJ, Croz JD, Greenbaum A, Hammarling S, McKenney A, Sorensen DC (1999) LAPACK user\u2019s guide, 3rd edn. Society for Industrial and Applied Mathematics, Philadelphia","edition":"3"},{"issue":"3\u20134","key":"191_CR3","first-page":"141","volume":"25","author":"H Anzt","year":"2010","unstructured":"Anzt H, Rocker B, Heuveline V (2010) Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms\u2014an evaluation of different solver and hardware configurations. Comput Sci 25(3\u20134):141\u2013148. doi: 10.1007\/s00450-010-0124-2","journal-title":"Comput Sci"},{"issue":"3\u20134","key":"191_CR4","first-page":"187","volume":"25","author":"C Bekas","year":"2010","unstructured":"Bekas C, Curioni A (2010) A\u00a0new energy aware performance metric. Comput Sci 25(3\u20134):187\u2013195. doi: 10.1007\/s00450-010-0119-z","journal-title":"Comput Sci"},{"issue":"4","key":"191_CR5","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1145\/365723.365736","volume":"26","author":"CH Bischof","year":"2000","unstructured":"Bischof CH, Lang B, Sun X (2000) Algorithm 807: the SBR toolbox\u2014software for successive band reduction. ACM Trans Math Softw 26(4):602\u2013616. http:\/\/doi.acm.org\/10.1145\/365723.365736","journal-title":"ACM Trans Math Softw"},{"issue":"4","key":"191_CR6","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1177\/1094342007084026","volume":"21","author":"A Buttari","year":"2007","unstructured":"Buttari A, Dongarra J, Langou J, Langou J, Luszczek P, Kurzak J (2007) Mixed precision iterative refinement techniques for the solution of dense linear systems. Int J Hight Perform Comput Appl 21(4):457\u2013466. doi: 10.1177\/1094342007084026","journal-title":"Int J Hight Perform Comput Appl"},{"issue":"1","key":"191_CR7","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1016\/j.parco.2008.10.002","volume":"35","author":"A Buttari","year":"2009","unstructured":"Buttari A, Langou J, Kurzak J, Dongarra J (2009) A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput 35(1):38\u201353","journal-title":"Parallel Comput"},{"key":"191_CR8","volume-title":"IPDPS","author":"G Chen","year":"2005","unstructured":"Chen G, Malkowski K, Kandemir MT, Raghavan P (2005) Reducing power with performance constraints for parallel sparse applications. In: IPDPS. IEEE Comput Soc, Los Alamitos. http:\/\/doi.ieeecomputersociety.org\/10.1109\/IPDPS.2005.378"},{"key":"191_CR9","first-page":"1","volume-title":"IPDPS","author":"Y Ding","year":"2008","unstructured":"Ding Y, Malkowski K, Raghavan P, Kandemir MT (2008) Towards energy efficient scaling of scientific codes. In: IPDPS. IEEE Press, New York, pp 1\u20138. doi: 10.1109\/IPDPS.2008.4536217"},{"key":"191_CR10","series-title":"ACM SIGPLAN Notices","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1145\/1065944.1065967","volume-title":"Proceedings of the ACM SIGPLAN symposium on principles and practice of parallel programming (10th PPOPP\u20192005)","author":"VW Freeh","year":"2005","unstructured":"Freeh VW, Lowenthal DK (2005) Using multiple energy gears in MPI programs on a power-scalable cluster. In: Pingali K, Yelick KA, Grimshaw AS (eds) Proceedings of the ACM SIGPLAN symposium on principles and practice of parallel programming (10th PPOPP\u20192005), Chicago, IL, USA. ACM SIGPLAN Notices, vol 40, pp 164\u2013173"},{"issue":"5","key":"191_CR11","doi-asserted-by":"crossref","first-page":"658","DOI":"10.1109\/TPDS.2009.76","volume":"PDS-21","author":"R Ge","year":"2010","unstructured":"Ge R, Feng X, Song S, Chang HC, Li D, Cameron KW (2010) Powerpack: Energy profiling and analysis of high-performance systems and applications. IEEE Trans Parallel Distrib Syst PDS-21(5):658\u2013671","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"191_CR12","series-title":"John Hopkins studies in the mathematical sciences","volume-title":"Matrix computation","author":"GH Golub","year":"1996","unstructured":"Golub GH, Van Loan CF (1996) Matrix computation, 3rd edn. John Hopkins studies in the mathematical sciences. Johns Hopkins University Press, Baltimore","edition":"3"},{"key":"191_CR13","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1007\/s10543-008-0180-1","volume":"48","author":"B K\u00e5gstr\u00f6m","year":"2008","unstructured":"K\u00e5gstr\u00f6m B, Kressner D, Quintana-Ort\u00ed E, Quintana-Ort\u00ed G (2008) Blocked algorithms for the reduction to Hessenberg-triangular form revisited. BIT Numer Math 48:563\u2013584","journal-title":"BIT Numer Math"},{"key":"191_CR14","first-page":"33","volume-title":"SC","author":"N Kappiah","year":"2005","unstructured":"Kappiah N, Freeh VW, Lowenthal DK (2005) Just in time dynamic voltage scaling: exploiting inter-node slack to save energy in MPI programs. In: SC. IEEE Comput Soc, Los Alamitos, p\u00a033. http:\/\/doi.acm.org\/10.1145\/1105760.1105797"},{"key":"191_CR15","unstructured":"Kogge P, Bergman K, Borkar S, Campbell D, Carlson W, Dally W, Denneau M, Franzon P, Harrod W, Hill K, Hiller J, Karp S, Keckler S, Klein D, Lucas R, Richards M, Scarpelli A, Scott S, Snavely A, Sterling T, Williams RS, Yelick K (2008) Exascale computing study: technology challenges in achieving exascale systems. Tech Rep TR-2008-13, Department of Computer Science and Engineering. University of Notre Dame"},{"key":"191_CR16","unstructured":"Ltaief H, Luszczek P, Dongarra J (2011, submitted) High performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures. ACM Trans Math Softw"},{"key":"191_CR17","volume-title":"Proceedings of IPDPS 2011","author":"P Luszczek","year":"2011","unstructured":"Luszczek P, Ltaief H, Dongarra J (2011) Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures. In: Proceedings of IPDPS 2011. ACM, Anchorage"},{"key":"191_CR18","unstructured":"Multicore application modeling infrastructure (MuMI) project. http:\/\/www.mumi-tool.org"},{"key":"191_CR19","unstructured":"Sutter H (2005) The free lunch is over: a\u00a0fundamental turn toward concurrency in software. Dr Dobb\u2019s Journal 30(3). http:\/\/www.ddj.com\/184405990"},{"key":"191_CR20","doi-asserted-by":"crossref","DOI":"10.1137\/1.9780898719574","volume-title":"Numerical linear algebra","author":"LN Trefethen","year":"1997","unstructured":"Trefethen LN, Bau D (1997) Numerical linear algebra. SIAM, Philadelphia. http:\/\/www.siam.org\/books\/OT50\/Index.htm"},{"key":"191_CR21","unstructured":"University of Tennessee Knoxville (2010) PLASMA users\u2019 guide, parallel linear algebra software for multicore architectures, version 2.3. Available electronically at http:\/\/icl.cs.utk.edu\/projectsfiles\/plasma\/pdf\/users_guide.pdf"}],"container-title":["Computer Science - Research and Development"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00450-011-0191-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s00450-011-0191-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00450-011-0191-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,5,29]],"date-time":"2019-05-29T13:32:48Z","timestamp":1559136768000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s00450-011-0191-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,8,31]]},"references-count":21,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,11]]}},"alternative-id":["191"],"URL":"https:\/\/doi.org\/10.1007\/s00450-011-0191-z","relation":{},"ISSN":["1865-2034","1865-2042"],"issn-type":[{"value":"1865-2034","type":"print"},{"value":"1865-2042","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,8,31]]}}}