{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T22:26:17Z","timestamp":1673735177599},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2011,10,6]],"date-time":"2011-10-06T00:00:00Z","timestamp":1317859200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2013,3]]},"DOI":"10.1007\/s11227-011-0699-9","type":"journal-article","created":{"date-parts":[[2011,10,5]],"date-time":"2011-10-05T13:56:02Z","timestamp":1317822962000},"page":"691-709","source":"Crossref","is-referenced-by-count":11,"title":["Designing energy efficient communication runtime systems: a view from PGAS models"],"prefix":"10.1007","volume":"63","author":[{"given":"Abhinav","family":"Vishnu","sequence":"first","affiliation":[]},{"given":"Shuaiwen","family":"Song","sequence":"additional","affiliation":[]},{"given":"Andres","family":"Marquez","sequence":"additional","affiliation":[]},{"given":"Kevin","family":"Barker","sequence":"additional","affiliation":[]},{"given":"Darren","family":"Kerbyson","sequence":"additional","affiliation":[]},{"given":"Kirk","family":"Cameron","sequence":"additional","affiliation":[]},{"given":"Pavan","family":"Balaji","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,10,6]]},"reference":[{"key":"699_CR1","unstructured":"Crosscutting Technologies for Computing at the Exascale. http:\/\/extremecomputing.labworks.org (2010)"},{"issue":"6","key":"699_CR2","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1016\/0167-8191(96)00024-5","volume":"22","author":"W Gropp","year":"1996","unstructured":"Gropp W, Lusk E, Doss N, Skjellum A (1996) A high-performance, portable implementation of the MPI message passing interface standard. Parallel Comput 22(6):789\u2013828","journal-title":"Parallel Comput"},{"key":"699_CR3","first-page":"128","volume-title":"Euro-Par","author":"A Geist","year":"1996","unstructured":"Geist A, Gropp W, Huss-Lederman S, Lumsdaine A, Lusk EL, Saphir W, Skjellum T, Snir M (1996) MPI-2: Extending the message-passing interface. In: Euro-Par, vol\u00a0I, pp 128\u2013135"},{"key":"699_CR4","first-page":"63","volume-title":"International conference on supercomputing","author":"P Husbands","year":"2003","unstructured":"Husbands P, Iancu C, Yelick KA (2003) A performance analysis of the Berkeley UPC compiler. In: International conference on supercomputing, pp 63\u201373"},{"issue":"2","key":"699_CR5","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1007\/BF00130708","volume":"10","author":"J Nieplocha","year":"1996","unstructured":"Nieplocha J, Harrison RJ, Littlefield RJ (1996) Global arrays: a nonuniform memory access programming model for high-performance computers. J Supercomput 10(2):169\u2013189","journal-title":"J Supercomput"},{"key":"699_CR6","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1145\/1094811.1094852","volume-title":"OOPSLA \u201905: Proceedings of the 20th annual ACM SIGPLAN conference on object-oriented programming, systems, languages, and applications","author":"P Charles","year":"2005","unstructured":"Charles P, Grothoff C, Saraswat V, Donawa C, Kielstra A, Ebcioglu K, von Praun C, Sarkar V (2005) X10: an object-oriented approach to non-uniform cluster computing. In: OOPSLA \u201905: Proceedings of the 20th annual ACM SIGPLAN conference on object-oriented programming, systems, languages, and applications. ACM, New York, pp\u00a0519\u2013538"},{"issue":"3","key":"699_CR7","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1177\/1094342007078442","volume":"21","author":"BL Chamberlain","year":"2007","unstructured":"Chamberlain BL, Callahan D, Zima HP (2007) Parallel programmability and the Chapel language. Int J High Perform Comput Appl 21(3):291\u2013312","journal-title":"Int J High Perform Comput Appl"},{"key":"699_CR8","unstructured":"InfiniBand Trade Association (2004) InfiniBand Architecture Specification, Release 1.2, October 2004"},{"key":"699_CR9","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1145\/1188455.1188576","volume-title":"SC \u201906: Proceedings of the 2006 ACM\/IEEE conference on supercomputing","author":"H Yu","year":"2006","unstructured":"Yu H, Chung I-H, Moreira J (2006) Blue gene system software\u2014topology mapping for blue gene\/l supercomputer. In: SC \u201906: Proceedings of the 2006 ACM\/IEEE conference on supercomputing. ACM, New York, p 116"},{"issue":"1","key":"699_CR10","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/40.988689","volume":"22","author":"F Petrini","year":"2002","unstructured":"Petrini F, Feng W, Hoisie A, Coll S, Frachtenberg E (2002) The quadrics network: high-performance clustering technology. IEEE MICRO 22(1):46\u201357","journal-title":"IEEE MICRO"},{"key":"699_CR11","volume-title":"Aggregate remote memory copy interface","author":"M Krishnan","year":"2010","unstructured":"Krishnan M, Vishnu A, Palmer B (2010) Aggregate remote memory copy interface"},{"key":"699_CR12","first-page":"260","volume-title":"IPPS\/SPDP","author":"G Shah","year":"1998","unstructured":"Shah G, Nieplocha J, Mirza JH, Kim C, Harrison RJ, Govindaraju R, Gildea KJ, DiNicola P, Bender CA (1998) Performance and experience with LAPI\u2014a new high-performance communication library for the IBM RS\/6000 SP. In: IPPS\/SPDP, pp 260\u2013266"},{"key":"699_CR13","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1145\/1375527.1375544","volume-title":"ICS \u201908: Proceedings of the 22nd annual international conference on supercomputing","author":"S Kumar","year":"2008","unstructured":"Kumar S, Dozsa G, Almasi G, Heidelberger P, Chen D, Giampapa ME, Blocksome M, Faraj A, Parker J, Ratterman J, Smith B, Archer CJ (2008) The deep computing messaging framework: generalized scalable message passing on the Blue Gene\/P supercomputer. In: ICS \u201908: Proceedings of the 22nd annual international conference on supercomputing, pp 94\u2013103"},{"issue":"1","key":"699_CR14","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1109\/40.342015","volume":"15","author":"NJ Boden","year":"1995","unstructured":"Boden NJ, Cohen D, Felderman RE, Kulawik AE, Seitz CL, Seizovic JN, Su W (1995) Myrinet: a gigabit-per-second local area network. IEEE MICRO 15(1):29\u201336","journal-title":"IEEE MICRO"},{"key":"699_CR15","volume-title":"Proceedings of third international workshop on system management techniques, processes, and services, held in conjunction with IPDPS\u201907","author":"A Vishnu","year":"2007","unstructured":"Vishnu A, Mamidala A, Narravula S, Panda DK (2007) Automatic path migration over InfiniBand: early experiences. In: Proceedings of third international workshop on system management techniques, processes, and services, held in conjunction with IPDPS\u201907, March 2007"},{"key":"699_CR16","volume-title":"Proceedings of first international workshop on system management techniques, processes, and services, held in conjunction with IPDPS\u201907","author":"A Vishnu","year":"2005","unstructured":"Vishnu A, Mamidala AR, Jin H-W, Panda DK (2005) Performance modeling of subnet management on fat tree InfiniBand networks using OpenSM. In: Proceedings of first international workshop on system management techniques, processes, and services, held in conjunction with IPDPS\u201907"},{"key":"699_CR17","first-page":"583","volume-title":"CCGRID","author":"S Narravula","year":"2007","unstructured":"Narravula S, Marnidala A, Vishnu A, Vaidyanathan K, Panda DK (2007) High performance distributed lock management services using network-based remote atomic operations. In: CCGRID, pp 583\u2013590"},{"key":"699_CR18","volume-title":"International conference on parallel processing","author":"S Narravula","year":"2007","unstructured":"Narravula S, Mamidala A, Vishnu A, Santhanaraman G, Panda DK (2007) High performance MPI over iWARP: early experiences. In: International conference on parallel processing"},{"issue":"1\/2","key":"699_CR19","first-page":"199","volume":"52","author":"IBM BlueGene Team","year":"2008","unstructured":"IBM BlueGene Team (2008) Overview of the IBM Blue Gene\/P project. IBM J Res Dev 52(1\/2):199\u2013220","journal-title":"IBM J Res Dev"},{"key":"699_CR20","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1109\/CLUSTR.2002.1137753","volume-title":"IEEE international conference on cluster computing","author":"W Feng","year":"2002","unstructured":"Feng W, Warren M, Weigle E (2002) The bladed beowulf: A cost-effective alternative to traditional beowulfs. In: IEEE international conference on cluster computing, p 245"},{"issue":"11","key":"699_CR21","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1109\/MC.2005.380","volume":"38","author":"KW Cameron","year":"2005","unstructured":"Cameron KW, Ge R, Feng X (2005) High-performance, power-aware distributed computing for scientific applications. Computer 38(11):40\u201347","journal-title":"Computer"},{"key":"699_CR22","first-page":"1","volume-title":"SC \u201907: Proceedings of the ACM\/IEEE conference on supercomputing","author":"B Rountree","year":"2007","unstructured":"Rountree B, Lowenthal DK, Funk S, Freeh VW, de Supinski BR, Schulz M (2007) Bounding energy consumption in large-scale mpi programs. In: SC \u201907: Proceedings of the ACM\/IEEE conference on supercomputing. ACM, New York, pp\u00a01\u20139"},{"key":"699_CR23","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1109\/IPDPS.2005.346","volume-title":"IPDPS \u201905: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS\u201905)\u2014papers","author":"X Feng","year":"2005","unstructured":"Feng X, Ge R, Cameron KW (2005) Power and energy profiling of scientific applications on distributed systems. In: IPDPS \u201905: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS\u201905)\u2014papers. IEEE Computer Society, Washington, p\u00a034"},{"key":"699_CR24","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1145\/383082.383165","volume-title":"ISLPED \u201901: Proceedings of the international symposium on low power electronics and design","author":"C-H Hsu","year":"2001","unstructured":"Hsu C-H, Kremer U, Hsiao M (2001) Compiler-directed dynamic voltage\/frequency scheduling for energy reduction in microprocessors. In: ISLPED \u201901: Proceedings of the international symposium on low power electronics and design. ACM, New York, pp\u00a0275\u2013278"},{"key":"699_CR25","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1145\/344166.344181","volume-title":"ISLPED \u201900: Proceedings of the 2000 international symposium on low power electronics and design","author":"TD Burd","year":"2000","unstructured":"Burd TD, Brodersen RW (2000) Design issues for dynamic voltage scaling. In: ISLPED \u201900: Proceedings of the 2000 international symposium on low power electronics and design. ACM, New York, pp\u00a09\u201314"},{"issue":"2","key":"699_CR26","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1145\/335043.335044","volume":"5","author":"L Benini","year":"2000","unstructured":"Benini L, de Micheli G (2000) System-level power optimization: techniques and tools. ACM Trans Des Autom Electron Syst 5(2):115\u2013192","journal-title":"ACM Trans Des Autom Electron Syst"},{"key":"699_CR27","first-page":"479","volume-title":"Cluster computing and grid","author":"A Vishnu","year":"2007","unstructured":"Vishnu A, Koop MJ, Moody A, Mamidala AR, Narravula S, Panda DK (2007) Hot-spot avoidance with multi-pathing over InfiniBand: an MPI perspective. In: Cluster computing and grid, pp 479\u2013486"},{"key":"699_CR28","unstructured":"LBNL (2003) Data Center Energy Benchmarking Case Study: Data Center Facility 5"},{"key":"699_CR29","unstructured":"IBM (2007) PowerExecutive"},{"key":"699_CR30","volume-title":"Energy 2002 workshop and exposition","author":"AM Bailey","year":"2002","unstructured":"Bailey AM (2002) Accelerated strategic computing initiative (asci): Driving the need for the terascale simulation facility (tsf). In: Energy 2002 workshop and exposition. IEEE Computer Society, Los Alamitos"},{"key":"699_CR31","doi-asserted-by":"crossref","unstructured":"Ye W, Vijaykrishnan N, Kandemir M, Irwin MJ (2000) The design and use of simple power: A cycle-accurate energy estimation tool, pp 340\u2013345","DOI":"10.1145\/337292.337436"},{"key":"699_CR32","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1145\/339647.339657","volume-title":"ISCA \u201900: proceedings of the 27th annual international symposium on computer architecture","author":"D Brooks","year":"2000","unstructured":"Brooks D, Tiwari V, Martonosi M (2000) Wattch: a framework for architectural-level power analysis and optimizations. In: ISCA \u201900: proceedings of the 27th annual international symposium on computer architecture. ACM, New York, pp\u00a083\u201394"},{"key":"699_CR33","first-page":"217","volume-title":"FAST \u201903: proceedings of the 2nd USENIX conference on file and storage technologies","author":"J Zedlewski","year":"2003","unstructured":"Zedlewski J, Sobti S, Garg N, Zheng F, Krishnamurthy A, Wang R (2003) Modeling hard-disk power consumption. In: FAST \u201903: proceedings of the 2nd USENIX conference on file and storage technologies. USENIX Association, Berkeley, pp\u00a0217\u2013230"},{"key":"699_CR34","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1145\/236387.236423","volume-title":"MobiCom \u201996: Proceedings of the 2nd annual international conference on mobile computing and networking","author":"DP Helmbold","year":"1996","unstructured":"Helmbold DP, Long DDE, Sherrod B (1996) A dynamic disk spin-down technique for mobile computing. In: MobiCom \u201996: Proceedings of the 2nd annual international conference on mobile computing and networking. ACM, New York, pp\u00a0130\u2013142"},{"key":"699_CR35","first-page":"121","volume-title":"MLICS \u201995: Proceedings of the 2nd symposium on mobile and location-independent computing","author":"F Douglis","year":"1995","unstructured":"Douglis F, Krishnan P, Bershad BN (1995) Adaptive disk spin-down policies for mobile computers. In: MLICS \u201995: Proceedings of the 2nd symposium on mobile and location-independent computing. USENIX Association, Berkeley, pp\u00a0121\u2013137"},{"key":"699_CR36","first-page":"658","volume":"99","author":"R Ge","year":"2009","unstructured":"Ge R, Feng X, Song S, Chang H-C, Li D, Cameron KW (2009) Powerpack: Energy profiling and analysis of high-performance systems and applications. IEEE Trans Parallel Distrib Syst 99:658\u2013671","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"699_CR37","first-page":"5","volume-title":"ATEC \u201905: Proceedings of the annual conference on USENIX annual technical conference","author":"J Moore","year":"2005","unstructured":"Moore J, Chase J, Ranganathan P, Sharma R (2005) Making scheduling \u201ccool\u201d: temperature-aware workload placement in data centers. In: ATEC \u201905: Proceedings of the annual conference on USENIX annual technical conference, USENIX Association, Berkeley, p\u00a05"},{"key":"699_CR38","unstructured":"Xinping H-SW, Wang HS, Zhu X, Peh LS, Malik S (2002) Orion: A power-performance simulator for interconnection networks, pp 294\u2013305"},{"key":"699_CR39","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1145\/1188455.1188677","volume-title":"SC \u201906: Proceedings of the 2006 ACM\/IEEE conference on supercomputing","author":"PR Luszczek","year":"2006","unstructured":"Luszczek PR, Bailey DH, Dongarra JJ, Kepner J, Lucas RF, Rabenseifner R, Takahashi D (2006) The hpc challenge (hpcc) benchmark suite. In: SC \u201906: Proceedings of the 2006 ACM\/IEEE conference on supercomputing. ACM, New York, p\u00a0213"},{"issue":"3","key":"699_CR40","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1177\/1094342009106193","volume":"23","author":"S Song","year":"2009","unstructured":"Song S, Ge R, Feng X, Cameron KW (2009) Energy profiling and analysis of the hpc challenge benchmarks. Int J High Perform Comput Appl 23(3):265\u2013276","journal-title":"Int J High Perform Comput Appl"},{"key":"699_CR41","doi-asserted-by":"crossref","unstructured":"Kandalla SSK, Mancini EP, Panda DK (2010) Designing power-aware collective communication algorithms for InfiniBand clusters. Technical Report, June 2010","DOI":"10.1109\/ICPP.2010.78"},{"issue":"6","key":"699_CR42","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1109\/TPDS.2007.1026","volume":"18","author":"VW Freeh","year":"2007","unstructured":"Freeh VW, Lowenthal DK, Pan F, Kappiah N, Springer R, Rountree BL, Femal ME (2007) Analyzing the energy-time trade-off in high-performance computing applications. IEEE Trans Parallel Distrib Syst 18(6):835\u2013848","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"699_CR43","first-page":"4.1","volume-title":"IPDPS \u201905: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS\u201905)\u2014papers","author":"VW Freeh","year":"2005","unstructured":"Freeh VW, Pan F, Kappiah N, Lowenthal DK, Springer R (2005) Exploring the energy-time tradeoff in mpi programs on a power-scalable cluster. In: IPDPS \u201905: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS\u201905)\u2014papers. IEEE Computer Society, Washington, DC, p\u00a04.1"},{"key":"699_CR44","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1145\/1454115.1454151","volume-title":"PACT \u201908: Proceedings of the 17th international conference on parallel architectures and compilation techniques","author":"M Curtis-Maury","year":"2008","unstructured":"Curtis-Maury M, Shah A, Blagojevic F, Nikolopoulos DS, de Supinski BR, Schulz M (2008) Prediction models for multi-dimensional power-performance optimization on many cores. In: PACT \u201908: Proceedings of the 17th international conference on parallel architectures and compilation techniques. ACM, New York, pp\u00a0250\u2013259"},{"key":"699_CR45","unstructured":"NAS (2010) NAS Parallel Benchmark"},{"key":"699_CR46","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/ICPP.2007.29","volume-title":"ICPP \u201907: Proceedings of the 2007 international conference on parallel processing","author":"R Ge","year":"2007","unstructured":"Ge R, Feng X, Feng W-C, Cameron KW (2007) Cpu miser: A performance-directed, run-time system for power-aware clusters. In: ICPP \u201907: Proceedings of the 2007 international conference on parallel processing. IEEE Computer Society, Washington, DC, p\u00a018"},{"key":"699_CR47","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1109\/CLUSTR.2007.4629224","volume-title":"CLUSTER \u201907: Proceedings of the 2007 IEEE international conference on cluster computing","author":"R Zamani","year":"2007","unstructured":"Zamani R, Afsahi A, Qian Y, Hamacher C (2007) A feasibility analysis of power-awareness and energy minimization in modern interconnects for high-performance computing. In: CLUSTER \u201907: Proceedings of the 2007 IEEE international conference on cluster computing. IEEE Computer Society, Washington, DC, pp\u00a0118\u2013128"},{"key":"699_CR48","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1145\/1542275.1542322","volume-title":"ICS \u201909: Proceedings of the 23rd international conference on supercomputing","author":"J Liu","year":"2009","unstructured":"Liu J, Poff D, Abali B (2009) Evaluating high performance communication: a power perspective. In: ICS \u201909: Proceedings of the 23rd international conference on supercomputing. ACM, New York, pp\u00a0326\u2013337"},{"issue":"1\u20132","key":"699_CR49","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1016\/S0010-4655(00)00065-5","volume":"128","author":"RA Kendall","year":"2000","unstructured":"Kendall RA, Apr\u00e0 E, Bernholdt DE, Bylaska EJ, Dupuis M, Fann GI, Harrison RJ, Ju J, Nichols JA, Nieplocha J, Straatsma TP, Windus TL, Wong AT (2000) High performance computational chemistry: an overview of NWChem, a distributed parallel application. Comput Phys Commun 128(1\u20132):260\u2013283","journal-title":"Comput Phys Commun"},{"key":"699_CR50","unstructured":"Subsurface Transport over Multiple Phases. STOMP. http:\/\/stomp.pnl.gov\/"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-011-0699-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11227-011-0699-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-011-0699-9","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,9]],"date-time":"2021-12-09T02:13:30Z","timestamp":1639016010000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11227-011-0699-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,6]]},"references-count":50,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2013,3]]}},"alternative-id":["699"],"URL":"https:\/\/doi.org\/10.1007\/s11227-011-0699-9","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,10,6]]}}}