{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T11:38:35Z","timestamp":1767872315436,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2011,11,12]],"date-time":"2011-11-12T00:00:00Z","timestamp":1321056000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000105","name":"Office of Cyberinfrastructure","doi-asserted-by":"publisher","award":["OCI 07-25070"],"award-info":[{"award-number":["OCI 07-25070"]}],"id":[{"id":"10.13039\/100000105","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2011,11,12]]},"DOI":"10.1145\/2063348.2063356","type":"proceedings-article","created":{"date-parts":[[2011,11,16]],"date-time":"2011-11-16T10:40:21Z","timestamp":1321440021000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":63,"title":["Performance modeling for systematic performance tuning"],"prefix":"10.1145","author":[{"given":"Torsten","family":"Hoefler","sequence":"first","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, IL"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"William","family":"Gropp","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, IL"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"William","family":"Kramer","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, IL"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marc","family":"Snir","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Urbana, IL"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2011,11,12]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1054907.1054910"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188618"},{"key":"e_1_3_2_1_3_1","volume-title":"A preliminary investigation of emulating applications that use petabytes of memory on petascale machines. Master's thesis","author":"Mei Chao","year":"2007","unstructured":"Chao Mei . A preliminary investigation of emulating applications that use petabytes of memory on petascale machines. Master's thesis , University of Illinois at Urbana-Champaign , 2007 . Chao Mei. A preliminary investigation of emulating applications that use petabytes of memory on petascale machines. Master's thesis, University of Illinois at Urbana-Champaign, 2007."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPADS.2010.98"},{"key":"e_1_3_2_1_5_1","volume-title":"Workshop on Grid Applications and Programming Tools (GGF '03)","author":"Badia Rosa M","year":"2003","unstructured":"Rosa M Badia , Jess Labarta , and Judit Gimenez . Dimemas : Predicting mpi applications behavior in grid environments . In Workshop on Grid Applications and Programming Tools (GGF '03) , 2003 . Rosa M Badia, Jess Labarta, and Judit Gimenez. Dimemas: Predicting mpi applications behavior in grid environments. In Workshop on Grid Applications and Programming Tools (GGF '03), 2003."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-03869-3_16"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1851476.1851564"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2010.12"},{"key":"e_1_3_2_1_9_1","volume-title":"Parallel scaling of Teter's minimization for Ab Initio calculations. 11","author":"Hoefler T.","year":"2006","unstructured":"T. Hoefler , R. Janisch , and W. Rehm . Parallel scaling of Teter's minimization for Ab Initio calculations. 11 2006 . HPC Nano'06 in conjunction with the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 06. T. Hoefler, R. Janisch, and W. Rehm. Parallel scaling of Teter's minimization for Ab Initio calculations. 11 2006. HPC Nano'06 in conjunction with the International Conference on High Performance Computing, Networking, Storage and Analysis, SC06."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/582034.582071"},{"key":"e_1_3_2_1_11_1","volume-title":"A general performance model for parallel sweeps on orthogonal grids for particle transport calculations. Technical report","author":"Mathis Mark M.","year":"2000","unstructured":"Mark M. Mathis , Nancy M. Amato , and Marvin L. Adams . A general performance model for parallel sweeps on orthogonal grids for particle transport calculations. Technical report , College Station, TX, USA , 2000 . Mark M. Mathis, Nancy M. Amato, and Marvin L. Adams. A general performance model for parallel sweeps on orthogonal grids for particle transport calculations. Technical report, College Station, TX, USA, 2000."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/370049.370424"},{"key":"e_1_3_2_1_13_1","volume-title":"Parallel Processing Symposium, International, 0: 249","author":"Gheith","year":"1996","unstructured":"Gheith A. Abandah and Edward S. Davidson. Modeling the Communication Performance of the IBM SP2 . Parallel Processing Symposium, International, 0: 249 , 1996 . Gheith A. Abandah and Edward S. Davidson. Modeling the Communication Performance of the IBM SP2. Parallel Processing Symposium, International, 0:249, 1996."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"e_1_3_2_1_16_1","volume-title":"http:\/\/www.ncsa.illinois.edu\/BlueWaters\/","author":"Sustained Petascale Computing Blue Waters","year":"2011","unstructured":"Blue Waters Sustained Petascale Computing , Project Office . http:\/\/www.ncsa.illinois.edu\/BlueWaters\/ , 2011 . accessed June 2011. Blue Waters Sustained Petascale Computing, Project Office. http:\/\/www.ncsa.illinois.edu\/BlueWaters\/, 2011. accessed June 2011."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/0010-4655(96)00009-4"},{"key":"e_1_3_2_1_18_1","first-page":"659","volume-title":"Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings","volume":"4782","author":"Hoefler T.","year":"2007","unstructured":"T. Hoefler , T. Mehlan , A. Lumsdaine , and W. Rehm . Netgauge: A Network Performance Measurement Framework. In High Performance Computing and Communications , Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings , volume 4782 , pages 659 -- 671 . Springer, 9 2007 . T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm. Netgauge: A Network Performance Measurement Framework. In High Performance Computing and Communications, Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings, volume 4782, pages 659--671. Springer, 9 2007."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2007.370593"},{"key":"e_1_3_2_1_20_1","volume-title":"Pallas MPI Benchmarks - PMB, Part MPI-1. Technical report","author":"Pallas","year":"2000","unstructured":"Pallas GmbH. Pallas MPI Benchmarks - PMB, Part MPI-1. Technical report , 2000 . Pallas GmbH. Pallas MPI Benchmarks - PMB, Part MPI-1. Technical report, 2000."},{"key":"e_1_3_2_1_21_1","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1007\/978-3-540-39924-7_10","volume-title":"Recent Advances in Parallel Virtual Machine and Message Passing Interface, 10th European PVM\/MPI Users' Group Meeting, Venice, Italy, September 29 -","author":"Turner Dave","year":"2003","unstructured":"Dave Turner , Adam Oline , Xuehua Chen , and Troy Benjegerdes . Integrating new capabilities into netpipe . In Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, editors, Recent Advances in Parallel Virtual Machine and Message Passing Interface, 10th European PVM\/MPI Users' Group Meeting, Venice, Italy, September 29 - October 2, 2003 , Proceedings, volume 2840 of Lecture Notes in Computer Science , pages 37 -- 44 . Springer , 2003. Dave Turner, Adam Oline, Xuehua Chen, and Troy Benjegerdes. Integrating new capabilities into netpipe. In Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, editors, Recent Advances in Parallel Virtual Machine and Message Passing Interface, 10th European PVM\/MPI Users' Group Meeting, Venice, Italy, September 29 - October 2, 2003, Proceedings, volume 2840 of Lecture Notes in Computer Science, pages 37--44. Springer, 2003."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1155\/2002\/202839"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/155332.155333"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTR.2008.4663762"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTI.2010.16"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1007\/978-3-642-15646-5_3","volume-title":"Recent Advances in the Message Passing Interface (EuroMPI'10)","author":"Hoefler T.","year":"2010","unstructured":"T. Hoefler , W. Gropp , R. Thakur , and J. L. Traeff . Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues . In Recent Advances in the Message Passing Interface (EuroMPI'10) , volume LNCS 6305 , pages 21 -- 30 . Springer , Sep. 2010 . T. Hoefler, W. Gropp, R. Thakur, and J. L. Traeff. Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues. In Recent Advances in the Message Passing Interface (EuroMPI'10), volume LNCS 6305, pages 21--30. Springer, Sep. 2010."},{"key":"e_1_3_2_1_27_1","unstructured":"Greg Bauer Steven Gottlieb and Torsten Hoefler. Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3_rmd. to appear.  Greg Bauer Steven Gottlieb and Torsten Hoefler. Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3_rmd. to appear."},{"key":"e_1_3_2_1_28_1","unstructured":"Steven Gottlieb. Personal communication about MILC code structure and main functions.  Steven Gottlieb. Personal communication about MILC code structure and main functions."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1177\/109434209100500406"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/63404.63407"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01379320"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1007\/978-3-642-15646-5_14","volume-title":"Recent Advances in the Message Passing Interface (EuroMPI'10)","author":"Hoefler T.","year":"2010","unstructured":"T. Hoefler and S. Gottlieb . Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient using MPI Datatypes . In Recent Advances in the Message Passing Interface (EuroMPI'10) , volume LNCS 6305 , pages 132 -- 141 . Springer , Sep. 2010 . T. Hoefler and S. Gottlieb. Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient using MPI Datatypes. In Recent Advances in the Message Passing Interface (EuroMPI'10), volume LNCS 6305, pages 132--141. Springer, Sep. 2010."},{"key":"e_1_3_2_1_33_1","volume-title":"Proceedings of Workshop on Productivity and Performance (PROPER 2010","author":"Hoefler T.","year":"2010","unstructured":"T. Hoefler . Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC . In Proceedings of Workshop on Productivity and Performance (PROPER 2010 ). Springer , Dec. 2010 . T. Hoefler. Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC. In Proceedings of Workshop on Productivity and Performance (PROPER 2010). Springer, Dec. 2010."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/800076.802486"}],"event":{"name":"SC '11: International Conference for High Performance Computing, Networking, Storage and Analysis","location":"Seattle Washington","acronym":"SC '11","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IEEE-CS Computer Society"]},"container-title":["State of the Practice Reports"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2063348.2063356","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2063348.2063356","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:54:25Z","timestamp":1750240465000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2063348.2063356"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,11,12]]},"references-count":33,"alternative-id":["10.1145\/2063348.2063356","10.1145\/2063348"],"URL":"https:\/\/doi.org\/10.1145\/2063348.2063356","relation":{},"subject":[],"published":{"date-parts":[[2011,11,12]]},"assertion":[{"value":"2011-11-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}