{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T06:53:43Z","timestamp":1761807223207},"reference-count":11,"publisher":"Association for Computing Machinery (ACM)","issue":"5","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGARCH Comput. Archit. News"],"published-print":{"date-parts":[[2005,12]]},"abstract":"<jats:p>Dynamic binary instrumentation for performance analysis on new, large scale architectures such as the IBM Blue Gene\/L system (BG\/L) poses new challenges. Their scale---with potentially hundreds of thousands of compute nodes---requires new, more scalable mechanisms to deploy and to organize binary instrumentation and to collect the resulting data gathered by the inserted probes. Further, many of these new machines don't support full operating systems on the compute nodes; rather, they rely on light-weight custom compute kernels that do not support daemon-based implementations.We describe the design and current status of a new implementation of the DPCL (Dynamic Probe Class Library) API for BG\/L. DPCL provides an easy to use layer for dynamic instrumentation on parallel MPI applications based on the DynInst dynamic instrumentation mechanism for sequential platforms. Our work includes modifying DynInst to control instrumentation from remote I\/O nodes and porting DPCL's communication to use MRNet, a scalable data reduction network for collecting performance data. We describe extensions to the DPCL API that support instrumentation of task subsets and aggregation of collected performance data. Overall, our implementation provides a scalable infrastructure that provides efficient binary instrumentation on BG\/L.<\/jats:p>","DOI":"10.1145\/1127577.1127581","type":"journal-article","created":{"date-parts":[[2006,5,8]],"date-time":"2006-05-08T22:51:53Z","timestamp":1147128713000},"page":"9-14","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Scalable dynamic binary instrumentation for Blue Gene\/L"],"prefix":"10.1145","volume":"33","author":[{"given":"Martin","family":"Schulz","sequence":"first","affiliation":[{"name":"Lawrence Livermore National Laboratory, Livermore, CA"}]},{"given":"Dong","family":"Ahn","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory, Livermore, CA"}]},{"given":"Andrew","family":"Bernat","sequence":"additional","affiliation":[{"name":"University of Wisconsin, Madison, WI"}]},{"given":"Bronis R.","family":"de Supinski","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory, Livermore, CA"}]},{"given":"Steven Y.","family":"Ko","sequence":"additional","affiliation":[{"name":"University of Illinois, Urbana-Champaign, IL"}]},{"given":"Gregory","family":"Lee","sequence":"additional","affiliation":[{"name":"University of California, San Diego, CA"}]},{"given":"Barry","family":"Rountree","sequence":"additional","affiliation":[{"name":"University of Georgia, GA"}]}],"member":"320","published-online":{"date-parts":[[2005,12]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Sept.","author":"Debugging Interface MPI","year":"2005","unstructured":"MPI Debugging Interface . http:\/\/www-unix.mcs.anl.gov\/mpi\/mpi-debug\/ , Sept. 2005 . MPI Debugging Interface. http:\/\/www-unix.mcs.anl.gov\/mpi\/mpi-debug\/, Sept. 2005."},{"key":"e_1_2_1_2_1","unstructured":"SLURM\n  : Simple Linux Utility for Resource Management. http:\/\/www.llnl.gov\/linux\/slurm\/ June 2005.  SLURM: Simple Linux Utility for Resource Management. http:\/\/www.llnl.gov\/linux\/slurm\/ June 2005."},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of IEEE\/ACM Supercomputing '02","author":"Adiga N.","year":"2002","unstructured":"N. Adiga and et al. An overview of the bluegene\/l supercomputer . In Proceedings of IEEE\/ACM Supercomputing '02 , Nov. 2002 . N. Adiga and et al. An overview of the bluegene\/l supercomputer. In Proceedings of IEEE\/ACM Supercomputing '02, Nov. 2002."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1177\/109434200001400404"},{"key":"e_1_2_1_5_1","volume-title":"Architecture and Software Workshop\", presentation","author":"DelSignore J.","unstructured":"J. DelSignore . TotalView on Blue Gene\/L. Presented at \"Blue Gene\/L: Applications , Architecture and Software Workshop\", presentation available at http:\/\/www.llnl.gov\/asci\/platforms\/bluegene\/papers\/26delsignore.pdf. J. DelSignore. TotalView on Blue Gene\/L. Presented at \"Blue Gene\/L: Applications, Architecture and Software Workshop\", presentation available at http:\/\/www.llnl.gov\/asci\/platforms\/bluegene\/papers\/26delsignore.pdf."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/645609.662971"},{"key":"e_1_2_1_7_1","unstructured":"IBM. An Overview of the BlueGene\/L Supercomputer. Whitepaper available at http:\/\/www-fp.mcs.anl.gov\/bgconsortium.  IBM. An Overview of the BlueGene\/L Supercomputer. Whitepaper available at http:\/\/www-fp.mcs.anl.gov\/bgconsortium."},{"key":"e_1_2_1_8_1","series-title":"LRR-TUM Research Report Series","volume-title":"OMIS --- On-line Monitoring Interface Specification (Version 2.0)","author":"Ludwig T.","year":"1997","unstructured":"T. Ludwig , R. Wism\u00fcller , V. Sunderam , and A. Bode . OMIS --- On-line Monitoring Interface Specification (Version 2.0) , volume 9 of LRR-TUM Research Report Series . Shaker Verlag , Aachen, Germany , 1997 . ISBN 3-8265-3035-7. T. Ludwig, R. Wism\u00fcller, V. Sunderam, and A. Bode. OMIS --- On-line Monitoring Interface Specification (Version 2.0), volume 9 of LRR-TUM Research Report Series. Shaker Verlag, Aachen, Germany, 1997. ISBN 3-8265-3035-7."},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 2003 International Conference on Parallel and Distributed Techniques and Applications","author":"May J.","year":"2003","unstructured":"J. May and J. Gyllenhaal . Tool Gear: Infrastructure for Parallel Tools . In Proceedings of the 2003 International Conference on Parallel and Distributed Techniques and Applications , June 2003 . J. May and J. Gyllenhaal. Tool Gear: Infrastructure for Parallel Tools. In Proceedings of the 2003 International Conference on Parallel and Distributed Techniques and Applications, June 2003."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.471178"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1048935.1050172"}],"container-title":["ACM SIGARCH Computer Architecture News"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1127577.1127581","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T21:23:03Z","timestamp":1672262583000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1127577.1127581"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,12]]},"references-count":11,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2005,12]]}},"alternative-id":["10.1145\/1127577.1127581"],"URL":"https:\/\/doi.org\/10.1145\/1127577.1127581","relation":{},"ISSN":["0163-5964"],"issn-type":[{"value":"0163-5964","type":"print"}],"subject":[],"published":{"date-parts":[[2005,12]]},"assertion":[{"value":"2005-12-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}