{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T07:44:45Z","timestamp":1768031085792,"version":"3.49.0"},"reference-count":25,"publisher":"IGI Global","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,4,1]]},"abstract":"<p>Efficient design of hardware and software for large-scale parallel execution requires detailed understanding of the interactions between the application, computer, and network. The authors have developed a macro-scale simulator (SST\/macro) that permits the coarse-grained study of distributed-memory applications. In the presented work, applications using the Message Passing Interface (MPI) are simulated; however, the simulator is designed to allow inclusion of other programming models. The simulator is driven from either a trace file or a skeleton application. Trace files can be either a standard format (Open Trace Format) or a more detailed custom format (DUMPI). The simulator architecture is modular, allowing it to easily be extended with additional network models, trace file formats, and more detailed processor models. This paper describes the design of the simulator, provides performance results, and presents studies showing how application performance is affected by machine characteristics.<\/p>","DOI":"10.4018\/jdst.2010040104","type":"journal-article","created":{"date-parts":[[2010,4,19]],"date-time":"2010-04-19T23:15:42Z","timestamp":1271718942000},"page":"57-73","source":"Crossref","is-referenced-by-count":85,"title":["A Simulator for Large-Scale Parallel Computer Architectures"],"prefix":"10.4018","volume":"1","author":[{"given":"Curtis L.","family":"Janssen","sequence":"first","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Helgi","family":"Adalsteinsson","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Scott","family":"Cranford","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joseph P.","family":"Kenny","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ali","family":"Pinar","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David A.","family":"Evensky","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jackson","family":"Mayo","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"2432","reference":[{"key":"jdst.2010040104-0","unstructured":"M5Sim. (n.d.). The M5 Simulator System: A modular platform for computer system architecture research. Retrieved September 9, 2009 from http:\/\/www.m5sim.org"},{"key":"jdst.2010040104-1","doi-asserted-by":"publisher","DOI":"10.1006\/jpdc.2001.1800"},{"key":"jdst.2010040104-2","unstructured":"AMG benchmark summary. (n.d.). Retrieved September 11, 2009 from https:\/\/asc.llnl.gov\/sequoia\/benchmarks\/AMG_summary_v1.0.pdf"},{"key":"jdst.2010040104-3","doi-asserted-by":"crossref","unstructured":"Benveniste, C., & Heidelberger, P. (1995). Parallel simulation of the IBM SP2 interconnection network. Paper presented at the 1995 Winter Simulation Conference, New York, NY.","DOI":"10.1145\/224401.224689"},{"key":"jdst.2010040104-4","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2006.82"},{"key":"jdst.2010040104-5","author":"W.Dally","year":"2004","journal-title":"Principles and Practices of Interconnection Networks"},{"key":"jdst.2010040104-6","first-page":"41","article-title":"BoomerAMG: A Parallel Algebraic Multigrid Solver and Preconditioner.","author":"V. E.Henson","year":"2002","journal-title":"Applied Numerical Mathematics"},{"key":"jdst.2010040104-7","unstructured":"Jacob, B. (n.d.). DRAMsim: A Detailed Memory-System Simulation Framework. Retrieved September 9, 2009 from http:\/\/www.ece.umd.edu\/dramsim\/"},{"issue":"1","key":"jdst.2010040104-8","article-title":"Communication Requirements and Interconnect Optimization for High-End Scientific Applications.","volume":"99","author":"S.Kamil","year":"2009","journal-title":"IEEE Transactions on Parallel and Distributed Systems"},{"key":"jdst.2010040104-9","doi-asserted-by":"crossref","unstructured":"Knupfer, A., Brendel, R., Brunst, H., Mix, H., & Nagel, W. E. (2006). Introducing the open trace format (OTF). In V. N. Alexandrov, G. D. van Albada, P. M. A. Sloot, & J. Dongarra (Eds.), Proceedings, Part II Computational Science-ICCS 2006, 6th International Conference (LNCS 3992).","DOI":"10.1007\/11758525_71"},{"key":"jdst.2010040104-10","unstructured":"Mantevo. (n.d.). Retrieved September 9, 2009 from https:\/\/software.sandia.gov\/mantevo\/"},{"key":"jdst.2010040104-11","unstructured":"Message Passing Interface Forum. (2008). MPI: A Message-Passing Interface Standard: Version 2.1. Retrieved September 9, 2009 from http:\/\/www.mpi-forum.org"},{"key":"jdst.2010040104-12","unstructured":"ns-3. (n.d.). The ns-3 network simulator. Retrieved September 9, 2009 from http:\/\/www.nsnam.org\/"},{"key":"jdst.2010040104-13","unstructured":"PAPI. (n.d.). Retrieved September 9, 2009 from http:\/\/icl.cs.utk.edu\/papi\/"},{"key":"jdst.2010040104-14","unstructured":"Petrini, F., & Vannesch, M. (1997). SMART: A simulator of massive architectures and topologies. In Proceedings of the Euro-PDS (pp. 185-191)."},{"key":"jdst.2010040104-15","doi-asserted-by":"publisher","DOI":"10.1109\/32.846297"},{"key":"jdst.2010040104-16","doi-asserted-by":"crossref","unstructured":"Riesen, R. (2006). A Hybrid MPI Simulator. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER'06).","DOI":"10.1109\/CLUSTR.2006.311852"},{"key":"jdst.2010040104-17","doi-asserted-by":"crossref","unstructured":"Rodrigues, A., Murphy, R., Kogge, P., Brockman, J., Brightwell, R., & Underwood, K. (2003). Implications of a PIM architectural model for MPI. Paper presented at the IEEE International Conference on Cluster Computing, Los ALamitos, CA, USA.","DOI":"10.1109\/CLUSTR.2003.1253323"},{"key":"jdst.2010040104-18","unstructured":"Sequoia Benchmark Codes, A. S. C. (n.d.). Retrieved September 11, 2009 from https:\/\/asc.llnl.gov\/sequoia\/benchmarks\/"},{"key":"jdst.2010040104-19","unstructured":"Shalf, J., Kamil, S., Oliker, L., & Skinner, D. (Artist) (2005). Analyzing Ultra-Scale Application Communication Requirements for a Reconfigurable Hybrid Interconnect."},{"key":"jdst.2010040104-20","unstructured":"Susukita, R., Ando, H., Aoyagi, M., Honda, H., Inadomi, Y., Inoue, K., et al. (2008). Performance prediction of large-scale parallel system and application using macro-level simulation. Paper presented at the SC '08: Proceedings of the 2008 ACM\/IEEE conference on Supercomputing, Piscataway, NJ, USA."},{"key":"jdst.2010040104-21","doi-asserted-by":"crossref","unstructured":"Underwood, K., Levenhagen, M., & Rodrigues, A. (2007). Simulating Red Storm: challenges and successes in building a system simulation. Paper presented at the 2007 IEEE International Parallel and Distributed Processing Symposium (IEEE Cat. No.07TH8938), Piscataway, NJ, USA.","DOI":"10.1109\/IPDPS.2007.370235"},{"key":"jdst.2010040104-22","unstructured":"VampirTrace. (n.d.). Retrieved September 9, 2009 from http:\/\/www.tu-dresden.de\/zih\/vampirtrace"},{"key":"jdst.2010040104-23","doi-asserted-by":"publisher","DOI":"10.1145\/1105734.1105748"},{"key":"jdst.2010040104-24","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-005-3582-6"}],"container-title":["International Journal of Distributed Systems and Technologies"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=42976","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,1]],"date-time":"2022-06-01T18:54:06Z","timestamp":1654109646000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/jdst.2010040104"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2010,4,1]]},"references-count":25,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,4]]}},"URL":"https:\/\/doi.org\/10.4018\/jdst.2010040104","relation":{},"ISSN":["1947-3532","1947-3540"],"issn-type":[{"value":"1947-3532","type":"print"},{"value":"1947-3540","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,4,1]]}}}