{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T11:27:36Z","timestamp":1763724456970,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,9,30]],"date-time":"2019-09-30T00:00:00Z","timestamp":1569801600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,9,30]]},"DOI":"10.1145\/3357526.3357571","type":"proceedings-article","created":{"date-parts":[[2019,11,6]],"date-time":"2019-11-06T14:25:56Z","timestamp":1573050356000},"page":"261-271","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Towards a scatter-gather architecture"],"prefix":"10.1145","author":[{"given":"Arun","family":"Rodrigues","sequence":"first","affiliation":[{"name":"Sandia National Labs, Albuquerque, New Mexico"}]},{"given":"Maya","family":"Gokhale","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory, Livermore"}]},{"given":"Gwendolyn","family":"Voskuilen","sequence":"additional","affiliation":[{"name":"Sandia National Labs, Albuquerque, New Mexico"}]}],"member":"320","published-online":{"date-parts":[[2019,9,30]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Hpgmg 1.0: A benchmark for ranking high performance computing systems. Tech. rep., hpgmg.org","author":"Adams M. F.","year":"2014","unstructured":"Adams , M. F. , Brown , J. , Shalf , J. , Straalen , B. V. , Strohmaier , E. , and Williams , S . Hpgmg 1.0: A benchmark for ranking high performance computing systems. Tech. rep., hpgmg.org , 2014 . https:\/\/hpgmg.org\/static\/hpgmg-tr-1.0.pdf. Adams, M. F., Brown, J., Shalf, J., Straalen, B. V., Strohmaier, E., and Williams, S. Hpgmg 1.0: A benchmark for ranking high performance computing systems. Tech. rep., hpgmg.org, 2014. https:\/\/hpgmg.org\/static\/hpgmg-tr-1.0.pdf."},{"key":"e_1_3_2_1_2_1","first-page":"1","volume-title":"Proceedings of the 1997 ACM\/IEEE Conference on Supercomputing (New York, NY, USA, 1997), SC '97, ACM","author":"Anderson E.","unstructured":"Anderson , E. , Brooks , J. , Grassl , C. , and Scott , S . Performance of the cray t3e multiprocessor . In Proceedings of the 1997 ACM\/IEEE Conference on Supercomputing (New York, NY, USA, 1997), SC '97, ACM , pp. 1 -- 17 . Anderson, E., Brooks, J., Grassl, C., and Scott, S. Performance of the cray t3e multiprocessor. In Proceedings of the 1997 ACM\/IEEE Conference on Supercomputing (New York, NY, USA, 1997), SC '97, ACM, pp. 1--17."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132402.3132431"},{"key":"e_1_3_2_1_5_1","volume-title":"Attack of the killer micros. Presentation at Supercomputing","author":"Brooks E.","year":"1990","unstructured":"Brooks , E. Attack of the killer micros. Presentation at Supercomputing 1990 , November 1990. Brooks, E. Attack of the killer micros. Presentation at Supercomputing 1990, November 1990."},{"key":"e_1_3_2_1_6_1","first-page":"15","volume-title":"Proceedings of the 19th ACM\/SIGDA International Symposium on Field Programmable Gate Arrays (New York, NY, USA, 2011), FPGA '11, ACM","author":"Chou C. H.","unstructured":"Chou , C. H. , Severance , A. , Brant , A. D. , Liu , Z. , Sant , S. , and Lemieux , G. G . Vegas: Soft vector processor with scratchpad memory . In Proceedings of the 19th ACM\/SIGDA International Symposium on Field Programmable Gate Arrays (New York, NY, USA, 2011), FPGA '11, ACM , pp. 15 -- 24 . Chou, C. H., Severance, A., Brant, A. D., Liu, Z., Sant, S., and Lemieux, G. G. Vegas: Soft vector processor with scratchpad memory. In Proceedings of the 19th ACM\/SIGDA International Symposium on Field Programmable Gate Arrays (New York, NY, USA, 2011), FPGA '11, ACM, pp. 15--24."},{"key":"e_1_3_2_1_7_1","first-page":"12","article-title":"Kokkos: Enabling manycore performance portability through polymorphic memory access patterns","volume":"74","author":"Edwards H. C.","year":"2014","unstructured":"Edwards , H. C. , Trott , C. R. , and Sunderland , D . Kokkos: Enabling manycore performance portability through polymorphic memory access patterns . Journal of Parallel and Distributed Computing 74 , 12 ( 2014 ), 3202--3216. Domain-Specific Languages and High-Level Frameworks for High-Performance Computing. Edwards, H. C., Trott, C. R., and Sunderland, D. Kokkos: Enabling manycore performance portability through polymorphic memory access patterns. Journal of Parallel and Distributed Computing 74, 12 (2014), 3202--3216. Domain-Specific Languages and High-Level Frameworks for High-Performance Computing.","journal-title":"Journal of Parallel and Distributed Computing"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2010.5470685"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818950.2818986"},{"key":"e_1_3_2_1_10_1","volume-title":"A PIM-based Data-Intensive Architecture. In Supercomputing","author":"Hall M.","year":"1999","unstructured":"Hall , M. , Kogge , P. , Koller , J. , Diniz , P. , Chame , J. , Draper , J. , LaCoss , J. , Granacki , J. , Srivastava , A. , Athas , W. , Brockman , J. , Freeh , V. , Park , J. , and Shin , J . Mapping Irregular Applications to DIVA , A PIM-based Data-Intensive Architecture. In Supercomputing , Portland , OR ( November 1999 ). Hall, M., Kogge, P., Koller, J., Diniz, P., Chame, J., Draper, J., LaCoss, J., Granacki, J., Srivastava, A., Athas, W., Brockman, J., Freeh, V., Park, J., and Shin, J. Mapping Irregular Applications to DIVA, A PIM-based Data-Intensive Architecture. In Supercomputing, Portland, OR (November 1999)."},{"key":"e_1_3_2_1_12_1","volume-title":"Kripke - a massively parallel transport mini-app","author":"Kunen A. J.","year":"2015","unstructured":"Kunen , A. J. , Bailey , T. S. , and Brown , P. N . Kripke - a massively parallel transport mini-app . In American Nuclear Society M &C ( April 2015 ). Kunen, A. J., Bailey, T. S., and Brown, P. N. Kripke - a massively parallel transport mini-app. In American Nuclear Society M&C (April 2015)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8191(05)80035-3"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1137\/0909019"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132402.3132434"},{"key":"e_1_3_2_1_17_1","volume-title":"PLDI","author":"Luk C.-K.","year":"2005","unstructured":"Luk , C.-K. , Cohn , R. , Muth , R. , Patil , H. , Klauser , A. , Lowney , G. , Wallace , S. , Reddi , V. J. , and Hazelwood , K . Pin: Building customized program analysis tools with dynamic instrumentation . In PLDI ( 2005 ). Luk, C.-K., Cohn, R., Muth, R., Patil, H., Klauser, A., Lowney, G., Wallace, S., Reddi, V. J., and Hazelwood, K. Pin: Building customized program analysis tools with dynamic instrumentation. In PLDI (2005)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/977091.977115"},{"key":"e_1_3_2_1_19_1","volume-title":"Occa: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968","author":"Medina D. S.","year":"2014","unstructured":"Medina , D. S. , St-Cyr , A. , and Warburton , T . Occa: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968 ( 2014 ). Medina, D. S., St-Cyr, A., and Warburton, T. Occa: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968 (2014)."},{"key":"e_1_3_2_1_20_1","first-page":"332","volume-title":"Proceedings of the 19th Annual International Conference on Supercomputing (New York, NY, USA, 2005), ICS '05, ACM","author":"Murphy R.","unstructured":"Murphy , R. , Rodrigues , A. , Kogge , P. , and Underwood , K . The implications of working set analysis on supercomputing memory hierarchy design . In Proceedings of the 19th Annual International Conference on Supercomputing (New York, NY, USA, 2005), ICS '05, ACM , pp. 332 -- 340 . Murphy, R., Rodrigues, A., Kogge, P., and Underwood, K. The implications of working set analysis on supercomputing memory hierarchy design. In Proceedings of the 19th Annual International Conference on Supercomputing (New York, NY, USA, 2005), ICS '05, ACM, pp. 332--340."},{"key":"e_1_3_2_1_21_1","article-title":"Active Memory Cube: A processing-in-memory architecture for exascale systems","volume":"59","author":"Nair R.","unstructured":"Nair , R. , Antao , S. , Bertolli , C. , Bose , P. , Active Memory Cube: A processing-in-memory architecture for exascale systems . IBM Journal of Research and Development 59 , 2\/3 (March-May 2015), 17:1--17:14. Nair, R., Antao, S., Bertolli, C., Bose, P., et al. Active Memory Cube: A processing-in-memory architecture for exascale systems. IBM Journal of Research and Development 59, 2\/3 (March-May 2015), 17:1--17:14.","journal-title":"IBM Journal of Research and Development"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964218.1964225"},{"key":"e_1_3_2_1_23_1","first-page":"66","volume-title":"Proceedings of the 20th Annual International Conference on Supercomputing (New York, NY, USA, 2006), ICS '06, ACM","author":"Rupnow K.","unstructured":"Rupnow , K. , Rodrigues , A. , Underwood , K. , and Compton , K . Scientific applications vs. spec-fp: A comparison of program behavior . In Proceedings of the 20th Annual International Conference on Supercomputing (New York, NY, USA, 2006), ICS '06, ACM , pp. 66 -- 74 . Rupnow, K., Rodrigues, A., Underwood, K., and Compton, K. Scientific applications vs. spec-fp: A comparison of program behavior. In Proceedings of the 20th Annual International Conference on Supercomputing (New York, NY, USA, 2006), ICS '06, ACM, pp. 66--74."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3149704.3149766"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830820"},{"key":"e_1_3_2_1_26_1","volume-title":"PHYSOR 2014 - The Role of Reactor Physics toward a Sustainable Future (Kyoto","author":"Tramm J. R.","year":"2014","unstructured":"Tramm , J. R. , Siegel , A. R. , Islam , T. , and Schulz , M . Bench - the development and verification of a performance abstraction for monte carlo reactor analysis . In PHYSOR 2014 - The Role of Reactor Physics toward a Sustainable Future (Kyoto , 2014 ). Tramm, J. R., Siegel, A. R., Islam, T., and Schulz, M. Bench - the development and verification of a performance abstraction for monte carlo reactor analysis. In PHYSOR 2014 - The Role of Reactor Physics toward a Sustainable Future (Kyoto, 2014)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1105734.1105748"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/76263.1379809"}],"event":{"name":"MEMSYS '19: The International Symposium on Memory Systems","acronym":"MEMSYS '19","location":"Washington District of Columbia USA"},"container-title":["Proceedings of the International Symposium on Memory Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3357526.3357571","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3357526.3357571","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:23:22Z","timestamp":1750202602000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3357526.3357571"}},"subtitle":["hardware and software issues"],"short-title":[],"issued":{"date-parts":[[2019,9,30]]},"references-count":25,"alternative-id":["10.1145\/3357526.3357571","10.1145\/3357526"],"URL":"https:\/\/doi.org\/10.1145\/3357526.3357571","relation":{},"subject":[],"published":{"date-parts":[[2019,9,30]]},"assertion":[{"value":"2019-09-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}