{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:27:49Z","timestamp":1750220869319,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":24,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,4,13]],"date-time":"2019-04-13T00:00:00Z","timestamp":1555113600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,4,13]]},"DOI":"10.1145\/3300053.3319420","type":"proceedings-article","created":{"date-parts":[[2019,4,10]],"date-time":"2019-04-10T19:07:28Z","timestamp":1554923248000},"page":"53-62","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Quantifying the NUMA Behavior of Partitioned GPGPU Applications"],"prefix":"10.1145","author":[{"given":"Alexander","family":"Matz","sequence":"first","affiliation":[{"name":"Heidelberg University, Heidelberg, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Holger","family":"Fr\u00f6ning","sequence":"additional","affiliation":[{"name":"Heidelberg University, Heidelberg, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,4,13]]},"reference":[{"volume-title":"2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 307--317","author":"Adhinarayanan V.","key":"e_1_3_2_1_1_1","unstructured":"V. Adhinarayanan and W. Feng . 2016. An automated framework for characterizing and subsetting GPGPU workloads . In 2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 307--317 . V. Adhinarayanan and W. Feng. 2016. An automated framework for characterizing and subsetting GPGPU workloads. In 2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 307--317."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080231"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735702"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2012.44"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/2606265.2606953"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2010.5649549"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306801"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.55"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/977395.977673"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/646102.681186"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1816038.1816021"},{"volume-title":"2018 IEEE International Symposium on Workload Characterization (IISWC). 191--202","author":"Li A.","key":"e_1_3_2_1_13_1","unstructured":"A. Li , S. L. Song , J. Chen , X. Liu , N. Tallent , and K. Barker . 2018. Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite . In 2018 IEEE International Symposium on Workload Characterization (IISWC). 191--202 . A. Li, S. L. Song, J. Chen, X. Liu, N. Tallent, and K. Barker. 2018. Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite. In 2018 IEEE International Symposium on Workload Characterization (IISWC). 191--202."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037709"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3124534"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTCHIPS.2009.7478342"},{"volume-title":"2014 IEEE International Symposium on Workload Characterization (IISWC). 130--139","author":"O'Neil M. A.","key":"e_1_3_2_1_17_1","unstructured":"M. A. O'Neil and M. Burtscher . 2014. Microarchitectural performance characterization of irregular GPU kernels . In 2014 IEEE International Symposium on Workload Characterization (IISWC). 130--139 . M. A. O'Neil and M. Burtscher. 2014. Microarchitectural performance characterization of irregular GPU kernels. In 2014 IEEE International Symposium on Workload Characterization (IISWC). 130--139."},{"key":"e_1_3_2_1_18_1","volume-title":"Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs. CoRR abs\/1711.05979","author":"Shi Shaohuai","year":"2017","unstructured":"Shaohuai Shi and Xiaowen Chu . 2017. Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs. CoRR abs\/1711.05979 ( 2017 ). arXiv:1711.05979 http:\/\/arxiv.org\/abs\/1711.05979 Shaohuai Shi and Xiaowen Chu. 2017. Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs. CoRR abs\/1711.05979 (2017). arXiv:1711.05979 http:\/\/arxiv.org\/abs\/1711.05979"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964179.1964194"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750375"},{"key":"e_1_3_2_1_22_1","volume-title":"Article arXiv:1811.02884 (Oct.","author":"Sun Yifan","year":"2018","unstructured":"Yifan Sun , Trinayan Baruah , Saiful A. Mojumder , Shi Dong , Rafael Ubal , Xiang Gong , Shane Treadway , Yuhui Bao , Vincent Zhao , Jos\u00e9 L. Abell\u00e1n , John Kim , Ajay Joshi , and David Kaeli . 2018. MG Sim + MG Mark : A Framework for Multi-GPU System Research. arXiv e-prints , Article arXiv:1811.02884 (Oct. 2018 ), arXiv:1811.02884 pages. arXiv:cs.DC\/1811.02884 Yifan Sun, Trinayan Baruah, Saiful A. Mojumder, Shi Dong, Rafael Ubal, Xiang Gong, Shane Treadway, Yuhui Bao, Vincent Zhao, Jos\u00e9 L. Abell\u00e1n, John Kim, Ajay Joshi, and David Kaeli. 2018. MGSim + MGMark: A Framework for Multi-GPU System Research. arXiv e-prints, Article arXiv:1811.02884 (Oct. 2018), arXiv:1811.02884 pages. arXiv:cs.DC\/1811.02884"},{"volume-title":"Evaluating On-Node GPU Interconnects for Deep Learning Workloads","author":"Tallent Nathan R.","key":"e_1_3_2_1_23_1","unstructured":"Nathan R. Tallent , Nitin A. Gawande , Charles Siegel , Abhinav Vishnu , and Adolfy Hoisie . 2018. Evaluating On-Node GPU Interconnects for Deep Learning Workloads . In High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, Stephen Jarvis, Steven Wright, and Simon Hammond (Eds.). Springer International Publishing , Cham , 3--21. Nathan R. Tallent, Nitin A. Gawande, Charles Siegel, Abhinav Vishnu, and Adolfy Hoisie. 2018. Evaluating On-Node GPU Interconnects for Deep Learning Workloads. In High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, Stephen Jarvis, Steven Wright, and Simon Hammond (Eds.). Springer International Publishing, Cham, 3--21."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2018.00108"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2854038.2854041"}],"event":{"name":"ASPLOS '19: Architectural Support for Programming Languages and Operating Systems","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGOPS ACM Special Interest Group on Operating Systems","SIGARCH ACM Special Interest Group on Computer Architecture","SIGBED ACM Special Interest Group on Embedded Systems"],"location":"Providence RI USA","acronym":"ASPLOS '19"},"container-title":["Proceedings of the 12th Workshop on General Purpose Processing Using GPUs"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3300053.3319420","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3300053.3319420","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:23:51Z","timestamp":1750202631000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3300053.3319420"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,13]]},"references-count":24,"alternative-id":["10.1145\/3300053.3319420","10.1145\/3300053"],"URL":"https:\/\/doi.org\/10.1145\/3300053.3319420","relation":{},"subject":[],"published":{"date-parts":[[2019,4,13]]},"assertion":[{"value":"2019-04-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}