{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T15:42:31Z","timestamp":1772725351720,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,8]],"date-time":"2022-10-08T00:00:00Z","timestamp":1665187200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"MCIN\/AEI\/10.13039\/501100011033","award":["PID2020-112827GB-I00"],"award-info":[{"award-number":["PID2020-112827GB-I00"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,8]]},"DOI":"10.1145\/3559009.3569666","type":"proceedings-article","created":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T14:02:50Z","timestamp":1674828170000},"page":"333-345","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["NaviSim"],"prefix":"10.1145","author":[{"given":"Yuhui","family":"Bao","sequence":"first","affiliation":[{"name":"Northeastern University"}]},{"given":"Yifan","family":"Sun","sequence":"additional","affiliation":[{"name":"William &amp; Mary"}]},{"given":"Zlatan","family":"Feric","sequence":"additional","affiliation":[{"name":"Northeastern University"}]},{"given":"Michael Tian","family":"Shen","sequence":"additional","affiliation":[{"name":"Northeastern University"}]},{"given":"Micah","family":"Weston","sequence":"additional","affiliation":[{"name":"Northeastern University"}]},{"given":"Jos\u00e9 L.","family":"Abell\u00e1n","sequence":"additional","affiliation":[{"name":"Universidad Cat\u00f3lica de Murcia, Murcia, Spain"}]},{"given":"Trinayan","family":"Baruah","sequence":"additional","affiliation":[{"name":"AMD"}]},{"given":"John","family":"Kim","sequence":"additional","affiliation":[{"name":"KAIST, Daejeon, South Korea"}]},{"given":"Ajay","family":"Joshi","sequence":"additional","affiliation":[{"name":"Boston University"}]},{"given":"David","family":"Kaeli","sequence":"additional","affiliation":[{"name":"Northeastern University"}]}],"member":"320","published-online":{"date-parts":[[2023,1,27]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00034"},{"key":"e_1_3_2_1_2_1","unstructured":"AMD Inc. 2012. AMD Graphics Core Next Architecture. https:\/\/www.techpowerup.com\/gpu-specs\/docs\/amd-gcn1-architecture.pdf  AMD Inc. 2012. AMD Graphics Core Next Architecture. https:\/\/www.techpowerup.com\/gpu-specs\/docs\/amd-gcn1-architecture.pdf"},{"key":"e_1_3_2_1_3_1","unstructured":"AMD Inc. 2019. Introducing RDNA Architecture The all new Radeon gaming architecture powering \"Navi\". https:\/\/www.amd.com\/system\/files\/documents\/rdna-whitepaper.pdf  AMD Inc. 2019. Introducing RDNA Architecture The all new Radeon gaming architecture powering \"Navi\". https:\/\/www.amd.com\/system\/files\/documents\/rdna-whitepaper.pdf"},{"key":"e_1_3_2_1_4_1","unstructured":"AMD Inc. 2020. \"RDNA 1.0\" Instruction Set Architecture Reference Guide. https:\/\/developer.amd.com\/wp-content\/resources\/RDNA_Shader_ISA.pdf  AMD Inc. 2020. \"RDNA 1.0\" Instruction Set Architecture Reference Guide. https:\/\/developer.amd.com\/wp-content\/resources\/RDNA_Shader_ISA.pdf"},{"key":"e_1_3_2_1_5_1","unstructured":"AMD Inc. 2022. HIP Programming Guide. https:\/\/rocmdocs.amd.com\/en\/latest\/Programming_Guides\/HIP-GUIDE.html  AMD Inc. 2022. HIP Programming Guide. https:\/\/rocmdocs.amd.com\/en\/latest\/Programming_Guides\/HIP-GUIDE.html"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3123975"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2009.4919648"},{"key":"e_1_3_2_1_8_1","volume-title":"Design-Process-Technology Co-optimization for Manufacturability X","author":"Chiou Tsann-Bim","unstructured":"Tsann-Bim Chiou , Alek C Chen , Mircea Dusa , and Shih-En Tseng . 2016. Impact of EUV patterning scenario on different design styles and their ground rules for 7nm\/5nm node BEOL layers . In Design-Process-Technology Co-optimization for Manufacturability X , Vol. 9781 . International Society for Optics and Photonics, SPIE , Bellingham, Washington USA , 978107. Tsann-Bim Chiou, Alek C Chen, Mircea Dusa, and Shih-En Tseng. 2016. Impact of EUV patterning scenario on different design styles and their ground rules for 7nm\/5nm node BEOL layers. In Design-Process-Technology Co-optimization for Manufacturability X, Vol. 9781. International Society for Optics and Photonics, SPIE, Bellingham, Washington USA, 978107."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASCOTS.2010.43"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735702"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3038228.3038239"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964179.1964192"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS47924.2020.00054"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2017.7975298"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00058"},{"key":"e_1_3_2_1_16_1","volume-title":"Mengchi Zhang, Yechen Liu, Tim Rogers, and Robert J Hoekstra.","author":"Hughes Clayton","year":"2021","unstructured":"Clayton Hughes , Simon David Hammond , Mengchi Zhang, Yechen Liu, Tim Rogers, and Robert J Hoekstra. 2021 . SST-GPU: A Scalable SST GPU Component for Performance Modeling and Profiling. Technical Report. Sandia National Lab.(SNLNM), Albuquerque, NM (United States) . Clayton Hughes, Simon David Hammond, Mengchi Zhang, Yechen Liu, Tim Rogers, and Robert J Hoekstra. 2021. SST-GPU: A Scalable SST GPU Component for Performance Modeling and Profiling. Technical Report. Sandia National Lab.(SNLNM), Albuquerque, NM (United States)."},{"key":"e_1_3_2_1_17_1","unstructured":"Open Source Intiative. 1980. The MIT License.  Open Source Intiative. 1980. The MIT License."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmmm.2015.10.054"},{"key":"e_1_3_2_1_19_1","volume-title":"Graphics double data rate 6 (GDDR6) SGRAM standard","author":"JEDEC","unstructured":"JEDEC JESD250. 2017. Graphics double data rate 6 (GDDR6) SGRAM standard . JEDEC Solid State Technology Association . JEDEC JESD250. 2017. Graphics double data rate 6 (GDDR6) SGRAM standard. JEDEC Solid State Technology Association."},{"key":"e_1_3_2_1_20_1","volume-title":"Heterogeneous computing with OpenCL 2.0. Morgan Kaufmann","author":"Kaeli David R","unstructured":"David R Kaeli , Perhaad Mistry , Dana Schaa , and Dong Ping Zhang . 2015. Heterogeneous computing with OpenCL 2.0. Morgan Kaufmann , Burlington, MA, USA . David R Kaeli, Perhaad Mistry, Dana Schaa, and Dong Ping Zhang. 2015. Heterogeneous computing with OpenCL 2.0. Morgan Kaufmann, Burlington, MA, USA."},{"key":"e_1_3_2_1_21_1","volume-title":"GPU Computing Gems Jade Edition","author":"Kerr Andrew","unstructured":"Andrew Kerr , Gregory Diamos , and Sudhakar Yalamanchili . 2012. Gpu application development, debugging, and performance tuning with gpu ocelot . In GPU Computing Gems Jade Edition . Elsevier , Amsterdam, Netherlands , 409--427. Andrew Kerr, Gregory Diamos, and Sudhakar Yalamanchili. 2012. Gpu application development, debugging, and performance tuning with gpu ocelot. In GPU Computing Gems Jade Edition. Elsevier, Amsterdam, Netherlands, 409--427."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00047"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.55"},{"key":"e_1_3_2_1_24_1","volume-title":"Macsim: A cpu-gpu heterogeneous simulation framework user guide","author":"Kim Hyesoon","year":"2012","unstructured":"Hyesoon Kim , Jaekyu Lee , Nagesh B Lakshminarayana , Jaewoong Sim , Jieun Lim , and Tri Pho . 2012 . Macsim: A cpu-gpu heterogeneous simulation framework user guide . Georgia Institute of Technology , Atlanta, GA . Hyesoon Kim, Jaekyu Lee, Nagesh B Lakshminarayana, Jaewoong Sim, Jieun Lim, and Tri Pho. 2012. Macsim: A cpu-gpu heterogeneous simulation framework user guide. Georgia Institute of Technology, Atlanta, GA."},{"key":"e_1_3_2_1_25_1","volume-title":"2014 IEEE Hot Chips 26 Symposium (HCS). IEEE, IEEE","author":"Kim Joonyoung","year":"2014","unstructured":"Joonyoung Kim and Younsu Kim . 2014 . HBM: Memory solution for bandwidth-hungry processors . In 2014 IEEE Hot Chips 26 Symposium (HCS). IEEE, IEEE , Cupertino, CA, 1--24. Joonyoung Kim and Younsu Kim. 2014. HBM: Memory solution for bandwidth-hungry processors. In 2014 IEEE Hot Chips 26 Symposium (HCS). IEEE, IEEE, Cupertino, CA, 1--24."},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the 25th International Conference on Neural Information Processing Systems -","volume":"1","author":"Krizhevsky Alex","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E. Hinton . 2012. ImageNet Classification with Deep Convolutional Neural Networks . In Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 (Lake Tahoe, Nevada) (NIPS'12). Curran Associates Inc., Red Hook, NY, USA, 1097--1105. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 (Lake Tahoe, Nevada) (NIPS'12). Curran Associates Inc., Red Hook, NY, USA, 1097--1105."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2020.2973991"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/384266.299683"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037707"},{"key":"e_1_3_2_1_30_1","volume-title":"Polybench: The polyhedral benchmark suite.","author":"Pouchet Louis-No\u00ebl","year":"2012","unstructured":"Louis-No\u00ebl Pouchet 2012 . Polybench: The polyhedral benchmark suite. Louis-No\u00ebl Pouchet et al. 2012. Polybench: The polyhedral benchmark suite."},{"key":"e_1_3_2_1_31_1","unstructured":"AMD Staff. 2014. Opencl and the AMD App SDK v2. 4.  AMD Staff. 2014. Opencl and the AMD App SDK v2. 4."},{"key":"e_1_3_2_1_32_1","volume-title":"Shi Dong, and David R. Kaeli.","author":"Sun Yifan","year":"2019","unstructured":"Yifan Sun , Nicolas Bohm Agostini , Shi Dong, and David R. Kaeli. 2019 . Summarizing CPU and GPU Design Trends with Product Data. CoRR abs\/1911.11313 (2019), 1--5. Yifan Sun, Nicolas Bohm Agostini, Shi Dong, and David R. Kaeli. 2019. Summarizing CPU and GPU Design Trends with Product Data. CoRR abs\/1911.11313 (2019), 1--5."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322230"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2016.7581262"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2018.00034"},{"key":"e_1_3_2_1_36_1","volume-title":"Daisen: A Framework for Visualizing Detailed GPU Execution. Eurographics Conference on Visualization 40","author":"Sun Yifan","year":"2021","unstructured":"Yifan Sun , Yixuan Zhang , Ali Mosallaei , Michael D Shah , Cody Dunne , and David Kaeli . 2021 . Daisen: A Framework for Visualizing Detailed GPU Execution. Eurographics Conference on Visualization 40 , 3 (2021), 239--250. Yifan Sun, Yixuan Zhang, Ali Mosallaei, Michael D Shah, Cody Dunne, and David Kaeli. 2021. Daisen: A Framework for Visualizing Detailed GPU Execution. Eurographics Conference on Visualization 40, 3 (2021), 239--250."},{"key":"e_1_3_2_1_37_1","unstructured":"The Go Project. 2019. Effective Go. https:\/\/golang.org\/doc\/effective_go.html.  The Go Project. 2019. Effective Go. https:\/\/golang.org\/doc\/effective_go.html."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2370816.2370865"},{"key":"e_1_3_2_1_39_1","series-title":"SIAM Journal on scientific and Statistical Computing 13, 2","volume-title":"Bi-CGSTAB: A fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems","author":"Van der Vorst Henk A","year":"1992","unstructured":"Henk A Van der Vorst . 1992. Bi-CGSTAB: A fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems . SIAM Journal on scientific and Statistical Computing 13, 2 ( 1992 ), 631--644. Henk A Van der Vorst. 1992. Bi-CGSTAB: A fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems. SIAM Journal on scientific and Statistical Computing 13, 2 (1992), 631--644."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00077"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2008.5214359"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00030"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCSim.2011.5999803"}],"event":{"name":"PACT '22: International Conference on Parallel Architectures and Compilation Techniques","location":"Chicago Illinois","acronym":"PACT '22","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IFIP WG 10.3 IFIP WG 10.3","IEEE CS"]},"container-title":["Proceedings of the International Conference on Parallel Architectures and Compilation Techniques"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3559009.3569666","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3559009.3569666","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:38Z","timestamp":1750186958000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3559009.3569666"}},"subtitle":["A Highly Accurate GPU Simulator for AMD RDNA GPUs"],"short-title":[],"issued":{"date-parts":[[2022,10,8]]},"references-count":43,"alternative-id":["10.1145\/3559009.3569666","10.1145\/3559009"],"URL":"https:\/\/doi.org\/10.1145\/3559009.3569666","relation":{},"subject":[],"published":{"date-parts":[[2022,10,8]]},"assertion":[{"value":"2023-01-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}