{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,12]],"date-time":"2026-06-12T10:02:28Z","timestamp":1781258548834,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":49,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,28]],"date-time":"2023-10-28T00:00:00Z","timestamp":1698451200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,28]]},"DOI":"10.1145\/3613424.3623782","type":"proceedings-article","created":{"date-parts":[[2023,12,8]],"date-time":"2023-12-08T17:22:15Z","timestamp":1702056135000},"page":"728-741","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Snake: A Variable-length Chain-based Prefetching for GPUs"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-3485-0036","authenticated-orcid":false,"given":"Saba","family":"Mostofi","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Sharif University of Technology, Iran"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8375-3339","authenticated-orcid":false,"given":"Hajar","family":"Falahati","sequence":"additional","affiliation":[{"name":"School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Iran"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5232-3539","authenticated-orcid":false,"given":"Negin","family":"Mahani","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Shahid Bahonar University, Iran and School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Iran"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3293-8274","authenticated-orcid":false,"given":"Pejman","family":"Lotfi-Kamran","sequence":"additional","affiliation":[{"name":"School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Iran"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4079-8603","authenticated-orcid":false,"given":"Hamid","family":"Sarbazi-Azad","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Sharif University of Technology, Iran and School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Iran"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,12,8]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2000. Design Compiler Synopsys inc.  2000. Design Compiler Synopsys inc."},{"key":"e_1_3_2_1_2_1","unstructured":"2015. Cadence SoC Encounter. https:\/\/www.cadence.com\/.  2015. Cadence SoC Encounter. https:\/\/www.cadence.com\/."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2015.38"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3123975"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2009.4919648"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00021"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00053"},{"key":"e_1_3_2_1_8_1","volume-title":"Nonvolatile Memory Workshop (NVMW).","author":"Chakraborttii Chandranil","year":"2022","unstructured":"Chandranil Chakraborttii and Heiner Litz . 2022 . Deep Learning based Prefetching for Flash . In Nonvolatile Memory Workshop (NVMW). Chandranil Chakraborttii and Heiner Litz. 2022. Deep Learning based Prefetching for Flash. In Nonvolatile Memory Workshop (NVMW)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.16"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3508036"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3154315"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CADS.2013.6714230"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-014-1331-6"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2021.3096191"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322224"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3570638"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485951"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980983.2908117"},{"key":"e_1_3_2_1_19_1","volume-title":"AccelWattch: A Power Modeling Framework for Modern GPUs. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 738\u2013753","author":"Kandiah Vijay","year":"2021","unstructured":"Vijay Kandiah , Scott Peverelle , Mahmoud Khairy , Junrui Pan , Amogh Manjunath , Timothy\u00a0 G Rogers , Tor\u00a0 M Aamodt , and Nikos Hardavellas . 2021 . AccelWattch: A Power Modeling Framework for Modern GPUs. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 738\u2013753 . Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy\u00a0G Rogers, Tor\u00a0M Aamodt, and Nikos Hardavellas. 2021. AccelWattch: A Power Modeling Framework for Modern GPUs. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 738\u2013753."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1049\/iet-cdt.2014.0092"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00047"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3291051"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783763"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446062"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2018.00024"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080239"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2014.6835970"},{"key":"e_1_3_2_1_28_1","volume-title":"Special Issue","author":"Lam D","year":"1991","unstructured":"Monica\u00a0 D Lam , Edward\u00a0 E Rothberg , and Michael\u00a0 E Wolf . 1991. The cache performance and optimizations of blocked algorithms. ACM SIGOPS Operating Systems Review 25 , Special Issue ( 1991 ), 63\u201374. Monica\u00a0D Lam, Edward\u00a0E Rothberg, and Michael\u00a0E Wolf. 1991. The cache performance and optimizations of blocked algorithms. ACM SIGOPS Operating Systems Review 25, Special Issue (1991), 63\u201374."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2010.44"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3322127"},{"key":"e_1_3_2_1_31_1","volume-title":"configuration, and break-up history of Rodinia: a synthesis. Precambrian research 160, 1-2","author":"Li Zheng-Xiang","year":"2008","unstructured":"Zheng-Xiang Li , S Vb Bogdanova , AS Collins , Anthony Davidson , Bert De\u00a0Waele , RE Ernst , ICW Fitzsimons , RA Fuck , DP Gladkochub , J Jacobs , 2008. Assembly , configuration, and break-up history of Rodinia: a synthesis. Precambrian research 160, 1-2 ( 2008 ), 179\u2013210. Zheng-Xiang Li, SVb Bogdanova, AS Collins, Anthony Davidson, Bert De\u00a0Waele, RE Ernst, ICW Fitzsimons, RA Fuck, DP Gladkochub, J Jacobs, 2008. Assembly, configuration, and break-up history of Rodinia: a synthesis. Precambrian research 160, 1-2 (2008), 179\u2013210."},{"key":"e_1_3_2_1_32_1","volume-title":"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 1\u201310","author":"Hung Tzu-Han","year":"2009","unstructured":"Shih-wei Liao, Tzu-Han Hung , Donald Nguyen , Chinyen Chou , Chiaheng Tu , and Hucheng Zhou . 2009 . Machine learning-based prefetch optimization for data center applications . In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 1\u201310 . Shih-wei Liao, Tzu-Han Hung, Donald Nguyen, Chinyen Chou, Chiaheng Tu, and Hucheng Zhou. 2009. Machine learning-based prefetch optimization for data center applications. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 1\u201310."},{"key":"e_1_3_2_1_33_1","volume-title":"Introduction to data prefetching. Data Prefetching Techniques in Computer Systems","author":"Lotfi-Kamrana Pejman","year":"2022","unstructured":"Pejman Lotfi-Kamrana and Hamid Sarbazi-Azadb . 2022. Introduction to data prefetching. Data Prefetching Techniques in Computer Systems ( 2022 ), 1. Pejman Lotfi-Kamrana and Hamid Sarbazi-Azadb. 2022. Introduction to data prefetching. Data Prefetching Techniques in Computer Systems (2022), 1."},{"key":"e_1_3_2_1_34_1","first-page":"1","article-title":"Efficient Nearest-Neighbor Data Sharing in GPUs","volume":"18","author":"Nematollahi Negin","year":"2020","unstructured":"Negin Nematollahi , Mohammad Sadrosadati , Hajar Falahati , Marzieh Barkhordar , Mario\u00a0Paulo Drumond , Hamid Sarbazi-Azad , and Babak Falsafi . 2020 . Efficient Nearest-Neighbor Data Sharing in GPUs . ACM Transactions on Architecture and Code Optimization (TACO) 18 , 1 (2020), 1 \u2013 26 . Negin Nematollahi, Mohammad Sadrosadati, Hajar Falahati, Marzieh Barkhordar, Mario\u00a0Paulo Drumond, Hamid Sarbazi-Azad, and Babak Falsafi. 2020. Efficient Nearest-Neighbor Data Sharing in GPUs. ACM Transactions on Architecture and Code Optimization (TACO) 18, 1 (2020), 1\u201326.","journal-title":"ACM Transactions on Architecture and Code Optimization (TACO)"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2018.2873679"},{"key":"e_1_3_2_1_36_1","unstructured":"Tesla NVIDIA. [n. d.]. V100 Volta Architecture. URL http:\/\/www. nvidia. com\/object\/volta-architecture-whitepaper. html ([n. d.]).  Tesla NVIDIA. [n. d.]. V100 Volta Architecture. URL http:\/\/www. nvidia. com\/object\/volta-architecture-whitepaper. html ([n. d.])."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001158"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/170036.170081"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00021"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2925426.2926282"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3291606"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3419973"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830793"},{"key":"e_1_3_2_1_44_1","first-page":"27","article-title":"Parboil: A revised benchmark suite for scientific and commercial throughput computing","volume":"127","author":"Stratton A","year":"2012","unstructured":"John\u00a0 A Stratton , Christopher Rodrigues , I- Jui Sung , Nady Obeid , Li-Wen Chang , Nasser Anssari , Geng\u00a0Daniel Liu , and Wen-mei\u00a0 W Hwu . 2012 . Parboil: A revised benchmark suite for scientific and commercial throughput computing . Center for Reliable and High-Performance Computing 127 (2012), 27 . John\u00a0A Stratton, Christopher Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng\u00a0Daniel Liu, and Wen-mei\u00a0W Hwu. 2012. Parboil: A revised benchmark suite for scientific and commercial throughput computing. Center for Reliable and High-Performance Computing 127 (2012), 27.","journal-title":"Center for Reliable and High-Performance Computing"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.596622"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2967938.2967947"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00030"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/4.509850"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2014.6835939"}],"event":{"name":"MICRO '23: 56th Annual IEEE\/ACM International Symposium on Microarchitecture","location":"Toronto ON Canada","acronym":"MICRO '23","sponsor":["SIGMICRO ACM Special Interest Group on Microarchitectural Research and Processing"]},"container-title":["56th Annual IEEE\/ACM International Symposium on Microarchitecture"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3613424.3623782","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3613424.3623782","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:30Z","timestamp":1750178190000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3613424.3623782"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,28]]},"references-count":49,"alternative-id":["10.1145\/3613424.3623782","10.1145\/3613424"],"URL":"https:\/\/doi.org\/10.1145\/3613424.3623782","relation":{},"subject":[],"published":{"date-parts":[[2023,10,28]]},"assertion":[{"value":"2023-12-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}