{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:12:19Z","timestamp":1750219939450,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,2,25]],"date-time":"2023-02-25T00:00:00Z","timestamp":1677283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"STINT","award":["MG2018-8007"],"award-info":[{"award-number":["MG2018-8007"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,2,25]]},"DOI":"10.1145\/3589236.3589243","type":"proceedings-article","created":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T00:32:47Z","timestamp":1687307567000},"page":"7-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["LATOA: Load-Aware Task Offloading and Adoption in GPU"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7135-5130","authenticated-orcid":false,"given":"Hossein","family":"Bitalebi","sequence":"first","affiliation":[{"name":"KTH Royal Institute of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6996-608X","authenticated-orcid":false,"given":"Vahid","family":"Geraeinejad","sequence":"additional","affiliation":[{"name":"KTH Royal Institute of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8546-3148","authenticated-orcid":false,"given":"Farshad","family":"Safaei","sequence":"additional","affiliation":[{"name":"Faculty of Computer Science and Engineering, Shahid Beheshti University, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7877-6712","authenticated-orcid":false,"given":"Masoumeh","family":"Ebrahimi","sequence":"additional","affiliation":[{"name":"KTH Royal Institute of Technology, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,6,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2009.4919648"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3530390.3532726"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-021-03854-w"},{"key":"e_1_3_2_1_4_1","volume-title":"Criticality-aware priority to accelerate GPU memory access. The Journal of Supercomputing","author":"Bitalebi Hossein","year":"2022","unstructured":"Hossein Bitalebi and Farshad Safaei. 2022. Criticality-aware priority to accelerate GPU memory access. The Journal of Supercomputing (2022), 1\u201326."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3508036"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-019-03091-2"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00040"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3330345.3330390"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1555754.1555775"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2896377.2901468"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2018.00073"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00041"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3093337.3037709"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2021.06.021"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3326124"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394885.3431535"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3144614"},{"key":"e_1_3_2_1_19_1","volume-title":"Euromicro Conference on ECRTS, Vol.\u00a023","author":"Pujol Roger","year":"2019","unstructured":"Roger Pujol, Hamid Tabani, Leonidas Kosmidis, Enrico Mezzetti, Jaume Abella\u00a0Ferrer, and Francisco\u00a0J Cazorla. 2019. Generating and exploiting deep learning variants to increase heterogeneous resource utilization in the nvidia xavier. In Euromicro Conference on ECRTS, Vol.\u00a023."},{"key":"e_1_3_2_1_20_1","volume-title":"Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1275\u20131280","author":"Singh Jayati","year":"2022","unstructured":"Jayati Singh, Ignacio\u00a0Sa\u00f1udo Olmedo, Nicola Capodieci, Andrea Marongiu, and Marco Caccamo. 2022. Reconciling QoS and concurrency in NVIDIA GPUs via warp-level scheduling. In Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1275\u20131280."},{"key":"e_1_3_2_1_21_1","first-page":"27","article-title":"Parboil: A revised benchmark suite for scientific and commercial throughput computing","volume":"127","author":"Stratton A","year":"2012","unstructured":"John\u00a0A Stratton, Christopher Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng\u00a0Daniel Liu, and Wen-mei\u00a0W Hwu. 2012. Parboil: A revised benchmark suite for scientific and commercial throughput computing. Center for Reliable and High-Performance Computing 127 (2012), 27.","journal-title":"Center for Reliable and High-Performance Computing"},{"key":"e_1_3_2_1_22_1","first-page":"1","article-title":"Paver: Locality graph-based thread block scheduling for gpus","volume":"18","author":"Tripathy Devashree","year":"2021","unstructured":"Devashree Tripathy, Amirali Abdolrashidi, Laxmi\u00a0Narayan Bhuyan, Liang Zhou, and Daniel Wong. 2021. Paver: Locality graph-based thread block scheduling for gpus. ACM Transactions on TACO 18, 3 (2021), 1\u201326.","journal-title":"ACM Transactions on TACO"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/NAS51552.2021.9605411"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783718"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001201"},{"key":"e_1_3_2_1_26_1","first-page":"1","article-title":"Improving thread-level parallelism in GPUs through expanding register file to scratchpad memory","volume":"15","author":"Yu Chao","year":"2018","unstructured":"Chao Yu, Yuebin Bai, Qingxiao Sun, and Hailong Yang. 2018. Improving thread-level parallelism in GPUs through expanding register file to scratchpad memory. ACM Transactions on TACO 15, 4 (2018), 1\u201324.","journal-title":"ACM Transactions on TACO"},{"volume-title":"Coordinated page prefetch and eviction for memory oversubscription management in gpus","author":"Yu Qi","key":"e_1_3_2_1_27_1","unstructured":"Qi Yu, Bruce Childers, Libo Huang, Cheng Qian, Hui Guo, and Zhiying Wang. 2020. Coordinated page prefetch and eviction for memory oversubscription management in gpus. In IEEE IPDPS. IEEE, 472\u2013482."},{"key":"e_1_3_2_1_28_1","volume-title":"Cuda c\/c++ basics. NVIDIA Coporation","author":"Zeller Cyril","year":"2011","unstructured":"Cyril Zeller. 2011. Cuda c\/c++ basics. NVIDIA Coporation (2011)."}],"event":{"name":"GPGPU '23: 15th Workshop on General Purpose Processing Using GPU","acronym":"GPGPU '23","location":"Montreal Canada"},"container-title":["Proceedings of the 15th Workshop on General Purpose Processing Using GPU"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3589236.3589243","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3589236.3589243","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:53Z","timestamp":1750182533000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3589236.3589243"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,25]]},"references-count":28,"alternative-id":["10.1145\/3589236.3589243","10.1145\/3589236"],"URL":"https:\/\/doi.org\/10.1145\/3589236.3589243","relation":{},"subject":[],"published":{"date-parts":[[2023,2,25]]},"assertion":[{"value":"2023-06-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}