{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T12:30:20Z","timestamp":1776774620125,"version":"3.51.2"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2019,11,18]],"date-time":"2019-11-18T00:00:00Z","timestamp":1574035200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2019,12,31]]},"abstract":"<jats:p>To prevent information leakage during program execution, modern software cryptographic implementations target constant-time function, where the number of instructions executed remains the same when program inputs change. However, the underlying microarchitecture behaves differently when processing different data inputs, impacting the execution time of the same instructions. These differences in execution time can covertly leak confidential information through a timing channel.<\/jats:p>\n          <jats:p>Given the recent reports of covert channels present on commercial microprocessors, a number of microarchitectural features on CPUs have been re-examined from a timing leakage perspective. Unfortunately, a similar microarchitectural evaluation of the potential attack surfaces on GPUs has not been adequately performed. Several prior work has considered a timing channel based on the behavior of a GPU\u2019s coalescing unit. In this article, we identify a second finer-grained microarchitectural timing channel, related to the banking structure of the GPU\u2019s Shared Memory. By considering the timing channel caused by Shared Memory bank conflicts, we have developed a differential timing attack that can compromise table-based cryptographic algorithms. We implement our timing attack on an Nvidia Kepler K40 GPU and successfully recover the complete 128-bit encryption key of an Advanced Encryption Standard (AES) GPU implementation using 900,000 timing samples. We also evaluate the scalability of our attack method by attacking an implementation of the AES encryption algorithm that fully occupies the compute resources of the GPU. We extend our timing analysis onto other Nvidia architectures: Maxwell, Pascal, Volta, and Turing GPUs. We also discuss countermeasures and experiment with a novel multi-key implementation, evaluating its resistance to our side-channel timing attack and its associated performance overhead.<\/jats:p>","DOI":"10.1145\/3361870","type":"journal-article","created":{"date-parts":[[2019,11,18]],"date-time":"2019-11-18T13:01:53Z","timestamp":1574082113000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":16,"title":["Exploiting Bank Conflict-based Side-channel Timing Leakage of GPUs"],"prefix":"10.1145","volume":"16","author":[{"given":"Zhen Hang","family":"Jiang","sequence":"first","affiliation":[{"name":"Facebook, Seattle,WA, USA"}]},{"given":"Yunsi","family":"Fei","sequence":"additional","affiliation":[{"name":"Electrical 8 Computer Engineering Department, Northeastern University, Boston, MA, USA"}]},{"given":"David","family":"Kaeli","sequence":"additional","affiliation":[{"name":"Electrical 8 Computer Engineering Department, Northeastern University, Boston, MA, USA"}]}],"member":"320","published-online":{"date-parts":[[2019,11,18]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 2017 Computing Conference. IEEE, 1077--1085","author":"Abdelrahman Ahmed A."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3196494.3196501"},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of the 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum (IPDPSW\u201910)","author":"Mei Chonglei","year":"2010"},{"key":"e_1_2_1_4_1","first-page":"35","article-title":"Non-monopolizable caches: Low-complexity mitigation of cache side channel attacks","volume":"8","author":"Domnitser Leonid","year":"2012","journal-title":"ACM Trans. Arch. Code Optimiz."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/DAC.2014.6881536"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Benedict Gaster Lee Howes David R. Kaeli Perhaad Mistry and Dana Schaa. 2012. Heterogeneous Computing with OpenCL: Revised OpenCL 1. Newnes.  Benedict Gaster Lee Howes David R. Kaeli Perhaad Mistry and Dana Schaa. 2012. Heterogeneous Computing with OpenCL: Revised OpenCL 1. Newnes.","DOI":"10.1016\/B978-0-12-387766-6.00024-4"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"B. Gierlichs L. Batina P. Tuyls and B. Preneel. 2008. Mutual information analysis. In Cryptographic Hardware and Embedded Systems (CHES\u201908). Springer Berlin Heidelberg 426\u2013442.  B. Gierlichs L. Batina P. Tuyls and B. Preneel. 2008. Mutual information analysis. In Cryptographic Hardware and Embedded Systems (CHES\u201908). Springer Berlin Heidelberg 426\u2013442.","DOI":"10.1007\/978-3-540-85053-3_27"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33383-5_21"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the USENIX Security Symposium.","author":"Gruss Daniel","year":"2017"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-40667-1_14"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-21476-4_8"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCAD.2017.8203771"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201916)","author":"Jiang Z. H.","year":"2016"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3060403.3060462"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00023"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCD.2018.00020"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the USENIX Security Symposium. 189--204","author":"Kim Taesoo","year":"2012"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00083"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-48405-1_25"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2009.4798277"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCC.2012.119"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3300053.3319415"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture. IEEE, 406--418","author":"Liu Fangfei"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the 47th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201914)","author":"Liu Fangfei"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the IEEE Symposium on Security 8 Privacy.","author":"Liu Fangfei"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240765.3240812"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCD.2015.7357115"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3124538"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3243734.3243831"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.15803\/ijnc.2.2_251"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1587\/transinf.E97.D.1506"},{"key":"e_1_2_1_32_1","unstructured":"Nvidia. 2015. Nvidia CUDA Toolkit v7.0 Documentation. Retrieved from http:\/\/docs.nvidia.com\/cuda\/index.html.  Nvidia. 2015. Nvidia CUDA Toolkit v7.0 Documentation. Retrieved from http:\/\/docs.nvidia.com\/cuda\/index.html."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13858-4_5"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1655008.1655019"},{"key":"e_1_2_1_35_1","doi-asserted-by":"crossref","unstructured":"Matthieu Rivain. 2009. On the exact success rate of side channel analysis in the Gaussian model. In Selected Areas in Cryptography. 165--183.  Matthieu Rivain. 2009. On the exact success rate of side channel analysis in the Gaussian model. In Selected Areas in Cryptography. 165--183.","DOI":"10.1007\/978-3-642-04159-4_11"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the International Workshop on Fast Software Encryption. Springer, 191--204","author":"Schneier Bruce","year":"1993"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00145-009-9049-y"},{"key":"e_1_2_1_38_1","volume-title":"Lee","author":"Wang Zhenghong","year":"2007"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the USENIX Security Symposium. 719--732","author":"Yarom Yuval","year":"2014"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13389-017-0152-y"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2976749.2978324"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3361870","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3361870","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:23:10Z","timestamp":1750202590000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3361870"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,18]]},"references-count":41,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2019,12,31]]}},"alternative-id":["10.1145\/3361870"],"URL":"https:\/\/doi.org\/10.1145\/3361870","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"value":"1544-3566","type":"print"},{"value":"1544-3973","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,18]]},"assertion":[{"value":"2019-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-09-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-11-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}