{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T13:40:26Z","timestamp":1776865226775,"version":"3.51.2"},"publisher-location":"New York, NY, USA","reference-count":90,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,11]],"date-time":"2022-06-11T00:00:00Z","timestamp":1654905600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Research Foundation of Korea (NRF)","award":["NRF-2021R1A2C2091753, NRF-2020M3H6A1085498"],"award-info":[{"award-number":["NRF-2021R1A2C2091753, NRF-2020M3H6A1085498"]}]},{"name":"Samsung Electronics Co., Ltd","award":["IO201210-07974-01"],"award-info":[{"award-number":["IO201210-07974-01"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,18]]},"DOI":"10.1145\/3470496.3527391","type":"proceedings-article","created":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T19:06:01Z","timestamp":1654023961000},"page":"932-945","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":54,"title":["SmartSAGE"],"prefix":"10.1145","author":[{"given":"Yunjae","family":"Lee","sequence":"first","affiliation":[{"name":"KAIST"}]},{"given":"Jinha","family":"Chung","sequence":"additional","affiliation":[{"name":"KAIST"}]},{"given":"Minsoo","family":"Rhu","sequence":"additional","affiliation":[{"name":"KAIST"}]}],"member":"320","published-online":{"date-parts":[[2022,6,11]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/291069.291026"},{"key":"e_1_3_2_1_2_1","volume-title":"Compute Caches. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA).","author":"Aga Shaizeen","year":"2017","unstructured":"Shaizeen Aga , Supreet Jeloka , Arun Subramaniyan , Satish Narayanasamy , David Blaauw , and Reetuparna Das . 2017 . Compute Caches. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA). Shaizeen Aga, Supreet Jeloka, Arun Subramaniyan, Satish Narayanasamy, David Blaauw, and Reetuparna Das. 2017. Compute Caches. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00070"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783753"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2507847"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2959100.2959180"},{"key":"e_1_3_2_1_7_1","volume-title":"Scalable Realistic Recommendation Datasets through Fractal Expansions. arXiv preprint arXiv:1901.08910","author":"Belletti Francois","year":"2019","unstructured":"Francois Belletti , Karthik Lakshmanan , Walid Krichene , Yi-Fan Chen , and John Anderson . 2019. Scalable Realistic Recommendation Datasets through Fractal Expansions. arXiv preprint arXiv:1901.08910 ( 2019 ). Francois Belletti, Karthik Lakshmanan, Walid Krichene, Yi-Fan Chen, and John Anderson. 2019. Scalable Realistic Recommendation Datasets through Fractal Expansions. arXiv preprint arXiv:1901.08910 (2019)."},{"key":"e_1_3_2_1_8_1","volume-title":"Proceedings of the International Solid State Circuits Conference (ISSCC).","author":"Cheong Wooseong","year":"2018","unstructured":"Wooseong Cheong , Chanho Yoon , Seonghoon Woo , Kyuwook Han , Daehyun Kim , Chulseung Lee , Youra Choi , Shine Kim , Dongku Kang , Geunyeong Yu , Jaehong Kim , Jaechun Park , Ki-Whan Song , Ki-Tae Park , Sangyeun Cho , Hwaseok Oh , Daniel DG Lee , Jin-Hyeok Choi , and Jaeheon Jeong . 2018 . A Flash Memory Controller for 15us Ultra-Low-Latency SSD Using High-Speed 3D NAND Flash with 3us Read Time . In Proceedings of the International Solid State Circuits Conference (ISSCC). Wooseong Cheong, Chanho Yoon, Seonghoon Woo, Kyuwook Han, Daehyun Kim, Chulseung Lee, Youra Choi, Shine Kim, Dongku Kang, Geunyeong Yu, Jaehong Kim, Jaechun Park, Ki-Whan Song, Ki-Tae Park, Sangyeun Cho, Hwaseok Oh, Daniel DG Lee, Jin-Hyeok Choi, and Jaeheon Jeong. 2018. A Flash Memory Controller for 15us Ultra-Low-Latency SSD Using High-Speed 3D NAND Flash with 3us Read Time. In Proceedings of the International Solid State Circuits Conference (ISSCC)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001140"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the 1st Workshop on Near-Data Processing.","author":"Cho Benjamin Y","year":"2013","unstructured":"Benjamin Y Cho , Won Seob Jeong , Doohwan Oh , and Won Woo Ro . 2013 . XSD: Accelerating MapReduce by Harnessing the GPU Inside an SSD . In Proceedings of the 1st Workshop on Near-Data Processing. Benjamin Y Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro. 2013. XSD: Accelerating MapReduce by Harnessing the GPU Inside an SSD. In Proceedings of the 1st Workshop on Near-Data Processing."},{"key":"e_1_3_2_1_11_1","unstructured":"M. Cho T. Le U. Finkler H. Imai Y. Negishi T. Sekiyama S. Vinod V. Zolotov K. Kawachiya D. Kung and H. Hunter. 2018. Large Model Support for Deep Learning in Caffe and Chainer. In SysML.  M. Cho T. Le U. Finkler H. Imai Y. Negishi T. Sekiyama S. Vinod V. Zolotov K. Kawachiya D. Kung and H. Hunter. 2018. Large Model Support for Deep Learning in Caffe and Chainer. In SysML."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2464996.2465003"},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the International Symposium on Memory Systems (MEMSYS).","author":"Stephen Choi I","year":"2015","unstructured":"I Stephen Choi and Yang-Suk Kee . 2015 . Energy Efficient Scale-In Clusters with In-Storage Processing for Big-Data Analytics . In Proceedings of the International Symposium on Memory Systems (MEMSYS). I Stephen Choi and Yang-Suk Kee. 2015. Energy Efficient Scale-In Clusters with In-Storage Processing for Big-Data Analytics. In Proceedings of the International Symposium on Memory Systems (MEMSYS)."},{"key":"e_1_3_2_1_14_1","unstructured":"Hanjun Dai Zornitsa Kozareva Bo Dai Alex Smola and Le Song. 2018. Learning Steady-States of Iterative Algorithms over Graphs. In ICML. 1114--1122.  Hanjun Dai Zornitsa Kozareva Bo Dai Alex Smola and Le Song. 2018. Learning Steady-States of Iterative Algorithms over Graphs. In ICML. 1114--1122."},{"key":"e_1_3_2_1_15_1","volume-title":"Priscila M. V. Lima, Felipe M. G. Franca, and Vladimir Alves.","author":"Do Jaeyoung","year":"2020","unstructured":"Jaeyoung Do , Victor C. Ferreira , Hossein Bobarshad , Mahdi Torabzadehkashi , Siavash Rezaei , Ali Heydarigorji , Diego Souza , Brunno F. Goldstein , Leandro Santiago , Min Soo Kim , Priscila M. V. Lima, Felipe M. G. Franca, and Vladimir Alves. 2020 . Cost-Effective, Energy- Efficient , and Scalable Storage Computing for Large-Scale AI Applications. ACM Transactions on Storage ( 2020). Jaeyoung Do, Victor C. Ferreira, Hossein Bobarshad, Mahdi Torabzadehkashi, Siavash Rezaei, Ali Heydarigorji, Diego Souza, Brunno F. Goldstein, Leandro Santiago, Min Soo Kim, Priscila M. V. Lima, Felipe M. G. Franca, and Vladimir Alves. 2020. Cost-Effective, Energy-Efficient, and Scalable Storage Computing for Large-Scale AI Applications. ACM Transactions on Storage (2020)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2465295"},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems (NIPS).","author":"Duvenaud David","unstructured":"David Duvenaud , Dougal Maclaurin , Jorge Aguilera-Iparraguirre , Rafael Gomez-Bombarelli , Timothy Hirzel , Alan Aspuru-Guzik , and Ryan P. Adams . 2015. Convolutional Networks on Graphs for Learning Molecular Fingerprints . In Proceedings of the International Conference on Neural Information Processing Systems (NIPS). David Duvenaud, Dougal Maclaurin, Jorge Aguilera-Iparraguirre, Rafael Gomez-Bombarelli, Timothy Hirzel, Alan Aspuru-Guzik, and Ryan P. Adams. 2015. Convolutional Networks on Graphs for Learning Molecular Fingerprints. In Proceedings of the International Conference on Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2018.00040"},{"key":"e_1_3_2_1_19_1","unstructured":"Eideticom 2021. NoLoad CSP. https:\/\/www.eideticom.com\/products.html  Eideticom 2021. NoLoad CSP. https:\/\/www.eideticom.com\/products.html"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056040"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Fey Matthias","unstructured":"Matthias Fey and Jan E. Lenssen . 2019. Fast Graph Representation Learning with PyTorch Geometric . In Proceedings of the International Conference on Learning Representations (ICLR). Matthias Fey and Jan E. Lenssen. 2019. Fast Graph Representation Learning with PyTorch Geometric. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_22_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems (NIPS).","author":"Fout Alex","year":"2017","unstructured":"Alex Fout , Jonathon Byrd , Basir Shariat , and Asa Ben-Hur . 2017 . Protein Interface Prediction Using Graph Convolutional Networks . In Proceedings of the International Conference on Neural Information Processing Systems (NIPS). Alex Fout, Jonathon Byrd, Basir Shariat, and Asa Ben-Hur. 2017. Protein Interface Prediction Using Graph Convolutional Networks. In Proceedings of the International Conference on Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037702"},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the International Symposium on Microarchitecture (MICRO).","author":"Geng Tong","unstructured":"Tong Geng , Ang Li , Runbin Shi , Chunshu Wu , T. Wang , Yanfei Li , Pouya Haghi , Antonino Tumeo , Shuai Che , S. Reinhardt , and M. Herbordt . 2020. AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing . In Proceedings of the International Symposium on Microarchitecture (MICRO). Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, T. Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, S. Reinhardt, and M. Herbordt. 2020. AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. In Proceedings of the International Symposium on Microarchitecture (MICRO)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.23"},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems (NIPS).","author":"Hamilton William L.","year":"2017","unstructured":"William L. Hamilton , Rex Ying , and Jure Leskovec . 2017 . Inductive Representation Learning on Large Graphs . In Proceedings of the International Conference on Neural Information Processing Systems (NIPS). William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In Proceedings of the International Conference on Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358282"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378530"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00083"},{"key":"e_1_3_2_1_30_1","volume-title":"Layer-Centric Memory Reuse and Data Migration for Extreme-Scale Deep Learning on Many-Core Architectures. ACM Transactions on Architecture and Code Optimization","author":"Jin Hai","year":"2018","unstructured":"Hai Jin , Bo Liu , Wenbin Jiang , Yang Ma , Xuanhua Shi , Bingsheng He , and Shaofeng Zhao . 2018. Layer-Centric Memory Reuse and Data Migration for Extreme-Scale Deep Learning on Many-Core Architectures. ACM Transactions on Architecture and Code Optimization ( 2018 ). Hai Jin, Bo Liu, Wenbin Jiang, Yang Ma, Xuanhua Shi, Bingsheng He, and Shaofeng Zhao. 2018. Layer-Centric Memory Reuse and Data Migration for Extreme-Scale Deep Learning on Many-Core Architectures. ACM Transactions on Architecture and Code Optimization (2018)."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750412"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2018.00042"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2013.6558444"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00070"},{"key":"e_1_3_2_1_35_1","volume-title":"A Case for Intelligent Disks (IDISKs). Acm Sigmod Record","author":"Keeton Kimberly","year":"1998","unstructured":"Kimberly Keeton , David A Patterson , and Joseph M Hellerstein . 1998. A Case for Intelligent Disks (IDISKs). Acm Sigmod Record ( 1998 ). Kimberly Keeton, David A Patterson, and Joseph M Hellerstein. 1998. A Case for Intelligent Disks (IDISKs). Acm Sigmod Record (1998)."},{"key":"e_1_3_2_1_36_1","volume-title":"TRiM: Tensor Reduction in Memory","author":"Kim Byeongho","unstructured":"Byeongho Kim , Jaehyun Park , Eojin Lee , Minsoo Rhu , and Jung Ho Ahn . 2020. TRiM: Tensor Reduction in Memory . In IEEE Computer Architecture Letters . Byeongho Kim, Jaehyun Park, Eojin Lee, Minsoo Rhu, and Jung Ho Ahn. 2020. TRiM: Tensor Reduction in Memory. In IEEE Computer Architecture Letters."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001178"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3409963.3410501"},{"key":"e_1_3_2_1_39_1","volume-title":"In-Storage Processing of Database Scans and Joins. Information Sciences","author":"Kim Sungchan","year":"2016","unstructured":"Sungchan Kim , Hyunok Oh , Chanik Park , Sangyeun Cho , Sang-Won Lee , and Bongki Moon . 2016. In-Storage Processing of Database Scans and Joins. Information Sciences ( 2016 ). Sungchan Kim, Hyunok Oh, Chanik Park, Sangyeun Cho, Sang-Won Lee, and Bongki Moon. 2016. In-Storage Processing of Database Scans and Joins. Information Sciences (2016)."},{"key":"e_1_3_2_1_40_1","volume-title":"GRIP: A Graph Neural Network Accelerator Architecture. arXiv preprint arXiv:2007.13828","author":"Kiningham Kevin","year":"2020","unstructured":"Kevin Kiningham , Christopher Re , and Philip Levis . 2020 . GRIP: A Graph Neural Network Accelerator Architecture. arXiv preprint arXiv:2007.13828 (2020). Kevin Kiningham, Christopher Re, and Philip Levis. 2020. GRIP: A Graph Neural Network Accelerator Architecture. arXiv preprint arXiv:2007.13828 (2020)."},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Thomas","unstructured":"Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks . In Proceedings of the International Conference on Learning Representations (ICLR). Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_42_1","volume-title":"Proceedings of the International Symposium on Microarchitecture (MICRO).","author":"Koo Gunjae","unstructured":"Gunjae Koo , Kiran Kumar Matam , Te I., H. V. Krishna Giri Narra, Jing Li, Hung- Wei Tseng, Steven Swanson, and Murali Annavaram. 2017. Summarizer: Trading Communication with Computing Near Storage . In Proceedings of the International Symposium on Microarchitecture (MICRO). Gunjae Koo, Kiran Kumar Matam, Te I., H.V. Krishna Giri Narra, Jing Li, Hung- Wei Tseng, Steven Swanson, and Murali Annavaram. 2017. Summarizer: Trading Communication with Computing Near Storage. In Proceedings of the International Symposium on Microarchitecture (MICRO)."},{"key":"e_1_3_2_1_43_1","volume-title":"Rapid Prototype for Flash Storage Systems. ACM Transactions on Storage","author":"Kwak Jaewook","year":"2020","unstructured":"Jaewook Kwak , Sangjin Lee , Kibin Park , Jinwoo Jeong , and Yong Ho Song . 2020. Cosmos+ OpenSSD : Rapid Prototype for Flash Storage Systems. ACM Transactions on Storage ( 2020 ). Jaewook Kwak, Sangjin Lee, Kibin Park, Jinwoo Jeong, and Yong Ho Song. 2020. Cosmos+ OpenSSD: Rapid Prototype for Flash Storage Systems. ACM Transactions on Storage (2020)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358284"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00029"},{"key":"e_1_3_2_1_46_1","volume-title":"A Case for Memory-Centric HPC System Architecture for Training Deep Neural Networks","author":"Kwon Youngeun","unstructured":"Youngeun Kwon and Minsoo Rhu . 2018. A Case for Memory-Centric HPC System Architecture for Training Deep Neural Networks . In IEEE Computer Architecture Letters . Youngeun Kwon and Minsoo Rhu. 2018. A Case for Memory-Centric HPC System Architecture for Training Deep Neural Networks. In IEEE Computer Architecture Letters."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00021"},{"key":"e_1_3_2_1_48_1","volume-title":"A Disaggregated Memory System for Deep Learning","author":"Kwon Youngeun","unstructured":"Youngeun Kwon and Minsoo Rhu . 2019. A Disaggregated Memory System for Deep Learning . In IEEE Micro . Youngeun Kwon and Minsoo Rhu. 2019. A Disaggregated Memory System for Deep Learning. In IEEE Micro."},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3137765.3137776"},{"key":"e_1_3_2_1_50_1","volume-title":"Proceedings of the International Symposium on Computer Architecture (ISCA).","author":"Lee Sukhan","year":"2021","unstructured":"Sukhan Lee , Shin-haeng Kang, Jaehoon Lee , Hyeonsu Kim , Eojin Lee , Seungwoo Seo , Hosang Yoon , Seungwon Lee , Kyounghwan Lim , Hyunsung Shin , Jinhyun Kim , O Seongil , Anand Iyer , David Wang , Kyomin Sohn , and Nam Sung Kim . 2021 . Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology . In Proceedings of the International Symposium on Computer Architecture (ISCA). Sukhan Lee, Shin-haeng Kang, Jaehoon Lee, Hyeonsu Kim, Eojin Lee, Seungwoo Seo, Hosang Yoon, Seungwon Lee, Kyounghwan Lim, Hyunsung Shin, Jinhyun Kim, O Seongil, Anand Iyer, David Wang, Kyomin Sohn, and Nam Sung Kim. 2021. Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology. In Proceedings of the International Symposium on Computer Architecture (ISCA)."},{"key":"e_1_3_2_1_51_1","volume-title":"Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training","author":"Lee Yunjae","unstructured":"Yunjae Lee , Youngeun Kwon , and Minsoo Rhu . 2021. Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training . In IEEE Computer Architecture Letters . Yunjae Lee, Youngeun Kwon, and Minsoo Rhu. 2021. Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training. In IEEE Computer Architecture Letters."},{"key":"e_1_3_2_1_52_1","volume-title":"Sang-Hoon Kim, Jin-Soo Kim, and Seungryoul Maeng.","author":"Lee Young-Sik","year":"2016","unstructured":"Young-Sik Lee , Luis Cavazos Quero , Sang-Hoon Kim, Jin-Soo Kim, and Seungryoul Maeng. 2016 . ActiveSort: Efficient External Sorting Using Active SSDs in the MapReduce Framework. Future Generation Computer Systems ( 2016). Young-Sik Lee, Luis Cavazos Quero, Sang-Hoon Kim, Jin-Soo Kim, and Seungryoul Maeng. 2016. ActiveSort: Efficient External Sorting Using Active SSDs in the MapReduce Framework. Future Generation Computer Systems (2016)."},{"key":"e_1_3_2_1_53_1","volume-title":"Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD).","author":"Leskovec J.","unstructured":"J. Leskovec , J. Kleinberg , and C. Faloutsos . 2005. Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations . In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). J. Leskovec, J. Kleinberg, and C. Faloutsos. 2005. Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD)."},{"key":"e_1_3_2_1_54_1","volume-title":"Proceedings of USENIX Conference on Annual Technical Conference (ATC).","author":"Li Cangyuan","year":"2021","unstructured":"Cangyuan Li , Ying Wang , Cheng Liu , Shengwen Liang , Huawei Li , and Xiaowei Li . 2021 . GLIST: Towards In-Storage Graph Learning . In Proceedings of USENIX Conference on Annual Technical Conference (ATC). Cangyuan Li, Ying Wang, Cheng Liu, Shengwen Liang, Huawei Li, and Xiaowei Li. 2021. GLIST: Towards In-Storage Graph Learning. In Proceedings of USENIX Conference on Annual Technical Conference (ATC)."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00070"},{"key":"e_1_3_2_1_56_1","volume-title":"EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks","author":"Liang Shengwen","year":"2020","unstructured":"Shengwen Liang , Ying Wang , Cheng Liu , Lei He , Huawei Li , Dawen Xu , and Xiao-Wei Li. 2020. EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks . IEEE Trans. Comput . ( 2020 ). Shengwen Liang, Ying Wang, Cheng Liu, Lei He, Huawei Li, Dawen Xu, and Xiao-Wei Li. 2020. EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks. IEEE Trans. Comput. (2020)."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3419111.3421281"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322275"},{"key":"e_1_3_2_1_59_1","volume-title":"Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Mohoney Jason","year":"2021","unstructured":"Jason Mohoney , Roger Waleffe , Henry Xu , Theodoros Rekatsinas , and Shivaram Venkataraman . 2021 . Marius: Learning Massive Graph Embeddings on a Single Machine . In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI). Jason Mohoney, Roger Waleffe, Henry Xu, Theodoros Rekatsinas, and Shivaram Venkataraman. 2021. Marius: Learning Massive Graph Embeddings on a Single Machine. In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3466752.3480080"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378505"},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00057"},{"key":"e_1_3_2_1_63_1","volume-title":"Proceedings of the International Symposium on Microarchitecture (MICRO).","author":"Rhu Minsoo","unstructured":"Minsoo Rhu , Natalia Gimelshein , Jason Clemons , Arslan Zulfiqar , and Stephen W. Keckler . 2016. vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design . In Proceedings of the International Symposium on Microarchitecture (MICRO). Minsoo Rhu, Natalia Gimelshein, Jason Clemons, Arslan Zulfiqar, and Stephen W. Keckler. 2016. vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design. In Proceedings of the International Symposium on Microarchitecture (MICRO)."},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00017"},{"key":"e_1_3_2_1_65_1","volume-title":"Proceedings of Conference on Very Large Databases (VLDB).","author":"Riedel Erik","year":"1998","unstructured":"Erik Riedel , Garth Gibson , and Christos Faloutsos . 1998 . Active Storage for Large-Scale Data Mining and Multimedia Applications . In Proceedings of Conference on Very Large Databases (VLDB). Erik Riedel, Garth Gibson, and Christos Faloutsos. 1998. Active Storage for Large-Scale Data Mining and Multimedia Applications. In Proceedings of Conference on Very Large Databases (VLDB)."},{"key":"e_1_3_2_1_66_1","unstructured":"Samsung 2021. SmartSSD. https:\/\/www.xilinx.com\/applications\/data-center\/computational-storage\/smartssd.html  Samsung 2021. SmartSSD. https:\/\/www.xilinx.com\/applications\/data-center\/computational-storage\/smartssd.html"},{"key":"e_1_3_2_1_67_1","volume-title":"Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Seshadri Sudharsan","year":"2014","unstructured":"Sudharsan Seshadri , Mark Gahagan , Sundaram Bhaskaran , Trevor Bunker , Arup De , Yanqin Jin , Yang Liu , and Steven Swanson . 2014 . Willow: A User-Programmable SSD . In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI). Sudharsan Seshadri, Mark Gahagan, Sundaram Bhaskaran, Trevor Bunker, Arup De, Yanqin Jin, Yang Liu, and Steven Swanson. 2014. Willow: A User-Programmable SSD. In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_3_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001139"},{"key":"e_1_3_2_1_69_1","volume-title":"Cambricon-G: A Polyvalent Energy-Efficient Accelerator for Dynamic Graph Neural Networks","author":"Song Xinkai","year":"2021","unstructured":"Xinkai Song , Tian Zhi , Zhe Fan , Zhenxing Zhang , Xi Zeng , Wei Li , Xing Hu , Zidong Du , Qi Guo , and Yunji Chen . 2021. Cambricon-G: A Polyvalent Energy-Efficient Accelerator for Dynamic Graph Neural Networks . IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( 2021 ). Xinkai Song, Tian Zhi, Zhe Fan, Zhenxing Zhang, Xi Zeng, Wei Li, Xing Hu, Zidong Du, Qi Guo, and Yunji Chen. 2021. Cambricon-G: A Polyvalent Energy-Efficient Accelerator for Dynamic Graph Neural Networks. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2021)."},{"key":"e_1_3_2_1_70_1","volume-title":"Proceedings of MLSys Workshop on Graph Neural Networks and Systems (GNNSys).","author":"Su Qidong","year":"2021","unstructured":"Qidong Su , Minjie Wang , Da Zheng , and Zheng Zhang . 2021 . Adaptive Load Balancing for Parallel GNN Training . In Proceedings of MLSys Workshop on Graph Neural Networks and Systems (GNNSys). Qidong Su, Minjie Wang, Da Zheng, and Zheng Zhang. 2021. Adaptive Load Balancing for Parallel GNN Training. In Proceedings of MLSys Workshop on Graph Neural Networks and Systems (GNNSys)."},{"key":"e_1_3_2_1_71_1","unstructured":"NGD Systems. 2021. Newport Platform. https:\/\/www.ngdsystems.com\/solutions#NewportSection  NGD Systems. 2021. Newport Platform. https:\/\/www.ngdsystems.com\/solutions#NewportSection"},{"key":"e_1_3_2_1_72_1","volume-title":"Proceedings of USENIX Conference on File and Storage Technologies (FAST).","author":"Tiwari Devesh","year":"2013","unstructured":"Devesh Tiwari , Simona Boboila , Sudharshan Vazhkudai , Youngjae Kim , Xiaosong Ma , Peter Desnoyers , and Yan Solihin . 2013 . Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines . In Proceedings of USENIX Conference on File and Storage Technologies (FAST). Devesh Tiwari, Simona Boboila, Sudharshan Vazhkudai, Youngjae Kim, Xiaosong Ma, Peter Desnoyers, and Yan Solihin. 2013. Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines. In Proceedings of USENIX Conference on File and Storage Technologies (FAST)."},{"key":"e_1_3_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/EMPDP.2019.8671589"},{"key":"e_1_3_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001143"},{"key":"e_1_3_2_1_75_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Velickovic Petar","year":"2018","unstructured":"Petar Velickovic , Guillem Cucurull , Arantxa Casanova , Adriana Romero , Pietro Lio , and Yoshua Bengio . 2018 . Graph Attention Networks . In Proceedings of the International Conference on Learning Representations (ICLR). Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph Attention Networks. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/2933349.2933353"},{"key":"e_1_3_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178487.3178491"},{"key":"e_1_3_2_1_78_1","volume-title":"Highly-Performant Package for Graph Neural Networks. arXiv preprint arXiv:1909.01315","author":"Wang Minjie","year":"2019","unstructured":"Minjie Wang , Da Zheng , Zihao Ye , Quan Gan , Mufei Li , Xiang Song , Jinjing Zhou , Chao Ma , Lingfan Yu , Yu Gai , Tianjun Xiao , Tong He , George Karypis , Jinyang Li , and Zheng Zhang . 2019. Deep Graph Library: A Graph-Centric , Highly-Performant Package for Graph Neural Networks. arXiv preprint arXiv:1909.01315 ( 2019 ). Minjie Wang, Da Zheng, Zihao Ye, Quan Gan, Mufei Li, Xiang Song, Jinjing Zhou, Chao Ma, Lingfan Yu, Yu Gai, Tianjun Xiao, Tong He, George Karypis, Jinyang Li, and Zheng Zhang. 2019. Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks. arXiv preprint arXiv:1909.01315 (2019)."},{"key":"e_1_3_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00029"},{"key":"e_1_3_2_1_80_1","volume-title":"Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Wang Yuke","year":"2021","unstructured":"Yuke Wang , Boyuan Feng , Gushu Li , Shuangchen Li , Lei Deng , Yuan Xie , and Yufei Ding . 2021 . GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs . In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI). Yuke Wang, Boyuan Feng, Gushu Li, Shuangchen Li, Lei Deng, Yuan Xie, and Yufei Ding. 2021. GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs. In Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446763"},{"key":"e_1_3_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732967.2732972"},{"key":"e_1_3_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00041"},{"key":"e_1_3_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA47549.2020.00012"},{"key":"e_1_3_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219890"},{"key":"e_1_3_2_1_86_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems (NIPS).","author":"Ying Rex","year":"2018","unstructured":"Rex Ying , Jiaxuan You , Christopher Morris , Xiang Ren , William L. Hamilton , and Jure Leskovec . 2018 . Hierarchical Graph Representation Learning with Differentiable Pooling . In Proceedings of the International Conference on Neural Information Processing Systems (NIPS). Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, and Jure Leskovec. 2018. Hierarchical Graph Representation Learning with Differentiable Pooling. In Proceedings of the International Conference on Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373087.3375312"},{"key":"e_1_3_2_1_88_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Zeng Hanqing","year":"2020","unstructured":"Hanqing Zeng , Hongkuan Zhou , Ajitesh Srivastava , Rajgopal Kannan , and Viktor Prasanna . 2020 . GraphSAINT: Graph Sampling Based Inductive Learning Method . In Proceedings of the International Conference on Learning Representations (ICLR). Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, and Viktor Prasanna. 2020. GraphSAINT: Graph Sampling Based Inductive Learning Method. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_89_1","volume-title":"DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs. arXiv preprint arXiv:2010.05337","author":"Zheng Da","year":"2021","unstructured":"Da Zheng , Chao Ma , Minjie Wang , Jinjing Zhou , Qidong Su , Xiang Song , Quan Gan , Zheng Zhang , and George Karypis . 2021. DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs. arXiv preprint arXiv:2010.05337 ( 2021 ). Da Zheng, Chao Ma, Minjie Wang, Jinjing Zhou, Qidong Su, Xiang Song, Quan Gan, Zheng Zhang, and George Karypis. 2021. DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs. arXiv preprint arXiv:2010.05337 (2021)."},{"key":"e_1_3_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352127"}],"event":{"name":"ISCA '22: The 49th Annual International Symposium on Computer Architecture","location":"New York New York","acronym":"ISCA '22","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IEEE CS TCAA IEEE CS technical committee on architectural acoustics"]},"container-title":["Proceedings of the 49th Annual International Symposium on Computer Architecture"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3470496.3527391","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3470496.3527391","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:28Z","timestamp":1750188628000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3470496.3527391"}},"subtitle":["training large-scale graph neural networks using in-storage processing architectures"],"short-title":[],"issued":{"date-parts":[[2022,6,11]]},"references-count":90,"alternative-id":["10.1145\/3470496.3527391","10.1145\/3470496"],"URL":"https:\/\/doi.org\/10.1145\/3470496.3527391","relation":{},"subject":[],"published":{"date-parts":[[2022,6,11]]},"assertion":[{"value":"2022-06-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}