{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T13:14:32Z","timestamp":1775913272376,"version":"3.50.1"},"reference-count":77,"publisher":"Association for Computing Machinery (ACM)","issue":"6","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,2]]},"abstract":"<jats:p>Many Graph Neural Network (GNN) training systems have emerged recently to support efficient GNN training. Since GNNs embody complex data dependencies between training samples, the training of GNNs should address distinct challenges different from DNN training in data management, such as data partitioning, batch preparation for mini-batch training, and data transferring between CPUs and GPUs. These factors, which take up a large proportion of training time, make data management in GNN training more significant. This paper reviews GNN training from a data management perspective and provides a comprehensive analysis and evaluation of the representative approaches. We conduct extensive experiments on various benchmark datasets and show many interesting and valuable results. We also provide some practical tips learned from these experiments, which are helpful for designing GNN training systems in the future.<\/jats:p>","DOI":"10.14778\/3648160.3648167","type":"journal-article","created":{"date-parts":[[2024,5,3]],"date-time":"2024-05-03T21:52:53Z","timestamp":1714773173000},"page":"1241-1254","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective"],"prefix":"10.14778","volume":"17","author":[{"given":"Hao","family":"Yuan","sequence":"first","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Yajiong","family":"Liu","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Yanfeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Xin","family":"Ai","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Qiange","family":"Wang","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}]},{"given":"Chaoyi","family":"Chen","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Yu","family":"Gu","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]},{"given":"Ge","family":"Yu","sequence":"additional","affiliation":[{"name":"Northeastern University, China"}]}],"member":"320","published-online":{"date-parts":[[2024,5,3]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning, ICML'19","author":"Abu-El-Haija Sami","year":"2019","unstructured":"Sami Abu-El-Haija, Bryan Perozzi, Amol Kapoor, Nazanin Alipourfard, Kristina Lerman, Hrayr Harutyunyan, Greg Ver Steeg, and Aram Galstyan. 2019. MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing. In Proceedings of the 36th International Conference on Machine Learning, ICML'19, Long Beach, California, USA. 21--29."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7552(98)00110-X"},{"key":"e_1_2_1_3_1","volume-title":"Sixteenth European Conference on Computer Systems, EuroSys'21","author":"Cai Zhenkun","year":"2021","unstructured":"Zhenkun Cai, Xiao Yan, Yidi Wu, Kaihao Ma, James Cheng, and Fan Yu. 2021. DGCL: an efficient communication library for distributed GNN training. In Sixteenth European Conference on Computer Systems, EuroSys'21, Online Event, United Kingdom. 130--144."},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP'23","author":"Cai Zhenkun","year":"2023","unstructured":"Zhenkun Cai, Qihui Zhou, Xiao Yan, Da Zheng, Xiang Song, Chenguang Zheng, James Cheng, and George Karypis. 2023. DSP: Efficient GNN Training with Multiple GPUs. In Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP'23, Montreal, QC, Canada. 392--404."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/3632093.3632108"},{"key":"e_1_2_1_6_1","volume-title":"6th International Conference on Learning Representations, ICLR'18","author":"Chen Jie","year":"2018","unstructured":"Jie Chen, Tengfei Ma, and Cao Xiao. 2018. FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling. In 6th International Conference on Learning Representations, ICLR'18, Vancouver, BC, Canada."},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the 35th International Conference on Machine Learning, ICML'18, Stockholmsm\u00e4ssan","author":"Chen Jianfei","year":"2018","unstructured":"Jianfei Chen, Jun Zhu, and Le Song. 2018. Stochastic Training of Graph Convolutional Networks with Variance Reduction. In Proceedings of the 35th International Conference on Machine Learning, ICML'18, Stockholmsm\u00e4ssan, Stockholm, Sweden. 941--949."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330925"},{"key":"e_1_2_1_9_1","unstructured":"CUDA 2023. CUDA. https:\/\/docs.nvidia.com\/cuda\/cuda-c-programming-guide\/index.html."},{"key":"e_1_2_1_10_1","unstructured":"Euler 2019. Euler. https:\/\/github.com\/alibaba\/euler\/wiki\/System-Introduction."},{"key":"e_1_2_1_11_1","volume-title":"Fast Graph Representation Learning with PyTorch Geometric. In ICLR Workshop on Representation Learning on Graphs and Manifolds.","author":"Fey Matthias","unstructured":"Matthias Fey and Jan E. Lenssen. 2019. Fast Graph Representation Learning with PyTorch Geometric. In ICLR Workshop on Representation Learning on Graphs and Manifolds."},{"key":"e_1_2_1_12_1","volume-title":"15th USENIX Symposium on Operating Systems Design and Implementation, OSDI'21","author":"Gandhi Swapnil","year":"2021","unstructured":"Swapnil Gandhi and Anand Padmanabha Iyer. 2021. P3: Distributed Deep Graph Learning at Scale. In 15th USENIX Symposium on Operating Systems Design and Implementation, OSDI'21, 2021. 551--568."},{"key":"e_1_2_1_13_1","volume-title":"Inductive Representation Learning on Large Graphs. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems","author":"Hamilton William L.","year":"2017","unstructured":"William L. Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, NeurIPS'17 Long Beach, CA, USA. 1024--1034."},{"key":"e_1_2_1_14_1","volume-title":"Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR'16","author":"He Kaiming","year":"2016","unstructured":"Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR'16, Las Vegas, NV, USA. 770--778."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3437801.3441585"},{"key":"e_1_2_1_16_1","volume-title":"Adaptive Sampling Towards Fast Graph Representation Learning. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems","author":"Zhang Tong","year":"2018","unstructured":"Wen-bing Huang, Tong Zhang, Yu Rong, and Junzhou Huang. 2018. Adaptive Sampling Towards Fast Graph Representation Learning. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS'18, Montr\u00e9al, Canada. 4563--4572."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of Machine Learning and Systems","author":"Jia Zhihao","year":"2020","unstructured":"Zhihao Jia, Sina Lin, Mingyu Gao, Matei Zaharia, and Alex Aiken. 2020. Improving the Accuracy, Scalability, and Performance of Graph Neural Networks with Roc. In Proceedings of Machine Learning and Systems 2020, MLSys'20, Austin, TX, USA."},{"key":"e_1_2_1_18_1","volume-title":"Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching. CoRR abs\/2305.03152","author":"Kaler Tim","year":"2023","unstructured":"Tim Kaler, Alexandros-Stavros Iliopoulos, Philip Murzynowski, Tao B. Schardl, Charles E. Leiserson, and Jie Chen. 2023. Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching. CoRR abs\/2305.03152 (2023)."},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of Machine Learning and Systems","author":"Kaler Tim","year":"2022","unstructured":"Tim Kaler, Nickolas Stathas, Anne Ouyang, Alexandros-Stavros Iliopoulos, Tao B. Schardl, Charles E. Leiserson, and Jie Chen. 2022. Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining. In Proceedings of Machine Learning and Systems 2022, MLSys'22, Santa Clara, CA, USA."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827595287997"},{"key":"e_1_2_1_21_1","volume-title":"Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations, ICLR'17","author":"Thomas","unstructured":"Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations, ICLR'17, Toulon, France, Conference Track Proceedings. OpenReview.net."},{"key":"e_1_2_1_22_1","unstructured":"KONECT 2017. LiveJournal links network dataset. http:\/\/konect.unikoblenz.de\/networks\/livejournal-links."},{"key":"e_1_2_1_23_1","unstructured":"KONECT 2017. Wikipedia links English network dataset. http:\/\/konect.cc\/networks\/wikipedia_link_en."},{"key":"e_1_2_1_24_1","volume-title":"A Comprehensive Survey on Distributed Training of Graph Neural Networks. CoRR abs\/2211.05368","author":"Lin Haiyang","year":"2022","unstructured":"Haiyang Lin, Mingyu Yan, Xiaochun Ye, Dongrui Fan, Shirui Pan, Wenguang Chen, and Yuan Xie. 2022. A Comprehensive Survey on Distributed Training of Graph Neural Networks. CoRR abs\/2211.05368 (2022)."},{"key":"e_1_2_1_25_1","volume-title":"ACM Symposium on Cloud Computing, SoCC'20","author":"Lin Zhiqi","year":"2020","unstructured":"Zhiqi Lin, Cheng Li, Youshan Miao, Yunxin Liu, and Yinlong Xu. 2020. PaGraph: Scaling GNN training on large graphs via computation-aware caching. In ACM Symposium on Cloud Computing, SoCC'20, Virtual Event, USA. 401--415."},{"key":"e_1_2_1_26_1","volume-title":"20th USENIX Symposium on Networked Systems Design and Implementation, NSDI'23","author":"Liu Tianfeng","year":"2023","unstructured":"Tianfeng Liu, Yangrui Chen, Dan Li, Chuan Wu, Yibo Zhu, Jun He, Yanghua Peng, Hongzheng Chen, Hongzhi Chen, and Chuanxiong Guo. 2023. BGL: GPU-Efficient GNN Training by Optimizing Graph Data I\/O and Preprocessing. In 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI'23, Boston, MA. 103--118."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/JAS.2021.1004311"},{"key":"e_1_2_1_28_1","volume-title":"NeuGraph: Parallel Deep Neural Network Computation on Large Graphs. In 2019 USENIX Annual Technical Conference, ATC'19","author":"Ma Lingxiao","year":"2019","unstructured":"Lingxiao Ma, Zhi Yang, Youshan Miao, Jilong Xue, Ming Wu, Lidong Zhou, and Yafei Dai. 2019. NeuGraph: Parallel Deep Neural Network Computation on Large Graphs. In 2019 USENIX Annual Technical Conference, ATC'19, Renton, WA, USA. 443--458."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3458817.3480856"},{"key":"e_1_2_1_30_1","volume-title":"An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training. CoRR abs\/2308.15602","author":"Merkel Nikolai","year":"2023","unstructured":"Nikolai Merkel, Daniel Stoll, Ruben Mayer, and Hans-Arno Jacobsen. 2023. An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training. CoRR abs\/2308.15602 (2023)."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476249.3476264"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the 7th ACM SIGCOMM Internet Measurement Conference, IMC'07","author":"Mislove Alan","year":"2007","unstructured":"Alan Mislove, Massimiliano Marcon, P. Krishna Gummadi, Peter Druschel, and Bobby Bhattacharjee. 2007. Measurement and analysis of online social networks. In Proceedings of the 7th ACM SIGCOMM Internet Measurement Conference, IMC'07, San Diego, California, USA. 29--42."},{"key":"e_1_2_1_33_1","unstructured":"NVIDIA. 2017. Mark Harris. Unified Memory for CUDA Beginners---nvidia developer blog. https:\/\/developer.nvidia.com\/blog\/unified-memory-cuda-beginners."},{"key":"e_1_2_1_34_1","unstructured":"NVIDIA. 2022. Zero Copy. https:\/\/docs.nvidia.com\/cuda\/cuda-c-best-practices-guide\/index.html#zero-copy."},{"key":"e_1_2_1_35_1","unstructured":"Open Graph Benchmark 2021. The ogbn-arxiv dataset. https:\/\/ogb.stanford.edu\/docs\/nodeprop\/#ogbn-arxiv."},{"key":"e_1_2_1_36_1","unstructured":"Open Graph Benchmark 2021. The ogbn-papers100M dataset. https:\/\/ogb.stanford.edu\/docs\/nodeprop\/#ogbn-papers100M."},{"key":"e_1_2_1_37_1","unstructured":"Open Graph Benchmark 2021. The ogbn-products dataset. https:\/\/ogb.stanford.edu\/docs\/nodeprop\/#ogbn-products."},{"key":"e_1_2_1_38_1","volume-title":"PyTorch: An Imperative Style","author":"Paszke Adam","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. Curran Associates, Inc., 8024--8035. http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.14778\/3538598.3538614"},{"key":"e_1_2_1_40_1","volume-title":"Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems","author":"Ramezani Morteza","year":"2020","unstructured":"Morteza Ramezani, Weilin Cong, Mehrdad Mahdavi, Anand Sivasubramaniam, and Mahmut T. Kandemir. 2020. GCN meets GPU: Decoupling \"When to Sample\" from \"How to Sample\". In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS'20, virtual, Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.)."},{"key":"e_1_2_1_41_1","volume-title":"Few-Shot Learning with Graph Neural Networks. In 6th International Conference on Learning Representations, ICLR'18","author":"Satorras Victor Garcia","year":"2018","unstructured":"Victor Garcia Satorras and Joan Bruna Estrach. 2018. Few-Shot Learning with Graph Neural Networks. In 6th International Conference on Learning Representations, ICLR'18, Vancouver, BC, Canada."},{"key":"e_1_2_1_42_1","volume-title":"Distributed Graph Neural Network Training: A Survey. CoRR abs\/2211.00216","author":"Shao Yingxia","year":"2022","unstructured":"Yingxia Shao, Hongzheng Li, Xizhi Gu, Hongbo Yin, Yawen Li, Xupeng Miao, Wentao Zhang, Bin Cui, and Lei Chen. 2022. Distributed Graph Neural Network Training: A Survey. CoRR abs\/2211.00216 (2022)."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3503221.3508435"},{"key":"e_1_2_1_44_1","volume-title":"Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training. In Annual Technical Conference, ATC'23","author":"Sun Jie","year":"2023","unstructured":"Jie Sun, Li Su, Zuocheng Shi, Wenting Shen, Zeke Wang, Lei Wang, Jie Zhang, Yong Li, Wenyuan Yu, Jingren Zhou, and Fei Wu. 2023. Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training. In Annual Technical Conference, ATC'23, Boston, MA, USA. 165--179."},{"key":"e_1_2_1_45_1","volume-title":"15th USENIX Symposium on Operating Systems Design and Implementation, OSDI'21","author":"Thorpe John","year":"2021","unstructured":"John Thorpe, Yifan Qiao, Jonathan Eyolfson, Shen Teng, Guanzhou Hu, Zhihao Jia, Jinliang Wei, Keval Vora, Ravi Netravali, Miryung Kim, and Guoqing Harry Xu. 2021. Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads. In 15th USENIX Symposium on Operating Systems Design and Implementation, OSDI'21. 495--514."},{"key":"e_1_2_1_46_1","volume-title":"The Evolution of Distributed Systems for Graph Neural Networks and Their Origin in Graph Processing and Deep Learning: A Survey. ACM Comput. Surv. 56, 1","author":"Vatter Jana","year":"2024","unstructured":"Jana Vatter, Ruben Mayer, and Hans-Arno Jacobsen. 2024. The Evolution of Distributed Systems for Graph Neural Networks and Their Origin in Graph Processing and Deep Learning: A Survey. ACM Comput. Surv. 56, 1 (2024), 6:1--6:37."},{"key":"e_1_2_1_47_1","volume-title":"Graph Attention Networks. In 6th International Conference on Learning Representations, ICLR'18","author":"Velickovic Petar","year":"2018","unstructured":"Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Li\u00f2, and Yoshua Bengio. 2018. Graph Attention Networks. In 6th International Conference on Learning Representations, ICLR'18, Vancouver, BC, Canada."},{"key":"e_1_2_1_48_1","volume-title":"MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks. In Eighteenth European Conference on Computer Systems, EuroSys'23","author":"Waleffe Roger","year":"2023","unstructured":"Roger Waleffe, Jason Mohoney, Theodoros Rekatsinas, and Shivaram Venkataraman. 2023. MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks. In Eighteenth European Conference on Computer Systems, EuroSys'23, Rome, Italy. 144--161."},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of Machine Learning and Systems","author":"Wan Cheng","year":"2022","unstructured":"Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, and Yingyan Lin. 2022. BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling. In Proceedings of Machine Learning and Systems 2022, MLSys'22, Santa Clara, CA, USA."},{"key":"e_1_2_1_50_1","first-page":"2","volume-title":"Proc. ACM Manag. Data 1","author":"Wan Xinchen","year":"2023","unstructured":"Xinchen Wan, Kaiqiang Xu, Xudong Liao, Yilun Jin, Kai Chen, and Xin Jin. 2023. Scalable and Efficient Full-Graph GNN Training for Large Graphs. Proc. ACM Manag. Data 1, 2 (2023), 143:1--143:23."},{"key":"e_1_2_1_51_1","volume-title":"Sixteenth European Conference on Computer Systems, EuroSys'21","author":"Wang Lei","year":"2021","unstructured":"Lei Wang, Qiang Yin, Chao Tian, Jianbang Yang, Rong Chen, Wenyuan Yu, Zihang Yao, and Jingren Zhou. 2021. FlexGraph: a flexible and efficient distributed framework for GNN training. In Sixteenth European Conference on Computer Systems, EuroSys'21, Online Event, United Kingdom. 67--82."},{"key":"e_1_2_1_52_1","volume-title":"Highly-Performant Package for Graph Neural Networks. arXiv preprint arXiv:1909.01315","author":"Wang Minjie","year":"2019","unstructured":"Minjie Wang, Da Zheng, Zihao Ye, Quan Gan, Mufei Li, Xiang Song, Jinjing Zhou, Chao Ma, Lingfan Yu, Yu Gai, Tianjun Xiao, Tong He, George Karypis, Jinyang Li, and Zheng Zhang. 2019. Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks. arXiv preprint arXiv:1909.01315 (2019)."},{"key":"e_1_2_1_53_1","volume-title":"HyTGraph: GPU-Accelerated Graph Processing with Hybrid Transfer Management. In 39th IEEE International Conference on Data Engineering, ICDE'23","author":"Wang Qiange","year":"2023","unstructured":"Qiange Wang, Xin Ai, Yanfeng Zhang, Jing Chen, and Ge Yu. 2023. HyTGraph: GPU-Accelerated Graph Processing with Hybrid Transfer Management. In 39th IEEE International Conference on Data Engineering, ICDE'23, Anaheim, CA, USA. 558--571."},{"key":"e_1_2_1_54_1","volume-title":"HongTu: Scalable Full-Graph GNN Training on Multiple GPUs (via communication-optimized CPU data offloading). CoRR abs\/2311.14898","author":"Wang Qiange","year":"2023","unstructured":"Qiange Wang, Yao Chen, Weng-Fai Wong, and Bingsheng He. 2023. HongTu: Scalable Full-Graph GNN Training on Multiple GPUs (via communication-optimized CPU data offloading). CoRR abs\/2311.14898 (2023)."},{"key":"e_1_2_1_55_1","volume-title":"NeutronStar: Distributed GNN Training with Hybrid Dependency Management. In International Conference on Management of Data, SIGMOD'22","author":"Wang Qiange","year":"2022","unstructured":"Qiange Wang, Yanfeng Zhang, Hao Wang, Chaoyi Chen, Xiaodong Zhang, and Ge Yu. 2022. NeutronStar: Distributed GNN Training with Hybrid Dependency Management. In International Conference on Management of Data, SIGMOD'22, Philadelphia, PA, USA. 1301--1315."},{"key":"e_1_2_1_56_1","unstructured":"Yuke Wang Boyuan Feng Zheng Wang Tong Geng Kevin J. Barker Ang Li and Yufei Ding. 2023. MGG: Accelerating Graph Neural Networks with Fine-Grained Intra-Kernel Communication-Computation Pipelining on Multi-GPU Platforms. In 17th USENIX Symposium on Operating Systems Design and Implementation OSDI'23 Boston MA USA."},{"key":"e_1_2_1_57_1","volume-title":"Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs. Neurocomputing","author":"Wang Zhaokang","year":"2021","unstructured":"Zhaokang Wang, Yunpan Wang, Chunfeng Yuan, Rong Gu, and Yihua Huang. 2021. Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs. Neurocomputing (2021), 165--191."},{"key":"e_1_2_1_58_1","volume-title":"Collective dynamics of 'small-world'networks. nature 393, 6684","author":"Watts Duncan J","year":"1998","unstructured":"Duncan J Watts and Steven H Strogatz. 1998. Collective dynamics of 'small-world'networks. nature 393, 6684 (1998), 440--442."},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.2978386"},{"key":"e_1_2_1_60_1","volume-title":"SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning. CoRR abs\/2202.00075","author":"Xue Zihui","year":"2022","unstructured":"Zihui Xue, Yuedong Yang, Mengtian Yang, and Radu Marculescu. 2022. SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning. CoRR abs\/2202.00075 (2022)."},{"key":"e_1_2_1_61_1","volume-title":"Defining and Evaluating Network Communities Based on Ground-Truth. In 12th IEEE International Conference on Data Mining, ICDM'12","author":"Yang Jaewon","year":"2012","unstructured":"Jaewon Yang and Jure Leskovec. 2012. Defining and Evaluating Network Communities Based on Ground-Truth. In 12th IEEE International Conference on Data Mining, ICDM'12, Brussels, Belgium. 745--754."},{"key":"e_1_2_1_62_1","volume-title":"Seventeenth European Conference on Computer Systems, EuroSys'22","author":"Yang Jianbang","year":"2022","unstructured":"Jianbang Yang, Dahai Tang, Xiaoniu Song, Lei Wang, Qiang Yin, Rong Chen, Wenyuan Yu, and Jingren Zhou. 2022. GNNLab: a factored system for sample-based GNN training over GPUs. In Seventeenth European Conference on Computer Systems, EuroSys'22, Rennes, France. 417--434."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3575693.3575725"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219890"},{"key":"e_1_2_1_65_1","volume-title":"Hierarchical Graph Representation Learning with Differentiable Pooling. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems","author":"Ying Zhitao","year":"2018","unstructured":"Zhitao Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, and Jure Leskovec. 2018. Hierarchical Graph Representation Learning with Differentiable Pooling. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS'18, Montr\u00e9al, Canada. 4805--4815."},{"key":"e_1_2_1_66_1","volume-title":"GraphSAINT: Graph Sampling Based Inductive Learning Method. In 8th International Conference on Learning Representations, ICLR'20","author":"Zeng Hanqing","unstructured":"Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, and Viktor K. Prasanna. 2020. GraphSAINT: Graph Sampling Based Inductive Learning Method. In 8th International Conference on Learning Representations, ICLR'20, Addis Ababa, Ethiopia."},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415539"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2023.3305077"},{"key":"e_1_2_1_69_1","volume-title":"Link Prediction Based on Graph Neural Networks. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems","author":"Zhang Muhan","year":"2018","unstructured":"Muhan Zhang and Yixin Chen. 2018. Link Prediction Based on Graph Neural Networks. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS'18, Montr\u00e9al, Canada."},{"key":"e_1_2_1_70_1","volume-title":"The ACM Web Conference 2022, WWW'22, Virtual Event","author":"Zhang Wentao","year":"2022","unstructured":"Wentao Zhang, Yu Shen, Zheyu Lin, Yang Li, Xiaosen Li, Wen Ouyang, Yangyu Tao, Zhi Yang, and Bin Cui. 2022. PaSca: A Graph Neural Architecture Search System under the Scalable Paradigm. In The ACM Web Conference 2022, WWW'22, Virtual Event, Lyon, France."},{"key":"e_1_2_1_71_1","first-page":"2","volume-title":"Proc. ACM Manag. Data 1","author":"Zhang Xin","year":"2023","unstructured":"Xin Zhang, Yanyan Shen, Yingxia Shao, and Lei Chen. 2023. DUCATI: A DualCache Training System for Graph Neural Networks on Giant Graphs with the GPU. Proc. ACM Manag. Data 1, 2 (2023), 166:1--166:24."},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.14778\/3514061.3514069"},{"key":"e_1_2_1_73_1","volume-title":"DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs. In 10th IEEE\/ACM Workshop on Irregular Applications: Architectures and Algorithms, IA3'20","author":"Zheng Da","year":"2020","unstructured":"Da Zheng, Chao Ma, Minjie Wang, Jinjing Zhou, Qidong Su, Xiang Song, Quan Gan, Zheng Zhang, and George Karypis. 2020. DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs. In 10th IEEE\/ACM Workshop on Irregular Applications: Architectures and Algorithms, IA3'20, Atlanta, GA, USA. 36--44."},{"key":"e_1_2_1_74_1","volume-title":"The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD'22","author":"Zheng Da","year":"2022","unstructured":"Da Zheng, Xiang Song, Chengru Yang, Dominique LaSalle, and George Karypis. 2022. Distributed Hybrid CPU and GPU training for Graph Neural Networks on Billion-Scale Heterogeneous Graphs. In The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD'22, Washington, DC, USA. 4582--4591."},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352127"},{"key":"e_1_2_1_76_1","volume-title":"Gemini: A Computation-Centric Distributed Graph Processing System. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI' 2016","author":"Zhu Xiaowei","year":"2016","unstructured":"Xiaowei Zhu, Wenguang Chen, Weimin Zheng, and Xiaosong Ma. 2016. Gemini: A Computation-Centric Distributed Graph Processing System. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI' 2016, Savannah, GA, USA, November 2-4, 2016, Kimberly Keeton and Timothy Roscoe (Eds.). 301--316."},{"key":"e_1_2_1_77_1","volume-title":"Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems","author":"Zou Difan","year":"2019","unstructured":"Difan Zou, Ziniu Hu, Yewen Wang, Song Jiang, Yizhou Sun, and Quanquan Gu. 2019. Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS'19, Vancouver, BC, Canada. 11247--11256."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3648160.3648167","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,3]],"date-time":"2024-05-03T21:58:04Z","timestamp":1714773484000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3648160.3648167"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2]]},"references-count":77,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,2]]}},"alternative-id":["10.14778\/3648160.3648167"],"URL":"https:\/\/doi.org\/10.14778\/3648160.3648167","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,2]]},"assertion":[{"value":"2024-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}