{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T18:44:58Z","timestamp":1776105898088,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,8,20]],"date-time":"2020-08-20T00:00:00Z","timestamp":1597881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Okawa Foundation Grant"},{"name":"NSF","award":["III-1705169 CAREER Award 1741634 1937599"],"award-info":[{"award-number":["III-1705169 CAREER Award 1741634 1937599"]}]},{"name":"DARPA","award":["HR00112090027 N660011924032"],"award-info":[{"award-number":["HR00112090027 N660011924032"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,8,23]]},"DOI":"10.1145\/3394486.3403237","type":"proceedings-article","created":{"date-parts":[[2020,8,20]],"date-time":"2020-08-20T23:03:59Z","timestamp":1597964639000},"page":"1857-1867","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":361,"title":["GPT-GNN"],"prefix":"10.1145","author":[{"given":"Ziniu","family":"Hu","sequence":"first","affiliation":[{"name":"University of California, Los Angeles, Los Angeles, CA, USA"}]},{"given":"Yuxiao","family":"Dong","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, USA"}]},{"given":"Kuansan","family":"Wang","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, USA"}]},{"given":"Kai-Wei","family":"Chang","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles, Los Angeles, CA, USA"}]},{"given":"Yizhou","family":"Sun","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles, Los Angeles, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,8,20]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Spectral networks and locally connected networks on graphs. arXiv:1312.6203","author":"Bruna Joan","year":"2013","unstructured":"Joan Bruna , Wojciech Zaremba , Arthur Szlam , and Yann LeCun . 2013. Spectral networks and locally connected networks on graphs. arXiv:1312.6203 ( 2013 ). Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun. 2013. Spectral networks and locally connected networks on graphs. arXiv:1312.6203 (2013)."},{"key":"e_1_3_2_2_2_1","volume-title":"Hinton","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey E . Hinton . 2020 . A Simple Framework for Contrastive Learning of Visual Representations . arxiv:2002.05709 (2020). Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey E. Hinton. 2020. A Simple Framework for Contrastive Learning of Visual Representations. arxiv:2002.05709 (2020)."},{"key":"e_1_3_2_2_3_1","volume-title":"Imagenet: A large-scale hierarchical image database.","author":"Deng Jia","year":"2009","unstructured":"Jia Deng , Wei Dong , Richard Socher , Li-Jia Li , Kai Li , and Li Fei-Fei . 2009 . Imagenet: A large-scale hierarchical image database. (2009). Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. (2009)."},{"key":"e_1_3_2_2_4_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL 2019 .","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL 2019 . Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL 2019 ."},{"key":"e_1_3_2_2_5_1","volume-title":"ICML 2014 .","author":"Donahue Jeff","year":"2014","unstructured":"Jeff Donahue , Yangqing Jia , Oriol Vinyals , Judy Hoffman , Ning Zhang , Eric Tzeng , and Trevor Darrell . 2014 . Decaf: A deep convolutional activation feature for generic visual recognition . In ICML 2014 . Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. Decaf: A deep convolutional activation feature for generic visual recognition. In ICML 2014 ."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098036"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"crossref","unstructured":"Yuxiao Dong Ziniu Hu Kuansan Wang Yizhou Sun and Jie Tang. 2020. Heterogeneous Network Representation Learning. In IJCAI .  Yuxiao Dong Ziniu Hu Kuansan Wang Yizhou Sun and Jie Tang. 2020. Heterogeneous Network Representation Learning. In IJCAI .","DOI":"10.24963\/ijcai.2020\/677"},{"key":"e_1_3_2_2_8_1","volume-title":"Fast Graph Representation Learning with PyTorch Geometric. ICLR Workshop","author":"Fey Matthias","year":"2019","unstructured":"Matthias Fey and Jan Eric Lenssen . 2019 . Fast Graph Representation Learning with PyTorch Geometric. ICLR Workshop (2019). Matthias Fey and Jan Eric Lenssen. 2019. Fast Graph Representation Learning with PyTorch Geometric. ICLR Workshop (2019)."},{"key":"e_1_3_2_2_9_1","volume-title":"Neural Message Passing for Quantum Chemistry. In ICML 2017 .","author":"Gilmer Justin","unstructured":"Justin Gilmer , Samuel S. Schoenholz , Patrick F. Riley , Oriol Vinyals , and George E. Dahl . 2017 . Neural Message Passing for Quantum Chemistry. In ICML 2017 . Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. 2017. Neural Message Passing for Quantum Chemistry. In ICML 2017 ."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939754"},{"key":"e_1_3_2_2_12_1","unstructured":"William L. Hamilton Zhitao Ying and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In NeurIPS 2017 .  William L. Hamilton Zhitao Ying and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In NeurIPS 2017 ."},{"key":"e_1_3_2_2_13_1","volume-title":"Momentum contrast for unsupervised visual representation learning. arXiv:1911.05722","author":"He Kaiming","year":"2019","unstructured":"Kaiming He , Haoqi Fan , Yuxin Wu , Saining Xie , and Ross Girshick . 2019. Momentum contrast for unsupervised visual representation learning. arXiv:1911.05722 ( 2019 ). Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2019. Momentum contrast for unsupervised visual representation learning. arXiv:1911.05722 (2019)."},{"key":"e_1_3_2_2_14_1","volume-title":"Strategies for Pre-training Graph Neural Networks. In ICLR 2020 .","author":"Hu Weihua","year":"2020","unstructured":"Weihua Hu , Bowen Liu , Joseph Gomes , Marinka Zitnik , Percy Liang , Vijay S. Pande , and Jure Leskovec . 2020 b . Strategies for Pre-training Graph Neural Networks. In ICLR 2020 . Weihua Hu, Bowen Liu, Joseph Gomes, Marinka Zitnik, Percy Liang, Vijay S. Pande, and Jure Leskovec. 2020 b. Strategies for Pre-training Graph Neural Networks. In ICLR 2020 ."},{"key":"e_1_3_2_2_15_1","volume-title":"Heterogeneous Graph Transformer. In WWW 2020 .","author":"Hu Ziniu","year":"2020","unstructured":"Ziniu Hu , Yuxiao Dong , Kuansan Wang , and Yizhou Sun . 2020 a . Heterogeneous Graph Transformer. In WWW 2020 . Ziniu Hu, Yuxiao Dong, Kuansan Wang, and Yizhou Sun. 2020 a. Heterogeneous Graph Transformer. In WWW 2020 ."},{"key":"e_1_3_2_2_16_1","volume-title":"Kipf and Max Welling","author":"Thomas","year":"2016","unstructured":"Thomas N. Kipf and Max Welling . 2016 . Variational Graph Auto-Encoders . arXiv:1611.07308 (2016). Thomas N. Kipf and Max Welling. 2016. Variational Graph Auto-Encoders. arXiv:1611.07308 (2016)."},{"key":"e_1_3_2_2_17_1","volume-title":"Semi-Supervised Classification with Graph Convolutional Networks. In ICLR 2017 .","author":"Thomas","unstructured":"Thomas N. Kipf and Max Welling. 2017 . Semi-Supervised Classification with Graph Convolutional Networks. In ICLR 2017 . Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR 2017 ."},{"key":"e_1_3_2_2_18_1","volume-title":"Zemel","author":"Liao Renjie","year":"2019","unstructured":"Renjie Liao , Yujia Li , Yang Song , Shenlong Wang , William L. Hamilton , David Duvenaud , Raquel Urtasun , and Richard S . Zemel . 2019 . Efficient Graph Generation with Graph Recurrent Attention Networks. In NeurIPS 2019 . Renjie Liao, Yujia Li, Yang Song, Shenlong Wang, William L. Hamilton, David Duvenaud, Raquel Urtasun, and Richard S. Zemel. 2019. Efficient Graph Generation with Graph Recurrent Attention Networks. In NeurIPS 2019 ."},{"key":"e_1_3_2_2_19_1","volume-title":"Learning to Rank for Information Retrieval","author":"Liu Tie-Yan","unstructured":"Tie-Yan Liu . 2011. Learning to Rank for Information Retrieval . Springer . Tie-Yan Liu. 2011. Learning to Rank for Information Retrieval .Springer."},{"key":"e_1_3_2_2_20_1","volume-title":"SGDR: Stochastic Gradient Descent with Warm Restarts. In ICLR 2017 .","author":"Loshchilov Ilya","year":"2017","unstructured":"Ilya Loshchilov and Frank Hutter . 2017 . SGDR: Stochastic Gradient Descent with Warm Restarts. In ICLR 2017 . Ilya Loshchilov and Frank Hutter. 2017. SGDR: Stochastic Gradient Descent with Warm Restarts. In ICLR 2017 ."},{"key":"e_1_3_2_2_21_1","volume-title":"Decoupled Weight Decay Regularization. In ICLR 2019 .","author":"Loshchilov Ilya","year":"2019","unstructured":"Ilya Loshchilov and Frank Hutter . 2019 . Decoupled Weight Decay Regularization. In ICLR 2019 . Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In ICLR 2019 ."},{"key":"e_1_3_2_2_22_1","volume-title":"NIPS 2013 .","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov , Ilya Sutskever , Kai Chen , Greg S Corrado , and Jeff Dean . 2013 . Distributed representations of words and phrases and their compositionality . In NIPS 2013 . Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In NIPS 2013 ."},{"key":"e_1_3_2_2_23_1","volume-title":"EMNLP 2019 .","author":"Ni Jianmo","unstructured":"Jianmo Ni , Jiacheng Li , and Julian J . McAuley. 2019. Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects . In EMNLP 2019 . Jianmo Ni, Jiacheng Li, and Julian J. McAuley. 2019. Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects. In EMNLP 2019 ."},{"key":"e_1_3_2_2_24_1","volume-title":"CVPR 2016 .","author":"Pathak Deepak","unstructured":"Deepak Pathak , Philipp Kr\"a henb\u00fc hl, Jeff Donahue , Trevor Darrell , and Alexei A. Efros . 2016. Context Encoders: Feature Learning by Inpainting . In CVPR 2016 . Deepak Pathak, Philipp Kr\"a henb\u00fc hl, Jeff Donahue, Trevor Darrell, and Alexei A. Efros. 2016. Context Encoders: Feature Learning by Inpainting. In CVPR 2016 ."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159706"},{"key":"e_1_3_2_2_27_1","unstructured":"Alec Radford Jeff Wu Rewon Child David Luan Dario Amodei and Ilya Sutskever. 2019. Language Models are Unsupervised Multitask Learners. (2019).  Alec Radford Jeff Wu Rewon Child David Luan Dario Amodei and Ilya Sutskever. 2019. Language Models are Unsupervised Multitask Learners. (2019)."},{"key":"e_1_3_2_2_28_1","volume-title":"Modeling Relational Data with Graph Convolutional Networks. In ESWC 2018 .","author":"Schlichtkrull Michael Sejr","unstructured":"Michael Sejr Schlichtkrull , Thomas N. Kipf , Peter Bloem , Rianne van den Berg, Ivan Titov, and Max Welling. 2018 . Modeling Relational Data with Graph Convolutional Networks. In ESWC 2018 . Michael Sejr Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling Relational Data with Graph Convolutional Networks. In ESWC 2018 ."},{"key":"e_1_3_2_2_29_1","volume-title":"ICLR 2020 .","author":"Sun Fan-Yun","year":"2020","unstructured":"Fan-Yun Sun , Jordan Hoffmann , Vikas Verma , and Jian Tang . 2020 . InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization . In ICLR 2020 . Fan-Yun Sun, Jordan Hoffmann, Vikas Verma, and Jian Tang. 2020. InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization. In ICLR 2020 ."},{"key":"e_1_3_2_2_30_1","volume-title":"Mining Heterogeneous Information Networks: Principles and Methodologies","author":"Sun Yizhou","unstructured":"Yizhou Sun and Jiawei Han . 2012. Mining Heterogeneous Information Networks: Principles and Methodologies . Morgan & Claypool Publishers . Yizhou Sun and Jiawei Han. 2012. Mining Heterogeneous Information Networks: Principles and Methodologies .Morgan & Claypool Publishers."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402736"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339738"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2736277.2741093"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1402008"},{"key":"e_1_3_2_2_35_1","volume-title":"Representation Learning with Contrastive Predictive Coding. arXiv:1807.03748","author":"Li Yazhe","year":"2018","unstructured":"A\"a ron van den Oord, Yazhe Li , and Oriol Vinyals . 2018. Representation Learning with Contrastive Predictive Coding. arXiv:1807.03748 ( 2018 ). A\"a ron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation Learning with Contrastive Predictive Coding. arXiv:1807.03748 (2018)."},{"key":"e_1_3_2_2_36_1","volume-title":"Graph Attention Networks. In ICLR 2018 .","author":"Velickovic Petar","year":"2018","unstructured":"Petar Velickovic , Guillem Cucurull , Arantxa Casanova , Adriana Romero , Pietro Li\u00f2 , and Yoshua Bengio . 2018 . Graph Attention Networks. In ICLR 2018 . Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Li\u00f2 , and Yoshua Bengio. 2018. Graph Attention Networks. In ICLR 2018 ."},{"key":"e_1_3_2_2_37_1","volume-title":"Deep Graph Infomax. In ICLR 2019 .","author":"Velickovic Petar","unstructured":"Petar Velickovic , William Fedus , William L. Hamilton , Pietro Li\u00f2 , Yoshua Bengio , and R. Devon Hjelm . 2019 . Deep Graph Infomax. In ICLR 2019 . Petar Velickovic, William Fedus, William L. Hamilton, Pietro Li\u00f2, Yoshua Bengio, and R. Devon Hjelm. 2019. Deep Graph Infomax. In ICLR 2019 ."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1162\/qss_a_00021"},{"key":"e_1_3_2_2_39_1","volume-title":"Yu","author":"Wang Xiao","year":"2019","unstructured":"Xiao Wang , Houye Ji , Chuan Shi , Bai Wang , Yanfang Ye , Peng Cui , and Philip S . Yu . 2019 . Heterogeneous Graph Attention Network. In WWW 2019 . Xiao Wang, Houye Ji, Chuan Shi, Bai Wang, Yanfang Ye, Peng Cui, and Philip S. Yu. 2019. Heterogeneous Graph Attention Network. In WWW 2019 ."},{"key":"e_1_3_2_2_40_1","volume-title":"Transformers: State-of-the-art Natural Language Processing. arxiv: cs.CL\/1910.03771","author":"Wolf Thomas","year":"2019","unstructured":"Thomas Wolf , Lysandre Debut , Victor Sanh , Julien Chaumond , Clement Delangue , Anthony Moi , Pierric Cistac , Tim Rault , R\u00e9mi Louf , Morgan Funtowicz , and Jamie Brew . 2019 . Transformers: State-of-the-art Natural Language Processing. arxiv: cs.CL\/1910.03771 Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, R\u00e9mi Louf, Morgan Funtowicz, and Jamie Brew. 2019. Transformers: State-of-the-art Natural Language Processing. arxiv: cs.CL\/1910.03771"},{"key":"e_1_3_2_2_41_1","volume-title":"Le","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang , Zihang Dai , Yiming Yang , Jaime G. Carbonell , Ruslan Salakhutdinov , and Quoc V . Le . 2019 . XLNet: Generalized Autoregressive Pretraining for Language Understanding. In NeurIPS 2019 . Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2019. XLNet: Generalized Autoregressive Pretraining for Language Understanding. In NeurIPS 2019 ."},{"key":"e_1_3_2_2_42_1","volume-title":"Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In KDD 2018 .","author":"Ying Rex","year":"2018","unstructured":"Rex Ying , Ruining He , Kaifeng Chen , Pong Eksombatchai , William L. Hamilton , and Jure Leskovec . 2018 . Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In KDD 2018 . Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L. Hamilton, and Jure Leskovec. 2018. Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In KDD 2018 ."},{"key":"e_1_3_2_2_43_1","volume-title":"GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models. In ICML 2018 .","author":"You Jiaxuan","year":"2018","unstructured":"Jiaxuan You , Rex Ying , Xiang Ren , William L. Hamilton , and Jure Leskovec . 2018 . GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models. In ICML 2018 . Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, and Jure Leskovec. 2018. GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models. In ICML 2018 ."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330785"},{"key":"e_1_3_2_2_45_1","unstructured":"Difan Zou Ziniu Hu Yewen Wang Song Jiang Yizhou Sun and Quanquan Gu. 2019. Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks. In NeurIPS 2019 .  Difan Zou Ziniu Hu Yewen Wang Song Jiang Yizhou Sun and Quanquan Gu. 2019. Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks. In NeurIPS 2019 ."}],"event":{"name":"KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Virtual Event CA USA","acronym":"KDD '20","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394486.3403237","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394486.3403237","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394486.3403237","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:01:47Z","timestamp":1750197707000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394486.3403237"}},"subtitle":["Generative Pre-Training of Graph Neural Networks"],"short-title":[],"issued":{"date-parts":[[2020,8,20]]},"references-count":45,"alternative-id":["10.1145\/3394486.3403237","10.1145\/3394486"],"URL":"https:\/\/doi.org\/10.1145\/3394486.3403237","relation":{},"subject":[],"published":{"date-parts":[[2020,8,20]]},"assertion":[{"value":"2020-08-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}