{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T05:49:04Z","timestamp":1777873744535,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":42,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,3]]},"DOI":"10.1145\/3711896.3736922","type":"proceedings-article","created":{"date-parts":[[2025,8,3]],"date-time":"2025-08-03T21:05:41Z","timestamp":1754255141000},"page":"3449-3460","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Efficient End-to-end Language Model Fine-tuning on Graphs"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-4975-1585","authenticated-orcid":false,"given":"Rui","family":"Xue","sequence":"first","affiliation":[{"name":"North Carolina State University, Raleigh, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3599-8010","authenticated-orcid":false,"given":"Xipeng","family":"Shen","sequence":"additional","affiliation":[{"name":"North Carolina State University, Raleigh, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0905-5158","authenticated-orcid":false,"given":"Ruozhou","family":"Yu","sequence":"additional","affiliation":[{"name":"North Carolina State University, Raleigh, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8217-5688","authenticated-orcid":false,"given":"Xiaorui","family":"Liu","sequence":"additional","affiliation":[{"name":"North Carolina State University, Raleigh, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,8,3]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Advances in Neural Information Processing Systems","volume":"32","author":"Bai Shaojie","year":"2019","unstructured":"Shaojie Bai, J Zico Kolter, and Vladlen Koltun. 2019. Deep equilibrium models. Advances in Neural Information Processing Systems, Vol. 32 (2019)."},{"key":"e_1_3_2_2_2_1","unstructured":"Jianfei Chen Jun Zhu and Le Song. 2017. Stochastic training of graph convolutional networks with variance reduction. arXiv preprint arXiv:1710.10568(2017)."},{"key":"e_1_3_2_2_3_1","volume-title":"Llaga: Large language and graph assistant. arXiv preprint arXiv:2402.08170(2024).","author":"Chen Runjin","year":"2024","unstructured":"Runjin Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, and Zhangyang Wang. 2024. Llaga: Large language and graph assistant. arXiv preprint arXiv:2402.08170(2024)."},{"key":"e_1_3_2_2_4_1","volume-title":"Neural ordinary differential equations. Advances in neural information processing systems","author":"Chen Ricky TQ","year":"2018","unstructured":"Ricky TQ Chen, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud. 2018. Neural ordinary differential equations. Advances in neural information processing systems, Vol. 31 (2018)."},{"key":"e_1_3_2_2_5_1","unstructured":"Zhikai Chen Haitao Mao Hang Li Wei Jin Hongzhi Wen Xiaochi Wei Shuaiqiang Wang Dawei Yin Wenqi Fan Hui Liu et al. 2023. Exploring the potential of large language models (llms) in learning on graphs. arXiv preprint arXiv:2307.03393(2023)."},{"key":"e_1_3_2_2_6_1","unstructured":"Eli Chien Wei-Cheng Chang Cho-Jui Hsieh Hsiang-Fu Yu Jiong Zhang Olgica Milenkovic and Inderjit S Dhillon. 2021. Node feature extraction by self-supervised multi-scale neighborhood prediction. arXiv preprint arXiv:2111.00064(2021)."},{"key":"e_1_3_2_2_7_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Tu Anh Dinh Jeroen den Boef Joran Cornelisse and Paul Groth. 2022. E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes. arXiv preprint arXiv:2208.04609(2022).","DOI":"10.1109\/ICDMW60847.2023.00142"},{"key":"e_1_3_2_2_9_1","volume-title":"Qizhe Xie, and Junxian He.","author":"Duan Keyu","year":"2023","unstructured":"Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, and Junxian He. 2023. Simteg: A frustratingly simple approach improves textual graph learning. arXiv preprint arXiv:2308.02565(2023)."},{"key":"e_1_3_2_2_10_1","unstructured":"Matthias Fey and Jan Eric Lenssen. 2019. Fast graph representation learning with PyTorch Geometric. arXiv preprint arXiv:1903.02428(2019)."},{"key":"e_1_3_2_2_11_1","volume-title":"International conference on machine learning. PMLR, 3294-3304","author":"Fey Matthias","year":"2021","unstructured":"Matthias Fey, Jan E Lenssen, Frank Weichert, and Jure Leskovec. 2021. Gnnautoscale: Scalable and expressive graph neural networks via historical embeddings. In International conference on machine learning. PMLR, 3294-3304."},{"key":"e_1_3_2_2_12_1","first-page":"11984","article-title":"Implicit graph neural networks","volume":"33","author":"Gu Fangda","year":"2020","unstructured":"Fangda Gu, Heng Chang, Wenwu Zhu, Somayeh Sojoudi, and Laurent El Ghaoui. 2020. Implicit graph neural networks. Advances in Neural Information Processing Systems, Vol. 33 (2020), 11984-11995.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_13_1","volume-title":"Inductive representation learning on large graphs. Advances in neural information processing systems","author":"Hamilton Will","year":"2017","unstructured":"Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. Advances in neural information processing systems, Vol. 30 (2017)."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-01588-5"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-009-8467-7_1"},{"key":"e_1_3_2_2_16_1","volume-title":"Deberta: Decoding-enhanced bert with disentangled attention. arXiv preprint arXiv:2006.03654(2020).","author":"He Pengcheng","year":"2020","unstructured":"Pengcheng He, Xiaodong Liu, Jianfeng Gao, and Weizhu Chen. 2020. Deberta: Decoding-enhanced bert with disentangled attention. arXiv preprint arXiv:2006.03654(2020)."},{"key":"e_1_3_2_2_17_1","unstructured":"Xiaoxin He Xavier Bresson Thomas Laurent and Bryan Hooi. 2023. Explanations as Features: LLM-Based Features for Text-Attributed Graphs. arXiv preprint arXiv:2305.19523(2023)."},{"key":"e_1_3_2_2_18_1","volume-title":"Open graph benchmark: Datasets for machine learning on graphs. Advances in neural information processing systems","author":"Hu Weihua","year":"2020","unstructured":"Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. 2020. Open graph benchmark: Datasets for machine learning on graphs. Advances in neural information processing systems, Vol. 33 (2020), 22118-22133."},{"key":"e_1_3_2_2_19_1","volume-title":"Patton: Language model pretraining on text-rich networks. arXiv preprint arXiv:2305.12268(2023).","author":"Jin Bowen","year":"2023","unstructured":"Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Xinyang Zhang, Qi Zhu, and Jiawei Han. 2023b. Patton: Language model pretraining on text-rich networks. arXiv preprint arXiv:2305.12268(2023)."},{"key":"e_1_3_2_2_20_1","unstructured":"Wei Jin Haitao Mao Zheng Li Haoming Jiang Chen Luo Hongzhi Wen Haoyu Han Hanqing Lu Zhengyang Wang Ruirui Li et al. 2023a. Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for Recommendation and Text Generation. arXiv preprint arXiv:2307.09688(2023)."},{"key":"e_1_3_2_2_21_1","first-page":"14582","article-title":"Pure transformers are powerful graph learners","volume":"35","author":"Kim Jinwoo","year":"2022","unstructured":"Jinwoo Kim, Dat Nguyen, Seonwoo Min, Sungjun Cho, Moontae Lee, Honglak Lee, and Seunghoon Hong. 2022. Pure transformers are powerful graph learners. Advances in Neural Information Processing Systems, Vol. 35 (2022), 14582-14595.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_22_1","unstructured":"Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907(2016)."},{"key":"e_1_3_2_2_23_1","unstructured":"Johannes Klicpera Aleksandar Bojchevski and Stephan G\u00fcnnemann. 2018. Predict then propagate: Graph neural networks meet personalized pagerank. arXiv preprint arXiv:1810.05997(2018)."},{"key":"e_1_3_2_2_24_1","volume-title":"International conference on machine learning. PMLR, 6437-6449","author":"Li Guohao","year":"2021","unstructured":"Guohao Li, Matthias M\u00fcller, Bernard Ghanem, and Vladlen Koltun. 2021. Training graph neural networks with 1000 layers. In International conference on machine learning. PMLR, 6437-6449."},{"key":"e_1_3_2_2_25_1","volume-title":"GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs. arXiv preprint arXiv:2310.15109(2023).","author":"Li Yichuan","year":"2023","unstructured":"Yichuan Li, Kaize Ding, and Kyumin Lee. 2023. GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs. arXiv preprint arXiv:2310.15109(2023)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482225"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Costas Mavromatis Vassilis N Ioannidis Shen Wang Da Zheng Soji Adeshina Jun Ma Han Zhao Christos Faloutsos and George Karypis. 2023. Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs. arXiv preprint arXiv:2304.10668(2023).","DOI":"10.1007\/978-3-031-43418-1_10"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009953814988"},{"key":"e_1_3_2_2_29_1","unstructured":"Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781(2013)."},{"key":"e_1_3_2_2_30_1","volume-title":"Grpe: Relative positional encoding for graph transformer. arXiv preprint arXiv:2201.12787(2022).","author":"Park Wonpyo","year":"2022","unstructured":"Wonpyo Park, Woonggi Chang, Donggeon Lee, Juntae Kim, and Seung-won Hwang. 2022. Grpe: Relative positional encoding for graph transformer. arXiv preprint arXiv:2201.12787(2022)."},{"key":"e_1_3_2_2_31_1","volume-title":"Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084(2019).","author":"Reimers Nils","year":"2019","unstructured":"Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084(2019)."},{"key":"e_1_3_2_2_32_1","unstructured":"Victor Sanh Lysandre Debut Julien Chaumond and Thomas Wolf. 2019. DistilBERT a distilled version of BERT: smaller faster cheaper and lighter. arXiv preprint arXiv:1910.01108(2019)."},{"key":"e_1_3_2_2_33_1","volume-title":"Collective classification in network data. AI magazine","author":"Sen Prithviraj","year":"2008","unstructured":"Prithviraj Sen, Galileo Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi-Rad. 2008. Collective classification in network data. AI magazine, Vol. 29, 3 (2008), 93-93."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657775"},{"key":"e_1_3_2_2_35_1","unstructured":"Petar Veli\u010dkovi\u0107 Guillem Cucurull Arantxa Casanova Adriana Romero Pietro Lio and Yoshua Bengio. 2017. Graph attention networks. arXiv preprint arXiv:1710.10903(2017)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"crossref","unstructured":"Luyu Wang Yujia Li Ozlem Aslan and Oriol Vinyals. 2021. WikiGraphs: A Wikipedia text-knowledge graph paired dataset. arXiv preprint arXiv:2107.09556(2021).","DOI":"10.18653\/v1\/2021.textgraphs-1.7"},{"key":"e_1_3_2_2_37_1","unstructured":"Rui Xue Haoyu Han MohamadAli Torkamani Jian Pei and Xiaorui Liu. 2023. LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation. arXiv preprint arXiv:2302.01503(2023)."},{"key":"e_1_3_2_2_38_1","unstructured":"Rui Xue Tong Zhao Neil Shah and Xiaorui Liu. 2024. Haste Makes Waste: A Simple Approach for Scaling Graph Neural Networks. arXiv preprint arXiv:2410.05416(2024)."},{"key":"e_1_3_2_2_39_1","first-page":"28798","article-title":"GraphFormers: GNN-nested transformers for representation learning on textual graph","volume":"34","author":"Yang Junhan","year":"2021","unstructured":"Junhan Yang, Zheng Liu, Shitao Xiao, Chaozhuo Li, Defu Lian, Sanjay Agrawal, Amit Singh, Guangzhong Sun, and Xing Xie. 2021. GraphFormers: GNN-nested transformers for representation learning on textual graph. Advances in Neural Information Processing Systems, Vol. 34 (2021), 28798-28810.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539121"},{"key":"e_1_3_2_2_41_1","unstructured":"Jianan Zhao Meng Qu Chaozhuo Li Hao Yan Qian Liu Rui Li Xing Xie and Jian Tang. 2022. Learning on large-scale text-attributed graphs via variational inference. arXiv preprint arXiv:2210.14709(2022)."},{"key":"e_1_3_2_2_42_1","unstructured":"Yun Zhu Yaoke Wang Haizhou Shi and Siliang Tang. 2024. Efficient tuning and inference for large language models on textual graphs. arXiv preprint arXiv:2401.15569(2024). endthebibl"}],"event":{"name":"KDD '25: The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Toronto ON Canada","acronym":"KDD '25","sponsor":["SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGMOD ACM Special Interest Group on Management of Data"]},"container-title":["Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3711896.3736922","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T18:18:10Z","timestamp":1777573090000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3711896.3736922"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,3]]},"references-count":42,"alternative-id":["10.1145\/3711896.3736922","10.1145\/3711896"],"URL":"https:\/\/doi.org\/10.1145\/3711896.3736922","relation":{},"subject":[],"published":{"date-parts":[[2025,8,3]]},"assertion":[{"value":"2025-08-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}