{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T16:06:03Z","timestamp":1776096363302,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":69,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T00:00:00Z","timestamp":1691107200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Shanghai Municipal Science and Technology Major Project","award":["2021SHZDZX0102"],"award-info":[{"award-number":["2021SHZDZX0102"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62177033"],"award-info":[{"award-number":["62177033"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,6]]},"DOI":"10.1145\/3580305.3599422","type":"proceedings-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T18:13:58Z","timestamp":1691172838000},"page":"1384-1395","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":31,"title":["MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8953-3203","authenticated-orcid":false,"given":"Jianghao","family":"Lin","sequence":"first","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7664-0421","authenticated-orcid":false,"given":"Yanru","family":"Qu","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8616-0221","authenticated-orcid":false,"given":"Wei","family":"Guo","sequence":"additional","affiliation":[{"name":"Huawei Noah's Ark Lab, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3351-5401","authenticated-orcid":false,"given":"Xinyi","family":"Dai","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9224-2431","authenticated-orcid":false,"given":"Ruiming","family":"Tang","sequence":"additional","affiliation":[{"name":"Huawei Noah's Ark Lab, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0281-8271","authenticated-orcid":false,"given":"Yong","family":"Yu","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0127-2425","authenticated-orcid":false,"given":"Weinan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"2020. MindSpore. https:\/\/www.mindspore.cn\/ 2020. MindSpore. https:\/\/www.mindspore.cn\/"},{"key":"e_1_3_2_2_2_1","volume-title":"Scarf: Self-supervised contrastive learning using random feature corruption. arXiv preprint arXiv:2106.15147","author":"Bahri Dara","year":"2021","unstructured":"Dara Bahri , Heinrich Jiang , Yi Tay , and Donald Metzler . 2021 . Scarf: Self-supervised contrastive learning using random feature corruption. arXiv preprint arXiv:2106.15147 (2021). Dara Bahri, Heinrich Jiang, Yi Tay, and Donald Metzler. 2021. Scarf: Self-supervised contrastive learning using random feature corruption. arXiv preprint arXiv:2106.15147 (2021)."},{"key":"e_1_3_2_2_3_1","unstructured":"Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell etal 2020. Language models are few-shot learners. Advances in neural information processing systems Vol. 33 (2020) 1877--1901. Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et al. 2020. Language models are few-shot learners. Advances in neural information processing systems Vol. 33 (2020) 1877--1901."},{"key":"e_1_3_2_2_4_1","article-title":"Training and testing low-degree polynomial data mappings via linear SVM","volume":"11","author":"Chang Yin-Wen","year":"2010","unstructured":"Yin-Wen Chang , Cho-Jui Hsieh , Kai-Wei Chang , Michael Ringgaard , and Chih-Jen Lin . 2010 . Training and testing low-degree polynomial data mappings via linear SVM . Journal of Machine Learning Research , Vol. 11 , 4 (2010). Yin-Wen Chang, Cho-Jui Hsieh, Kai-Wei Chang, Michael Ringgaard, and Chih-Jen Lin. 2010. Training and testing low-degree polynomial data mappings via linear SVM. Journal of Machine Learning Research, Vol. 11, 4 (2010).","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_2_5_1","volume-title":"International conference on machine learning. PMLR, 1597--1607","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey Hinton . 2020 b. A simple framework for contrastive learning of visual representations . In International conference on machine learning. PMLR, 1597--1607 . Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020b. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597--1607."},{"key":"e_1_3_2_2_6_1","volume-title":"Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Kevin Swersky , Mohammad Norouzi , and Geoffrey E Hinton . 2020c. Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems , Vol. 33 ( 2020 ), 22243--22255. Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey E Hinton. 2020c. Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems, Vol. 33 (2020), 22243--22255."},{"key":"e_1_3_2_2_7_1","volume-title":"Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297","author":"Chen Xinlei","year":"2020","unstructured":"Xinlei Chen , Haoqi Fan , Ross Girshick , and Kaiming He. 2020a. Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 ( 2020 ). Xinlei Chen, Haoqi Fan, Ross Girshick, and Kaiming He. 2020a. Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 (2020)."},{"key":"e_1_3_2_2_8_1","volume-title":"Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555","author":"Clark Kevin","year":"2020","unstructured":"Kevin Clark , Minh-Thang Luong , Quoc V Le , and Christopher D Manning . 2020 . Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555 (2020). Kevin Clark, Minh-Thang Luong, Quoc V Le, and Christopher D Manning. 2020. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555 (2020)."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3449913"},{"key":"e_1_3_2_2_10_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00945"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3539597.3570365"},{"key":"e_1_3_2_2_13_1","volume-title":"Zhaohan Guo, Mohammad Gheshlaghi Azar, et al.","author":"Grill Jean-Bastien","year":"2020","unstructured":"Jean-Bastien Grill , Florian Strub , Florent Altch\u00e9 , Corentin Tallec , Pierre Richemond , Elena Buchatskaya , Carl Doersch , Bernardo Avila Pires , Zhaohan Guo, Mohammad Gheshlaghi Azar, et al. 2020 . Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems, Vol. 33 (2020), 21271--21284. Jean-Bastien Grill, Florian Strub, Florent Altch\u00e9, Corentin Tallec, Pierre Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Guo, Mohammad Gheshlaghi Azar, et al. 2020. Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems, Vol. 33 (2020), 21271--21284."},{"key":"e_1_3_2_2_14_1","unstructured":"Huifeng Guo Ruiming Tang Yunming Ye Zhenguo Li and Xiuqiang He. 2017. Deepfm: a factorization-machine based neural network for ctr prediction. In IJCAI. Huifeng Guo Ruiming Tang Yunming Ye Zhenguo Li and Xiuqiang He. 2017. Deepfm: a factorization-machine based neural network for ctr prediction. In IJCAI."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00059"},{"key":"e_1_3_2_2_16_1","volume-title":"Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 297--304","author":"Gutmann Michael","year":"2010","unstructured":"Michael Gutmann and Aapo Hyv\u00e4rinen . 2010 . Noise-contrastive estimation: A new estimation principle for unnormalized statistical models . In Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 297--304 . Michael Gutmann and Aapo Hyv\u00e4rinen. 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. In Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 297--304."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441738"},{"key":"e_1_3_2_2_18_1","volume-title":"MixGen: A New Multi-Modal Data Augmentation. arXiv preprint arXiv:2206.08358","author":"Hao Xiaoshuai","year":"2022","unstructured":"Xiaoshuai Hao , Yi Zhu , Srikar Appalaraju , Aston Zhang , Wanqian Zhang , Bo Li , and Mu Li. 2022. MixGen: A New Multi-Modal Data Augmentation. arXiv preprint arXiv:2206.08358 ( 2022 ). Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, Aston Zhang, Wanqian Zhang, Bo Li, and Mu Li. 2022. MixGen: A New Multi-Modal Data Augmentation. arXiv preprint arXiv:2206.08358 (2022)."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01553"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"e_1_3_2_2_21_1","unstructured":"Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In SIGIR. 355--364. Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In SIGIR. 355--364."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"crossref","unstructured":"Xiangnan He Lizi Liao Hanwang Zhang Liqiang Nie Xia Hu and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. 173--182. Xiangnan He Lizi Liao Hanwang Zhang Liqiang Nie Xia Hu and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. 173--182.","DOI":"10.1145\/3038912.3052569"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3298689.3347043"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531762"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"crossref","unstructured":"Yuchin Juan Yong Zhuang Wei-Sheng Chin and Chih-Jen Lin. 2016. Field-aware factorization machines for CTR prediction. In RecSys. 43--50. Yuchin Juan Yong Zhuang Wei-Sheng Chin and Chih-Jen Lin. 2016. Field-aware factorization machines for CTR prediction. In RecSys. 43--50.","DOI":"10.1145\/2959100.2959134"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371785"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357951"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"Jianxun Lian Xiaohuan Zhou Fuzheng Zhang Zhongxia Chen Xing Xie and Guangzhong Sun. 2018. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In KDD. 1754--1763. Jianxun Lian Xiaohuan Zhou Fuzheng Zhang Zhongxia Chen Xing Xie and Guangzhong Sun. 2018. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In KDD. 1754--1763.","DOI":"10.1145\/3219819.3220023"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462895"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Bin Liu Ruiming Tang Yingzhi Chen Jinkai Yu Huifeng Guo and Yuzhou Zhang. 2019b. Feature generation by convolutional neural network for click-through rate prediction. In WWW. 1119--1129. Bin Liu Ruiming Tang Yingzhi Chen Jinkai Yu Huifeng Guo and Yuzhou Zhang. 2019b. Feature generation by convolutional neural network for click-through rate prediction. In WWW. 1119--1129.","DOI":"10.1145\/3308558.3313497"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2806416.2806603"},{"key":"e_1_3_2_2_33_1","volume-title":"Self-supervised learning: Generative or contrastive","author":"Liu Xiao","year":"2021","unstructured":"Xiao Liu , Fanjin Zhang , Zhenyu Hou , Li Mian , Zhaoyu Wang , Jing Zhang , and Jie Tang . 2021c. Self-supervised learning: Generative or contrastive . IEEE Transactions on Knowledge and Data Engineering ( 2021 ). Xiao Liu, Fanjin Zhang, Zhenyu Hou, Li Mian, Zhaoyu Wang, Jing Zhang, and Jie Tang. 2021c. Self-supervised learning: Generative or contrastive. IEEE Transactions on Knowledge and Data Engineering (2021)."},{"key":"e_1_3_2_2_34_1","volume-title":"Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu , Myle Ott , Naman Goyal , Jingfei Du , Mandar Joshi , Danqi Chen , Omer Levy , Mike Lewis , Luke Zettlemoyer , and Veselin Stoyanov . 2019 a. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019). Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019a. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475709"},{"key":"e_1_3_2_2_36_1","volume-title":"Contrastive self-supervised sequential recommendation with robust augmentation. arXiv preprint arXiv:2108.06479","author":"Liu Zhiwei","year":"2021","unstructured":"Zhiwei Liu , Yongjun Chen , Jia Li , Philip S Yu , Julian McAuley , and Caiming Xiong . 2021a. Contrastive self-supervised sequential recommendation with robust augmentation. arXiv preprint arXiv:2108.06479 ( 2021 ). Zhiwei Liu, Yongjun Chen, Jia Li, Philip S Yu, Julian McAuley, and Caiming Xiong. 2021a. Contrastive self-supervised sequential recommendation with robust augmentation. arXiv preprint arXiv:2108.06479 (2021)."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3512527.3531378"},{"key":"e_1_3_2_2_38_1","volume-title":"Graph neural pre-training for enhancing recommendations using side information. arXiv preprint arXiv:2107.03936","author":"Meng Zaiqiao","year":"2021","unstructured":"Zaiqiao Meng , Siwei Liu , Craig Macdonald , and Iadh Ounis . 2021. Graph neural pre-training for enhancing recommendations using side information. arXiv preprint arXiv:2107.03936 ( 2021 ). Zaiqiao Meng, Siwei Liu, Craig Macdonald, and Iadh Ounis. 2021. Graph neural pre-training for enhancing recommendations using side information. arXiv preprint arXiv:2107.03936 (2021)."},{"key":"e_1_3_2_2_39_1","volume-title":"Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov , Kai Chen , Greg Corrado , and Jeffrey Dean . 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 ( 2013 ). Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)."},{"key":"e_1_3_2_2_40_1","volume-title":"A fast and simple algorithm for training neural probabilistic language models. arXiv preprint arXiv:1206.6426","author":"Mnih Andriy","year":"2012","unstructured":"Andriy Mnih and Yee Whye Teh . 2012. A fast and simple algorithm for training neural probabilistic language models. arXiv preprint arXiv:1206.6426 ( 2012 ). Andriy Mnih and Yee Whye Teh. 2012. A fast and simple algorithm for training neural probabilistic language models. arXiv preprint arXiv:1206.6426 (2012)."},{"key":"e_1_3_2_2_41_1","volume-title":"Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748","author":"van den Oord Aaron","year":"2018","unstructured":"Aaron van den Oord , Yazhe Li , and Oriol Vinyals . 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 ( 2018 ). Aaron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018)."},{"key":"e_1_3_2_2_42_1","volume-title":"Click-through Rate Prediction with Auto-Quantized Contrastive Learning. arXiv preprint arXiv:2109.13921","author":"Pan Yujie","year":"2021","unstructured":"Yujie Pan , Jiangchao Yao , Bo Han , Kunyang Jia , Ya Zhang , and Hongxia Yang . 2021. Click-through Rate Prediction with Auto-Quantized Contrastive Learning. arXiv preprint arXiv:2109.13921 ( 2021 ). Yujie Pan, Jiangchao Yao, Bo Han, Kunyang Jia, Ya Zhang, and Hongxia Yang. 2021. Click-through Rate Prediction with Auto-Quantized Contrastive Learning. arXiv preprint arXiv:2109.13921 (2021)."},{"key":"e_1_3_2_2_43_1","unstructured":"Jiarui Qin W. Zhang Xin Wu Jiarui Jin Yuchen Fang and Y. Yu. 2020. User Behavior Retrieval for Click-Through Rate Prediction. In SIGIR. Jiarui Qin W. Zhang Xin Wu Jiarui Jin Yuchen Fang and Y. Yu. 2020. User Behavior Retrieval for Click-Through Rate Prediction. In SIGIR."},{"key":"e_1_3_2_2_44_1","unstructured":"Yanru Qu Han Cai Kan Ren Weinan Zhang Yong Yu Ying Wen and Jun Wang. 2016. Product-based neural networks for user response prediction. In ICDM. Yanru Qu Han Cai Kan Ren Weinan Zhang Yong Yu Ying Wen and Jun Wang. 2016. Product-based neural networks for user response prediction. In ICDM."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3233770"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"crossref","unstructured":"Steffen Rendle. 2010. Factorization machines. In ICDM. Steffen Rendle. 2010. Factorization machines. In ICDM.","DOI":"10.1109\/ICDM.2010.127"},{"key":"e_1_3_2_2_47_1","volume-title":"Factorization machines with libfm. TIST","author":"Rendle Steffen","year":"2012","unstructured":"Steffen Rendle . 2012. Factorization machines with libfm. TIST ( 2012 ). Steffen Rendle. 2012. Factorization machines with libfm. TIST (2012)."},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"crossref","unstructured":"Matthew Richardson Ewa Dominowska and Robert Ragno. 2007. Predicting clicks: estimating the click-through rate for new ads. In WWW. ACM 521--530. Matthew Richardson Ewa Dominowska and Robert Ragno. 2007. Predicting clicks: estimating the click-through rate for new ads. In WWW. ACM 521--530.","DOI":"10.1145\/1242572.1242643"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357925"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357895"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58621-8_45"},{"key":"e_1_3_2_2_52_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008. Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008."},{"key":"e_1_3_2_2_53_1","volume-title":"Pre-training Graph Neural Network for Cross Domain Recommendation. In 2021 IEEE Third International Conference on Cognitive Machine Intelligence","author":"Wang Chen","year":"2021","unstructured":"Chen Wang , Yueqing Liang , Zhiwei Liu , Tao Zhang , and S Yu Philip . 2021 a. Pre-training Graph Neural Network for Cross Domain Recommendation. In 2021 IEEE Third International Conference on Cognitive Machine Intelligence ( CogMI). IEEE, 140--145. Chen Wang, Yueqing Liang, Zhiwei Liu, Tao Zhang, and S Yu Philip. 2021a. Pre-training Graph Neural Network for Cross Domain Recommendation. In 2021 IEEE Third International Conference on Cognitive Machine Intelligence (CogMI). IEEE, 140--145."},{"key":"e_1_3_2_2_54_1","volume-title":"CL4CTR: A Contrastive Learning Framework for CTR Prediction. arXiv preprint arXiv:2212.00522","author":"Wang Fangye","year":"2022","unstructured":"Fangye Wang , Yingxu Wang , Dongsheng Li , Hansu Gu , Tun Lu , Peng Zhang , and Ning Gu. 2022a. CL4CTR: A Contrastive Learning Framework for CTR Prediction. arXiv preprint arXiv:2212.00522 ( 2022 ). Fangye Wang, Yingxu Wang, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, and Ning Gu. 2022a. CL4CTR: A Contrastive Learning Framework for CTR Prediction. arXiv preprint arXiv:2212.00522 (2022)."},{"key":"e_1_3_2_2_55_1","volume-title":"Enhancing CTR Prediction with Context-Aware Feature Representation Learning. arXiv preprint arXiv:2204.08758","author":"Wang Fangye","year":"2022","unstructured":"Fangye Wang , Yingxu Wang , Dongsheng Li , Hansu Gu , Tun Lu , Peng Zhang , and Ning Gu. 2022b. Enhancing CTR Prediction with Context-Aware Feature Representation Learning. arXiv preprint arXiv:2204.08758 ( 2022 ). Fangye Wang, Yingxu Wang, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, and Ning Gu. 2022b. Enhancing CTR Prediction with Context-Aware Feature Representation Learning. arXiv preprint arXiv:2204.08758 (2022)."},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412726"},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3124749.3124754"},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3450078"},{"key":"e_1_3_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557268"},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00393"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3539597.3570399"},{"key":"e_1_3_2_2_62_1","volume-title":"Attentional factorization machines: Learning the weight of feature interactions via attention networks. IJCAI","author":"Xiao Jun","year":"2017","unstructured":"Jun Xiao , Hao Ye , Xiangnan He , Hanwang Zhang , Fei Wu , and Tat-Seng Chua . 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. IJCAI ( 2017 ). Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. IJCAI (2017)."},{"key":"e_1_3_2_2_63_1","volume-title":"Contrastive pre-training for sequential recommendation. arXiv preprint arXiv:2010.14395","author":"Xie Xu","year":"2020","unstructured":"Xu Xie , Fei Sun , Zhaoyang Liu , Jinyang Gao , Bolin Ding , and Bin Cui . 2020. Contrastive pre-training for sequential recommendation. arXiv preprint arXiv:2010.14395 ( 2020 ). Xu Xie, Fei Sun, Zhaoyang Liu, Jinyang Gao, Bolin Ding, and Bin Cui. 2020. Contrastive pre-training for sequential recommendation. arXiv preprint arXiv:2010.14395 (2020)."},{"key":"e_1_3_2_2_64_1","volume-title":"International Conference on Machine Learning. PMLR, 10524--10533","author":"Xiong Ruibin","year":"2020","unstructured":"Ruibin Xiong , Yunchang Yang , Di He , Kai Zheng , Shuxin Zheng , Chen Xing , Huishuai Zhang , Yanyan Lan , Liwei Wang , and Tieyan Liu . 2020 . On layer normalization in the transformer architecture . In International Conference on Machine Learning. PMLR, 10524--10533 . Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, and Tieyan Liu. 2020. On layer normalization in the transformer architecture. In International Conference on Machine Learning. PMLR, 10524--10533."},{"key":"e_1_3_2_2_65_1","volume-title":"Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang , Zihang Dai , Yiming Yang , Jaime Carbonell , Russ R Salakhutdinov , and Quoc V Le . 2019 . Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems , Vol. 32 (2019). Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R Salakhutdinov, and Quoc V Le. 2019. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems, Vol. 32 (2019)."},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3481952"},{"key":"e_1_3_2_2_67_1","first-page":"11033","article-title":"Vime: Extending the success of self-and semi-supervised learning to tabular domain","volume":"33","author":"Yoon Jinsung","year":"2020","unstructured":"Jinsung Yoon , Yao Zhang , James Jordon , and Mihaela van der Schaar . 2020 . Vime: Extending the success of self-and semi-supervised learning to tabular domain . Advances in Neural Information Processing Systems , Vol. 33 (2020), 11033 -- 11043 . Jinsung Yoon, Yao Zhang, James Jordon, and Mihaela van der Schaar. 2020. Vime: Extending the success of self-and semi-supervised learning to tabular domain. Advances in Neural Information Processing Systems, Vol. 33 (2020), 11033--11043.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_68_1","volume-title":"mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412","author":"Zhang Hongyi","year":"2017","unstructured":"Hongyi Zhang , Moustapha Cisse , Yann N Dauphin , and David Lopez-Paz . 2017. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 ( 2017 ). Hongyi Zhang, Moustapha Cisse, Yann N Dauphin, and David Lopez-Paz. 2017. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)."},{"key":"e_1_3_2_2_69_1","volume-title":"Deep learning for click-through rate estimation. IJCAI","author":"Zhang Weinan","year":"2021","unstructured":"Weinan Zhang , Jiarui Qin , Wei Guo , Ruiming Tang , and Xiuqiang He. 2021. Deep learning for click-through rate estimation. IJCAI ( 2021 ). Weinan Zhang, Jiarui Qin, Wei Guo, Ruiming Tang, and Xiuqiang He. 2021. Deep learning for click-through rate estimation. IJCAI (2021)."}],"event":{"name":"KDD '23: The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Long Beach CA USA","acronym":"KDD '23","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599422","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3580305.3599422","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:36Z","timestamp":1750178256000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599422"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":69,"alternative-id":["10.1145\/3580305.3599422","10.1145\/3580305"],"URL":"https:\/\/doi.org\/10.1145\/3580305.3599422","relation":{},"subject":[],"published":{"date-parts":[[2023,8,4]]},"assertion":[{"value":"2023-08-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}