{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:06:33Z","timestamp":1775815593456,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":67,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,9,13]],"date-time":"2021-09-13T00:00:00Z","timestamp":1631491200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,13]]},"DOI":"10.1145\/3460231.3474255","type":"proceedings-article","created":{"date-parts":[[2021,9,13]],"date-time":"2021-09-13T21:45:04Z","timestamp":1631569504000},"page":"143-153","source":"Crossref","is-referenced-by-count":136,"title":["Transformers4Rec: Bridging the Gap between NLP and Sequential \/ Session-Based Recommendation"],"prefix":"10.1145","author":[{"given":"Gabriel","family":"de Souza Pereira Moreira","sequence":"first","affiliation":[{"name":"NVIDIA, Brazil"}]},{"given":"Sara","family":"Rabhi","sequence":"additional","affiliation":[{"name":"NVIDIA, Canada"}]},{"given":"Jeong Min","family":"Lee","sequence":"additional","affiliation":[{"name":"Facebook AI, United States"}]},{"given":"Ronay","family":"Ak","sequence":"additional","affiliation":[{"name":"NVIDIA, United States"}]},{"given":"Even","family":"Oldridge","sequence":"additional","affiliation":[{"name":"NVIDIA, Canada"}]}],"member":"320","published-online":{"date-parts":[[2021,9,13]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"2021. Transformers4Rec paper online appendix. https:\/\/github.com\/NVIDIA-Merlin\/publications\/tree\/main\/2021_acm_recsys_transformers4rec 2021. Transformers4Rec paper online appendix. https:\/\/github.com\/NVIDIA-Merlin\/publications\/tree\/main\/2021_acm_recsys_transformers4rec"},{"key":"e_1_3_2_2_2_1","unstructured":"Jimmy\u00a0Lei Ba Jamie\u00a0Ryan Kiros and Geoffrey\u00a0E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450(2016). Jimmy\u00a0Lei Ba Jamie\u00a0Ryan Kiros and Geoffrey\u00a0E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450(2016)."},{"key":"e_1_3_2_2_3_1","unstructured":"Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2016. Neural Machine Translation by Jointly Learning to Align and Translate. arxiv:1409.0473\u00a0[cs.CL] Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2016. Neural Machine Translation by Jointly Learning to Align and Translate. arxiv:1409.0473\u00a0[cs.CL]"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159727"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3326937.3341261"},{"key":"e_1_3_2_2_6_1","volume-title":"Proceedings of the 27th ACM International Conference on Multimedia. 2597\u20132601","author":"Chen Xusong","year":"2019","unstructured":"Xusong Chen , Dong Liu , Chenyi Lei , Rui Li , Zheng-Jun Zha , and Zhiwei Xiong . 2019 . BERT4SessRec: Content-Based Video Relevance Prediction with Bidirectional Encoder Representations from Transformer . In Proceedings of the 27th ACM International Conference on Multimedia. 2597\u20132601 . Xusong Chen, Dong Liu, Chenyi Lei, Rui Li, Zheng-Jun Zha, and Zhiwei Xiong. 2019. BERT4SessRec: Content-Based Video Relevance Prediction with Bidirectional Encoder Representations from Transformer. In Proceedings of the 27th ACM International Conference on Multimedia. 2597\u20132601."},{"key":"e_1_3_2_2_7_1","unstructured":"Kyunghyun Cho Bart Van\u00a0Merri\u00ebnboer Caglar Gulcehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078(2014). Kyunghyun Cho Bart Van\u00a0Merri\u00ebnboer Caglar Gulcehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078(2014)."},{"key":"e_1_3_2_2_8_1","volume-title":"ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. arxiv:2003.10555\u00a0[cs.CL]","author":"Clark Kevin","year":"2020","unstructured":"Kevin Clark , Minh-Thang Luong , Quoc\u00a0 V. Le , and Christopher\u00a0 D. Manning . 2020 . ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. arxiv:2003.10555\u00a0[cs.CL] Kevin Clark, Minh-Thang Luong, Quoc\u00a0V. Le, and Christopher\u00a0D. Manning. 2020. ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. arxiv:2003.10555\u00a0[cs.CL]"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1285"},{"key":"e_1_3_2_2_10_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , Volume 1 . Association for Computational Linguistics, Minneapolis, Minnesota, 4171\u20134186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1. Association for Computational Linguistics, Minneapolis, Minnesota, 4171\u20134186."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3426723"},{"key":"e_1_3_2_2_12_1","unstructured":"Jun Fang. 2021. Session-based Recommendation with Self-Attention Networks. arXiv preprint arXiv:2102.01922(2021). Jun Fang. 2021. Session-based Recommendation with Self-Attention Networks. arXiv preprint arXiv:2102.01922(2021)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331322"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2788627"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3106426.3109436"},{"key":"e_1_3_2_2_16_1","volume-title":"International Conference on Machine Learning. PMLR, 1321\u20131330","author":"Guo Chuan","year":"2017","unstructured":"Chuan Guo , Geoff Pleiss , Yu Sun , and Kilian\u00a0 Q Weinberger . 2017 . On calibration of modern neural networks . In International Conference on Machine Learning. PMLR, 1321\u20131330 . Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian\u00a0Q Weinberger. 2017. On calibration of modern neural networks. In International Conference on Machine Learning. PMLR, 1321\u20131330."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3269206.3271761"},{"key":"e_1_3_2_2_18_1","unstructured":"Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939(2015). Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939(2015)."},{"key":"e_1_3_2_2_19_1","volume-title":"Proceedings of Fourth International Conference on Learning Representations (ICLR\u201916)","author":"Hidasi Bal\u00e1zs","year":"2016","unstructured":"Bal\u00e1zs Hidasi , Alexandros Karatzoglou , Linas Baltrunas , and Domonkos Tikk . 2016 . Session-based recommendations with recurrent neural networks . In Proceedings of Fourth International Conference on Learning Representations (ICLR\u201916) . Bal\u00e1zs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2016. Session-based recommendations with recurrent neural networks. In Proceedings of Fourth International Conference on Learning Representations (ICLR\u201916)."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2959100.2959167"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240609"},{"key":"e_1_3_2_2_22_1","unstructured":"Hakan Inan Khashayar Khosravi and Richard Socher. 2016. Tying word vectors and word classifiers: A loss framework for language modeling. arXiv preprint arXiv:1611.01462(2016). Hakan Inan Khashayar Khosravi and Richard Socher. 2016. Tying word vectors and word classifiers: A loss framework for language modeling. arXiv preprint arXiv:1611.01462(2016)."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3109859.3109872"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2018.00035"},{"key":"e_1_3_2_2_25_1","volume-title":"Le and Tomas Mikolov","author":"V.","year":"2014","unstructured":"Quoc\u00a0 V. Le and Tomas Mikolov . 2014 . Distributed Representations of Sentences and Documents . arxiv:1405.4053\u00a0[cs.CL] Quoc\u00a0V. Le and Tomas Mikolov. 2014. Distributed Representations of Sentences and Documents. arxiv:1405.4053\u00a0[cs.CL]"},{"key":"e_1_3_2_2_26_1","unstructured":"Jing Li Pengjie Ren Zhumin Chen Zhaochun Ren and Jun Ma. 2017. Neural Attentive Session-based Recommendation. arxiv:1711.04725\u00a0[cs.IR] Jing Li Pengjie Ren Zhumin Chen Zhaochun Ren and Jun Ma. 2017. Neural Attentive Session-based Recommendation. arxiv:1711.04725\u00a0[cs.IR]"},{"key":"e_1_3_2_2_27_1","unstructured":"Yang Li Nan Du and Samy Bengio. 2017. Time-dependent representation for neural event sequence prediction. arXiv preprint arXiv:1708.00065(2017). Yang Li Nan Du and Samy Bengio. 2017. Time-dependent representation for neural event sequence prediction. arXiv preprint arXiv:1708.00065(2017)."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219950"},{"key":"e_1_3_2_2_29_1","unstructured":"Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arxiv:1907.11692\u00a0[cs.CL] Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arxiv:1907.11692\u00a0[cs.CL]"},{"key":"e_1_3_2_2_30_1","volume-title":"Decoupled Weight Decay Regularization. In 7th International Conference on Learning Representations, ICLR 2019","author":"Loshchilov Ilya","year":"2019","unstructured":"Ilya Loshchilov and Frank Hutter . 2019 . Decoupled Weight Decay Regularization. In 7th International Conference on Learning Representations, ICLR 2019 , New Orleans, LA, USA , May 6-9, 2019. OpenReview.net. https:\/\/openreview.net\/forum?id=Bkg6RiCqY7 Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net. https:\/\/openreview.net\/forum?id=Bkg6RiCqY7"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11257-018-9209-6"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3298689.3347041"},{"key":"e_1_3_2_2_33_1","unstructured":"Anjing Luo Pengpeng Zhao Yanchi Liu Fuzhen Zhuang Deqing Wang Jiajie Xu Junhua Fang and Victor\u00a0S Sheng. [n.d.]. Collaborative Self-Attention Network for Session-based Recommendation. ([n.\u00a0d.]). Anjing Luo Pengpeng Zhao Yanchi Liu Fuzhen Zhuang Deqing Wang Jiajie Xu Junhua Fang and Victor\u00a0S Sheng. [n.d.]. Collaborative Self-Attention Network for Session-based Recommendation. ([n.\u00a0d.])."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357818"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Ludewig Malte Noemi Mauro Latifi Sara Jannach Dietmar 2020. Empirical analysis of session-based recommendation algorithms. (2020). Ludewig Malte Noemi Mauro Latifi Sara Jannach Dietmar 2020. Empirical analysis of session-based recommendation algorithms. (2020).","DOI":"10.1007\/s11257-020-09277-1"},{"key":"e_1_3_2_2_36_1","unstructured":"Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. arxiv:1301.3781\u00a0[cs.CL] Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. arxiv:1301.3781\u00a0[cs.CL]"},{"key":"e_1_3_2_2_37_1","unstructured":"Tomas Mikolov Ilya Sutskever Kai Chen Greg Corrado and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. arxiv:1310.4546\u00a0[cs.CL] Tomas Mikolov Ilya Sutskever Kai Chen Greg Corrado and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. arxiv:1310.4546\u00a0[cs.CL]"},{"key":"e_1_3_2_2_38_1","unstructured":"Sarai Mizrachi and Pavel Levin. 2019. Combining Context Features in Sequence-Aware Recommender Systems. In RecSys (Late-Breaking Results). 11\u201315. Sarai Mizrachi and Pavel Levin. 2019. Combining Context Features in Sequence-Aware Recommender Systems. In RecSys (Late-Breaking Results). 11\u201315."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3270323.3270328"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2954957"},{"key":"e_1_3_2_2_41_1","volume-title":"Proceedings of the 7th International Workshop on News Recommendation and Analytics (INRA 2019). CEUR Workshop Proceedings Vol-2554","author":"de Souza\u00a0Pereira Moreira Gabriel","year":"2019","unstructured":"Gabriel de Souza\u00a0Pereira Moreira , Dietmar Jannach , and Adilson\u00a0Marques da Cunha . 2019 . On the importance of news content representation in hybrid neural session-based recommender systems , In Proceedings of the 7th International Workshop on News Recommendation and Analytics (INRA 2019). CEUR Workshop Proceedings Vol-2554 . Gabriel de Souza\u00a0Pereira Moreira, Dietmar Jannach, and Adilson\u00a0Marques da Cunha. 2019. On the importance of news content representation in hybrid neural session-based recommender systems, In Proceedings of the 7th International Workshop on News Recommendation and Analytics (INRA 2019). CEUR Workshop Proceedings Vol-2554."},{"key":"e_1_3_2_2_42_1","volume-title":"Proceedings of the Fifth SIGIR eCommerce Workshop","author":"de Souza\u00a0Pereira Moreira Gabriel","year":"2021","unstructured":"Gabriel de Souza\u00a0Pereira Moreira , Sara Rabhi , Ronay Ak , Md\u00a0Yasin Kabir , and Even Oldridge . 2021 . Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation . In Proceedings of the Fifth SIGIR eCommerce Workshop 2021. Gabriel de Souza\u00a0Pereira Moreira, Sara Rabhi, Ronay Ak, Md\u00a0Yasin Kabir, and Even Oldridge. 2021. Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation. In Proceedings of the Fifth SIGIR eCommerce Workshop 2021."},{"key":"e_1_3_2_2_43_1","volume-title":"Advances in Neural Information Processing Systems, H.\u00a0Wallach, H.\u00a0Larochelle, A.\u00a0Beygelzimer, F.\u00a0d'Alch\u00e9-Buc, E.\u00a0Fox, and R.\u00a0Garnett (Eds.), Vol.\u00a032. Curran Associates","author":"M\u00fcller Rafael","year":"2019","unstructured":"Rafael M\u00fcller , Simon Kornblith , and Geoffrey\u00a0 E Hinton . 2019. When does label smoothing help? . In Advances in Neural Information Processing Systems, H.\u00a0Wallach, H.\u00a0Larochelle, A.\u00a0Beygelzimer, F.\u00a0d'Alch\u00e9-Buc, E.\u00a0Fox, and R.\u00a0Garnett (Eds.), Vol.\u00a032. Curran Associates , Inc .https:\/\/proceedings.neurips.cc\/paper\/ 2019 \/file\/f1748d6b0fd9d439f71450117eba2725-Paper.pdf Rafael M\u00fcller, Simon Kornblith, and Geoffrey\u00a0E Hinton. 2019. When does label smoothing help?. In Advances in Neural Information Processing Systems, H.\u00a0Wallach, H.\u00a0Larochelle, A.\u00a0Beygelzimer, F.\u00a0d'Alch\u00e9-Buc, E.\u00a0Fox, and R.\u00a0Garnett (Eds.), Vol.\u00a032. Curran Associates, Inc.https:\/\/proceedings.neurips.cc\/paper\/2019\/file\/f1748d6b0fd9d439f71450117eba2725-Paper.pdf"},{"key":"e_1_3_2_2_44_1","volume-title":"Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1837\u20131840","author":"Pan Zhiqiang","year":"2020","unstructured":"Zhiqiang Pan , Fei Cai , Yanxiang Ling , and Maarten de Rijke . 2020 . Rethinking Item Importance in Session-based Recommendation . In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1837\u20131840 . Zhiqiang Pan, Fei Cai, Yanxiang Ling, and Maarten de Rijke. 2020. Rethinking Item Importance in Session-based Recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1837\u20131840."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Jeffrey Pennington Richard Socher and Christopher\u00a0D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP). 1532\u20131543. http:\/\/www.aclweb.org\/anthology\/D14-1162 Jeffrey Pennington Richard Socher and Christopher\u00a0D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP). 1532\u20131543. http:\/\/www.aclweb.org\/anthology\/D14-1162","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-2025"},{"key":"e_1_3_2_2_47_1","unstructured":"A. Radford and Karthik Narasimhan. 2018. Improving Language Understanding by Generative Pre-Training. A. Radford and Karthik Narasimhan. 2018. Improving Language Understanding by Generative Pre-Training."},{"key":"e_1_3_2_2_48_1","volume-title":"Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM WebTour\u201921)","author":"Schifferer Benedikt","year":"2021","unstructured":"Benedikt Schifferer , Chris Deotte , Jean-Fran\u0107ois Puget , Gabriel de Souza\u00a0Pereira Moreira , Gilberto Titericz , Jiwei Liu , and Ronay Ak . 2021 . Using Deep Learning to Win the Booking.com WSDMWebTour21 Challenge on Sequential Recommendations (to be published). https:\/\/www.bookingchallenge.com\/ . In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM WebTour\u201921) . Benedikt Schifferer, Chris Deotte, Jean-Fran\u0107ois Puget, Gabriel de Souza\u00a0Pereira Moreira, Gilberto Titericz, Jiwei Liu, and Ronay Ak. 2021. Using Deep Learning to Win the Booking.com WSDMWebTour21 Challenge on Sequential Recommendations (to be published). https:\/\/www.bookingchallenge.com\/. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM WebTour\u201921)."},{"key":"e_1_3_2_2_49_1","volume-title":"Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15, 1","author":"Srivastava Nitish","year":"2014","unstructured":"Nitish Srivastava , Geoffrey Hinton , Alex Krizhevsky , Ilya Sutskever , and Ruslan Salakhutdinov . 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15, 1 ( 2014 ), 1929\u20131958. Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15, 1 (2014), 1929\u20131958."},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357895"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2931945"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988450.2988452"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2959100.2959160"},{"key":"e_1_3_2_2_55_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762(2017). Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762(2017)."},{"key":"e_1_3_2_2_56_1","unstructured":"Shoujin Wang Longbing Cao Yan Wang Quan\u00a0Z Sheng Mehmet Orgun and Defu Lian. 2019. A survey on session-based recommender systems. arXiv preprint arXiv:1902.04864(2019). Shoujin Wang Longbing Cao Yan Wang Quan\u00a0Z Sheng Mehmet Orgun and Defu Lian. 2019. A survey on session-based recommender systems. arXiv preprint arXiv:1902.04864(2019)."},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"crossref","unstructured":"Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u00e9mi Louf Morgan Funtowicz 2019. HuggingFace\u2019s Transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771(2019). Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u00e9mi Louf Morgan Funtowicz 2019. HuggingFace\u2019s Transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771(2019).","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_2_2_58_1","volume-title":"Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019","author":"Wu Liwei","year":"2019","unstructured":"Liwei Wu , Shuqing Li , Cho-Jui Hsieh , and James\u00a0 L. Sharpnack . 2019 . Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 , NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, Hanna\u00a0M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily\u00a0B. Fox, and Roman Garnett(Eds.). 24\u201334. https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/37693cfc748049e45d87b8c7d8b9aacd-Abstract.html Liwei Wu, Shuqing Li, Cho-Jui Hsieh, and James\u00a0L. Sharpnack. 2019. Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, Hanna\u00a0M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d\u2019Alch\u00e9-Buc, Emily\u00a0B. Fox, and Roman Garnett(Eds.). 24\u201334. https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/37693cfc748049e45d87b8c7d8b9aacd-Abstract.html"},{"key":"e_1_3_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383313.3412258"},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.3301346"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"crossref","unstructured":"Jun Xiao Hao Ye Xiangnan He Hanwang Zhang Fei Wu and Tat-Seng Chua. 2017. Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks. arxiv:1708.04617\u00a0[cs.LG] Jun Xiao Hao Ye Xiangnan He Hanwang Zhang Fei Wu and Tat-Seng Chua. 2017. Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks. arxiv:1708.04617\u00a0[cs.LG]","DOI":"10.24963\/ijcai.2017\/435"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"crossref","unstructured":"Chengfeng Xu Pengpeng Zhao Yanchi Liu Victor\u00a0S Sheng Jiajie Xu Fuzhen Zhuang Junhua Fang and Xiaofang Zhou. 2019. Graph Contextualized Self-Attention Network for Session-based Recommendation.. In IJCAI Vol.\u00a019. 3940\u20133946. Chengfeng Xu Pengpeng Zhao Yanchi Liu Victor\u00a0S Sheng Jiajie Xu Fuzhen Zhuang Junhua Fang and Xiaofang Zhou. 2019. Graph Contextualized Self-Attention Network for Session-based Recommendation.. In IJCAI Vol.\u00a019. 3940\u20133946.","DOI":"10.24963\/ijcai.2019\/547"},{"key":"e_1_3_2_2_63_1","volume-title":"Ruslan Salakhutdinov, and Quoc\u00a0V. Le.","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang , Zihang Dai , Yiming Yang Jaime\u00a0G. Carbonell , Ruslan Salakhutdinov, and Quoc\u00a0V. Le. 2019 . XLNet: Generalized Autoregressive Pretraining for Language Understanding. In Advances in Neural Information Processing Systems . Zhilin Yang, Zihang Dai, Yiming Yang Jaime\u00a0G. Carbonell, Ruslan Salakhutdinov, and Quoc\u00a0V. Le. 2019. XLNet: Generalized Autoregressive Pretraining for Language Understanding. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_2_64_1","volume-title":"Thirty-Third AAAI Conference on Artificial Intelligence, Vol.\u00a09.","author":"Zhang Shuai","year":"2019","unstructured":"Shuai Zhang , Yi Tay , Lina Yao , Aixin Sun , and Jake An . 2019 . Next item recommendation with self-attentive metric learning . In Thirty-Third AAAI Conference on Artificial Intelligence, Vol.\u00a09. Shuai Zhang, Yi Tay, Lina Yao, Aixin Sun, and Jake An. 2019. Next item recommendation with self-attentive metric learning. In Thirty-Third AAAI Conference on Artificial Intelligence, Vol.\u00a09."},{"key":"e_1_3_2_2_65_1","volume-title":"Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1479\u20131488","author":"Zhang Yang","year":"2020","unstructured":"Yang Zhang , Fuli Feng , Chenxu Wang , Xiangnan He , Meng Wang , Yan Li , and Yongdong Zhang . 2020 . How to retrain recommender system? A sequential meta-learning method . In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1479\u20131488 . Yang Zhang, Fuli Feng, Chenxu Wang, Xiangnan He, Meng Wang, Yan Li, and Yongdong Zhang. 2020. How to retrain recommender system? A sequential meta-learning method. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1479\u20131488."},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9054639"},{"key":"e_1_3_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11618"}],"event":{"name":"RecSys '21: Fifteenth ACM Conference on Recommender Systems","location":"Amsterdam Netherlands","acronym":"RecSys '21","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGAI ACM Special Interest Group on Artificial Intelligence","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGIR ACM Special Interest Group on Information Retrieval","SIGCHI ACM Special Interest Group on Computer-Human Interaction","SIGecom Special Interest Group on Economics and Computation"]},"container-title":["Fifteenth ACM Conference on Recommender Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460231.3474255","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3460231.3474255","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:17Z","timestamp":1750191137000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460231.3474255"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,13]]},"references-count":67,"alternative-id":["10.1145\/3460231.3474255","10.1145\/3460231"],"URL":"https:\/\/doi.org\/10.1145\/3460231.3474255","relation":{},"subject":[],"published":{"date-parts":[[2021,9,13]]}}}