{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T19:38:47Z","timestamp":1771702727611,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":24,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,9,13]],"date-time":"2021-09-13T00:00:00Z","timestamp":1631491200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,13]]},"DOI":"10.1145\/3460231.3478864","type":"proceedings-article","created":{"date-parts":[[2021,9,13]],"date-time":"2021-09-13T21:45:04Z","timestamp":1631569504000},"page":"714-718","source":"Crossref","is-referenced-by-count":7,"title":["Sequence Adaptation via Reinforcement Learning in Recommender Systems"],"prefix":"10.1145","author":[{"given":"Stefanos","family":"Antaris","sequence":"first","affiliation":[{"name":"KTH Royal Institute of Technology, Sweden"}]},{"given":"Dimitrios","family":"Rafailidis","sequence":"additional","affiliation":[{"name":"University of Thessaly, Greece"}]}],"member":"320","published-online":{"date-parts":[[2021,9,13]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba IFashion. In KDD. 2662\u20132670.","author":"Chen Wen","year":"2019","unstructured":"Wen Chen , Pipei Huang , Jiaming Xu , Xin Guo , Cheng Guo , Fei Sun , Chao Li , Andreas Pfadler , Huan Zhao , and Binqiang Zhao . 2019 . POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba IFashion. In KDD. 2662\u20132670. Wen Chen, Pipei Huang, Jiaming Xu, Xin Guo, Cheng Guo, Fei Sun, Chao Li, Andreas Pfadler, Huan Zhao, and Binqiang Zhao. 2019. POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba IFashion. In KDD. 2662\u20132670."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"crossref","unstructured":"Andres Ferraro Dietmar Jannach and Xavier Serra. 2020. Exploring Longitudinal Effects of Session-based Recommendations. In RecSys. 474\u2013479. Andres Ferraro Dietmar Jannach and Xavier Serra. 2020. Exploring Longitudinal Effects of Session-based Recommendations. In RecSys. 474\u2013479.","DOI":"10.1145\/3383313.3412213"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"crossref","unstructured":"Casper Hansen Christian Hansen Lucas Maystre Rishabh Mehrotra Brian Brost Federico Tomasi and Mounia Lalmas. 2020. Contextual and Sequential User Embeddings for Large-Scale Music Recommendation. In RecSys. 53\u201362. Casper Hansen Christian Hansen Lucas Maystre Rishabh Mehrotra Brian Brost Federico Tomasi and Mounia Lalmas. 2020. Contextual and Sequential User Embeddings for Large-Scale Music Recommendation. In RecSys. 53\u201362.","DOI":"10.1145\/3383313.3412248"},{"key":"e_1_3_2_2_4_1","volume":"201","author":"He Ruining","unstructured":"Ruining He and Julian\u00a0 J. McAuley. 201 6. Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering. In WWW. 507\u2013517. Ruining He and Julian\u00a0J. McAuley. 2016. Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering. In WWW. 507\u2013517.","journal-title":"J. McAuley."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"crossref","unstructured":"Bal\u00e1zs Hidasi and Alexandros Karatzoglou. 2018. Recurrent Neural Networks with Top-k Gains for Session-based Recommendations. In CIKM. 843\u2013852. Bal\u00e1zs Hidasi and Alexandros Karatzoglou. 2018. Recurrent Neural Networks with Top-k Gains for Session-based Recommendations. In CIKM. 843\u2013852.","DOI":"10.1145\/3269206.3271761"},{"key":"e_1_3_2_2_6_1","unstructured":"Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2016. Session-based Recommendations with Recurrent Neural Networks. arxiv:1511.06939 Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2016. Session-based Recommendations with Recurrent Neural Networks. arxiv:1511.06939"},{"key":"e_1_3_2_2_7_1","unstructured":"Yujing Hu Qing Da Anxiang Zeng Yang Yu and Yinghui Xu. 2018. Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization Analysis and Application. In KDD. 368\u2013377. Yujing Hu Qing Da Anxiang Zeng Yang Yu and Yinghui Xu. 2018. Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization Analysis and Application. In KDD. 368\u2013377."},{"key":"e_1_3_2_2_8_1","unstructured":"Wendi Ji Keqiang Wang Xiaoling Wang Tingwei Chen and Alexandra Cristea. 2020. Sequential Recommender via Time-Aware Attentive Memory Network. In CIKM. 565\u2013574. Wendi Ji Keqiang Wang Xiaoling Wang Tingwei Chen and Alexandra Cristea. 2020. Sequential Recommender via Time-Aware Attentive Memory Network. In CIKM. 565\u2013574."},{"key":"e_1_3_2_2_9_1","unstructured":"Christos Kaplanis Claudia Clopath and Murray Shanahan. 2020. Continual Reinforcement Learning with Multi-Timescale Replay. arXiv preprint arXiv:2004.07530(2020). Christos Kaplanis Claudia Clopath and Murray Shanahan. 2020. Continual Reinforcement Learning with Multi-Timescale Replay. arXiv preprint arXiv:2004.07530(2020)."},{"key":"e_1_3_2_2_10_1","unstructured":"Christos Kaplanis Murray Shanahan and Claudia Clopath. 2019. Policy Consolidation for Continual Reinforcement Learning. In ICML. 3242\u20133251. Christos Kaplanis Murray Shanahan and Claudia Clopath. 2019. Policy Consolidation for Continual Reinforcement Learning. In ICML. 3242\u20133251."},{"key":"e_1_3_2_2_11_1","volume-title":"Kingma and Jimmy Ba","author":"P.","year":"2017","unstructured":"Diederik\u00a0 P. Kingma and Jimmy Ba . 2017 . Adam : A Method for Stochastic Optimization . arxiv:1412.6980 Diederik\u00a0P. Kingma and Jimmy Ba. 2017. Adam: A Method for Stochastic Optimization. arxiv:1412.6980"},{"key":"e_1_3_2_2_12_1","unstructured":"Vijay\u00a0R Konda and John\u00a0N Tsitsiklis. 2000. Actor-critic algorithms. In NeurIPS. 1008\u20131014. Vijay\u00a0R Konda and John\u00a0N Tsitsiklis. 2000. Actor-critic algorithms. In NeurIPS. 1008\u20131014."},{"key":"e_1_3_2_2_13_1","volume-title":"Social Attentive Deep Q-networks for Recommender Systems. TKDE","author":"Lei Yu","year":"2020","unstructured":"Yu Lei , Zhitao Wang , Wenjie Li , Hongbin Pei , and Quanyu Dai . 2020. Social Attentive Deep Q-networks for Recommender Systems. TKDE ( 2020 ), 1\u20131. Yu Lei, Zhitao Wang, Wenjie Li, Hongbin Pei, and Quanyu Dai. 2020. Social Attentive Deep Q-networks for Recommender Systems. TKDE (2020), 1\u20131."},{"key":"e_1_3_2_2_14_1","unstructured":"Jiacheng Li Yujie Wang and Julian McAuley. 2020. Time Interval Aware Self-Attention for Sequential Recommendation. In WSDM. 322\u2013330. Jiacheng Li Yujie Wang and Julian McAuley. 2020. Time Interval Aware Self-Attention for Sequential Recommendation. In WSDM. 322\u2013330."},{"key":"e_1_3_2_2_15_1","unstructured":"Nicholas Lim Bryan Hooi See-Kiong Ng Xueou Wang Yong\u00a0Liang Goh Renrong Weng and Jagannadan Varadarajan. 2020. STP-UDGAT: Spatial-Temporal-Preference User Dimensional Graph Attention Network for Next POI Recommendation. In CIKM. 845\u2013854. Nicholas Lim Bryan Hooi See-Kiong Ng Xueou Wang Yong\u00a0Liang Goh Renrong Weng and Jagannadan Varadarajan. 2020. STP-UDGAT: Spatial-Temporal-Preference User Dimensional Graph Attention Network for Next POI Recommendation. In CIKM. 845\u2013854."},{"key":"e_1_3_2_2_16_1","unstructured":"Marko Mitrovic Ehsan Kazemi Moran Feldman Andreas Krause and Amin Karbasi. 2019. Adaptive Sequence Submodularity. In NeurIPS Vol.\u00a032. Marko Mitrovic Ehsan Kazemi Moran Feldman Andreas Krause and Amin Karbasi. 2019. Adaptive Sequence Submodularity. In NeurIPS Vol.\u00a032."},{"key":"e_1_3_2_2_17_1","unstructured":"Emilio Parisotto Francis Song Jack Rae Razvan Pascanu Caglar Gulcehre Siddhant Jayakumar Max Jaderberg Rapha\u00ebl\u00a0Lopez Kaufman Aidan Clark Seb Noury Matthew Botvinick Nicolas Heess and Raia Hadsell. 2020. Stabilizing Transformers for Reinforcement Learning. In ICML. 7487\u20137498. Emilio Parisotto Francis Song Jack Rae Razvan Pascanu Caglar Gulcehre Siddhant Jayakumar Max Jaderberg Rapha\u00ebl\u00a0Lopez Kaufman Aidan Clark Seb Noury Matthew Botvinick Nicolas Heess and Raia Hadsell. 2020. Stabilizing Transformers for Reinforcement Learning. In ICML. 7487\u20137498."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"Jiarui Qin Kan Ren Yuchen Fang Weinan Zhang and Yong Yu. 2020. Sequential Recommendation with Dual Side Neighbor-Based Collaborative Relation Modeling. In WSDM. 465\u2013473. Jiarui Qin Kan Ren Yuchen Fang Weinan Zhang and Yong Yu. 2020. Sequential Recommendation with Dual Side Neighbor-Based Collaborative Relation Modeling. In WSDM. 465\u2013473.","DOI":"10.1145\/3336191.3371842"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Weiping Song Zhiping Xiao Yifan Wang Laurent Charlin Ming Zhang and Jian Tang. 2019. Session-Based Social Recommendation via Dynamic Graph Attention Networks. In WSDM. 555\u2013563. Weiping Song Zhiping Xiao Yifan Wang Laurent Charlin Ming Zhang and Jian Tang. 2019. Session-Based Social Recommendation via Dynamic Graph Attention Networks. In WSDM. 555\u2013563.","DOI":"10.1145\/3289600.3290989"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Sebastian Tschiatschek Adish Singla and Andreas Krause. 2017. Selecting Sequences of Items via Submodular Maximization. In AAAI. 2667\u20132673. Sebastian Tschiatschek Adish Singla and Andreas Krause. 2017. Selecting Sequences of Items via Submodular Maximization. In AAAI. 2667\u20132673.","DOI":"10.1609\/aaai.v31i1.10923"},{"key":"e_1_3_2_2_21_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141\u00a0ukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In NeurIPs Vol.\u00a030. Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141\u00a0ukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In NeurIPs Vol.\u00a030."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"crossref","unstructured":"Shoujin Wang Liang Hu Yan Wang Longbing Cao Quan\u00a0Z. Sheng and Mehmet Orgun. 2019. Sequential Recommender Systems: Challenges Progress and Prospects. In IJCAI-19. 6332\u20136338. Shoujin Wang Liang Hu Yan Wang Longbing Cao Quan\u00a0Z. Sheng and Mehmet Orgun. 2019. Sequential Recommender Systems: Challenges Progress and Prospects. In IJCAI-19. 6332\u20136338.","DOI":"10.24963\/ijcai.2019\/883"},{"key":"e_1_3_2_2_23_1","unstructured":"Liwei Wu Shuqing Li Cho-Jui Hsieh and James Sharpnack. 2020. SSE-PT: Sequential Recommendation Via Personalized Transformer. In RecSys. 328\u2013337. Liwei Wu Shuqing Li Cho-Jui Hsieh and James Sharpnack. 2020. SSE-PT: Sequential Recommendation Via Personalized Transformer. In RecSys. 328\u2013337."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"crossref","unstructured":"Xin Xin Alexandros Karatzoglou Ioannis Arapakis and Joemon\u00a0M. Jose. 2020. Self-Supervised Reinforcement Learning for Recommender Systems. In SIGIR. 931\u2013940. Xin Xin Alexandros Karatzoglou Ioannis Arapakis and Joemon\u00a0M. Jose. 2020. Self-Supervised Reinforcement Learning for Recommender Systems. In SIGIR. 931\u2013940.","DOI":"10.1145\/3397271.3401147"}],"event":{"name":"RecSys '21: Fifteenth ACM Conference on Recommender Systems","location":"Amsterdam Netherlands","acronym":"RecSys '21","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGAI ACM Special Interest Group on Artificial Intelligence","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGIR ACM Special Interest Group on Information Retrieval","SIGCHI ACM Special Interest Group on Computer-Human Interaction","SIGecom Special Interest Group on Economics and Computation"]},"container-title":["Fifteenth ACM Conference on Recommender Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460231.3478864","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3460231.3478864","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:30Z","timestamp":1750193310000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460231.3478864"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,13]]},"references-count":24,"alternative-id":["10.1145\/3460231.3478864","10.1145\/3460231"],"URL":"https:\/\/doi.org\/10.1145\/3460231.3478864","relation":{},"subject":[],"published":{"date-parts":[[2021,9,13]]}}}