{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,4]],"date-time":"2026-07-04T02:33:16Z","timestamp":1783132396597,"version":"3.54.6"},"reference-count":103,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,8,18]],"date-time":"2023-08-18T00:00:00Z","timestamp":1692316800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Key Research and Development Program of China","award":["2021ZD0111802"],"award-info":[{"award-number":["2021ZD0111802"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61972372, U19A2079, 62121002"],"award-info":[{"award-number":["61972372, U19A2079, 62121002"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"CCCD Key Lab of Ministry of Culture and Tourism"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2024,1,31]]},"abstract":"<jats:p>While personalization increases the utility of recommender systems, it also brings the issue of<jats:italic>filter bubbles<\/jats:italic>. e.g., if the system keeps exposing and recommending the items that the user is interested in, it may also make the user feel bored and less satisfied. Existing work studies filter bubbles in static recommendation, where the effect of overexposure is hard to capture. In contrast, we believe it is more meaningful to study the issue in interactive recommendation and optimize long-term user satisfaction. Nevertheless, it is unrealistic to train the model online due to the high cost. As such, we have to leverage offline training data and disentangle the causal effect on user satisfaction.<\/jats:p><jats:p>To achieve this goal, we propose a counterfactual interactive recommender system (CIRS) that augments offline reinforcement learning (offline RL) with causal inference. The basic idea is to first learn a causal user model on historical data to capture the overexposure effect of items on user satisfaction. It then uses the learned causal user model to help the planning of the RL policy. To conduct evaluation offline, we innovatively create an authentic RL environment (KuaiEnv) based on a real-world fully observed user rating dataset. The experiments show the effectiveness of CIRS in bursting filter bubbles and achieving long-term success in interactive recommendation. The implementation of CIRS is available via https:\/\/github.com\/chongminggao\/ CIRS-codes.<\/jats:p>","DOI":"10.1145\/3594871","type":"journal-article","created":{"date-parts":[[2023,4,28]],"date-time":"2023-04-28T11:58:05Z","timestamp":1682683085000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":67,"title":["CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5187-9196","authenticated-orcid":false,"given":"Chongming","family":"Gao","sequence":"first","affiliation":[{"name":"University of Science and Technology of China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5369-884X","authenticated-orcid":false,"given":"Shiqi","family":"Wang","sequence":"additional","affiliation":[{"name":"Chongqing University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4495-0732","authenticated-orcid":false,"given":"Shijun","family":"Li","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4752-2629","authenticated-orcid":false,"given":"Jiawei","family":"Chen","sequence":"additional","affiliation":[{"name":"Zhejiang University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8472-7992","authenticated-orcid":false,"given":"Xiangnan","family":"He","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6540-0601","authenticated-orcid":false,"given":"Wenqiang","family":"Lei","sequence":"additional","affiliation":[{"name":"Sichuan University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5667-5347","authenticated-orcid":false,"given":"Biao","family":"Li","sequence":"additional","affiliation":[{"name":"Kuaishou Technology Co., Ltd."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7849-208X","authenticated-orcid":false,"given":"Yuan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Kuaishou Technology Co., Ltd."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9266-0780","authenticated-orcid":false,"given":"Peng","family":"Jiang","sequence":"additional","affiliation":[{"name":"Kuaishou Technology Co., Ltd."}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,8,18]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3383313.3412246"},{"key":"e_1_3_2_3_2","unstructured":"Xueying Bai Jian Guan and Hongning Wang. 2019. A Model-based reinforcement learning with adversarial training for online recommendation. In Proceedings of the 33rd International Conference on Neural Information Processing Systems . Curran Associates Inc. Red Hook NY 10735\u201310746."},{"key":"e_1_3_2_4_2","volume-title":"Proceedings of the ICML 2020 Tutorial","author":"Bareinboim Elias","year":"2020","unstructured":"Elias Bareinboim. 2020. Causal reinforcement learning. In Proceedings of the ICML 2020 Tutorial."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240360"},{"key":"e_1_3_2_6_2","volume-title":"Are Filter Bubbles Real?","author":"Bruns Axel","year":"2019","unstructured":"Axel Bruns. 2019. Are Filter Bubbles Real?John Wiley & Sons."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240370"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33013312"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3564284"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3289600.3290999"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441764"},{"key":"e_1_3_2_12_2","first-page":"1052","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Chen Xinshi","year":"2019","unstructured":"Xinshi Chen, Shuang Li, Hui Li, Shaohua Jiang, Yuan Qi, and Le Song. 2019. Generative adversarial user model for reinforcement learning based recommendation system. In Proceedings of the International Conference on Machine Learning. 1052\u20131061."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.218"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460231.3474261"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00511"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1093\/poq\/nfw006"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3591636"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.aiopen.2021.06.002"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557220"},{"key":"e_1_3_2_20_2","first-page":"487","volume-title":"Proceedings of the DASFAA\u201919","author":"Gao Chongming","year":"2019","unstructured":"Chongming Gao, Shuai Yuan, Zhong Zhang, Hongzhi Yin, and Junming Shao. 2019. BLOMA: Explain collaborative filtering via boosted local rank-one matrix approximation. In Proceedings of the DASFAA\u201919. Springer, 487\u2013490."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3172944.3172970"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441824"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159687"},{"key":"e_1_3_2_24_2","first-page":"1725","volume-title":"Proceedings of the IJCAI\u201917","author":"Guo Huifeng","year":"2017","unstructured":"Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. DeepFM: A factorization-machine based neural network for CTR prediction. In Proceedings of the IJCAI\u201917. 1725\u20131731."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401063"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/963770.963772"},{"key":"e_1_3_2_27_2","volume-title":"Causal Inference: What If.","author":"Hern\u00e1n M. A.","year":"2020","unstructured":"M. A. Hern\u00e1n and J. M. Robins. 2020. Causal Inference: What If.Boca Raton: Chapman & Hall\/CRC."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531716"},{"key":"e_1_3_2_29_2","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1145\/3383313.3412252","volume-title":"Proceedings of the 14th ACM Conference on Recommender Systems","author":"Huang Jin","year":"2020","unstructured":"Jin Huang, Harrie Oosterhuis, Maarten de Rijke, and Herke van Hoof. 2020. Keeping dataset biases out of the simulation: A debiased simulator for reinforcement learning based recommender systems. In Proceedings of the 14th ACM Conference on Recommender Systems. 190\u2013199."},{"issue":"1","key":"e_1_3_2_30_2","article-title":"Measuring misinformation in video search platforms: An audit study on youtube","volume":"4","author":"Hussein Eslam","year":"2020","unstructured":"Eslam Hussein, Prerna Juneja, and Tanushree Mitra. 2020. Measuring misinformation in video search platforms: An audit study on youtube. Proceedings of the ACM on Human-Computer Interaction 4, CSCW1(2020), 1\u201327.","journal-title":"Proceedings of the ACM on Human-Computer Interaction"},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","unstructured":"Eugene Ie Vihan Jain Jing Wang Sanmit Narvekar Ritesh Agarwal Rui Wu Heng-Tze Cheng Tushar Chandra and Craig Boutilier. 2019. SLATEQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (Macao China) . AAAI Press 2592\u20132599.","DOI":"10.24963\/ijcai.2019\/360"},{"key":"e_1_3_2_32_2","first-page":"447","volume-title":"Proceedings of the 12th ACM International Conference on Web Search and Data Mining","author":"Jagerman Rolf","year":"2019","unstructured":"Rolf Jagerman, Ilya Markov, and Maarten de Rijke. 2019. When people change their mind: Off-policy evaluation in non-stationary recommendation environments. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 447\u2013455."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460231.3474247"},{"key":"e_1_3_2_34_2","unstructured":"Diederick P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations ."},{"key":"e_1_3_2_35_2","unstructured":"Haruka Kiyohara Kosuke Kawakami and Yuta Saito. 2021. Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation. SimuRec workshop at RecSys 2021 (2021)."},{"key":"e_1_3_2_36_2","unstructured":"Vijay R. Konda and John N. Tsitsiklis. 1999. Actor-critic Algorithms. In Advances in Neural Information Processing Systems Vol. 12."},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401944"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460231.3473325"},{"key":"e_1_3_2_39_2","unstructured":"Sergey Levine Aviral Kumar George Tucker and Justin Fu. 2020. Offline Reinforcement Learning: Tutorial Review and Perspectives on Open Problems. CoRR abs\/2005.01643 (2020). arXiv:2005.01643 https:\/\/arxiv.org\/abs\/2005.01643."},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3533725"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/2872427.2883090"},{"key":"e_1_3_2_42_2","unstructured":"Timothy P. Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. In International Conference on Learning Representations ."},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401083"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371858"},{"key":"e_1_3_2_45_2","unstructured":"Feng Liu Ruiming Tang Xutao Li Weinan Zhang Yunming Ye Haokun Chen Huifeng Guo and Yuzhou Zhang. 2018. Deep Reinforcement Learning based Recommendation with Explicit User-item Interactions Modeling. arXiv preprint arXiv:1810.12027 (2018)."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3450113"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5931"},{"key":"e_1_3_2_48_2","first-page":"6979","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Lopez-Paz David","year":"2017","unstructured":"David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Scholkopf, and L\u00e9on Bottou. 2017. Discovering causal signals in images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6979\u20136987."},{"key":"e_1_3_2_49_2","first-page":"463","volume-title":"Proceedings of the Web Conference","author":"Ma Jiaqi","year":"2020","unstructured":"Jiaqi Ma, Zhe Zhao, Xinyang Yi, Ji Yang, Minmin Chen, Jiaxi Tang, Lichan Hong, and Ed H. Chi. 2020. Off-policy learning in two-stage recommender systems. In Proceedings of the Web Conference. 463\u2013473."},{"key":"e_1_3_2_50_2","first-page":"2493","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Madumal Prashan","year":"2020","unstructured":"Prashan Madumal, Tim Miller, Liz Sonenberg, and Frank Vetere. 2020. Explainable reinforcement learning through a causal lens. In Proceedings of the AAAI Conference on Artificial Intelligence. 2493\u20132500."},{"key":"e_1_3_2_51_2","first-page":"841","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Masrour Farzan","year":"2020","unstructured":"Farzan Masrour, Tyler Wilson, Heng Yan, Pang-Ning Tan, and Abdol Esfahanian. 2020. Bursting the filter bubble: Fairness-aware network link prediction. In Proceedings of the AAAI Conference on Artificial Intelligence. 841\u2013848."},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403229"},{"key":"e_1_3_2_53_2","doi-asserted-by":"crossref","unstructured":"Dana McKay Kaipin Owyong Stephann Makri and Marisela Gutierrez Lopez. 2022. Turn and face the strange: Investigating filter bubble bursting information interactions.ACM SIGIR Conference on Human Information Interaction and Retrieval. 233\u2013242.","DOI":"10.1145\/3498366.3505822"},{"key":"e_1_3_2_54_2","unstructured":"Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. In NIPS Deep Learning Workshop ."},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8463189"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1145\/2566486.2568012"},{"key":"e_1_3_2_57_2","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1145\/3375462.3375524","volume-title":"Proceedings of the 10th International Conference on Learning Analytics Knowledge","author":"Pardos Zachary A.","year":"2020","unstructured":"Zachary A. Pardos and Weijie Jiang. 2020. Designing for serendipity in a university course recommendation system. In Proceedings of the 10th International Conference on Learning Analytics Knowledge. 350\u2013359."},{"key":"e_1_3_2_58_2","volume-title":"The Filter Bubble: How the New Personalized Web is Changing what we Read and How We Think","author":"Pariser Eli","year":"2011","unstructured":"Eli Pariser. 2011. The Filter Bubble: How the New Personalized Web is Changing what we Read and How We Think. Penguin."},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.5555\/1642718"},{"key":"e_1_3_2_60_2","unstructured":"Doina Precup Richard S. Sutton and Satinder P. Singh. 2000. Eligibility Traces for Off-Policy Policy Evaluation. In Proceedings of the Seventeenth International Conference on Machine Learning . Morgan Kaufmann Publishers Inc. San Francisco CA 759\u2013766."},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372879"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371783"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1145\/3383313.3412261"},{"key":"e_1_3_2_64_2","first-page":"513","volume-title":"Proceedings of the 11th ACM International Conference on Web Search and Data Mining","author":"Schnabel Tobias","year":"2018","unstructured":"Tobias Schnabel, Paul N. Bennett, Susan T. Dumais, and Thorsten Joachims. 2018. Short-term satisfaction and long-term coverage: Understanding how users tolerate algorithmic exploration. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining. 513\u2013521."},{"key":"e_1_3_2_65_2","first-page":"1670","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Schnabel Tobias","year":"2016","unstructured":"Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: Debiasing learning and evaluation. In Proceedings of the International Conference on Machine Learning. 1670\u20131679."},{"key":"e_1_3_2_66_2","first-page":"1889","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Schulman John","year":"2015","unstructured":"John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, and Philipp Moritz. 2015. Trust region policy optimization. In Proceedings of the International Conference on Machine Learning. 1889\u20131897."},{"key":"e_1_3_2_67_2","unstructured":"John Schulman Philipp Moritz Sergey Levine Michael Jordan and Pieter Abbeel. 2016. High-dimensional continuous control using generalized advantage estimation. In International Conference on Learning Representations ."},{"key":"e_1_3_2_68_2","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33014902"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1145\/3386392.3399566"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1145\/3488560.3498471"},{"key":"e_1_3_2_72_2","volume-title":"Reinforcement Learning: An Introduction","author":"Sutton Richard S.","year":"2018","unstructured":"Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. MIT Press."},{"key":"e_1_3_2_73_2","first-page":"814","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Swaminathan Adith","year":"2015","unstructured":"Adith Swaminathan and Thorsten Joachims. 2015. Counterfactual risk minimization: Learning from logged bandit feedback. In Proceedings of the International Conference on Machine Learning. 814\u2013823."},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.5555\/2590069.2590071"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.5555\/3045390.3045616"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460231.3474241"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460231.3474270"},{"key":"e_1_3_2_78_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2022.3205334"},{"key":"e_1_3_2_80_2","first-page":"10760","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wang Tan","year":"2020","unstructured":"Tan Wang, Jianqiang Huang, Hanwang Zhang, and Qianru Sun. 2020. Visual commonsense r-cnn. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10760\u201310770."},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1145\/3447548.3467249"},{"key":"e_1_3_2_82_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462962"},{"key":"e_1_3_2_83_2","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Wang Wenlin","year":"2021","unstructured":"Wenlin Wang, Hongteng Xu, Ruiyi Zhang, Wenqi Wang, Piyush Rai, and Lawrence Carin. 2021. Learning to recommend from sparse data via generative user feedback. In Proceedings of the AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_84_2","first-page":"6638","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Wang Xiaojie","year":"2019","unstructured":"Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019. Doubly robust joint learning for recommendation on data missing not at random. In Proceedings of the International Conference on Machine Learning. PMLR, 6638\u20136647."},{"key":"e_1_3_2_85_2","unstructured":"Zifeng Wang Xi Chen Rui Wen Shao-Lun Huang Ercan E. Kuruoglu and Yefeng Zheng. 2020. Information Theoretic Counterfactual Learning from Missing-Not-at-Random Feedback. In Proceedings of the 34th International Conference on Neural Information Processing Systems (Vancouver BC Canada) . Curran Associates Inc. Red Hook NY 1854\u20131864."},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462855"},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i5.16579"},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401147"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557300"},{"key":"e_1_3_2_90_2","doi-asserted-by":"crossref","first-page":"2227","DOI":"10.1145\/2783258.2788602","volume-title":"Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","author":"Xu Ya","year":"2015","unstructured":"Ya Xu, Nanyu Chen, Addrian Fernandez, Omar Sinno, and Anmol Bhasin. 2015. From infrastructure to culture: A\/B testing challenges in large scale social networks. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2227\u20132236."},{"issue":"4","key":"e_1_3_2_91_2","article-title":"Neural serendipity recommendation: Exploring the balance between accuracy and novelty with sparse explicit feedback","volume":"14","author":"Xu Yuanbo","year":"2020","unstructured":"Yuanbo Xu, Yongjian Yang, En Wang, Jiayu Han, Fuzhen Zhuang, Zhiwen Yu, and Hui Xiong. 2020. Neural serendipity recommendation: Exploring the balance between accuracy and novelty with sparse explicit feedback. ACM Transactions on Knowledge Discovery from Data 14, 4(2020), 1\u201312.","journal-title":"ACM Transactions on Knowledge Discovery from Data"},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482305"},{"key":"e_1_3_2_93_2","first-page":"768","volume-title":"Proceedings of the 2019 IEEE International Conference on Data Mining","author":"Yu Junliang","year":"2019","unstructured":"Junliang Yu, Min Gao, Hongzhi Yin, Jundong Li, Chongming Gao, and Qinyong Wang. 2019. Generating reliable friends via adversarial training to improve social recommendation. In Proceedings of the 2019 IEEE International Conference on Data Mining. IEEE, 768\u2013777."},{"key":"e_1_3_2_94_2","volume-title":"Proceedings of the NeurIPS\u201919","author":"Zhang Ruiyi","year":"2019","unstructured":"Ruiyi Zhang, Tong Yu, Yilin Shen, Hongxia Jin, Changyou Chen, and Lawrence Carin. 2019. Reward constrained interactive recommendation with natural language feedback. In Proceedings of the NeurIPS\u201919."},{"key":"e_1_3_2_95_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462875"},{"key":"e_1_3_2_96_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.108075"},{"key":"e_1_3_2_97_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412044"},{"key":"e_1_3_2_98_2","first-page":"3582","volume-title":"Proceedings of the Web Conference","author":"Zhao Xiangyu","year":"2021","unstructured":"Xiangyu Zhao, Long Xia, Lixin Zou, Hui Liu, Dawei Yin, and Jiliang Tang. 2021. UserSim: User simulation via supervised generative adversarial network. In Proceedings of the Web Conference. 3582\u20133589."},{"key":"e_1_3_2_99_2","first-page":"2980","volume-title":"Proceedings of the Web Conference","author":"Zheng Yu","year":"2021","unstructured":"Yu Zheng, Chen Gao, Xiang Li, Xiangnan He, Yong Li, and Depeng Jin. 2021. Disentangling user interest and conformity for recommendation with causal embedding. In Proceedings of the Web Conference. 2980\u20132991."},{"key":"e_1_3_2_100_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401174"},{"key":"e_1_3_2_101_2","doi-asserted-by":"publisher","DOI":"10.1145\/3447548.3467376"},{"key":"e_1_3_2_102_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330668"},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371801"},{"key":"e_1_3_2_104_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401181"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3594871","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3594871","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:08Z","timestamp":1750182548000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3594871"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,18]]},"references-count":103,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1,31]]}},"alternative-id":["10.1145\/3594871"],"URL":"https:\/\/doi.org\/10.1145\/3594871","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,18]]},"assertion":[{"value":"2022-05-04","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-04-09","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-08-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}