{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T16:35:54Z","timestamp":1759941354574,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,19]],"date-time":"2021-04-19T00:00:00Z","timestamp":1618790400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"DOI":"10.1145\/3442381.3450125","type":"proceedings-article","created":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T19:24:51Z","timestamp":1622748291000},"page":"3582-3589","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":16,"title":["UserSim: User Simulation via Supervised GenerativeAdversarial Network"],"prefix":"10.1145","author":[{"given":"Xiangyu","family":"Zhao","sequence":"first","affiliation":[{"name":"Michigan State University and City University of Hong Kong, USA"}]},{"given":"Long","family":"Xia","sequence":"additional","affiliation":[{"name":"York University, China"}]},{"given":"Lixin","family":"Zou","sequence":"additional","affiliation":[{"name":"Baidu, China"}]},{"given":"Hui","family":"Liu","sequence":"additional","affiliation":[{"name":"Michigan State University, USA"}]},{"given":"Dawei","family":"Yin","sequence":"additional","affiliation":[{"name":"Baidu, USA"}]},{"given":"Jiliang","family":"Tang","sequence":"additional","affiliation":[{"name":"Michigan State University, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,6,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Xueying Bai Jian Guan and Hongning Wang. 2019. A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation. In Advances in Neural Information Processing Systems. 10735\u201310746. Xueying Bai Jian Guan and Hongning Wang. 2019. A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation. In Advances in Neural Information Processing Systems. 10735\u201310746."},{"key":"e_1_3_2_1_2_1","unstructured":"Mariusz Bojarski Davide Del\u00a0Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence\u00a0D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016). Mariusz Bojarski Davide Del\u00a0Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence\u00a0D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186039"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/MRA.2010.936947"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3289600.3290999"},{"key":"e_1_3_2_1_6_1","volume-title":"Generative Adversarial User Model for Reinforcement Learning Based Recommendation System. In International Conference on Machine Learning. 1052\u20131061","author":"Chen Xinshi","year":"2019","unstructured":"Xinshi Chen , Shuang Li , Hui Li , Shaohua Jiang , Yuan Qi , and Le Song . 2019 . Generative Adversarial User Model for Reinforcement Learning Based Recommendation System. In International Conference on Machine Learning. 1052\u20131061 . Xinshi Chen, Shuang Li, Hui Li, Shaohua Jiang, Yuan Qi, and Le Song. 2019. Generative Adversarial User Model for Reinforcement Learning Based Recommendation System. In International Conference on Machine Learning. 1052\u20131061."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988450.2988454"},{"key":"e_1_3_2_1_8_1","unstructured":"Corinna Cortes and Mehryar Mohri. 2004. AUC optimization vs. error rate minimization. In Advances in neural information processing systems. 313\u2013320. Corinna Cortes and Mehryar Mohri. 2004. AUC optimization vs. error rate minimization. In Advances in neural information processing systems. 313\u2013320."},{"key":"e_1_3_2_1_9_1","unstructured":"Gabriel Dulac-Arnold Richard Evans Hado van Hasselt Peter Sunehag Timothy Lillicrap Jonathan Hunt Timothy Mann Theophane Weber Thomas Degris and Ben Coppin. 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679(2015). Gabriel Dulac-Arnold Richard Evans Hado van Hasselt Peter Sunehag Timothy Lillicrap Jonathan Hunt Timothy Mann Theophane Weber Thomas Degris and Ben Coppin. 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679(2015)."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Wenqi Fan Tyler Derr Xiangyu Zhao Yao Ma Hui Liu Jianping Wang Jiliang Tang and Qing Li. 2020. Attacking Black-box Recommendations via Copying Cross-domain User Profiles. arXiv preprint arXiv:2005.08147(2020). Wenqi Fan Tyler Derr Xiangyu Zhao Yao Ma Hui Liu Jianping Wang Jiliang Tang and Qing Li. 2020. Attacking Black-box Recommendations via Copying Cross-domain User Profiles. arXiv preprint arXiv:2005.08147(2020).","DOI":"10.1109\/ICDE51399.2021.00140"},{"key":"e_1_3_2_1_11_1","unstructured":"Mamdouh Farouk. 2019. Measuring sentences similarity: a survey. arXiv preprint arXiv:1910.03940(2019). Mamdouh Farouk. 2019. Measuring sentences similarity: a survey. arXiv preprint arXiv:1910.03940(2019)."},{"key":"e_1_3_2_1_12_1","unstructured":"Jim Gao. 2014. Machine learning applications for data center optimization. (2014). Jim Gao. 2014. Machine learning applications for data center optimization. (2014)."},{"key":"e_1_3_2_1_13_1","unstructured":"Yingqiang Ge Shuchang Liu Ruoyuan Gao Yikun Xian Yunqi Li Xiangyu Zhao Changhua Pei Fei Sun Junfeng Ge Wenwu Ou 2021. Towards Long-term Fairness in Recommendation. arXiv preprint arXiv:2101.03584(2021). Yingqiang Ge Shuchang Liu Ruoyuan Gao Yikun Xian Yunqi Li Xiangyu Zhao Changhua Pei Fei Sun Junfeng Ge Wenwu Ou 2021. Towards Long-term Fairness in Recommendation. arXiv preprint arXiv:2101.03584(2021)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159687"},{"key":"e_1_3_2_1_15_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672\u20132680. Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672\u20132680."},{"key":"e_1_3_2_1_16_1","unstructured":"Jie Gui Zhenan Sun Yonggang Wen Dacheng Tao and Jieping Ye. 2020. A review on generative adversarial networks: Algorithms theory and applications. arXiv preprint arXiv:2001.06937(2020). Jie Gui Zhenan Sun Yonggang Wen Dacheng Tao and Jieping Ye. 2020. A review on generative adversarial networks: Algorithms theory and applications. arXiv preprint arXiv:2001.06937(2020)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330839"},{"key":"e_1_3_2_1_18_1","unstructured":"Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939(2015). Bal\u00e1zs Hidasi Alexandros Karatzoglou Linas Baltrunas and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939(2015)."},{"key":"e_1_3_2_1_19_1","unstructured":"Eugene Ie Chih-wei Hsu Martin Mladenov Vihan Jain Sanmit Narvekar Jing Wang Rui Wu and Craig Boutilier. 2019. RecSim: A Configurable Simulation Platform for Recommender Systems. arXiv preprint arXiv:1909.04847(2019). Eugene Ie Chih-wei Hsu Martin Mladenov Vihan Jain Sanmit Narvekar Jing Wang Rui Wu and Craig Boutilier. 2019. RecSim: A Configurable Simulation Platform for Recommender Systems. arXiv preprint arXiv:1909.04847(2019)."},{"key":"e_1_3_2_1_20_1","volume-title":"Online Controlled Experiments and A\/B Testing.Encyclopedia of machine learning and data mining 7, 8","author":"Kohavi Ron","year":"2017","unstructured":"Ron Kohavi and Roger Longbotham . 2017. Online Controlled Experiments and A\/B Testing.Encyclopedia of machine learning and data mining 7, 8 ( 2017 ), 922\u2013929. Ron Kohavi and Roger Longbotham. 2017. Online Controlled Experiments and A\/B Testing.Encyclopedia of machine learning and data mining 7, 8 (2017), 922\u2013929."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. 19\u201336","author":"Li Lihong","year":"2012","unstructured":"Lihong Li , Wei Chu , John Langford , Taesup Moon , and Xuanhui Wang . 2012 . An unbiased offline evaluation of contextual bandit algorithms with generalized linear models . In Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. 19\u201336 . Lihong Li, Wei Chu, John Langford, Taesup Moon, and Xuanhui Wang. 2012. An unbiased offline evaluation of contextual bandit algorithms with generalized linear models. In Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. 19\u201336."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2684822.2685311"},{"key":"e_1_3_2_1_23_1","unstructured":"Pauline Luc Camille Couprie Soumith Chintala and Jakob Verbeek. 2016. Semantic segmentation using adversarial networks. arXiv preprint arXiv:1611.08408(2016). Pauline Luc Camille Couprie Soumith Chintala and Jakob Verbeek. 2016. Semantic segmentation using adversarial networks. arXiv preprint arXiv:1611.08408(2016)."},{"volume-title":"Applied logistic regression analysis. Vol.\u00a0106","author":"Menard Scott","key":"e_1_3_2_1_24_1","unstructured":"Scott Menard . 2002. Applied logistic regression analysis. Vol.\u00a0106 . Sage . Scott Menard. 2002. Applied logistic regression analysis. Vol.\u00a0106. Sage."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1703435.1703645"},{"key":"e_1_3_2_1_26_1","unstructured":"David\u00a0Martin Powers. 2011. Evaluation: from precision recall and F-measure to ROC informedness markedness and correlation. (2011). David\u00a0Martin Powers. 2011. Evaluation: from precision recall and F-measure to ROC informedness markedness and correlation. (2011)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2010.127"},{"key":"e_1_3_2_1_28_1","unstructured":"David Rohde Stephen Bonner Travis Dunlop Flavian Vasile and Alexandros Karatzoglou. 2018. RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising. arXiv preprint arXiv:1808.00720(2018). David Rohde Stephen Bonner Travis Dunlop Flavian Vasile and Alexandros Karatzoglou. 2018. RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising. arXiv preprint arXiv:1808.00720(2018)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2013.6630809"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740908.2742726"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33014902"},{"key":"e_1_3_2_1_32_1","unstructured":"Hyungseok Song Hyeryung Jang Hai\u00a0Tran Hong Seeun Yun Donggyu Yun Hyoju Chung and Yung Yi. 2019. Solving Continual Combinatorial Selection via Deep Reinforcement Learning. (2019). Hyungseok Song Hyeryung Jang Hai\u00a0Tran Hong Seeun Yun Donggyu Yun Hyoju Chung and Yung Yi. 2019. Solving Continual Combinatorial Selection via Deep Reinforcement Learning. (2019)."},{"key":"e_1_3_2_1_33_1","volume-title":"Robobarista: Object part based transfer of manipulation trajectories from crowd-sourcing in 3d pointclouds. In Robotics Research","author":"Sung Jaeyong","year":"2018","unstructured":"Jaeyong Sung , Seok\u00a0Hyun Jin , and Ashutosh Saxena . 2018 . Robobarista: Object part based transfer of manipulation trajectories from crowd-sourcing in 3d pointclouds. In Robotics Research . Springer , 701\u2013720. Jaeyong Sung, Seok\u00a0Hyun Jin, and Ashutosh Saxena. 2018. Robobarista: Object part based transfer of manipulation trajectories from crowd-sourcing in 3d pointclouds. In Robotics Research. Springer, 701\u2013720."},{"volume-title":"Reinforcement learning: An introduction","author":"Sutton S","key":"e_1_3_2_1_34_1","unstructured":"Richard\u00a0 S Sutton and Andrew\u00a0 G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard\u00a0S Sutton and Andrew\u00a0G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080786"},{"key":"e_1_3_2_1_36_1","volume-title":"STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control","author":"Wang Yanan","year":"2020","unstructured":"Yanan Wang , Tong Xu , Xin Niu , Chang Tan , Enhong Chen , and Hui Xiong . 2020 . STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control . IEEE Transactions on Mobile Computing( 2020). Yanan Wang, Tong Xu, Xin Niu, Chang Tan, Enhong Chen, and Hui Xiong. 2020. STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control. IEEE Transactions on Mobile Computing(2020)."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018661.3018689"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240355"},{"key":"e_1_3_2_1_39_1","article-title":"Improving library user experience with A\/B testing: Principles and process. Weave","volume":"1","author":"Young WH","year":"2014","unstructured":"Scott\u00a0 WH Young . 2014 . Improving library user experience with A\/B testing: Principles and process. Weave : Journal of Library User Experience 1 , 1 (2014). Scott\u00a0WH Young. 2014. Improving library user experience with A\/B testing: Principles and process. Weave: Journal of Library User Experience 1, 1 (2014).","journal-title":"Journal of Library User Experience"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401467"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"crossref","unstructured":"Xiangyu Zhao Changsheng Gu Haoshenglun Zhang Xiaobing Liu Xiwang Yang and Jiliang Tang. 2019. Deep Reinforcement Learning for Online Advertising in Recommender Systems. arXiv preprint arXiv:1909.03602(2019). Xiangyu Zhao Changsheng Gu Haoshenglun Zhang Xiaobing Liu Xiwang Yang and Jiliang Tang. 2019. Deep Reinforcement Learning for Online Advertising in Recommender Systems. arXiv preprint arXiv:1909.03602(2019).","DOI":"10.1145\/3320496.3320500"},{"key":"e_1_3_2_1_42_1","volume-title":"Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB NewsletterSpring","author":"Zhao Xiangyu","year":"2019","unstructured":"Xiangyu Zhao , Long Xia , Jiliang Tang , and Dawei Yin . 2019. Deep reinforcement learning for search, recommendation, and online advertising: a survey by Xiangyu Zhao , Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB NewsletterSpring ( 2019 ), 4. Xiangyu Zhao, Long Xia, Jiliang Tang, and Dawei Yin. 2019. Deep reinforcement learning for search, recommendation, and online advertising: a survey by Xiangyu Zhao, Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB NewsletterSpring (2019), 4."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240374"},{"key":"e_1_3_2_1_44_1","volume-title":"Whole-Chain Recommendations. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1883\u20131891","author":"Zhao Xiangyu","year":"2020","unstructured":"Xiangyu Zhao , Long Xia , Lixin Zou , Hui Liu , Dawei Yin , and Jiliang Tang . 2020 . Whole-Chain Recommendations. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1883\u20131891 . Xiangyu Zhao, Long Xia, Lixin Zou, Hui Liu, Dawei Yin, and Jiliang Tang. 2020. Whole-Chain Recommendations. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1883\u20131891."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219886"},{"key":"e_1_3_2_1_46_1","unstructured":"Xiangyu Zhao Liang Zhang Zhuoye Ding Dawei Yin Yihong Zhao and Jiliang Tang. 2017. Deep Reinforcement Learning for List-wise Recommendations. arXiv preprint arXiv:1801.00209(2017). Xiangyu Zhao Liang Zhang Zhuoye Ding Dawei Yin Yihong Zhao and Jiliang Tang. 2017. Deep Reinforcement Learning for List-wise Recommendations. arXiv preprint arXiv:1801.00209(2017)."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403384"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3185994"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330668"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-18579-4_7"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371801"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401181"}],"event":{"name":"WWW '21: The Web Conference 2021","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"],"location":"Ljubljana Slovenia","acronym":"WWW '21"},"container-title":["Proceedings of the Web Conference 2021"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3450125","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3442381.3450125","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:28Z","timestamp":1750195468000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3450125"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,19]]},"references-count":52,"alternative-id":["10.1145\/3442381.3450125","10.1145\/3442381"],"URL":"https:\/\/doi.org\/10.1145\/3442381.3450125","relation":{},"subject":[],"published":{"date-parts":[[2021,4,19]]},"assertion":[{"value":"2021-06-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}