{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T15:04:02Z","timestamp":1773155042779,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,4,20]],"date-time":"2020-04-20T00:00:00Z","timestamp":1587340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,4,20]]},"DOI":"10.1145\/3366423.3380294","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T08:11:44Z","timestamp":1588579904000},"page":"2298-2308","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":27,"title":["RLPer: A Reinforcement Learning Model for Personalized Search"],"prefix":"10.1145","author":[{"given":"Jing","family":"Yao","sequence":"first","affiliation":[{"name":"School of Information Renmin University of China"}]},{"given":"Zhicheng","family":"Dou","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence Renmin University of China"}]},{"given":"Jun","family":"Xu","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence Renmin University of China"}]},{"given":"Ji-Rong","family":"Wen","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Big Data Management and Analysis Methods and Key Laboratory of Data Engineering and Knowledge Engineering, MOE"}]}],"member":"320","published-online":{"date-parts":[[2020,4,20]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Multi-Task Learning for Document Ranking and Query Suggestion. In 6th International Conference on Learning Representations, ICLR","author":"Ahmad Wasi\u00a0Uddin","year":"2018","unstructured":"Wasi\u00a0Uddin Ahmad , Kai-Wei Chang , and Hongning Wang . 2018 . Multi-Task Learning for Document Ranking and Query Suggestion. In 6th International Conference on Learning Representations, ICLR 2018,. Wasi\u00a0Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. 2018. Multi-Task Learning for Document Ranking and Query Suggestion. In 6th International Conference on Learning Representations, ICLR 2018,."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331246"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2009938"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772703"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2348283.2348312"},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the Twenty-Second International Conference (ICML 2005","author":"Burges Christopher","year":"2005","unstructured":"Christopher J.\u00a0C. Burges , Tal Shaked , Erin Renshaw , Ari Lazier , Matt Deeds , Nicole Hamilton , and Gregory\u00a0 N. Hullender . 2005 . Learning to rank using gradient descent. In Machine Learning , Proceedings of the Twenty-Second International Conference (ICML 2005 ), Bonn, Germany , August 7-11, 2005. 89\u201396. Christopher J.\u00a0C. Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Gregory\u00a0N. Hullender. 2005. Learning to rank using gradient descent. In Machine Learning, Proceedings of the Twenty-Second International Conference (ICML 2005), Bonn, Germany, August 7-11, 2005. 89\u201396."},{"key":"e_1_3_2_1_7_1","unstructured":"Chris J.\u00a0C. Burges Krysta\u00a0M. Svore Qiang Wu and Jianfeng Gao. 2008. Ranking Boosting and Model Adaptation. Technical Report MSR-TR-2008-109. 18 pages.  Chris J.\u00a0C. Burges Krysta\u00a0M. Svore Qiang Wu and Jianfeng Gao. 2008. Ranking Boosting and Model Adaptation. Technical Report MSR-TR-2008-109. 18 pages."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609453"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871745"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Clarke L.\u00a0A Charles Kolla Maheedhar Cormack V Gordon Vechtomova Olga Ashkan and Azin. 2008. Novelty and diversity in information retrieval evaluation.  Clarke L.\u00a0A Charles Kolla Maheedhar Cormack V Gordon Vechtomova Olga Ashkan and Azin. 2008. Novelty and diversity in information retrieval evaluation.","DOI":"10.1145\/1390334.1390446"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063639"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242651"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3269206.3271728"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2449396.2449413"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505642"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1076034.1076063"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1458082.1458176"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331218"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935840"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1146847.1146848"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_1_22_1","volume-title":"Proceedings of the Twenty-Fifth International Conference (ICML 2008","author":"Radlinski Filip","year":"2008","unstructured":"Filip Radlinski , Robert Kleinberg , and Thorsten Joachims . 2008 . Learning diverse rankings with multi-armed bandits. In Machine Learning , Proceedings of the Twenty-Fifth International Conference (ICML 2008 ), Helsinki, Finland , June 5-9, 2008. 784\u2013791. Filip Radlinski, Robert Kleinberg, and Thorsten Joachims. 2008. Learning diverse rankings with multi-armed bandits. In Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, June 5-9, 2008. 784\u2013791."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000019"},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011","author":"Ross St\u00e9phane","year":"2011","unstructured":"St\u00e9phane Ross , Geoffrey\u00a0 J. Gordon , and Drew Bagnell . 2011 . A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning . In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011 , Fort Lauderdale, USA , April 11-13, 2011. 627\u2013635. St\u00e9phane Ross, Geoffrey\u00a0J. Gordon, and Drew Bagnell. 2011. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11-13, 2011. 627\u2013635."},{"key":"e_1_3_2_1_25_1","unstructured":"Guy Shani Ronen\u00a0I. Brafman and David Heckerman. 2013. An MDP-based Recommender System. CoRR abs\/1301.0600(2013).  Guy Shani Ronen\u00a0I. Brafman and David Heckerman. 2013. An MDP-based Recommender System. CoRR abs\/1301.0600(2013)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321515"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2556195.2556234"},{"key":"e_1_3_2_1_28_1","volume-title":"Reinforcement learning - an introduction","author":"Sutton S.","unstructured":"Richard\u00a0 S. Sutton and Andrew\u00a0 G. Barto . 1998. Reinforcement learning - an introduction . MIT Press . http:\/\/www.worldcat.org\/oclc\/37293240 Richard\u00a0S. Sutton and Andrew\u00a0G. Barto. 1998. Reinforcement learning - an introduction. MIT Press. http:\/\/www.worldcat.org\/oclc\/37293240"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390334.1390364"},{"key":"e_1_3_2_1_30_1","first-page":"85","article-title":"Understanding and predicting personal navigation","volume":"2011","author":"Teevan Jaime","year":"2011","unstructured":"Jaime Teevan , Daniel\u00a0 J. Liebling , and Gayathri\u00a0Ravichandran Geetha . 2011 . Understanding and predicting personal navigation . In Proceedings of WSDM , 2011. 85 \u2013 94 . Jaime Teevan, Daniel\u00a0J. Liebling, and Gayathri\u00a0Ravichandran Geetha. 2011. Understanding and predicting personal navigation. In Proceedings of WSDM, 2011. 85\u201394.","journal-title":"Proceedings of WSDM"},{"key":"e_1_3_2_1_31_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems 30.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems 30."},{"key":"e_1_3_2_1_32_1","volume-title":"ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings. 598\u2013604","author":"Vu Thanh","year":"2017","unstructured":"Thanh Vu , Dat\u00a0Quoc Nguyen , Mark Johnson , Dawei Song , and Alistair Willis . 2017 . Search Personalization with Embeddings. In Advances in Information Retrieval - 39th European Conference on IR Research , ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings. 598\u2013604 . Thanh Vu, Dat\u00a0Quoc Nguyen, Mark Johnson, Dawei Song, and Alistair Willis. 2017. Search Personalization with Embeddings. In Advances in Information Retrieval - 39th European Conference on IR Research, ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings. 598\u2013604."},{"key":"e_1_3_2_1_33_1","volume-title":"ECIR 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings. 605\u2013616","author":"Vu Thanh\u00a0Tien","year":"2015","unstructured":"Thanh\u00a0Tien Vu , Alistair Willis , Son\u00a0Ngoc Tran , and Dawei Song . 2015 . Temporal Latent Topic User Profiles for Search Personalisation. In Advances in Information Retrieval - 37th European Conference on IR Research , ECIR 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings. 605\u2013616 . Thanh\u00a0Tien Vu, Alistair Willis, Son\u00a0Ngoc Tran, and Dawei Song. 2015. Temporal Latent Topic User Profiles for Search Personalisation. In Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings. 605\u2013616."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484068"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219961"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080685"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2488388.2488511"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/bf00992696"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080775"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2747874"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3234944.3234977"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240374"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219886"},{"key":"e_1_3_2_1_44_1","unstructured":"Xiangyu Zhao Liang Zhang Zhuoye Ding Dawei Yin Yihong Zhao and Jiliang Tang. 2018. Deep Reinforcement Learning for List-wise Recommendations. CoRR abs\/1801.00209(2018).  Xiangyu Zhao Liang Zhang Zhuoye Ding Dawei Yin Yihong Zhao and Jiliang Tang. 2018. Deep Reinforcement Learning for List-wise Recommendations. CoRR abs\/1801.00209(2018)."}],"event":{"name":"WWW '20: The Web Conference 2020","location":"Taipei Taiwan","acronym":"WWW '20","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of The Web Conference 2020"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380294","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366423.3380294","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:11Z","timestamp":1750199591000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380294"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,20]]},"references-count":44,"alternative-id":["10.1145\/3366423.3380294","10.1145\/3366423"],"URL":"https:\/\/doi.org\/10.1145\/3366423.3380294","relation":{},"subject":[],"published":{"date-parts":[[2020,4,20]]},"assertion":[{"value":"2020-04-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}