{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T22:26:11Z","timestamp":1762899971571,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":20,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,7,25]],"date-time":"2020-07-25T00:00:00Z","timestamp":1595635200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"U.S. National Science Foundation","award":["IIS-145374"],"award-info":[{"award-number":["IIS-145374"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,7,25]]},"DOI":"10.1145\/3397271.3401200","type":"proceedings-article","created":{"date-parts":[[2020,7,25]],"date-time":"2020-07-25T07:50:08Z","timestamp":1595663408000},"page":"1525-1528","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Balancing Reinforcement Learning Training Experiences in Interactive Information Retrieval"],"prefix":"10.1145","author":[{"given":"Limin","family":"Chen","sequence":"first","affiliation":[{"name":"Georgetown University, Washington, DC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiwen","family":"Tang","sequence":"additional","affiliation":[{"name":"Georgetown University, Washington, DC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Grace Hui","family":"Yang","sequence":"additional","affiliation":[{"name":"Georgetown University, Washington, DC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,7,25]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"ICLR '18","author":"Berseth Glen","year":"2018","unstructured":"Glen Berseth , Cheng Xie , Paul Cernek , and Michiel Van de Panne . 2018 . Progressive reinforcement learning with distillation for multi-skilled motion control . In ICLR '18 . Glen Berseth, Cheng Xie, Paul Cernek, and Michiel Van de Panne. 2018. Progressive reinforcement learning with distillation for multi-skilled motion control. In ICLR '18."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8793789"},{"key":"e_1_3_2_2_3_1","volume-title":"HotFlip: White-Box Adversarial Examples for Text Classification. In ACL '18","author":"Ebrahimi Javid","year":"2018","unstructured":"Javid Ebrahimi , Anyi Rao , Daniel Lowd , and Dejing Dou . 2018 . HotFlip: White-Box Adversarial Examples for Text Classification. In ACL '18 . Javid Ebrahimi, Anyi Rao, Daniel Lowd, and Dejing Dou. 2018. HotFlip: White-Box Adversarial Examples for Text Classification. In ACL '18."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1611835114"},{"key":"e_1_3_2_2_5_1","volume-title":"Deep Reinforcement Learning for Dialogue Generation. In EMNLP '16","author":"Li Jiwei","year":"2016","unstructured":"Jiwei Li , Will Monroe , Alan Ritter , Dan Jurafsky , Michel Galley , and Jianfeng Gao . 2016 . Deep Reinforcement Learning for Dialogue Generation. In EMNLP '16 . Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, and Jianfeng Gao. 2016. Deep Reinforcement Learning for Dialogue Generation. In EMNLP '16."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-2004"},{"key":"e_1_3_2_2_7_1","unstructured":"Feng Liu Ruiming Tang Xutao Li Weinan Zhang Yunming Ye Haokun Chen Huifeng Guo and Yuzhou Zhang. 2018. Deep reinforcement learning based recommendation with explicit user-item interactions modeling. arXiv preprintarXiv:1810.12027(2018).  Feng Liu Ruiming Tang Xutao Li Weinan Zhang Yunming Ye Haokun Chen Huifeng Guo and Yuzhou Zhang. 2018. Deep reinforcement learning based recommendation with explicit user-item interactions modeling. arXiv preprintarXiv:1810.12027(2018)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460528"},{"key":"e_1_3_2_2_9_1","unstructured":"Andrei A Rusu Neil C Rabinowitz Guillaume Desjardins Hubert Soyer James Kirkpatrick Koray Kavukcuoglu Razvan Pascanu and Raia Hadsell. 2016. Progressive neural networks. arXiv preprint arXiv:1606.04671(2016).  Andrei A Rusu Neil C Rabinowitz Guillaume Desjardins Hubert Soyer James Kirkpatrick Koray Kavukcuoglu Razvan Pascanu and Raia Hadsell. 2016. Progressive neural networks. arXiv preprint arXiv:1606.04671(2016)."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Fereshteh Sadeghi and Sergey Levine. 2016. Cad2rl: Real single-image flight without a single real image.arXiv preprint arXiv:1611.04201(2016).  Fereshteh Sadeghi and Sergey Levine. 2016. Cad2rl: Real single-image flight without a single real image.arXiv preprint arXiv:1611.04201(2016).","DOI":"10.15607\/RSS.2017.XIII.034"},{"key":"e_1_3_2_2_11_1","first-page":"e26752","article-title":"The new york times annotated corpus","volume":"6","author":"Sandhaus Evan","year":"2008","unstructured":"Evan Sandhaus . 2008 . The new york times annotated corpus . Linguistic Data Consortium, Philadelphia 6 , 12 (2008), e26752 . Evan Sandhaus. 2008. The new york times annotated corpus. Linguistic Data Consortium, Philadelphia 6, 12 (2008), e26752.","journal-title":"Linguistic Data Consortium, Philadelphia"},{"key":"e_1_3_2_2_12_1","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. arXiv e-prints(2017). arXiv:1707.06347 [cs.LG]  John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. arXiv e-prints(2017). arXiv:1707.06347 [cs.LG]"},{"key":"e_1_3_2_2_13_1","volume-title":"A survey on image data augmentation for deep learning.Journal of Big Data 6, 1","author":"Shorten Connor","year":"2019","unstructured":"Connor Shorten and Taghi M Khoshgoftaar . 2019. A survey on image data augmentation for deep learning.Journal of Big Data 6, 1 ( 2019 ), 60. Connor Shorten and Taghi M Khoshgoftaar. 2019. A survey on image data augmentation for deep learning.Journal of Big Data 6, 1 (2019), 60."},{"key":"e_1_3_2_2_14_1","unstructured":"Zhiwen Tang and Grace Hui Yang. 2019. Dynamic Search--Optimizing the Game of Information Seeking.arXiv preprint arXiv:1909.12425(2019).  Zhiwen Tang and Grace Hui Yang. 2019. Dynamic Search--Optimizing the Game of Information Seeking.arXiv preprint arXiv:1909.12425(2019)."},{"key":"e_1_3_2_2_15_1","volume-title":"Corpus-Level End-to-End Exploration for Interactive Systems. In AAAI '20","author":"Tang Zhiwen","year":"2020","unstructured":"Zhiwen Tang and Grace Hui Yang . 2020 . Corpus-Level End-to-End Exploration for Interactive Systems. In AAAI '20 . Zhiwen Tang and Grace Hui Yang. 2020. Corpus-Level End-to-End Exploration for Interactive Systems. In AAAI '20."},{"key":"e_1_3_2_2_16_1","unstructured":"Luke Taylor and Geoff Nitschke. 2017. Improving deep learning using generic data augmentation. arXiv preprint arXiv:1708.06020(2017).  Luke Taylor and Geoff Nitschke. 2017. Improving deep learning using generic data augmentation. arXiv preprint arXiv:1708.06020(2017)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8202133"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1670"},{"key":"e_1_3_2_2_19_1","volume-title":"TREC 2017 Dynamic Domain Track Overview. In TREC '17.","author":"Yang Grace Hui","year":"2017","unstructured":"Grace Hui Yang , Zhiwen Tang , and Ian Soboroff . 2017 . TREC 2017 Dynamic Domain Track Overview. In TREC '17. Grace Hui Yang, Zhiwen Tang, and Ian Soboroff. 2017. TREC 2017 Dynamic Domain Track Overview. In TREC '17."},{"key":"e_1_3_2_2_20_1","volume-title":"Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB Newsletter Spring","author":"Zhao Xiangyu","year":"2019","unstructured":"Xiangyu Zhao , Long Xia , Jiliang Tang , and Dawei Yin . 2019. \" Deep reinforcement learning for search , recommendation, and online advertising: a survey\" by Xiangyu Zhao , Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB Newsletter Spring ( 2019 ), 1--15. Xiangyu Zhao, Long Xia, Jiliang Tang, and Dawei Yin. 2019. \"Deep reinforcement learning for search, recommendation, and online advertising: a survey\" by Xiangyu Zhao, Long Xia, Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM SIGWEB Newsletter Spring (2019), 1--15."}],"event":{"name":"SIGIR '20: The 43rd International ACM SIGIR conference on research and development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Virtual Event China","acronym":"SIGIR '20"},"container-title":["Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3397271.3401200","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3397271.3401200","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:41:43Z","timestamp":1750200103000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3397271.3401200"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,25]]},"references-count":20,"alternative-id":["10.1145\/3397271.3401200","10.1145\/3397271"],"URL":"https:\/\/doi.org\/10.1145\/3397271.3401200","relation":{},"subject":[],"published":{"date-parts":[[2020,7,25]]},"assertion":[{"value":"2020-07-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}