{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:04:21Z","timestamp":1775815461802,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T00:00:00Z","timestamp":1691107200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,6]]},"DOI":"10.1145\/3580305.3599386","type":"proceedings-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T18:10:58Z","timestamp":1691172658000},"page":"1687-1697","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7301-4399","authenticated-orcid":false,"given":"Thomas M.","family":"McDonald","sequence":"first","affiliation":[{"name":"University of Manchester, Manchester, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8307-7673","authenticated-orcid":false,"given":"Lucas","family":"Maystre","sequence":"additional","affiliation":[{"name":"Spotify, London, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3531-3096","authenticated-orcid":false,"given":"Mounia","family":"Lalmas","sequence":"additional","affiliation":[{"name":"Spotify, London, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5926-8624","authenticated-orcid":false,"given":"Daniel","family":"Russo","sequence":"additional","affiliation":[{"name":"University of Columbia &amp; Spotify, New York, NY, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0238-9393","authenticated-orcid":false,"given":"Kamil","family":"Ciosek","sequence":"additional","affiliation":[{"name":"Spotify, London, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Conference on learning theory. JMLR Workshop and Conference Proceedings, 39--1.","author":"Agrawal Shipra","year":"2012","unstructured":"Shipra Agrawal and Navin Goyal . 2012 . Analysis of Thompson sampling for the multi-armed bandit problem . In Conference on learning theory. JMLR Workshop and Conference Proceedings, 39--1. Shipra Agrawal and Navin Goyal. 2012. Analysis of Thompson sampling for the multi-armed bandit problem. In Conference on learning theory. JMLR Workshop and Conference Proceedings, 39--1."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3523227.3546766"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.2021.4071"},{"key":"e_1_3_2_2_5_1","first-page":"28029","article-title":"No regrets for learning the prior in bandits","volume":"34","author":"Basu Soumya","year":"2021","unstructured":"Soumya Basu , Branislav Kveton , Manzil Zaheer , and Csaba Szepesv\u00e1ri . 2021 . No regrets for learning the prior in bandits . Advances in Neural Information Processing Systems , Vol. 34 (2021), 28029 -- 28041 . Soumya Basu, Branislav Kveton, Manzil Zaheer, and Csaba Szepesv\u00e1ri. 2021. No regrets for learning the prior in bandits. Advances in Neural Information Processing Systems, Vol. 34 (2021), 28029--28041.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_6_1","volume-title":"Proceedings of KDDCup '07","author":"Bennett James","year":"2007","unstructured":"James Bennett and Stan Lanning . 2007 . The Netflix Prize . In Proceedings of KDDCup '07 . San Jose, CA, USA. James Bennett and Stan Lanning. 2007. The Netflix Prize. In Proceedings of KDDCup '07. San Jose, CA, USA."},{"key":"e_1_3_2_2_7_1","unstructured":"Veronika Bogina and Tsvi Kuflik. 2017. Incorporating Dwell Time in Session-Based Recommendations with Recurrent Neural Networks. In RecTemp@ RecSys. 57--59.  Veronika Bogina and Tsvi Kuflik. 2017. Incorporating Dwell Time in Session-Based Recommendations with Recurrent Neural Networks. In RecTemp@ RecSys. 57--59."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Stefano Caria Maximilian Kasy Simon Quinn Soha Shami Alex Teytelboym etal 2020. An adaptive targeted field experiment: Job search assistance for refugees in Jordan. (2020).  Stefano Caria Maximilian Kasy Simon Quinn Soha Shami Alex Teytelboym et al. 2020. An adaptive targeted field experiment: Job search assistance for refugees in Jordan. (2020).","DOI":"10.2139\/ssrn.3689456"},{"key":"e_1_3_2_2_9_1","volume-title":"Advances in Neural Information Processing Systems","volume":"24","author":"Chapelle Olivier","year":"2011","unstructured":"Olivier Chapelle and Lihong Li . 2011 . An empirical evaluation of Thompson sampling . Advances in Neural Information Processing Systems , Vol. 24 (2011). Olivier Chapelle and Lihong Li. 2011. An empirical evaluation of Thompson sampling. Advances in Neural Information Processing Systems, Vol. 24 (2011)."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3269206.3271700"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2750368"},{"key":"e_1_3_2_2_12_1","volume-title":"Thomas Borchert, and Ralf Herbrich.","author":"Graepel Thore","year":"2010","unstructured":"Thore Graepel , Joaquin Quinonero Candela , Thomas Borchert, and Ralf Herbrich. 2010 . Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing search engine. Omnipress . Thore Graepel, Joaquin Quinonero Candela, Thomas Borchert, and Ralf Herbrich. 2010. Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing search engine. Omnipress."},{"key":"e_1_3_2_2_13_1","volume-title":"International Conference on Artificial Intelligence and Statistics. PMLR, 833--842","author":"Grover Aditya","year":"2018","unstructured":"Aditya Grover , Todor Markov , Peter Attia , Norman Jin , Nicolas Perkins , Bryan Cheong , Michael Chen , Zi Yang , Stephen Harris , William Chueh , 2018 . Best arm identification in multi-armed bandits with delayed feedback . In International Conference on Artificial Intelligence and Statistics. PMLR, 833--842 . Aditya Grover, Todor Markov, Peter Attia, Norman Jin, Nicolas Perkins, Bryan Cheong, Michael Chen, Zi Yang, Stephen Harris, William Chueh, et al. 2018. Best arm identification in multi-armed bandits with delayed feedback. In International Conference on Artificial Intelligence and Statistics. PMLR, 833--842."},{"key":"e_1_3_2_2_14_1","volume-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","author":"Hastie Trevor","unstructured":"Trevor Hastie , Robert Tibshirani , and Jerome Friedman . 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction second ed.). Springer . Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction second ed.). Springer."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2788583"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3320087"},{"key":"e_1_3_2_2_17_1","volume-title":"International Conference on Machine Learning. PMLR, 1453--1461","author":"Joulani Pooria","year":"2013","unstructured":"Pooria Joulani , Andras Gyorgy , and Csaba Szepesv\u00e1ri . 2013 . Online learning under delayed feedback . In International Conference on Machine Learning. PMLR, 1453--1461 . Pooria Joulani, Andras Gyorgy, and Csaba Szepesv\u00e1ri. 2013. Online learning under delayed feedback. In International Conference on Machine Learning. PMLR, 1453--1461."},{"key":"e_1_3_2_2_18_1","volume-title":"International Conference on Artificial Intelligence and Statistics. PMLR, 133--142","author":"Kandasamy Kirthevasan","year":"2018","unstructured":"Kirthevasan Kandasamy , Akshay Krishnamurthy , Jeff Schneider , and Barnab\u00e1s P\u00f3czos . 2018 . Parallelised Bayesian optimisation via Thompson sampling . In International Conference on Artificial Intelligence and Statistics. PMLR, 133--142 . Kirthevasan Kandasamy, Akshay Krishnamurthy, Jeff Schneider, and Barnab\u00e1s P\u00f3czos. 2018. Parallelised Bayesian optimisation via Thompson sampling. In International Conference on Artificial Intelligence and Statistics. PMLR, 133--142."},{"key":"e_1_3_2_2_19_1","volume-title":"Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700","author":"Lacoste Alexandre","year":"2019","unstructured":"Alexandre Lacoste , Alexandra Luccioni , Victor Schmidt , and Thomas Dandres . 2019. Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700 ( 2019 ). Alexandre Lacoste, Alexandra Luccioni, Victor Schmidt, and Thomas Dandres. 2019. Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700 (2019)."},{"key":"e_1_3_2_2_20_1","volume-title":"Measuring user engagement. Synthesis lectures on information concepts, retrieval, and services","author":"Lalmas Mounia","year":"2014","unstructured":"Mounia Lalmas , Heather O'Brien , and Elad Yom-Tov . 2014. Measuring user engagement. Synthesis lectures on information concepts, retrieval, and services , Vol. 6 , 4 ( 2014 ), 1--132. Mounia Lalmas, Heather O'Brien, and Elad Yom-Tov. 2014. Measuring user engagement. Synthesis lectures on information concepts, retrieval, and services, Vol. 6, 4 (2014), 1--132."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772758"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/3122009.3242042"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2019.05.004"},{"key":"e_1_3_2_2_24_1","volume-title":"Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective. (Feb","author":"Maystre Lucas","year":"2023","unstructured":"Lucas Maystre , Dan Russo , and Yu Zhao . 2023. Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective. (Feb . 2023 ). Preprint , textttarXiv:2302.03561v2 [cs.LG] . Lucas Maystre, Dan Russo, and Yu Zhao. 2023. Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective. (Feb. 2023). Preprint, textttarXiv:2302.03561v2 [cs.LG] ."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240323.3240354"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4780080407"},{"key":"e_1_3_2_2_27_1","volume-title":"Adaptivity and confounding in multi-armed bandit experiments. arXiv preprint arXiv:2202.09036","author":"Qin Chao","year":"2022","unstructured":"Chao Qin and Daniel Russo . 2022. Adaptivity and confounding in multi-armed bandit experiments. arXiv preprint arXiv:2202.09036 ( 2022 ). Chao Qin and Daniel Russo. 2022. Adaptivity and confounding in multi-armed bandit experiments. arXiv preprint arXiv:2202.09036 (2022)."},{"key":"e_1_3_2_2_28_1","volume-title":"Christopher KI Williams, et al","author":"Rasmussen Carl Edward","year":"2006","unstructured":"Carl Edward Rasmussen , Christopher KI Williams, et al . 2006 . Gaussian processes for machine learning. Vol. 1 . Springer . Carl Edward Rasmussen, Christopher KI Williams, et al. 2006. Gaussian processes for machine learning. Vol. 1. Springer."},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"F. Ricci L. Rokach and B. Shapira. 2015. Recommender Systems Handbook second ed.). Springer.  F. Ricci L. Rokach and B. Shapira. 2015. Recommender Systems Handbook second ed.). Springer.","DOI":"10.1007\/978-1-4899-7637-6"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/2946645.3007021"},{"key":"e_1_3_2_2_31_1","volume-title":"Abbas Kazerouni, Ian Osband, Zheng Wen, et al.","author":"Russo Daniel J","year":"2018","unstructured":"Daniel J Russo , Benjamin Van Roy , Abbas Kazerouni, Ian Osband, Zheng Wen, et al. 2018 . A tutorial on Thompson sampling. Foundations and Trends\u00ae in Machine Learning , Vol. 11 , 1 (2018), 1--96. Daniel J Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen, et al. 2018. A tutorial on Thompson sampling. Foundations and Trends\u00ae in Machine Learning, Vol. 11, 1 (2018), 1--96."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1002\/asmb.874"},{"key":"e_1_3_2_2_33_1","first-page":"26382","article-title":"Bayesian decision-making under misspecified priors with applications to meta-learning","volume":"34","author":"Simchowitz Max","year":"2021","unstructured":"Max Simchowitz , Christopher Tosh , Akshay Krishnamurthy , Daniel J Hsu , Thodoris Lykouris , Miro Dudik , and Robert E Schapire . 2021 . Bayesian decision-making under misspecified priors with applications to meta-learning . Advances in Neural Information Processing Systems , Vol. 34 (2021), 26382 -- 26394 . Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy, Daniel J Hsu, Thodoris Lykouris, Miro Dudik, and Robert E Schapire. 2021. Bayesian decision-making under misspecified priors with applications to meta-learning. Advances in Neural Information Processing Systems, Vol. 34 (2021), 26382--26394.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_34_1","volume-title":"Foundations and Trends\u00ae in Machine Learning","volume":"12","author":"Aleksandrs","year":"2019","unstructured":"Aleksandrs Slivkins et al. 2019. Introduction to multi-armed bandits . Foundations and Trends\u00ae in Machine Learning , Vol. 12 , 1--2 ( 2019 ), 1--286. Aleksandrs Slivkins et al. 2019. Introduction to multi-armed bandits. Foundations and Trends\u00ae in Machine Learning, Vol. 12, 1--2 (2019), 1--286."},{"key":"e_1_3_2_2_35_1","volume-title":"Gaussian process optimization in the bandit setting: No regret and experimental design. arXiv preprint arXiv:0912.3995","author":"Srinivas Niranjan","year":"2009","unstructured":"Niranjan Srinivas , Andreas Krause , Sham M Kakade , and Matthias Seeger . 2009. Gaussian process optimization in the bandit setting: No regret and experimental design. arXiv preprint arXiv:0912.3995 ( 2009 ). Niranjan Srinivas, Andreas Krause, Sham M Kakade, and Matthias Seeger. 2009. Gaussian process optimization in the bandit setting: No regret and experimental design. arXiv preprint arXiv:0912.3995 (2009)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/25.3-4.285"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10844-020-00633-6"},{"key":"e_1_3_2_2_38_1","first-page":"318","article-title":"Context-Aware Recommender Systems for Learning: A Survey and Future Challenges","volume":"5","author":"Verbert Katrien","year":"2012","unstructured":"Katrien Verbert , Nikos Manouselis , Xavier Ochoa , Martin Wolpers , Hendrik Drachsler , Ivana Bosnic , and Erik Duval . 2012 . Context-Aware Recommender Systems for Learning: A Survey and Future Challenges . Journal of Intelligent Information Systems , Vol. 5 , 4 (2012), 318 -- 335 . Katrien Verbert, Nikos Manouselis, Xavier Ochoa, Martin Wolpers, Hendrik Drachsler, Ivana Bosnic, and Erik Duval. 2012. Context-Aware Recommender Systems for Learning: A Survey and Future Challenges. Journal of Intelligent Information Systems, Vol. 5, 4 (2012), 318--335.","journal-title":"Journal of Intelligent Information Systems"},{"key":"e_1_3_2_2_39_1","volume-title":"Partial Likelihood Thompson Sampling. arXiv preprint arXiv:2203.00820","author":"Wu Han","year":"2022","unstructured":"Han Wu and Stefan Wager . 2022a. Partial Likelihood Thompson Sampling. arXiv preprint arXiv:2203.00820 ( 2022 ). Han Wu and Stefan Wager. 2022a. Partial Likelihood Thompson Sampling. arXiv preprint arXiv:2203.00820 (2022)."},{"key":"e_1_3_2_2_40_1","volume-title":"Thompson Sampling with Unrestricted Delays. arXiv preprint arXiv:2202.12431","author":"Wu Han","year":"2022","unstructured":"Han Wu and Stefan Wager . 2022b. Thompson Sampling with Unrestricted Delays. arXiv preprint arXiv:2202.12431 ( 2022 ). Han Wu and Stefan Wager. 2022b. Thompson Sampling with Unrestricted Delays. arXiv preprint arXiv:2202.12431 (2022)."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3133025"},{"key":"e_1_3_2_2_42_1","volume-title":"Targeting for long-term outcomes. arXiv preprint arXiv:2010.15835","author":"Yang Jeremy","year":"2020","unstructured":"Jeremy Yang , Dean Eckles , Paramveer Dhillon , and Sinan Aral . 2020. Targeting for long-term outcomes. arXiv preprint arXiv:2010.15835 ( 2020 ). Jeremy Yang, Dean Eckles, Paramveer Dhillon, and Sinan Aral. 2020. Targeting for long-term outcomes. arXiv preprint arXiv:2010.15835 (2020)."},{"key":"e_1_3_2_2_43_1","volume-title":"Deep reinforcement learning for search, recommendation, and online advertising: a survey. ACM SIGWEB newsletter Spring","author":"Zhao Xiangyu","year":"2019","unstructured":"Xiangyu Zhao , Long Xia , Jiliang Tang , and Dawei Yin . 2019. Deep reinforcement learning for search, recommendation, and online advertising: a survey. ACM SIGWEB newsletter Spring ( 2019 ), 1--15. Xiangyu Zhao, Long Xia, Jiliang Tang, and Dawei Yin. 2019. Deep reinforcement learning for search, recommendation, and online advertising: a survey. ACM SIGWEB newsletter Spring (2019), 1--15."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3185994"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330668"}],"event":{"name":"KDD '23: The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Long Beach CA USA","acronym":"KDD '23","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599386","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3580305.3599386","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:48Z","timestamp":1750178268000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599386"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":44,"alternative-id":["10.1145\/3580305.3599386","10.1145\/3580305"],"URL":"https:\/\/doi.org\/10.1145\/3580305.3599386","relation":{},"subject":[],"published":{"date-parts":[[2023,8,4]]},"assertion":[{"value":"2023-08-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}