{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,11]],"date-time":"2025-09-11T19:09:22Z","timestamp":1757617762732,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":24,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,9,22]]},"DOI":"10.1145\/3705328.3759305","type":"proceedings-article","created":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T10:46:13Z","timestamp":1757155573000},"page":"1181-1186","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge"],"prefix":"10.1145","author":[{"given":"Francesco","family":"Fabbri","sequence":"first","affiliation":[{"name":"Spotify, Spain"}]},{"given":"Gustavo","family":"Penha","sequence":"additional","affiliation":[{"name":"Spotify, Netherlands"}]},{"given":"Edoardo","family":"D'Amico","sequence":"additional","affiliation":[{"name":"Spotify, Spain"}]},{"given":"Alice","family":"Wang","sequence":"additional","affiliation":[{"name":"Spotify, USA"}]},{"given":"Marco","family":"De Nadai","sequence":"additional","affiliation":[{"name":"Spotify, Denmark"}]},{"given":"Jackie","family":"Doremus","sequence":"additional","affiliation":[{"name":"Spotify, USA"}]},{"given":"Paul","family":"Gigioli","sequence":"additional","affiliation":[{"name":"Spotify, USA"}]},{"given":"Andreas","family":"Damianou","sequence":"additional","affiliation":[{"name":"Spotify, United Kingdom"}]},{"given":"Oskar","family":"St\u00e5l","sequence":"additional","affiliation":[{"name":"Spotify, Sweden"}]},{"given":"Mounia","family":"Lalmas","sequence":"additional","affiliation":[{"name":"Spotify, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2025,9,7]]},"reference":[{"key":"e_1_3_3_1_2_2","unstructured":"Josh Achiam Steven Adler Sandhini Agarwal Lama Ahmad Ilge Akkaya Florencia\u00a0Leoni Aleman Diogo Almeida Janko Altenschmidt Sam Altman Shyamal Anadkat et\u00a0al. 2023. Gpt-4 technical report. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2303.08774 (2023)."},{"key":"e_1_3_3_1_3_2","unstructured":"Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared\u00a0D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et\u00a0al. 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020) 1877\u20131901."},{"key":"e_1_3_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-emnlp.592"},{"key":"e_1_3_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.naacl-long.365"},{"key":"e_1_3_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Isabel\u00a0O Gallegos Ryan\u00a0A Rossi Joe Barrow Md\u00a0Mehrab Tanjim Sungchul Kim Franck Dernoncourt Tong Yu Ruiyi Zhang and Nesreen\u00a0K Ahmed. 2024. Bias and fairness in large language models: A survey. Computational Linguistics 50 3 (2024) 1097\u20131179.","DOI":"10.1162\/coli_a_00524"},{"key":"e_1_3_3_1_7_2","unstructured":"Jiawei Gu Xuhui Jiang Zhichao Shi Hexiang Tan Xuehao Zhai Chengjin Xu Wei Li Yinghan Shen Shengjie Ma Honghao Liu Saizhuo Wang Kun Zhang Yuanzhuo Wang Wen Gao Lionel Ni and Jian Guo. 2025. A Survey on LLM-as-a-Judge. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2411.15594 (2025). https:\/\/arxiv.org\/abs\/2411.15594"},{"key":"e_1_3_3_1_8_2","unstructured":"Chengkai Huang Tong Yu Kaige Xie Shuai Zhang Lina Yao and Julian McAuley. 2024. Foundation models for recommender systems: A survey and new perspectives. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2402.11143 (2024)."},{"key":"e_1_3_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462805"},{"key":"e_1_3_3_1_10_2","doi-asserted-by":"crossref","unstructured":"Jianghao Lin Xinyi Dai Yunjia Xi Weiwen Liu Bo Chen Hao Zhang Yong Liu Chuhan Wu Xiangyang Li Chenxu Zhu et\u00a0al. 2025. How can recommender systems benefit from large language models: A survey. ACM Transactions on Information Systems 43 2 (2025) 1\u201347.","DOI":"10.1145\/3678004"},{"key":"e_1_3_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.153"},{"key":"e_1_3_3_1_12_2","volume-title":"Proceedings of the 42nd International Conference on Machine Learning (ICML 2025)","author":"Sahoo Aishwarya","year":"2025","unstructured":"Aishwarya Sahoo, Jeevana\u00a0Kruthi Karnuthala, Tushar\u00a0Parmanand Budhwani, Pranchal Agarwal, Sankaran Vaidyanathan, Alexa Siu, Franck Dernoncourt, Jennifer Healey, Nedim Lipka, Ryan Rossi, Uttaran Bhattacharya, and Branislav Kveton. 2025. Quantitative LLM Judges. In Proceedings of the 42nd International Conference on Machine Learning (ICML 2025). https:\/\/arxiv.org\/abs\/2506.02945 Spotlight, to appear."},{"key":"e_1_3_3_1_13_2","unstructured":"Kun Su Krishna Sayana Hubert Pham James Pine Yuri Vasilevski Raghavendra Vasudeva Marialena Kyriakidi Liam Hebert Ambarish Jash Anushya Subbiah et\u00a0al. 2025. REGEN: A Dataset and Benchmarks with Natural Language Critiques and Narratives. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2503.11924 (2025)."},{"key":"e_1_3_3_1_14_2","unstructured":"Aman\u00a0Singh Thakur Kartik Choudhary Venkat\u00a0Srinik Ramayapally Sankaran Vaidyanathan and Dieuwke Hupkes. 2025. Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.12624 (2025). https:\/\/arxiv.org\/abs\/2406.12624"},{"key":"e_1_3_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657707"},{"key":"e_1_3_3_1_16_2","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_3_1_17_2","unstructured":"Jianling Wang Yifan Liu Yinghao Sun Xuejian Ma Yueqi Wang He Ma Zhengyang Su Minmin Chen Mingyan Gao Onkar Dalal et\u00a0al. 2025. User Feedback Alignment for LLM-powered Exploration in Large-scale Recommendation Systems. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2504.05522 (2025)."},{"key":"e_1_3_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3589335.3651532"},{"key":"e_1_3_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3640457.3688161"},{"key":"e_1_3_3_1_20_2","unstructured":"Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Fei Xia Ed Chi Quoc\u00a0V Le Denny Zhou et\u00a0al. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in neural information processing systems 35 (2022) 24824\u201324837."},{"key":"e_1_3_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.acl-long.470"},{"key":"e_1_3_3_1_22_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR 2025)","author":"Ye Jiayi","year":"2025","unstructured":"Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen, Qihui Zhang, Nuno Moniz, Tian Gao, Werner Geyer, Chao Huang, Pin-Yu Chen, Nitesh\u00a0V. Chawla, and Xiangliang Zhang. 2025. Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge. In Proceedings of the International Conference on Learning Representations (ICLR 2025). https:\/\/openreview.net\/forum?id=3GTtZFiajM Poster."},{"key":"e_1_3_3_1_23_2","unstructured":"Weizhi Zhang Yuanchen Bei Liangwei Yang Henry\u00a0Peng Zou Peilin Zhou Aiwei Liu Yinghui Li Hao Chen Jianling Wang Yu Wang et\u00a0al. 2025. Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2501.01945 (2025)."},{"key":"e_1_3_3_1_24_2","volume-title":"Advances in Neural Information Processing Systems 36 (NeurIPS 2023), Datasets and Benchmarks Track","author":"Zheng Lianmin","year":"2023","unstructured":"Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric\u00a0P. Xing, Hao Zhang, Joseph\u00a0E. Gonzalez, and Ion Stoica. 2023. Judging LLM-as-a-Judge with MT\u2010Bench and Chatbot Arena. In Advances in Neural Information Processing Systems 36 (NeurIPS 2023), Datasets and Benchmarks Track. https:\/\/proceedings.neurips.cc\/paper\/91f18a1287b398d378ef22505bf41832"},{"volume-title":"The Thirteenth International Conference on Learning Representations","author":"Zhu Lianghui","key":"e_1_3_3_1_25_2","unstructured":"Lianghui Zhu, Xinggang Wang, and Xinlong Wang. [n. d.]. JudgeLM: Fine-tuned Large Language Models are Scalable Judges. In The Thirteenth International Conference on Learning Representations."}],"event":{"name":"RecSys '25: Nineteenth ACM Conference on Recommender Systems","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction","SIGAI ACM Special Interest Group on Artificial Intelligence","SIGIR ACM Special Interest Group on Information Retrieval","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"],"location":"Prague Czech Republic","acronym":"RecSys '25"},"container-title":["Proceedings of the Nineteenth ACM Conference on Recommender Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3705328.3759305","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T11:42:13Z","timestamp":1757158933000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3705328.3759305"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,7]]},"references-count":24,"alternative-id":["10.1145\/3705328.3759305","10.1145\/3705328"],"URL":"https:\/\/doi.org\/10.1145\/3705328.3759305","relation":{},"subject":[],"published":{"date-parts":[[2025,9,7]]},"assertion":[{"value":"2025-09-07","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}