{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T02:14:49Z","timestamp":1777342489604,"version":"3.51.4"},"reference-count":85,"publisher":"Association for Computing Machinery (ACM)","issue":"1","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62422215 and 62472427"],"award-info":[{"award-number":["62422215 and 62472427"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Beijing Outstanding Young Scientist Program","award":["BJJWZYJH012019100020098"],"award-info":[{"award-number":["BJJWZYJH012019100020098"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2026,1,31]]},"abstract":"<jats:p>Recent advancements in explainable recommendation have greatly bolstered user experience by elucidating the decision-making rationale. However, the existing methods actually fail to provide effective feedback signals for potentially better or worse generated explanations due to their reliance on traditional supervised learning paradigms in sparse interaction data. To address these issues, we propose a novel human-like feedback-driven optimization framework. This framework employs a dynamic interactive optimization mechanism for achieving human-centered explainable requirements without incurring high labor costs. Specifically, we propose to utilize large language models (LLMs) as human simulators to predict human-like feedback for guiding the learning process. To enable the LLMs to deeply understand the task essence and meet user\u2019s diverse personalized requirements, we introduce a human-induced customized reward scoring method, which helps stimulate the language understanding and logical reasoning capabilities of LLMs. Furthermore, considering the potential conflicts between different perspectives of explanation quality, we introduce a principled Pareto optimization that transforms the multi-perspective quality enhancement task into a multi-objective optimization problem for improving explanation performance. At last, to achieve efficient model training, we design an off-policy optimization pipeline. By incorporating a replay buffer and addressing the data distribution biases, we can effectively improve data utilization and enhance model generality. Extensive experiments on four datasets demonstrate the superiority of our approach.<\/jats:p>","DOI":"10.1145\/3758091","type":"journal-article","created":{"date-parts":[[2025,8,5]],"date-time":"2025-08-05T15:24:04Z","timestamp":1754407444000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Explainable Recommendation with Simulated Human Feedback"],"prefix":"10.1145","volume":"44","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9543-8889","authenticated-orcid":false,"given":"Jiakai","family":"Tang","sequence":"first","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2997-3386","authenticated-orcid":false,"given":"Jingsen","family":"Zhang","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-8292-5384","authenticated-orcid":false,"given":"Zihang","family":"Tian","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-5595-3775","authenticated-orcid":false,"given":"Xueyang","family":"Feng","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-7769-6918","authenticated-orcid":false,"given":"Lei","family":"Wang","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0144-1775","authenticated-orcid":false,"given":"Xu","family":"Chen","sequence":"additional","affiliation":[{"name":"Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2025,10,14]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"337","volume-title":"International Conference on Machine Learning","author":"Aher Gati V.","year":"2023","unstructured":"Gati V. Aher, Rosa I. Arriaga, and Adam Tauman Kalai. 2023. Using large language models to simulate multiple humans and replicate human subject studies. In International Conference on Machine Learning. PMLR, 337\u2013371."},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.3390\/a11090137"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331211"},{"key":"e_1_3_2_5_2","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown Tom","year":"2020","unstructured":"Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems, Vol. 33, 1877\u20131901.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.5555\/3495724.3495883"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF01442131"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3449973"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3449846"},{"key":"e_1_3_2_10_2","first-page":"51","volume-title":"61st Annual Meeting of the Association for Computational Linguistics (ACL \u201923)","author":"Cheng Hao","year":"2023","unstructured":"Hao Cheng, Shuo Wang, Wensheng Lu, Wei Zhang, Mingyang Zhou, Kezhong Lu, and Hao Liao. 2023. Explainable recommendation with personalized review retrieval and aspect learning. In 61st Annual Meeting of the Association for Computational Linguistics (ACL \u201923). Association for Computational Linguistics, 51\u201364."},{"key":"e_1_3_2_11_2","unstructured":"Zhixuan Chu Yan Wang Qing Cui Longfei Li Wenqing Chen Sheng Li Zhan Qin and Kui Ren. 2024. LLM-guided multi-view hypergraph learning for human-centric explainable recommendation. arXiv:2401.08217. Retrieved from https:\/\/arxiv.org\/abs\/2401.08217"},{"key":"e_1_3_2_12_2","volume-title":"NIPS 2014 Workshop on Deep Learning","author":"Chung Junyoung","year":"2014","unstructured":"Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS 2014 Workshop on Deep Learning."},{"key":"e_1_3_2_13_2","first-page":"1","volume-title":"6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP","author":"Colas Anthony","year":"2023","unstructured":"Anthony Colas, Jun Araki, Zhengyu Zhou, Bingqing Wang, and Zhe Feng. 2023. Knowledge-grounded natural language recommendation explanation. In 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP. Association for Computational Linguistics, Singapore, 1\u201315."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.5555\/3042573.3042600"},{"key":"e_1_3_2_15_2","first-page":"4171","volume-title":"2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Minneapolis, Minnesota, 4171\u20134186."},{"key":"e_1_3_2_16_2","first-page":"623","volume-title":"15th Conference of the European Chapter of the Association for Computational Linguistics","author":"Dong Li","year":"2017","unstructured":"Li Dong, Shaohan Huang, Furu Wei, Mirella Lapata, Ming Zhou, and Ke Xu. 2017. Learning to generate product reviews from attributes. In 15th Conference of the European Chapter of the Association for Computational Linguistics, 623\u2013632."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401051"},{"key":"e_1_3_2_18_2","first-page":"3622","volume-title":"AAAI Conference on Artificial Intelligence","volume":"33","author":"Gao Jingyue","year":"2019","unstructured":"Jingyue Gao, Xiting Wang, Yasha Wang, and Xing Xie. 2019. Explainable recommendation through attentive multi-view learning. In AAAI Conference on Artificial Intelligence, Vol. 33, 3622\u20133629."},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3523227.3546767"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.2307\/2346830"},{"issue":"4","key":"e_1_3_2_21_2","first-page":"542","article-title":"Pareto optimal redistribution","volume":"59","author":"Hochman Harold M.","year":"1969","unstructured":"Harold M. Hochman and James D. Rodgers. 1969. Pareto optimal redistribution. The American Economic Review 59, 4 (1969), 542\u2013557.","journal-title":"The American Economic Review"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_23_2","unstructured":"Jie Huang and Kevin Chen-Chuan Chang. 2022. Towards reasoning in large language models: A survey. arXiv:2212.10403. Retrieved from https:\/\/arxiv.org\/abs\/2212.10403"},{"key":"e_1_3_2_24_2","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980. Retrieved from https:\/\/arxiv.org\/abs\/1412.6980"},{"key":"e_1_3_2_25_2","article-title":"Actor-critic algorithms","volume":"12","author":"Konda Vijay","year":"1999","unstructured":"Vijay Konda and John Tsitsiklis. 1999. Actor-critic algorithms. In Advances in Neural Information Processing Systems, Vol. 12.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/1644873.1644874"},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","first-page":"7871","DOI":"10.18653\/v1\/2020.acl-main.703","volume-title":"58th Annual Meeting of the Association for Computational Linguistics","author":"Lewis Mike","year":"2020","unstructured":"Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2020. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 7871\u20137880."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3580305.3599535"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3580305.3599519"},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1007\/s10844-020-00631-8","article-title":"CAESAR: Context-aware explanation based on supervised attention for service recommendations","author":"Li Lei","year":"2021","unstructured":"Lei Li, Li Chen, and Ruihai Dong. 2021. CAESAR: Context-aware explanation based on supervised attention for service recommendations. Journal of Intelligent Information Systems 57 (Aug. 2021), 147\u2013170.","journal-title":"Journal of Intelligent Information Systems"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3411992"},{"key":"e_1_3_2_32_2","first-page":"4947","volume-title":"59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing","author":"Li Lei","year":"2021","unstructured":"Lei Li, Yongfeng Zhang, and Li Chen. 2021. Personalized transformer for explainable recommendation. In 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, 4947\u20134957."},{"issue":"2","key":"e_1_3_2_33_2","article-title":"On the relationship between explanation and recommendation: Learning to rank explanations for improved performance","volume":"14","author":"Li Lei","year":"2023","unstructured":"Lei Li, Yongfeng Zhang, and Li Chen. 2023. On the relationship between explanation and recommendation: Learning to rank explanations for improved performance. ACM Transactiosn on Intelligent Systems and Technology 14, 2, Article 21(Feb. 2023), 24 pages.","journal-title":"ACM Transactiosn on Intelligent Systems and Technology"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/3580488"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1145\/3583780.3615017"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080822"},{"key":"e_1_3_2_37_2","volume-title":"4th International Conference on Learning Representations (ICLR \u201916)","author":"Lillicrap Timothy P.","year":"2016","unstructured":"Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In 4th International Conference on Learning Representations (ICLR \u201916)."},{"key":"e_1_3_2_38_2","first-page":"74","volume-title":"Text Summarization Branches out","author":"Lin Chin-Yew","year":"2004","unstructured":"Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Text Summarization Branches out, 74\u201381."},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/3298689.3346998"},{"key":"e_1_3_2_40_2","first-page":"481","article-title":"Pareto optimality","author":"Luc Dinh The","year":"2008","unstructured":"Dinh The Luc. 2008. Pareto optimality. Pareto Optimality, Game Theory and Equilibria, 481\u2013515.","journal-title":"Pareto Optimality, Game Theory and Equilibria"},{"key":"e_1_3_2_41_2","article-title":"Probabilistic matrix factorization","volume":"20","author":"Mnih Andriy","year":"2007","unstructured":"Andriy Mnih and Russ R. Salakhutdinov. 2007. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems, Vol. 20.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_42_2","first-page":"1928","volume-title":"International Conference on Machine Learning","author":"Mnih Volodymyr","year":"2016","unstructured":"Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning. PMLR, 1928\u20131937."},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_44_2","first-page":"1597","volume-title":"15th ACM International Conference on Web Search and Data Mining","author":"Ovaisi Zohreh","year":"2022","unstructured":"Zohreh Ovaisi, Shelby Heinecke, Jia Li, Yongfeng Zhang, Elena Zheleva, and Caiming Xiong. 2022. RGRecSys: A toolkit for robustness evaluation of recommender systems. In 15th ACM International Conference on Web Search and Data Mining, 1597\u20131600."},{"key":"e_1_3_2_45_2","volume-title":"40th Annual Meeting on Association for Computational Linguistics (ACL \u201902)","author":"Papineni Kishore","year":"2001","unstructured":"Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2001. BLEU. In 40th Annual Meeting on Association for Computational Linguistics (ACL \u201902)."},{"key":"e_1_3_2_46_2","article-title":"Pytorch: An imperative style, high-performance deep learning library","volume":"32","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems, Vol. 32.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_47_2","doi-asserted-by":"crossref","first-page":"2060","DOI":"10.1145\/3219819.3220072","volume-title":"24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","author":"Peake Georgina","year":"2018","unstructured":"Georgina Peake and Jun Wang. 2018. Explanation mining: Post hoc interpretability of latent factor models for recommendation systems. In 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2060\u20132069."},{"issue":"8","key":"e_1_3_2_48_2","first-page":"9","article-title":"Language models are unsupervised multitask learners","volume":"1","author":"Radford Alec","year":"2019","unstructured":"Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language models are unsupervised multitask learners. OpenAI Blog 1, 8 (2019), 9.","journal-title":"OpenAI Blog"},{"key":"e_1_3_2_49_2","first-page":"71095","article-title":"Rewarded soups: Towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards","volume":"36","author":"Rame Alexandre","year":"2023","unstructured":"Alexandre Rame, Guillaume Couairon, Corentin Dancette, Jean-Baptiste Gaya, Mustafa Shukor, Laure Soulier, and Matthieu Cord. 2023. Rewarded soups: Towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards. In Advances in Neural Information Processing Systems, Vol. 36, 71095\u201371134.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_50_2","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1145\/2365952.2365962","volume-title":"6th ACM Conference on Recommender Systems","author":"Ribeiro Marco Tulio","year":"2012","unstructured":"Marco Tulio Ribeiro, Anisio Lacerda, Adriano Veloso, and Nivio Ziviani. 2012. Pareto-efficient hybridization for multi-objective recommender systems. In 6th ACM Conference on Recommender Systems, 19\u201326."},{"key":"e_1_3_2_51_2","first-page":"1889","volume-title":"International Conference on Machine Learning","author":"Schulman John","year":"2015","unstructured":"John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, and Philipp Moritz. 2015. Trust region policy optimization. In International Conference on Machine Learning. PMLR, 1889\u20131897."},{"key":"e_1_3_2_52_2","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv:1707.06347. Retrieved from https:\/\/arxiv.org\/abs\/1707.06347"},{"key":"e_1_3_2_53_2","article-title":"Multi-task learning as multi-objective optimization","volume":"31","author":"Sener Ozan","year":"2018","unstructured":"Ozan Sener and Vladlen Koltun. 2018. Multi-task learning as multi-objective optimization. In Advances in Neural Information Processing Systems, Vol. 31.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3411949"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482420"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1145\/3604915.3608770"},{"key":"e_1_3_2_57_2","unstructured":"Hugo Touvron Thibaut Lavril Gautier Izacard Xavier Martinet Marie-Anne Lachaux Timoth\u00e9e Lacroix Baptiste Rozi\u00e8re Naman Goyal Eric Hambro Faisal Azhar et al. 2023. Llama: Open and efficient foundation language models. arXiv:2302.13971. Retrieved from https:\/\/arxiv.org\/abs\/2302.13971"},{"key":"e_1_3_2_58_2","volume-title":"AAAI Conference on Artificial Intelligence","volume":"30","author":"Hasselt Hado Van","year":"2016","unstructured":"Hado Van Hasselt, Arthur Guez, and David Silver. 2016. Deep reinforcement learning with double q-learning. In AAAI Conference on Artificial Intelligence, Vol. 30."},{"key":"e_1_3_2_59_2","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems, Vol. 30.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_60_2","unstructured":"Lei Wang Jingsen Zhang Xu Chen Yankai Lin Ruihua Song Wayne Xin Zhao and Ji-Rong Wen. 2023. Recagent: A novel simulation paradigm for recommender systems. arXiv:2306.02552. Retrieved from https:\/\/arxiv.org\/abs\/2306.02552"},{"key":"e_1_3_2_61_2","unstructured":"Lei Wang Jingsen Zhang Hao Yang Zhiyuan Chen Jiakai Tang Zeyu Zhang Xu Chen Yankai Lin Ruihua Song Wayne Xin Zhao et al. 2023. User behavior simulation with large language model based agents. arXiv:2306.02552. Retrieved from https:\/\/arxiv.org\/abs\/2306.02552"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210010"},{"key":"e_1_3_2_63_2","article-title":"Reinforced path reasoning for counterfactual explainable recommendation","author":"Wang Xiangmeng","year":"2024","unstructured":"Xiangmeng Wang, Qian Li, Dianer Yu, Qing Li, and Guandong Xu. 2024. Reinforced path reasoning for counterfactual explainable recommendation. IEEE Transactions on Knowledge and Data Engineering 36, 7 (2024), 3443\u20133459.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_2_64_2","first-page":"5329","volume-title":"AAAI Conference on Artificial Intelligence","volume":"33","author":"Wang Xiang","year":"2019","unstructured":"Xiang Wang, Dingxian Wang, Canran Xu, Xiangnan He, Yixin Cao, and Tat-Seng Chua. 2019. Explainable reasoning over knowledge graphs for recommendation. In AAAI Conference on Artificial Intelligence, Vol. 33, 5329\u20135336."},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF00992698"},{"key":"e_1_3_2_66_2","unstructured":"Jason Wei Yi Tay Rishi Bommasani Colin Raffel Barret Zoph Sebastian Borgeaud Dani Yogatama Maarten Bosma Denny Zhou Donald Metzler et al. 2022. Emergent abilities of large language models. arxiv:2206.07682. Retrieved from https:\/\/arxiv.org\/abs\/2206.07682."},{"key":"e_1_3_2_67_2","first-page":"24824","article-title":"Chain-of-thought prompting elicits reasoning in large language models","volume":"35","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V. Le, Denny Zhou, et al. 2022. Chain-of-thought prompting elicits reasoning in large language models. In Advances in Neural Information Processing Systems, Vol. 35, 24824\u201324837.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331203"},{"key":"e_1_3_2_69_2","first-page":"1645","volume-title":"29th ACM International Conference on Information & Knowledge Management","author":"Xian Yikun","year":"2020","unstructured":"Yikun Xian, Zuohui Fu, Handong Zhao, Yingqiang Ge, Xu Chen, Qiaoying Huang, Shijie Geng, Zhou Qin, Gerard De Melo, Shan Muthukrishnan, et al. 2020. CAFE: Coarse-to-fine neural symbolic reasoning for explainable recommendation. In 29th ACM International Conference on Information & Knowledge Management, 1645\u20131654."},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3450039"},{"key":"e_1_3_2_71_2","first-page":"3839","volume-title":"Web Conference 2021 (WWW \u201921)","author":"Xie Ruobing","year":"2021","unstructured":"Ruobing Xie, Yanlei Liu, Shaoliang Zhang, Rui Wang, Feng Xia, and Leyu Lin. 2021. Personalized approximate pareto-efficient recommendation. In Web Conference 2021 (WWW \u201921). ACM, New York, NY, 3839\u20133849."},{"key":"e_1_3_2_72_2","first-page":"13816","volume-title":"AAAI Conference on Artificial Intelligence","volume":"37","author":"Xie Zhouhang","year":"2023","unstructured":"Zhouhang Xie, Sameer Singh, Julian McAuley, and Bodhisattwa Prasad Majumder. 2023. Factual and informative review generation for explainable recommendation. In AAAI Conference on Artificial Intelligence, Vol. 37, 13816\u201313824."},{"key":"e_1_3_2_73_2","first-page":"9250","volume-title":"AAAI Conference on Artificial Intelligence","volume":"38","author":"Yang Mengyuan","year":"2024","unstructured":"Mengyuan Yang, Mengying Zhu, Yan Wang, Linxun Chen, Yilei Zhao, Xiuyuan Wang, Bing Han, Xiaolin Zheng, and Jianwei Yin. 2024. Fine-tuning large language model based explainable recommendation with explainable quality reward. In AAAI Conference on Artificial Intelligence, Vol. 38, 9250\u20139259."},{"key":"e_1_3_2_74_2","doi-asserted-by":"crossref","unstructured":"Se-Eun Yoon Zhankui He Jessica Maria Echterhoff and Julian McAuley. 2024. Evaluating large language models as generative user simulators for conversational recommendation. arXiv:2403.09738. Retrieved from https:\/\/arxiv.org\/abs\/2403.09738","DOI":"10.18653\/v1\/2024.naacl-long.83"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.1002\/0470011815.b2a15150"},{"key":"e_1_3_2_76_2","volume-title":"11th International Conference on Learning Representations (ICLR \u201923)","author":"Zeng Aohan","year":"2023","unstructured":"Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, et al. 2023. GLM-130B: An open bilingual pre-trained model. In 11th International Conference on Learning Representations (ICLR \u201923)."},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1145\/3543507.3583260"},{"key":"e_1_3_2_78_2","doi-asserted-by":"crossref","first-page":"3679","DOI":"10.1145\/3589334.3645537","volume-title":"Proceedings of the ACM on Web Conference 2024 (WWW \u201924)","author":"Zhang Junjie","year":"2024","unstructured":"Junjie Zhang, Yupeng Hou, Ruobing Xie, Wenqi Sun, Julian McAuley, Wayne Xin Zhao, Leyu Lin, and Ji-Rong Wen. 2024. AgentCF: Collaborative learning with autonomous language agents for recommender systems. In Proceedings of the ACM on Web Conference 2024 (WWW \u201924). ACM, New York, NY, 3679\u20133689."},{"key":"e_1_3_2_79_2","volume-title":"International Conference on Learning Representations","author":"Zhang Tianyi","year":"2020","unstructured":"Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi. 2020. BERTScore: Evaluating text generation with BERT. In International Conference on Learning Representations."},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609579"},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482016"},{"key":"e_1_3_2_82_2","unstructured":"Wayne Xin Zhao Kun Zhou Junyi Li Tianyi Tang Xiaolei Wang Yupeng Hou Yingqian Min Beichen Zhang Junjie Zhang Zican Dong et al. 2023. A survey of large language models. arXiv:2303.18223. Retrieved from https:\/\/arxiv.org\/abs\/2303.18223"},{"key":"e_1_3_2_83_2","article-title":"Dags with no tears: Continuous optimization for structure learning","volume":"31","author":"Zheng Xun","year":"2018","unstructured":"Xun Zheng, Bryon Aragam, Pradeep K. Ravikumar, and Eric P. Xing. 2018. Dags with no tears: Continuous optimization for structure learning. In Advances in Neural Information Processing Systems, Vol. 31.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_84_2","volume-title":"11th International Conference on Learning Representations (ICLR \u201923)","author":"Zhou Denny","year":"2023","unstructured":"Denny Zhou, Nathanael Sch\u00e4rli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc V. Le et al. 2023. Least-to-most prompting enables complex reasoning in large language models. In 11th International Conference on Learning Representations (ICLR \u201923)."},{"key":"e_1_3_2_85_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcss.2014.11.016"},{"key":"e_1_3_2_86_2","volume-title":"2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Zhu Yaxin","year":"2021","unstructured":"Yaxin Zhu, Yikun Xian, Zuohui Fu, Gerard de Melo, and Yongfeng Zhang. 2021. Faithfully explainable recommendation via neural logic reasoning. In 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics."}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3758091","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T18:31:22Z","timestamp":1760466682000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3758091"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,14]]},"references-count":85,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,1,31]]}},"alternative-id":["10.1145\/3758091"],"URL":"https:\/\/doi.org\/10.1145\/3758091","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,14]]},"assertion":[{"value":"2024-05-14","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-26","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-10-14","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}