{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T16:49:35Z","timestamp":1755794975949,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":37,"publisher":"ACM","funder":[{"name":"DARPA FoundSci Grant","award":["HR00112490370"],"award-info":[{"award-number":["HR00112490370"]}]},{"name":"NSF of the United States Grant","award":["ITE 2333736"],"award-info":[{"award-number":["ITE 2333736"]}]},{"name":"DARPA award","award":["HR00112220046"],"award-info":[{"award-number":["HR00112220046"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,3]]},"DOI":"10.1145\/3711896.3737384","type":"proceedings-article","created":{"date-parts":[[2025,8,3]],"date-time":"2025-08-03T21:03:27Z","timestamp":1754255007000},"page":"5493-5504","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["FoodPuzzle: Toward Developing Large Language Model Agents as Autonomous Flavor Scientists"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-5498-477X","authenticated-orcid":false,"given":"Tenghao","family":"Huang","sequence":"first","affiliation":[{"name":"University of Southern California, Los Angeles, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-5851-8595","authenticated-orcid":false,"given":"Dong Hee","family":"Lee","sequence":"additional","affiliation":[{"name":"University of California, Davis, Davis, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-4897-7374","authenticated-orcid":false,"given":"John","family":"Sweeney","sequence":"additional","affiliation":[{"name":"Independent, Seattle, WA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-0403-8028","authenticated-orcid":false,"given":"Jiatong","family":"Shi","sequence":"additional","affiliation":[{"name":"Independent, New York City, NY, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-0501-880X","authenticated-orcid":false,"given":"Emily","family":"Steliotes","sequence":"additional","affiliation":[{"name":"University of California, Davis, Davis, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6148-7962","authenticated-orcid":false,"given":"Matthew","family":"Lange","sequence":"additional","affiliation":[{"name":"IC-FOODS, Davis, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5284-477X","authenticated-orcid":false,"given":"Jonathan","family":"May","sequence":"additional","affiliation":[{"name":"University of Southern California, LOS ANGELES, CA, USA and Information Sciences Institute, Marina del Rey, California, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0118-3147","authenticated-orcid":false,"given":"Muhao","family":"Chen","sequence":"additional","affiliation":[{"name":"University of California, Davis, Davis, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,8,3]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1371"},{"key":"e_1_3_2_2_2_1","volume-title":"Proceedings of the 39th International Conference on Machine Learning(Proceedings of Machine Learning Research","volume":"2240","author":"Borgeaud Sebastian","year":"2022","unstructured":"Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George Bm Van Den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego De Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack Rae, Erich Elsen, and Laurent Sifre. 2022. Improving Language Models by Retrieving from Trillions of Tokens. In Proceedings of the 39th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 162), Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato(Eds.). PMLR, 2206-2240. https:\/\/proceedings.mlr.press\/v162\/borgeaud22a.html"},{"key":"e_1_3_2_2_3_1","volume-title":"NeurIPS 2023 AI for Science Workshop. https:\/\/openreview.net\/forum?id=wdGIL6lx3l","author":"Bran Andres M","year":"2023","unstructured":"Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew White, and Philippe Schwaller. 2023. Augmenting large language models with chemistry tools. In NeurIPS 2023 AI for Science Workshop. https:\/\/openreview.net\/forum?id=wdGIL6lx3l"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1171"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.26"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cofs.2018.07.002"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Neelansh Garg Apuroop Sethupathy Rudraksh Tuwani Rakhi Nk Shubham Dokania Arvind Iyer Ayushi Gupta Shubhra Agrawal Navjot Singh Shubham Shukla et al. 2018. FlavorDB: a database of flavor molecules. Nucleic acids research Vol. 46 D1 (2018) D1210-D1216.","DOI":"10.1093\/nar\/gkx957"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3458754"},{"key":"e_1_3_2_2_10_1","first-page":"3929","volume-title":"Proceedings of the 37th International Conference on Machine Learning(Proceedings of Machine Learning Research","author":"Guu Kelvin","year":"2020","unstructured":"Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Mingwei Chang. 2020. Retrieval Augmented Language Model Pre-Training. In Proceedings of the 37th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh(Eds.). PMLR, 3929-3938. https:\/\/proceedings.mlr.press\/v119\/guu20a.html"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.foodqual.2020.103926"},{"key":"e_1_3_2_2_12_1","first-page":"2197","volume-title":"Current Status and Future Perspectives in Flavor Research: Highlights of the 11th Wartburg Symposium on Flavor Chemistry & Biology. J. Agric. Food Chem.","volume":"66","author":"Hofmann Thomas","year":"2018","unstructured":"Thomas Hofmann, Dietmar Krautwurst, and Peter Schieberle. 2018. Current Status and Future Perspectives in Flavor Research: Highlights of the 11th Wartburg Symposium on Flavor Chemistry & Biology. J. Agric. Food Chem., Vol. 66, 10 (March 2018), 2197-2203."},{"key":"e_1_3_2_2_13_1","volume-title":"Data Interpreter: An LLM Agent For Data Science. ArXiv","author":"Hong Sirui","year":"2024","unstructured":"Sirui Hong, Yizhang Lin, Bangbang Liu, Binhao Wu, Danyang Li, Jiaqi Chen, Jiayi Zhang, Jinlin Wang, Lingyao Zhang, Mingchen Zhuge, Taicheng Guo, Tuo Zhou, Wei Tao, Wenyi Wang, Xiangru Tang, Xiangtao Lu, Xinbing Liang, Yaying Fei, Yuheng Cheng, Zongze Xu, Chenglin Wu, Li Zhang, Min Yang, and Xiawu Zheng. 2024. Data Interpreter: An LLM Agent For Data Science. ArXiv, Vol. abs\/2402.18679 (2024). https:\/\/api.semanticscholar.org\/CorpusID:268063292"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.bigscience-1.12"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"crossref","unstructured":"Tenghao Huang Kinjal Basu Ibrahim Abdelaziz Pavan Kapanipathi Jonathan May and Muhao Chen. 2025. R2D2: Remembering Reflecting and Dynamic Decision Making for Web Agents. arXiv preprint arXiv:2501.12485(2025).","DOI":"10.18653\/v1\/2025.acl-long.1464"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-naacl.61"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btz682"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1612"},{"key":"e_1_3_2_2_20_1","first-page":"9459","volume-title":"Lin(Eds.)","volume":"33","author":"Lewis Patrick","year":"2020","unstructured":"Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich K\u00fcttler, Mike Lewis, Wen-tau Yih, Tim Rockt\u00e4schel, Sebastian Riedel, and Douwe Kiela. 2020. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin(Eds.), Vol. 33. Curran Associates, Inc., 9459-9474. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2020\/file\/6b493230205f780e1bc26945df7481e5-Paper.pdf"},{"key":"e_1_3_2_2_21_1","unstructured":"Jieyu Lu and Yingkai Zhang. 2022. Unified Deep Learning Model for Multitask Reaction Predictions with Explanation. Journal of Chemical Information and Modeling(2022)."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.162"},{"volume-title":"Flavor Development for Functional Foods and Nutraceuticals","author":"Patel Jayvadan","key":"e_1_3_2_2_23_1","unstructured":"Jayvadan Patel and Anita Patel. 2019. Flavor Manufacturing and Selection Criteria for Functional Food and Nutraceuticals Industries. In Flavor Development for Functional Foods and Nutraceuticals. CRC Press, 39-72."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.70"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_3_2_2_26_1","first-page":"232","volume-title":"Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Robertson S. E.","unstructured":"S. E. Robertson and S. Walker. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(Dublin, Ireland) (SIGIR '94). Springer-Verlag, Berlin, Heidelberg, 232-241."},{"key":"e_1_3_2_2_27_1","unstructured":"Tatsuya Sagawa and Ryosuke Kojima. 2023. ReactionT5: a large-scale pre-trained model towards application of limited reaction data. arxiv:2311.06708 [physics.chem-ph]"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.3390\/molecules21010001"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-023-06291-2"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.201"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.naacl-tutorial.1"},{"key":"e_1_3_2_2_32_1","volume-title":"Galactica: A Large Language Model for Science. arxiv:2211.09085 [cs.CL]","author":"Taylor Ross","year":"2022","unstructured":"Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, and Robert Stojnic. 2022. Galactica: A Large Language Model for Science. arxiv:2211.09085 [cs.CL]"},{"key":"e_1_3_2_2_33_1","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale Dan Bikel Lukas Blecher Cristian Canton Ferrer Moya Chen Guillem Cucurull David Esiobu Jude Fernandes Jeremy Fu Wenyin Fu Brian Fuller Cynthia Gao Vedanuj Goswami Naman Goyal Anthony Hartshorn Saghar Hosseini Rui Hou Hakan Inan Marcin Kardas Viktor Kerkez Madian Khabsa Isabel Kloumann Artem Korenev Punit Singh Koura Marie-Anne Lachaux Thibaut Lavril Jenya Lee Diana Liskovich Yinghai Lu Yuning Mao Xavier Martinet Todor Mihaylov Pushkar Mishra Igor Molybog Yixin Nie Andrew Poulton Jeremy Reizenstein Rashi Rungta Kalyan Saladi Alan Schelten Ruan Silva Eric Michael Smith Ranjan Subramanian Xiaoqing Ellen Tan Binh Tang Ross Taylor Adina Williams Jian Xiang Kuan Puxin Xu Zheng Yan Iliyan Zarov Yuchen Zhang Angela Fan Melanie Kambadur Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov and Thomas Scialom. 2023. Llama 2: Open Foundation and Fine-Tuned Chat Models. arxiv:2307.09288 [cs.CL]"},{"key":"e_1_3_2_2_34_1","volume-title":"International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=YWtLZvLmud7","author":"Vig Jesse","year":"2021","unstructured":"Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, richard socher, and Nazneen Rajani. 2021. BERT, ology Meets Biology: Interpreting Attention in Protein Language Models. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=YWtLZvLmud7"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.naacl-main.371"},{"key":"e_1_3_2_2_36_1","first-page":"24824","volume-title":"Oh(Eds.)","volume":"35","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, brian ichter, Fei Xia, Ed Chi, Quoc V Le, and Denny Zhou. 2022. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. In Advances in Neural Information Processing Systems, S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh(Eds.), Vol. 35. Curran Associates, Inc., 24824-24837. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2022\/file\/9d5609613524ecf4f15af0f7b31abca4-Paper-Conference.pdf"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-emnlp.231"}],"event":{"name":"KDD '25: The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"],"location":"Toronto ON Canada","acronym":"KDD '25"},"container-title":["Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3711896.3737384","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,16]],"date-time":"2025-08-16T14:37:45Z","timestamp":1755355065000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3711896.3737384"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,3]]},"references-count":37,"alternative-id":["10.1145\/3711896.3737384","10.1145\/3711896"],"URL":"https:\/\/doi.org\/10.1145\/3711896.3737384","relation":{},"subject":[],"published":{"date-parts":[[2025,8,3]]},"assertion":[{"value":"2025-08-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}