{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,22]],"date-time":"2026-05-22T04:06:17Z","timestamp":1779422777719,"version":"3.53.1"},"publisher-location":"New York, NY, USA","reference-count":94,"publisher":"ACM","license":[{"start":{"date-parts":[[2026,5,26]],"date-time":"2026-05-26T00:00:00Z","timestamp":1779753600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/legalcode"}],"funder":[{"name":"MIT Generative AI Impact Consortium (MGAIC)","award":["0"],"award-info":[{"award-number":["0"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2026,5,26]]},"DOI":"10.1145\/3786335.3813125","type":"proceedings-article","created":{"date-parts":[[2026,5,22]],"date-time":"2026-05-22T03:16:22Z","timestamp":1779419782000},"page":"61-84","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Glia: A Human-Inspired AI for Automated Systems Design and Optimization"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6364-4108","authenticated-orcid":false,"given":"Pouya","family":"Hamadanian","sequence":"first","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-1243-8483","authenticated-orcid":false,"given":"Pantea","family":"Karimi","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4844-6875","authenticated-orcid":false,"given":"Arash","family":"Nasr-Esfahany","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3069-9820","authenticated-orcid":false,"given":"Kimia","family":"Noorbakhsh","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-9583-2661","authenticated-orcid":false,"given":"Joseph","family":"Chandler","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-8404-2664","authenticated-orcid":false,"given":"Ali","family":"ParandehGheibi","sequence":"additional","affiliation":[{"name":"Fidian, San Francisco, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0014-6742","authenticated-orcid":false,"given":"Mohammad","family":"Alizadeh","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1455-9652","authenticated-orcid":false,"given":"Hari","family":"Balakrishnan","sequence":"additional","affiliation":[{"name":"MIT CSAIL, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2026,5,26]]},"reference":[{"key":"e_1_3_3_1_2_2","unstructured":"Amey Agrawal Nitin Kedia Jayashree Mohan Ashish Panwar Nipun Kwatra Bhargav\u00a0S Gulavani Ramachandran Ramjee and Alexey Tumanov. 2024. Vidur: A large-scale simulation framework for LLM inference. Proceedings of Machine Learning and Systems 6 (2024) 351\u2013366."},{"key":"e_1_3_3_1_3_2","volume-title":"OSDI","author":"Agrawal Amey","year":"2024","unstructured":"Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav\u00a0S. Gulavani, Alexey Tumanov, and Ramachandran Ramjee. 2024. Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve. In OSDI. Article 7, 18\u00a0pages."},{"key":"e_1_3_3_1_4_2","unstructured":"Lakshya\u00a0A Agrawal Shangyin Tan Dilara Soylu Noah Ziems Rishi Khare Krista Opsahl-Ong Arnav Singhvi Herumb Shandilya Michael\u00a0J Ryan and Meng Jiang. 2025. Gepa: Reflective prompt evolution can outperform reinforcement learning. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.19457 (2025)."},{"key":"e_1_3_3_1_5_2","unstructured":"Jayvant Anantpur Nagendra\u00a0Gulur Dwarakanath Shivaram Kalyanakrishnan Shalabh Bhatnagar and R. Govindarajan. 2017. RLWS: A Reinforcement Learning based GPU Warp Scheduler. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1712.04303 (2017)."},{"key":"e_1_3_3_1_6_2","unstructured":"Martin Andrews and Sam Witteveen. 2025. GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2506.20807 (2025)."},{"key":"e_1_3_3_1_7_2","unstructured":"Anthropic. 2024. Building Effective Agents. https:\/\/www.anthropic.com\/research\/building-effective-agents Accessed: 2026-04-27."},{"key":"e_1_3_3_1_8_2","unstructured":"Eser Ayg\u00fcn Anastasiya Belyaeva Gheorghe Comanici Marc Coram Hao Cui Jake Garrison Renee Johnston\u00a0Anton Kast Cory\u00a0Y McLean Peter Norgaard Zahra Shamsi et\u00a0al. 2025. An AI system to help scientists write expert-level empirical software. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2509.06503 (2025)."},{"key":"e_1_3_3_1_9_2","unstructured":"Michelle Brachman Amina El-Ashry Casey Dugan and Werner Geyer. 2025. Current and Future Use of Large Language Models for Knowledge Work. arxiv:https:\/\/arXiv.org\/abs\/2503.16774\u00a0[cs.HC] https:\/\/arxiv.org\/abs\/2503.16774"},{"key":"e_1_3_3_1_10_2","unstructured":"Mert Cemri Shubham Agrawal Akshat Gupta Shu Liu Audrey Cheng Qiuyang Mang Ashwin Naren Lutfi\u00a0Eren Erdogan Koushik Sen Matei Zaharia et\u00a0al. 2026. Adaevolve: Adaptive llm driven zeroth-order optimization. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2602.20133 (2026)."},{"key":"e_1_3_3_1_11_2","volume-title":"ACM SIGCOMM Workshop on APNet","author":"Chen Jie","year":"2018","unstructured":"Jie Chen, Kang\u00a0G. Shin, Jiaqi Zheng, Xin Jin, Xia Zhou, Ben\u00a0Y. Zhao, and Haitao Zheng. 2018. AuTO: Scaling Deep Reinforcement Learning for Datacenter-Scale Traffic Optimization. In ACM SIGCOMM Workshop on APNet."},{"key":"e_1_3_3_1_12_2","unstructured":"Audrey Cheng Shu Liu Melissa Pan Zhifei Li Bowen Wang Alex Krentsel Tian Xia Mert Cemri Jongseok Park Shuo Yang Jeff Chen Lakshya Agrawal Aditya Desai Jiarong Xing Koushik Sen Matei Zaharia and Ion Stoica. 2025. Barbarians at the Gate: How AI is Upending Systems Research. arxiv:https:\/\/arXiv.org\/abs\/2510.06189\u00a0[cs.AI] https:\/\/arxiv.org\/abs\/2510.06189"},{"key":"e_1_3_3_1_13_2","unstructured":"Karl Cobbe Vineet Kosaraju Mohammad Bavarian Mark Chen Heewoo Jun Lukasz Kaiser Matthias Plappert Jerry Tworek Jacob Hilton Reiichiro Nakano et\u00a0al. 2021. Training verifiers to solve math word problems. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2110.14168 (2021)."},{"key":"e_1_3_3_1_14_2","unstructured":"Jaber Daneshamooz Jessica Nguyen William Chen Sanjay Chandrasekaran Satyandra Guthula Ankit Gupta Arpit Gupta and Walter Willinger. 2025. Addressing the ML Domain Adaptation Problem for Networking: Realistic and Controllable Training Data Generation with NetReplica. arxiv:https:\/\/arXiv.org\/abs\/2507.13476\u00a0[cs.NI] https:\/\/arxiv.org\/abs\/2507.13476"},{"key":"e_1_3_3_1_15_2","unstructured":"DeepMind. 2024. Advanced version of Gemini with DeepThink officially achieves gold-medal standard at the International Mathematical Olympiad. https:\/\/deepmind.google\/discover\/blog\/advanced-version-of-gemini-with-deep-think-officially-achieves-gold-medal-standard-at-the-international-mathematical-olympiad\/. Accessed: 2025-10-17."},{"key":"e_1_3_3_1_16_2","first-page":"343","volume-title":"15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18)","author":"Dong Mo","year":"2018","unstructured":"Mo Dong, Tong Meng, Doron Zarchy, Engin Arslan, Yossi Gilad, Brighten Godfrey, and Michael Schapira. 2018. PCC Vivace: Online-Learning Congestion Control. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18). USENIX Association, Renton, WA, 343\u2013356. https:\/\/www.usenix.org\/conference\/nsdi18\/presentation\/dong"},{"key":"e_1_3_3_1_17_2","first-page":"343","volume-title":"NSDI","author":"Dong Mo","year":"2018","unstructured":"Mo Dong, Tong Meng, Doron Zarchy, Engin Arslan, Yossi Gilad, P.\u00a0Brighten Godfrey, and Michael Schapira. 2018. PCC Vivace: Online-Learning Congestion Control. In NSDI. 343\u2013356."},{"key":"e_1_3_3_1_18_2","doi-asserted-by":"crossref","unstructured":"Rohit Dwivedula Divyanshu Saxena Aditya Akella Swarat Chaudhuri and Daehyeok Kim. 2025. Man-Made Heuristics Are Dead. Long Live Code Generators! arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2510.08803 (2025).","DOI":"10.1145\/3772356.3772413"},{"key":"e_1_3_3_1_19_2","unstructured":"Ryan Ehrlich Bradley Brown Jordan Juravsky Ronald Clark Christopher R\u00e9 and Azalia Mirhoseini. 2025. CodeMonkeys: Scaling Test-Time Compute for Software Engineering. arxiv:https:\/\/arXiv.org\/abs\/2501.14723\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2501.14723"},{"key":"e_1_3_3_1_20_2","unstructured":"Juraj Gottweis Wei-Hung Weng Alexander Daryin Tao Tu Anil Palepu Petar Sirkovic Artiom Myaskovsky Felix Weissenberger Keran Rong Ryutaro Tanno et\u00a0al. 2025. Towards an AI co-scientist. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2502.18864 (2025)."},{"key":"e_1_3_3_1_21_2","unstructured":"Harvard Extension School. [n. d.]. Principles of Good Design. https:\/\/cscie2x.dce.harvard.edu\/hw\/ch01s06.html. Accessed: 2025-10-17."},{"key":"e_1_3_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3696348.3696868"},{"key":"e_1_3_3_1_23_2","unstructured":"Zhiyuan He Aashish Gottipati Lili Qiu Yuqing Yang and Francis\u00a0Y. Yan. 2025. Congestion Control System Optimization with Large Language Models. arxiv:https:\/\/arXiv.org\/abs\/2508.16074\u00a0[cs.NI] https:\/\/arxiv.org\/abs\/2508.16074"},{"key":"e_1_3_3_1_24_2","unstructured":"Ziyao Huang Weiwei Wu Kui Wu Jianping Wang and Wei-Bin Lee. 2025. Calm: Co-evolution of algorithms and language model for automatic heuristic design. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2505.12285 (2025)."},{"key":"e_1_3_3_1_25_2","unstructured":"Nathan Jay Noga\u00a0H. Rotman P.\u00a0Brighten Godfrey Michael Schapira and Aviv Tamar. 2019. Internet Congestion Control via Deep Reinforcement Learning. arxiv:https:\/\/arXiv.org\/abs\/1810.03259\u00a0[cs.NI] https:\/\/arxiv.org\/abs\/1810.03259"},{"key":"e_1_3_3_1_26_2","volume-title":"ICML","author":"Jay Nathan","year":"2019","unstructured":"Nathan Jay, Yair Rotman, P.\u00a0Brighten Godfrey, and Michael Schapira. 2019. An End-to-End Deep Reinforcement Learning Framework for Internet Congestion Control. In ICML."},{"key":"e_1_3_3_1_27_2","volume-title":"NSDI","author":"Karimi Pantea","year":"2026","unstructured":"Pantea Karimi, Siva Kesava\u00a0Reddy Kakarla, Pooria Namyar, Santiago Segarra, Ryan Beckett, Mohammad Alizadeh, and Behnaz Arzani. 2026. Heuristic Analysis from Source Code via Symbolic-Guided Optimization. In NSDI. USENIX Association."},{"key":"e_1_3_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3696348.3696884"},{"key":"e_1_3_3_1_29_2","unstructured":"Pantea Karimi Dany Rouhana Pooria Namyar Siva Kesava\u00a0Reddy Kakarla Venkat Arun and Behnaz Arzani. 2025. Robust Heuristic Algorithm Design with LLMs. arxiv:https:\/\/arXiv.org\/abs\/2510.08755\u00a0[cs.AI] https:\/\/arxiv.org\/abs\/2510.08755"},{"key":"e_1_3_3_1_30_2","doi-asserted-by":"crossref","unstructured":"Mehrdad Khani Mohammad Alizadeh Jakob Hoydis and Phil Fleming. 2020. Adaptive neural signal detection for massive MIMO. IEEE Transactions on Wireless Communications 19 8 (2020) 5635\u20135648.","DOI":"10.1109\/TWC.2020.2996144"},{"key":"e_1_3_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3600006.3613165"},{"key":"e_1_3_3_1_32_2","unstructured":"Robert\u00a0Tjarko Lange Yuki Imajuku and Edoardo Cetin. 2025. ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2509.19349 (2025)."},{"key":"e_1_3_3_1_33_2","volume-title":"Advances in Neural Information Processing Systems (NeurIPS) Workshop","author":"Lazic Nikolay","year":"2018","unstructured":"Nikolay Lazic, Craig Boutilier, Thomas Lu, Eric Wong, Binz Roy, Marcin Minka, Ben\u00a0J. Heller, David Schuurmans, Geoffrey\u00a0J. Gordon, Olivier Duchesnay, Marc\u00a0L. Bellemare, Albin Cassirer, et\u00a0al. 2018. Data Center Cooling Using Model-Predictive Control. In Advances in Neural Information Processing Systems (NeurIPS) Workshop. Describes learning-assisted control for DC cooling."},{"key":"e_1_3_3_1_34_2","unstructured":"Baolin Li Yankai Jiang Vijay Gadepally and Devesh Tiwari. 2024. LLM Inference Serving: Survey of Recent Advances and Opportunities. arxiv:https:\/\/arXiv.org\/abs\/2407.12391\u00a0[cs.DC] https:\/\/arxiv.org\/abs\/2407.12391"},{"key":"e_1_3_3_1_35_2","unstructured":"Tianhong Li Vibhaalakshmi Sivaraman Pantea Karimi Lijie Fan Mohammad Alizadeh and Dina Katabi. 2023. Reparo: Loss-resilient generative codec for video conferencing. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.14135 (2023)."},{"key":"e_1_3_3_1_36_2","doi-asserted-by":"publisher","unstructured":"Yujia Li David Choi Junyoung Chung Nate Kushman Julian Schrittwieser R\u00e9mi Leblond Tom Eccles James Keeling Felix Gimeno Agustin Dal\u00a0Lago Thomas Hubert Peter Choy Cyprien de Masson\u00a0d\u2019Autume Igor Babuschkin Xinyun Chen Po-Sen Huang Johannes Welbl Sven Gowal Alexey Cherepanov James Molloy Daniel\u00a0J. Mankowitz Esme Sutherland\u00a0Robson Pushmeet Kohli Nando de Freitas Koray Kavukcuoglu and Oriol Vinyals. 2022. Competition-level code generation with AlphaCode. Science 378 6624 (Dec. 2022) 1092\u20131097. 10.1126\/science.abq1158","DOI":"10.1126\/science.abq1158"},{"key":"e_1_3_3_1_37_2","first-page":"1","volume-title":"SIGCOMM","author":"Liang Eric","year":"2019","unstructured":"Eric Liang, Hang Zhu, Xin Jin, and Ion Stoica. 2019. NeuroCuts: Neural Decision Trees for Packet Classification. In SIGCOMM. 1\u201315."},{"key":"e_1_3_3_1_38_2","series-title":"(ICML\u201924)","volume-title":"ICML","author":"Liu Fei","year":"2024","unstructured":"Fei Liu, Xialiang Tong, Mingxuan Yuan, Xi Lin, Fu Luo, Zhenkun Wang, Zhichao Lu, and Qingfu Zhang. 2024. Evolution of heuristics: Towards efficient automatic algorithm design using large language model. In ICML (Vienna, Austria) (ICML\u201924). JMLR.org, Article 1304, 23\u00a0pages."},{"key":"e_1_3_3_1_39_2","unstructured":"Fei Liu Qingfu Zhang Jialong Shi Xialiang Tong Kun Mao and Mingxuan Yuan. 2025. Fitness landscape of large language model-assisted automated algorithm search. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2504.19636 (2025)."},{"key":"e_1_3_3_1_40_2","unstructured":"Fei Liu Rui Zhang Xi Lin Zhichao Lu and Qingfu Zhang. 2025. Fine-tuning large language model for automated algorithm design. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.10614 (2025)."},{"key":"e_1_3_3_1_41_2","unstructured":"Fei Liu Rui Zhang Zhuoliang Xie Rui Sun Kai Li Xi Lin Zhenkun Wang Zhichao Lu and Qingfu Zhang. 2024. Llm4ad: A platform for algorithm design with large language model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2412.17287 (2024)."},{"key":"e_1_3_3_1_42_2","unstructured":"Gang Liu Yihan Zhu Jie Chen and Meng Jiang. 2025. Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2510.06056 (2025)."},{"key":"e_1_3_3_1_43_2","doi-asserted-by":"publisher","unstructured":"Nelson\u00a0F. Liu Kevin Lin John Hewitt Ashwin Paranjape Michele Bevilacqua Fabio Petroni and Percy Liang. 2024. Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics 12 (2024) 157\u2013173. 10.1162\/tacl_a_00638","DOI":"10.1162\/tacl_a_00638"},{"key":"e_1_3_3_1_44_2","unstructured":"Shu Liu Shubham Agarwal Monishwaran Maheswaran Mert Cemri Zhifei Li Qiuyang Mang Ashwin Naren Ethan Boneh Audrey Cheng Melissa\u00a0Z Pan et\u00a0al. 2026. Evox: Meta-evolution for automated discovery. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2602.23413 (2026)."},{"key":"e_1_3_3_1_45_2","unstructured":"Yixiu Liu Yang Nan Weixian Xu Xiangkun Hu Lyumanshan Ye Zhen Qin and Pengfei Liu. 2025. Alphago moment for model architecture discovery. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.18074 (2025)."},{"key":"e_1_3_3_1_46_2","unstructured":"llm-d Community. 2025. GitHub - llm-d\/llm-d: llm-d enables high-performance distributed LLM inference on Kubernetes. https:\/\/github.com\/llm-d\/llm-d. [Accessed 10-10-2025]."},{"key":"e_1_3_3_1_47_2","unstructured":"Ruiying Ma Chieh-Jan\u00a0Mike Liang Yanjie Gao and Francis\u00a0Y Yan. 2025. Algorithm Generation via Creative Ideation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2510.03851 (2025)."},{"key":"e_1_3_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3005745.3005750"},{"key":"e_1_3_3_1_49_2","unstructured":"Hongzi Mao Shannon Chen Drew Dimmery Shaun Singh Drew Blaisdell Yuandong Tian Mohammad Alizadeh and Eytan Bakshy. 2020. Real-world Video Adaptation with Reinforcement Learning. arxiv:https:\/\/arXiv.org\/abs\/2008.12858\u00a0[cs.NI] https:\/\/arxiv.org\/abs\/2008.12858"},{"key":"e_1_3_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3098822.3098843"},{"key":"e_1_3_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342080"},{"key":"e_1_3_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342080"},{"key":"e_1_3_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452838"},{"key":"e_1_3_3_1_54_2","doi-asserted-by":"publisher","unstructured":"Ryan Marcus Parimarjan Negi Hongzi Mao Chi Zhang Mohammad Alizadeh Tim Kraska Olga Papaemmanouil and Nesime Tatbul. 2019. Neo: A Learned Query Optimizer. Proc. VLDB Endow. 12 11 (July 2019) 1705\u20131718. 10.14778\/3342263.3342644","DOI":"10.14778\/3342263.3342644"},{"key":"e_1_3_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1145\/3387514.3405859"},{"key":"e_1_3_3_1_56_2","unstructured":"MIT News Office. 2023. Study finds ChatGPT boosts worker productivity in writing tasks. MIT News (2023). https:\/\/news.mit.edu\/2023\/study-finds-chatgpt-boosts-worker-productivity-writing-0714 Accessed: 2025-10-17."},{"key":"e_1_3_3_1_57_2","unstructured":"Ansh Nagda Prabhakar Raghavan and Abhradeep Thakurta. 2025. Reinforced Generation of Combinatorial Structures: Applications to Complexity Theory. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2509.18057 (2025)."},{"key":"e_1_3_3_1_58_2","unstructured":"Alexander Novikov Ng\u00e2n V\u0169 Marvin Eisenberger Emilien Dupont Po-Sen Huang Adam\u00a0Zsolt Wagner Sergey Shirobokov Borislav Kozlovskii Francisco\u00a0JR Ruiz Abbas Mehrabian et\u00a0al. 2025. AlphaEvolve: A coding agent for scientific and algorithmic discovery. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2506.13131 (2025)."},{"key":"e_1_3_3_1_59_2","unstructured":"NVIDIA. 2025. GitHub - ai-dynamo\/dynamo: A Datacenter Scale Distributed Inference Serving Framework. https:\/\/github.com\/ai-dynamo\/dynamo. [Accessed 10-10-2025]."},{"key":"e_1_3_3_1_60_2","volume-title":"OpenAI o3 and o4-mini System Card","year":"2025","unstructured":"OpenAI. 2025. OpenAI o3 and o4-mini System Card. Technical Report. OpenAI. https:\/\/openai.com\/index\/o3-o4-mini-system-card\/"},{"key":"e_1_3_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA59077.2024.00019"},{"key":"e_1_3_3_1_62_2","unstructured":"Ori Press Brandon Amos Haoyu Zhao Yikai Wu Samuel\u00a0K Ainsworth Dominik Krupke Patrick Kidger Touqir Sajed Bartolomeo Stellato Jisun Park et\u00a0al. 2025. AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs? arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.15887 (2025)."},{"key":"e_1_3_3_1_63_2","unstructured":"Prithvi Rajasekaran Ethan Dixon Carly Ryan and Jeremy Hadfield. 2025. Effective context engineering for AI agents. https:\/\/www.anthropic.com\/engineering\/effective-context-engineering-for-ai-agents With contributions from Rafi Ayub Hannah Moran Cal Rueb and Connor Jennings. Published online September 29 2025."},{"key":"e_1_3_3_1_64_2","doi-asserted-by":"crossref","unstructured":"Bernardino Romera-Paredes Mohammadamin Barekatain Alexander Novikov Matej Balog M\u00a0Pawan Kumar Emilien Dupont Francisco\u00a0JR Ruiz Jordan\u00a0S Ellenberg Pengming Wang Omar Fawzi et\u00a0al. 2024. Mathematical discoveries from program search with large language models. Nature 625 7995 (2024) 468\u2013475.","DOI":"10.1038\/s41586-023-06924-6"},{"key":"e_1_3_3_1_65_2","unstructured":"Fabian Ruffy Michael Przystupa and Ivan Beschastnikh. 2018. Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1812.09975 (2018)."},{"key":"e_1_3_3_1_66_2","doi-asserted-by":"crossref","unstructured":"Saim Salman Christopher Streiffer Huan Chen Theophilus Benson and Asim Kadav. 2018. DeepConf: Automating Data Center Network Topologies and Routing with Deep Reinforcement Learning. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1712.03890 (2018).","DOI":"10.1145\/3229543.3229554"},{"key":"e_1_3_3_1_67_2","first-page":"785","volume-title":"18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21)","author":"Sapio Amedeo","year":"2021","unstructured":"Amedeo Sapio, Marco Canini, Chen-Yu Ho, Jacob Nelson, Panos Kalnis, Changhoon Kim, Arvind Krishnamurthy, Masoud Moshref, Dan Ports, and Peter Richtarik. 2021. Scaling Distributed Machine Learning with In-Network Aggregation. In 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21). USENIX Association, 785\u2013808. https:\/\/www.usenix.org\/conference\/nsdi21\/presentation\/sapio"},{"key":"e_1_3_3_1_68_2","unstructured":"sharegpt 2025. ShareGPT Datasets at Hugging Face. https:\/\/huggingface.co\/datasets\/anon8231489123\/ShareGPT_Vicuna_unfiltered. [Accessed 10-10-2025]."},{"key":"e_1_3_3_1_69_2","unstructured":"Asankhaya Sharma. 2025. OpenEvolve: an open-source evolutionary coding agent. https:\/\/github.com\/codelion\/openevolve"},{"key":"e_1_3_3_1_70_2","doi-asserted-by":"crossref","unstructured":"Alexander Shypula Aman Madaan Yimeng Zeng Uri Alon Jacob Gardner Milad Hashemi Graham Neubig Parthasarathy Ranganathan Osbert Bastani and Amir Yazdanbakhsh. 2025. Automated High-Level Code Optimization for Warehouse Performance. IEEE Micro (2025).","DOI":"10.1109\/MM.2025.3590033"},{"key":"e_1_3_3_1_71_2","volume-title":"Operating System Concepts (10th ed.)","author":"Silberschatz Abraham","year":"2018","unstructured":"Abraham Silberschatz, Peter\u00a0B. Galvin, and Greg Gagne. 2018. Operating System Concepts (10th ed.). Wiley Publishing."},{"key":"e_1_3_3_1_72_2","volume-title":"NSDI","author":"Sivaraman Vibhaalakshmi","year":"2024","unstructured":"Vibhaalakshmi Sivaraman, Pantea Karimi, Vedantha Venkatapathy, Mehrdad Khani, Sadjad Fouladi, Mohammad Alizadeh, Fr\u00e9do Durand, and Vivienne Sze. 2024. Gemino: Practical and robust neural compression for video conferencing. In NSDI."},{"key":"e_1_3_3_1_73_2","unstructured":"Charlie Snell Jaehoon Lee Kelvin Xu and Aviral Kumar. 2024. Scaling llm test-time compute optimally can be more effective than scaling model parameters. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2408.03314 (2024)."},{"key":"e_1_3_3_1_74_2","unstructured":"Yiwen Sun Furong Ye Zhihan Chen Ke Wei and Shaowei Cai. 2025. Automatically discovering heuristics in a complex SAT solver with large language models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.22876 (2025)."},{"key":"e_1_3_3_1_75_2","unstructured":"Yiheng Tao Yihe Zhang Matthew\u00a0T. Dearing Xin Wang Yuping Fan and Zhiling Lan. 2025. Prompt-Aware Scheduling for Low-Latency LLM Serving. arxiv:https:\/\/arXiv.org\/abs\/2510.03243\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2510.03243"},{"key":"e_1_3_3_1_76_2","unstructured":"The\u00a0AIBrix Team Jiaxin Shan Varun Gupta Le Xu Haiyang Shi Jingyuan Zhang Ning Wang Linhui Xu Rong Kang Tongping Liu Yifei Zhang Yiqing Zhu Shuowei Jin Gangmuk Lim Binbin Chen Zuzhi Chen Xiao Liu Xin Chen Kante Yin Chak-Pong Chung Chenyu Jiang Yicheng Lu Jianjun Chen Caixue Lin Wu Xiang Rui Shi and Liguang Xie. 2025. AIBrix: Towards Scalable Cost-Effective Large Language Model Inference Infrastructure. arxiv:https:\/\/arXiv.org\/abs\/2504.03648\u00a0[cs.DC] https:\/\/arxiv.org\/abs\/2504.03648"},{"key":"e_1_3_3_1_77_2","volume-title":"USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage)","author":"Vietri Giuseppe","year":"2018","unstructured":"Giuseppe Vietri, Liana\u00a0V. Rodriguez, Wendy\u00a0A. Martinez, Steven Lyons, Jason Liu, Raju Rangaswami, Ming Zhao, and Giri Narasimhan. 2018. Driving Cache Replacement with ML-based LeCaR. In USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage)."},{"key":"e_1_3_3_1_78_2","unstructured":"vllm-project. 2025. vLLM Production Stack: reference stack for production vLLM deployment. https:\/\/github.com\/vllm-project\/production-stack."},{"key":"e_1_3_3_1_79_2","unstructured":"Anjiang Wei Allen Nie Thiago\u00a0SFX Teixeira Rohan Yadav Wonchan Lee Ke Wang and Alex Aiken. 2024. Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2410.15625 (2024)."},{"key":"e_1_3_3_1_80_2","unstructured":"Anjiang Wei Tianran Sun Yogesh Seenichamy Hang Song Anne Ouyang Azalia Mirhoseini Ke Wang and Alex Aiken. 2025. Astra: A multi-agent system for gpu kernel performance optimization. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2509.07506 (2025)."},{"key":"e_1_3_3_1_81_2","unstructured":"David Wheeler. [n. d.]. Problems in the Design of Systems. https:\/\/www.doc.ic.ac.uk\/\u00a0dcw\/PSD\/article13\/. Accessed: 2025-10-17."},{"key":"e_1_3_3_1_82_2","unstructured":"Wikiquote contributors. 2025. Edsger W. Dijkstra \u2013 Wikiquote. https:\/\/en.wikiquote.org\/wiki\/Edsger_W._Dijkstra. Accessed: 2025-10-17."},{"key":"e_1_3_3_1_83_2","doi-asserted-by":"publisher","DOI":"10.1145\/2486001.2486011"},{"key":"e_1_3_3_1_84_2","unstructured":"Shijie Xia Yuhan Sun and Pengfei Liu. 2025. SR-Scientist: Scientific Equation Discovery With Agentic AI. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2510.11661 (2025)."},{"key":"e_1_3_3_1_85_2","unstructured":"Qiujie Xie Yixuan Weng Minjun Zhu Fuchen Shen Shulin Huang Zhen Lin Jiahui Zhou Zilan Mao Zijie Yang Linyi Yang et\u00a0al. 2025. How Far Are AI Scientists from Changing the World? arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2507.23276 (2025)."},{"key":"e_1_3_3_1_86_2","first-page":"495","volume-title":"NSDI","author":"Yan Francis\u00a0Y","year":"2020","unstructured":"Francis\u00a0Y Yan, Hudson Ayers, Chenzhi Zhu, Sadjad Fouladi, James Hong, Keyi Zhang, Philip Levis, and Keith Winstein. 2020. Learning in situ: A randomized experiment in video streaming. In NSDI. 495\u2013511."},{"key":"e_1_3_3_1_87_2","unstructured":"Minghao Yan Bo Peng Benjamin Coleman Ziqi Chen Zhouhang Xie Shuo Chen Zhankui He Noveen Sachdeva Isabella Ye Weili Wang et\u00a0al. 2026. Pacevolve: Enabling long-horizon progress-aware consistent evolution. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2601.10657 (2026)."},{"key":"e_1_3_3_1_88_2","unstructured":"Yuqing Yang Yuedong Xu and Lei Jiao. 2024. A Queueing Theoretic Perspective on Low-Latency LLM Inference with Variable Token Length. arxiv:https:\/\/arXiv.org\/abs\/2407.05347\u00a0[cs.NI] https:\/\/arxiv.org\/abs\/2407.05347"},{"key":"e_1_3_3_1_89_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v39i25.34922"},{"key":"e_1_3_3_1_90_2","first-page":"521","volume-title":"OSDI","author":"Yu Gyeong-In","year":"2022","unstructured":"Gyeong-In Yu, Joo\u00a0Seong Jeong, Geon-Woo Kim, Soojeong Kim, and Byung-Gon Chun. 2022. Orca: A Distributed Serving System for Transformer-Based Generative Models. In OSDI. USENIX Association, Carlsbad, CA, 521\u2013538. https:\/\/www.usenix.org\/conference\/osdi22\/presentation\/yu"},{"key":"e_1_3_3_1_91_2","unstructured":"Yi Zhai Zhiqiang Wei Ruohan Li Keyu Pan Shuo Liu Lu Zhang Jianmin Ji Wuyang Zhang Yu Zhang and Yanyong Zhang. 2025. \\ (X\\)-evolve: Solution space evolution powered by large language models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2508.07932 (2025)."},{"key":"e_1_3_3_1_92_2","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3314043"},{"key":"e_1_3_3_1_93_2","unstructured":"Lianmin Zheng Liangsheng Yin Zhiqiang Xie Chuyue Sun Jeff Huang Cody\u00a0Hao Yu Shiyi Cao Christos Kozyrakis Ion Stoica Joseph\u00a0E. Gonzalez Clark Barrett and Ying Sheng. 2024. SGLang: Efficient Execution of Structured Language Model Programs. arxiv:https:\/\/arXiv.org\/abs\/2312.07104\u00a0[cs.AI] https:\/\/arxiv.org\/abs\/2312.07104"},{"key":"e_1_3_3_1_94_2","unstructured":"Zhi Zheng Zhuoliang Xie Zhenkun Wang and Bryan Hooi. 2025. Monte carlo tree search for comprehensive exploration in llm-based automatic heuristic design. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2501.08603 (2025)."},{"key":"e_1_3_3_1_95_2","doi-asserted-by":"publisher","DOI":"10.1145\/3452296.3472902"}],"event":{"name":"CAIS '26: ACM Conference on AI and Agentic Systems","location":"San Jose CA USA","acronym":"CAIS '26"},"container-title":["Proceedings of the ACM Conference on AI and Agentic Systems"],"original-title":[],"deposited":{"date-parts":[[2026,5,22]],"date-time":"2026-05-22T03:17:10Z","timestamp":1779419830000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3786335.3813125"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5,26]]},"references-count":94,"alternative-id":["10.1145\/3786335.3813125","10.1145\/3786335"],"URL":"https:\/\/doi.org\/10.1145\/3786335.3813125","relation":{},"subject":[],"published":{"date-parts":[[2026,5,26]]},"assertion":[{"value":"2026-05-26","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}