{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T12:08:05Z","timestamp":1763381285464,"version":"3.45.0"},"publisher-location":"New York, NY, USA","reference-count":37,"publisher":"ACM","funder":[{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["2402695"],"award-info":[{"award-number":["2402695"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,11,17]]},"DOI":"10.1145\/3772356.3772391","type":"proceedings-article","created":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T12:02:48Z","timestamp":1763380968000},"page":"193-200","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Programmable and Adaptive Scheduling for Distributed Systems"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4164-3208","authenticated-orcid":false,"given":"Yuyao","family":"Wang","sequence":"first","affiliation":[{"name":"University of Washington, Seattle, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-7557-2090","authenticated-orcid":false,"given":"Xiangfeng","family":"Zhu","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-8005-6948","authenticated-orcid":false,"given":"Ratul","family":"Mahajan","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-8684-2357","authenticated-orcid":false,"given":"Stephanie","family":"Wang","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2021. On the In-Depth Cluster Scheduling and Management. https:\/\/www.alibabacloud.com\/blog\/on-the-in-depth-cluster-scheduling-and-management_598012."},{"key":"e_1_3_2_1_2_1","unstructured":"2025. Kubernetes Scheduler. https:\/\/kubernetes.io\/docs\/concepts\/scheduling-eviction\/kube-scheduler\/."},{"key":"e_1_3_2_1_3_1","unstructured":"2025. Kubernetes Scheduling Framework. https:\/\/kubernetes.io\/docs\/concepts\/scheduling-eviction\/scheduling-framework\/."},{"key":"e_1_3_2_1_4_1","unstructured":"2025. llm-d: a Kubernetes-native high-performance distributed LLM inference framework. https:\/\/llm-d.ai\/."},{"key":"e_1_3_2_1_5_1","unstructured":"2025. Ray Scheduling. https:\/\/docs.ray.io\/en\/latest\/ray-core\/scheduling\/index.html."},{"key":"e_1_3_2_1_6_1","unstructured":"2025. Uber's Journey to Ray on Kubernetes: Resource Management. https:\/\/www.uber.com\/blog\/ubers-journey-to-ray-on-kubernetes-resource-management\/."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3542929.3563498"},{"key":"e_1_3_2_1_8_1","volume-title":"11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14)","author":"Boutin Eric","year":"2014","unstructured":"Eric Boutin, Jaliya Ekanayake, Wei Lin, Bing Shi, Jingren Zhou, Zhengping Qian, Ming Wu, and Lidong Zhou. 2014. Apollo: Scalable and coordinated scheduling for cloud-scale computing. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14). 285\u2013300."},{"key":"e_1_3_2_1_9_1","volume-title":"18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24)","author":"Choudhury Arnab","year":"2024","unstructured":"Arnab Choudhury, Yang Wang, Tuomas Pelkonen, Kutta Srinivasan, Abha Jain, Shenghao Lin, Delia David, Siavash Soleimanifard, Michael Chen, Abhishek Yadav, et al. 2024. MAST: Global scheduling of ML training across geo-distributed datacenters at hyperscale. In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). 563\u2013580."},{"key":"e_1_3_2_1_10_1","volume-title":"2015 USENIX Annual Technical Conference (USENIX ATC 15)","author":"Delgado Pamela","year":"2015","unstructured":"Pamela Delgado, Florin Dinu, Anne-Marie Kermarrec, and Willy Zwaenepoel. 2015. Hawk: Hybrid datacenter scheduling. In 2015 USENIX Annual Technical Conference (USENIX ATC 15). 499\u2013510."},{"key":"e_1_3_2_1_11_1","volume-title":"Manuel Mazzara, Fabrizio Montesi, Ruslan Mustafin, and Larisa Safina.","author":"Dragoni Nicola","year":"2017","unstructured":"Nicola Dragoni, Saverio Giallorenzo, Alberto Lluch Lafuente, Manuel Mazzara, Fabrizio Montesi, Ruslan Mustafin, and Larisa Safina. 2017. Microservices: yesterday, today, and tomorrow. Present and Ulterior Software Engineering (2017), 195\u2013216."},{"key":"e_1_3_2_1_12_1","volume-title":"8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)","author":"Ghodsi Ali","year":"2011","unstructured":"Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, and Ion Stoica. 2011. Dominant resource fairness: Fair allocation of multiple resource types. In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)."},{"key":"e_1_3_2_1_13_1","volume-title":"12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)","author":"Gog Ionel","year":"2016","unstructured":"Ionel Gog, Malte Schwarzkopf, Adam Gleave, Robert NM Watson, and Steven Hand. 2016. Firmament: Fast, centralized cluster scheduling at scale. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). 99\u2013115."},{"key":"e_1_3_2_1_14_1","volume-title":"8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)","author":"Hindman Benjamin","year":"2011","unstructured":"Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D Joseph, Randy Katz, Scott Shenker, and Ion Stoica. 2011. Mesos: A platform for fine-grained resource sharing in the data center. In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477132.3483542"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3519939.3523446"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629575.1629601"},{"key":"e_1_3_2_1_18_1","first-page":"2022","article-title":"A quantitative measure of fairness and discrimination. Eastern Research Laboratory, Digital Equipment Corporation, Hudson","volume":"21","author":"Jain Rajendra K","year":"1984","unstructured":"Rajendra K Jain, Dah-Ming W Chiu, William R Hawe, et al. 1984. A quantitative measure of fairness and discrimination. Eastern Research Laboratory, Digital Equipment Corporation, Hudson, MA 21, 1 (1984), 2022\u20132023.","journal-title":"MA"},{"key":"e_1_3_2_1_19_1","unstructured":"Eric Jonas Johann Schleier-Smith Vikram Sreekanti Chia-Che Tsai Anurag Khandelwal Qifan Pu Vaishaal Shankar Joao Carreira Karl Krauth Neeraja Yadwadkar et al. 2019. Cloud programming simplified: A berkeley view on serverless computing. arXiv preprint arXiv:1902.03383 (2019)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477132.3483548"},{"key":"e_1_3_2_1_21_1","volume-title":"2015 USENIX Annual Technical Conference (USENIX ATC 15)","author":"Karanasos Konstantinos","year":"2015","unstructured":"Konstantinos Karanasos, Sriram Rao, Carlo Curino, Chris Douglas, Kishore Chaliparambil, Giovanni Matteo Fumarola, Solom Heddaya, Raghu Ramakrishnan, and Sarvesh Sakalanaga. 2015. Mercury: Hybrid centralized and distributed scheduling in large shared clusters. In 2015 USENIX Annual Technical Conference (USENIX ATC 15). 485\u2013497."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3387514.3405874"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342080"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/71.963420"},{"key":"e_1_3_2_1_25_1","volume-title":"13th USENIX symposium on operating systems design and implementation (OSDI 18)","author":"Moritz Philipp","year":"2018","unstructured":"Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I Jordan, et al. 2018. Ray: A distributed framework for emerging {AI} applications. In 13th USENIX symposium on operating systems design and implementation (OSDI 18). 561\u2013577."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477132.3483588"},{"key":"e_1_3_2_1_27_1","volume-title":"2001 USENIX Annual Technical Conference (USENIX ATC 01)","author":"Nieh Jason","year":"2001","unstructured":"Jason Nieh, Christopher Vaill, and Hua Zhong. 2001. Virtual-time round-robin: an O (1) proportional share scheduler.. In 2001 USENIX Annual Technical Conference (USENIX ATC 01). 245\u2013259."},{"key":"e_1_3_2_1_28_1","volume-title":"14th Workshop on Hot Topics in Operating Systems (HotOS XIV).","author":"Ousterhout Kay","year":"2013","unstructured":"Kay Ousterhout, Aurojit Panda, Joshua Rosen, Shivaram Venkataraman, Reynold Xin, Sylvia Ratnasamy, Scott Shenker, and Ion Stoica. 2013. The case for tiny tasks in compute clusters. In 14th Workshop on Hot Topics in Operating Systems (HotOS XIV)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522716"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465386"},{"key":"e_1_3_2_1_31_1","volume-title":"14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20)","author":"Suresh Lalith","year":"2020","unstructured":"Lalith Suresh, Jo\u00e3o Loff, Faria Kalim, Sangeetha Abdu Jyothi, Nina Narodytska, Leonid Ryzhyk, Sahan Gamage, Brian Oki, Pranshu Jain, and Michael Gasch. 2020. Building scalable and flexible cluster managers using declarative programming. In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). 827\u2013844."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2901318.2901355"},{"key":"e_1_3_2_1_33_1","volume-title":"Mapple: A Domain-Specific Language for Mapping Distributed Heterogeneous Parallel Programs. arXiv preprint arXiv:2507.17087","author":"Wei Anjiang","year":"2025","unstructured":"Anjiang Wei, Rohan Yadav, Hang Song, Wonchan Lee, Ke Wang, and Alex Aiken. 2025. Mapple: A Domain-Specific Language for Mapping Distributed Heterogeneous Parallel Programs. arXiv preprint arXiv:2507.17087 (2025)."},{"key":"e_1_3_2_1_34_1","volume-title":"19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25)","author":"Xu Zhiying","year":"2025","unstructured":"Zhiying Xu, Minlan Yu, and Francis Y Yan. 2025. Decouple and decompose: Scaling resource allocation with DeDe. In 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522737"},{"key":"e_1_3_2_1_36_1","volume-title":"14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17)","author":"Zhang Haoyu","year":"2017","unstructured":"Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J Freedman. 2017. Live video analytics at scale with approximation and delay-tolerance. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). 377\u2013392."},{"key":"e_1_3_2_1_37_1","first-page":"62557","article-title":"SGlang: Efficient execution of structured language model programs","volume":"37","author":"Zheng Lianmin","year":"2024","unstructured":"Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Livia Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E Gonzalez, et al. 2024. SGlang: Efficient execution of structured language model programs. Advances in Neural Information Processing Systems 37 (2024), 62557\u201362583.","journal-title":"Advances in Neural Information Processing Systems"}],"event":{"name":"HotNets '25: 24th ACM Workshop on Hot Topics in Networks","location":"UMD Campus College Park MD USA","acronym":"HotNets '25","sponsor":["SIGCOMM ACM Special Interest Group on Data Communication"]},"container-title":["Proceedings of the 24th ACM Workshop on Hot Topics in Networks"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3772356.3772391","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T12:03:57Z","timestamp":1763381037000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3772356.3772391"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":37,"alternative-id":["10.1145\/3772356.3772391","10.1145\/3772356"],"URL":"https:\/\/doi.org\/10.1145\/3772356.3772391","relation":{},"subject":[],"published":{"date-parts":[[2025,11,17]]},"assertion":[{"value":"2025-11-17","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}