{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,10]],"date-time":"2023-09-10T11:12:07Z","timestamp":1694344327618},"publisher-location":"New York, NY, USA","reference-count":81,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,6]]},"DOI":"10.1145\/3580305.3599241","type":"proceedings-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T18:10:58Z","timestamp":1691172658000},"update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["A Dual-Agent Scheduler for Distributed Deep Learning Jobs on Public Cloud via Reinforcement Learning"],"prefix":"10.1145","author":[{"ORCID":"http:\/\/orcid.org\/0000-0002-2065-9852","authenticated-orcid":false,"given":"Mingzhe","family":"Xing","sequence":"first","affiliation":[{"name":"Peking University, Beijing, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-4499-7581","authenticated-orcid":false,"given":"Hangyu","family":"Mao","sequence":"additional","affiliation":[{"name":"Sensetime Research, Beijing, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-5216-9946","authenticated-orcid":false,"given":"Shenglin","family":"Yin","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-7451-0140","authenticated-orcid":false,"given":"Lichen","family":"Pan","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"ORCID":"http:\/\/orcid.org\/0009-0002-4782-9046","authenticated-orcid":false,"given":"Zhengchao","family":"Zhang","sequence":"additional","affiliation":[{"name":"ByteDance, Shanghai, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-6784-9709","authenticated-orcid":false,"given":"Zhen","family":"Xiao","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"ORCID":"http:\/\/orcid.org\/0009-0007-4646-7131","authenticated-orcid":false,"given":"Jieyi","family":"Long","sequence":"additional","affiliation":[{"name":"Theta Labs, Inc., San Jose, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-020-03075-5"},{"key":"e_1_3_2_2_2_1","volume-title":"Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 ( 2014 ). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)."},{"key":"e_1_3_2_2_3_1","volume-title":"Deep Learning-Based Job Placement in Distributed Machine Learning Clusters With Heterogeneous Workloads","author":"Bao Yixin","year":"2022","unstructured":"Yixin Bao , Yanghua Peng , and Chuan Wu. 2022. Deep Learning-Based Job Placement in Distributed Machine Learning Clusters With Heterogeneous Workloads . IEEE\/ACM Transactions on Networking ( 2022 ). Yixin Bao, Yanghua Peng, and Chuan Wu. 2022. Deep Learning-Based Job Placement in Distributed Machine Learning Clusters With Heterogeneous Workloads. IEEE\/ACM Transactions on Networking (2022)."},{"key":"e_1_3_2_2_4_1","volume-title":"Social learning strategies modify the effect of network structure on group performance. Nature communications","author":"Barkoczi Daniel","year":"2016","unstructured":"Daniel Barkoczi and Mirta Galesic . 2016. Social learning strategies modify the effect of network structure on group performance. Nature communications , Vol. 7 , 1 ( 2016 ), 1--8. Daniel Barkoczi and Mirta Galesic. 2016. Social learning strategies modify the effect of network structure on group performance. Nature communications, Vol. 7, 1 (2016), 1--8."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1497"},{"key":"e_1_3_2_2_6_1","volume-title":"The complexity of decentralized control of Markov decision processes. Mathematics of operations research","author":"Bernstein Daniel S","year":"2002","unstructured":"Daniel S Bernstein , Robert Givan , Neil Immerman , and Shlomo Zilberstein . 2002. The complexity of decentralized control of Markov decision processes. Mathematics of operations research , Vol. 27 , 4 ( 2002 ), 819--840. Daniel S Bernstein, Robert Givan, Neil Immerman, and Shlomo Zilberstein. 2002. The complexity of decentralized control of Markov decision processes. Mathematics of operations research, Vol. 27, 4 (2002), 819--840."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/71.798317"},{"key":"e_1_3_2_2_8_1","volume-title":"Packet routing in dynamically changing networks: A reinforcement learning approach. Advances in neural information processing systems","author":"Boyan Justin","year":"1993","unstructured":"Justin Boyan and Michael Littman . 1993. Packet routing in dynamically changing networks: A reinforcement learning approach. Advances in neural information processing systems , Vol. 6 ( 1993 ). Justin Boyan and Michael Littman. 1993. Packet routing in dynamically changing networks: A reinforcement learning approach. Advances in neural information processing systems, Vol. 6 (1993)."},{"key":"e_1_3_2_2_9_1","unstructured":"Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell etal 2020. Language models are few-shot learners. Advances in neural information processing systems Vol. 33 (2020) 1877--1901. Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et al. 2020. Language models are few-shot learners. Advances in neural information processing systems Vol. 33 (2020) 1877--1901."},{"key":"e_1_3_2_2_10_1","volume-title":"Advances in Neural Information Processing Systems","volume":"26","author":"Cao Yanshuai","year":"2013","unstructured":"Yanshuai Cao , Marcus A Brubaker , David J Fleet , and Aaron Hertzmann . 2013 . Efficient optimization for sparse Gaussian process regression . Advances in Neural Information Processing Systems , Vol. 26 (2013). Yanshuai Cao, Marcus A Brubaker, David J Fleet, and Aaron Hertzmann. 2013. Efficient optimization for sparse Gaussian process regression. Advances in Neural Information Processing Systems, Vol. 26 (2013)."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2016.2537261"},{"key":"e_1_3_2_2_12_1","first-page":"929","article-title":"A task scheduling algorithm for Hadoop platform","volume":"8","author":"Chen Jilan","year":"2013","unstructured":"Jilan Chen , Dan Wang , and Wenbing Zhao . 2013 . A task scheduling algorithm for Hadoop platform . Journal of Computers , Vol. 8 , 4 (2013), 929 -- 936 . Jilan Chen, Dan Wang, and Wenbing Zhao. 2013. A task scheduling algorithm for Hadoop platform. Journal of Computers, Vol. 8, 4 (2013), 929--936.","journal-title":"Journal of Computers"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1002\/spe.3066"},{"key":"e_1_3_2_2_14_1","first-page":"20","article-title":"MapReduce online","volume":"10","author":"Condie Tyson","year":"2010","unstructured":"Tyson Condie , Neil Conway , Peter Alvaro , Joseph M Hellerstein , Khaled Elmeleegy , and Russell Sears . 2010 . MapReduce online .. In Nsdi , Vol. 10. 20 . Tyson Condie, Neil Conway, Peter Alvaro, Joseph M Hellerstein, Khaled Elmeleegy, and Russell Sears. 2010. MapReduce online.. In Nsdi, Vol. 10. 20.","journal-title":"Nsdi"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/0168-9002(94)90719-6"},{"key":"e_1_3_2_2_16_1","unstructured":"Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly etal 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020). Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly et al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1137\/0402042"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11794"},{"key":"e_1_3_2_2_19_1","volume-title":"Advances in Neural Information Processing Systems","volume":"25","author":"Fox Emily","year":"2012","unstructured":"Emily Fox and David Dunson . 2012 . Multiresolution gaussian processes . Advances in Neural Information Processing Systems , Vol. 25 (2012). Emily Fox and David Dunson. 2012. Multiresolution gaussian processes. Advances in Neural Information Processing Systems, Vol. 25 (2012)."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3472883.3486978"},{"key":"e_1_3_2_2_21_1","volume-title":"Learning theory and kernel machines","author":"G\u00e4rtner Thomas","unstructured":"Thomas G\u00e4rtner , Peter Flach , and Stefan Wrobel . 2003. On graph kernels: Hardness results and efficient alternatives . In Learning theory and kernel machines . Springer , 129--143. Thomas G\u00e4rtner, Peter Flach, and Stefan Wrobel. 2003. On graph kernels: Hardness results and efficient alternatives. In Learning theory and kernel machines. Springer, 129--143."},{"key":"e_1_3_2_2_22_1","first-page":"299","article-title":"Classes of kernels for machine learning: a statistics perspective","volume":"2","author":"Genton Marc G","year":"2001","unstructured":"Marc G Genton . 2001 . Classes of kernels for machine learning: a statistics perspective . Journal of machine learning research , Vol. 2 , Dec (2001), 299 -- 312 . Marc G Genton. 2001. Classes of kernels for machine learning: a statistics perspective. Journal of machine learning research, Vol. 2, Dec (2001), 299--312.","journal-title":"Journal of machine learning research"},{"key":"e_1_3_2_2_23_1","volume-title":"8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)","author":"Ghodsi Ali","year":"2011","unstructured":"Ali Ghodsi , Matei Zaharia , Benjamin Hindman , Andy Konwinski , Scott Shenker , and Ion Stoica . 2011 . Dominant resource fairness: Fair allocation of multiple resource types . In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11) . Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, and Ion Stoica. 2011. Dominant resource fairness: Fair allocation of multiple resource types. In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740070.2626334"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_26_1","volume-title":"8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)","author":"Hindman Benjamin","year":"2011","unstructured":"Benjamin Hindman , Andy Konwinski , Matei Zaharia , Ali Ghodsi , Anthony D Joseph , Randy Katz , Scott Shenker , and Ion Stoica . 2011 . Mesos: A Platform for {Fine-Grained} Resource Sharing in the Data Center . In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11) . Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D Joseph, Randy Katz, Scott Shenker, and Ion Stoica. 2011. Mesos: A Platform for {Fine-Grained} Resource Sharing in the Data Center. In 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11)."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3458817.3476223"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1272996.1273005"},{"key":"e_1_3_2_2_29_1","first-page":"1","article-title":"Beyond Data and Model Parallelism for Deep Neural Networks","volume":"1","author":"Jia Zhihao","year":"2019","unstructured":"Zhihao Jia , Matei Zaharia , and Alex Aiken . 2019 . Beyond Data and Model Parallelism for Deep Neural Networks . Proceedings of Machine Learning and Systems , Vol. 1 (2019), 1 -- 13 . Zhihao Jia, Matei Zaharia, and Alex Aiken. 2019. Beyond Data and Model Parallelism for Deep Neural Networks. Proceedings of Machine Learning and Systems, Vol. 1 (2019), 1--13.","journal-title":"Proceedings of Machine Learning and Systems"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177730391"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611976700.34"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972825.71"},{"key":"e_1_3_2_2_33_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba . 2015 . Adam : A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds .). Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.)."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMTT.2010.2049768"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5934"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1609\/aiide.v18i1.21959"},{"key":"e_1_3_2_2_37_1","volume-title":"Pareto optimality. Pareto optimality, game theory and equilibria","author":"Luc Dinh The","year":"2008","unstructured":"Dinh The Luc . 2008. Pareto optimality. Pareto optimality, game theory and equilibria ( 2008 ), 481--515. Dinh The Luc. 2008. Pareto optimality. Pareto optimality, game theory and equilibria (2008), 481--515."},{"key":"e_1_3_2_2_38_1","volume-title":"5th Berkeley Symp. Math. Statist. Probability. 281--297","author":"MacQueen J","year":"1967","unstructured":"J MacQueen . 1967 . Classification and analysis of multivariate observations . In 5th Berkeley Symp. Math. Statist. Probability. 281--297 . J MacQueen. 1967. Classification and analysis of multivariate observations. In 5th Berkeley Symp. Math. Statist. Probability. 281--297."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3005745.3005750"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5957"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-020-09455-w"},{"key":"e_1_3_2_2_42_1","volume-title":"Iterative ranking from pair-wise comparisons. Advances in neural information processing systems","author":"Negahban Sahand","year":"2012","unstructured":"Sahand Negahban , Sewoong Oh , and Devavrat Shah . 2012. Iterative ranking from pair-wise comparisons. Advances in neural information processing systems , Vol. 25 ( 2012 ). Sahand Negahban, Sewoong Oh, and Devavrat Shah. 2012. Iterative ranking from pair-wise comparisons. Advances in neural information processing systems, Vol. 25 (2012)."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3052895"},{"key":"e_1_3_2_2_44_1","volume-title":"International conference on machine learning. PMLR, 4257--4266","author":"Raileanu Roberta","year":"2018","unstructured":"Roberta Raileanu , Emily Denton , Arthur Szlam , and Rob Fergus . 2018 . Modeling others using oneself in multi-agent reinforcement learning . In International conference on machine learning. PMLR, 4257--4266 . Roberta Raileanu, Emily Denton, Arthur Szlam, and Rob Fergus. 2018. Modeling others using oneself in multi-agent reinforcement learning. In International conference on machine learning. PMLR, 4257--4266."},{"key":"e_1_3_2_2_45_1","volume-title":"International conference on machine learning. PMLR, 4295--4304","author":"Rashid Tabish","year":"2018","unstructured":"Tabish Rashid , Mikayel Samvelyan , Christian Schroeder , Gregory Farquhar , Jakob Foerster , and Shimon Whiteson . 2018 . Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning . In International conference on machine learning. PMLR, 4295--4304 . Tabish Rashid, Mikayel Samvelyan, Christian Schroeder, Gregory Farquhar, Jakob Foerster, and Shimon Whiteson. 2018. Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning. In International conference on machine learning. PMLR, 4295--4304."},{"key":"e_1_3_2_2_46_1","volume-title":"Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task. Cerebral cortex","author":"Rolls Edmund T","year":"2008","unstructured":"Edmund T Rolls , Ciara McCabe , and Jerome Redoute . 2008. Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task. Cerebral cortex , Vol. 18 , 3 ( 2008 ), 652--663. Edmund T Rolls, Ciara McCabe, and Jerome Redoute. 2008. Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task. Cerebral cortex, Vol. 18, 3 (2008), 652--663."},{"key":"e_1_3_2_2_47_1","volume-title":"Multivariate uncertainty in deep learning","author":"Russell Rebecca L","year":"2021","unstructured":"Rebecca L Russell and Christopher Reale . 2021. Multivariate uncertainty in deep learning . IEEE Transactions on Neural Networks and Learning Systems ( 2021 ). Rebecca L Russell and Christopher Reale. 2021. Multivariate uncertainty in deep learning. IEEE Transactions on Neural Networks and Learning Systems (2021)."},{"key":"e_1_3_2_2_48_1","volume-title":"High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438","author":"Schulman John","year":"2015","unstructured":"John Schulman , Philipp Moritz , Sergey Levine , Michael Jordan , and Pieter Abbeel . 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 ( 2015 ). John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015)."},{"key":"e_1_3_2_2_49_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_2_50_1","volume-title":"SODA","volume":"98","author":"Schwiegelshohn Uwe","year":"1998","unstructured":"Uwe Schwiegelshohn and Ramin Yahyapour . 1998 . Analysis of first-come-first-serve parallel job scheduling . In SODA , Vol. 98 . Citeseer, 629--638. Uwe Schwiegelshohn and Ramin Yahyapour. 1998. Analysis of first-come-first-serve parallel job scheduling. In SODA, Vol. 98. Citeseer, 629--638."},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10878-020-00607-y"},{"key":"e_1_3_2_2_52_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_2_53_1","unstructured":"Sainbayar Sukhbaatar Rob Fergus etal 2016. Learning multiagent communication with backpropagation. Advances in neural information processing systems Vol. 29 (2016). Sainbayar Sukhbaatar Rob Fergus et al. 2016. Learning multiagent communication with backpropagation. Advances in neural information processing systems Vol. 29 (2016)."},{"key":"e_1_3_2_2_54_1","volume-title":"Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z Leibo, Karl Tuyls, et al.","author":"Sunehag Peter","year":"2017","unstructured":"Peter Sunehag , Guy Lever , Audrunas Gruslys , Wojciech Marian Czarnecki , Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z Leibo, Karl Tuyls, et al. 2017 . Value-decomposition networks for cooperative multi-agent learning. arXiv preprint arXiv:1706.05296 (2017). Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z Leibo, Karl Tuyls, et al. 2017. Value-decomposition networks for cooperative multi-agent learning. arXiv preprint arXiv:1706.05296 (2017)."},{"key":"e_1_3_2_2_55_1","volume-title":"Sequence to sequence learning with neural networks. Advances in neural information processing systems","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever , Oriol Vinyals , and Quoc V Le. 2014. Sequence to sequence learning with neural networks. Advances in neural information processing systems , Vol. 27 ( 2014 ). Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. Advances in neural information processing systems, Vol. 27 (2014)."},{"key":"e_1_3_2_2_56_1","volume-title":"Reinforcement learning: An introduction","author":"Sutton Richard S","unstructured":"Richard S Sutton and Andrew G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard S Sutton and Andrew G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2015.05.387"},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCIS.2011.6045081"},{"key":"e_1_3_2_2_59_1","volume-title":"Attention is all you need. Advances in neural information processing systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. Advances in neural information processing systems , Vol. 30 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, Vol. 30 (2017)."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/2523616.2523633"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.5555\/1756006.1859891"},{"key":"e_1_3_2_2_62_1","volume-title":"Machine learning","author":"Watkins Christopher JCH","year":"1992","unstructured":"Christopher JCH Watkins and Peter Dayan . 1992. Q-learning. Machine learning , Vol. 8 , 3 ( 1992 ), 279--292. Christopher JCH Watkins and Peter Dayan. 1992. Q-learning. Machine learning, Vol. 8, 3 (1992), 279--292."},{"key":"e_1_3_2_2_63_1","volume-title":"19th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 22).","author":"Weng Qizhen","unstructured":"Qizhen Weng , Wencong Xiao , Yinghao Yu , Wei Wang , Cheng Wang , Jian He , Yong Li , Liping Zhang , Wei Lin , and Yu Ding . 2022. MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters . In 19th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 22). Qizhen Weng, Wencong Xiao, Yinghao Yu, Wei Wang, Cheng Wang, Jian He, Yong Li, Liping Zhang, Wei Lin, and Yu Ding. 2022. MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. In 19th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 22)."},{"key":"e_1_3_2_2_64_1","volume-title":"Machine Learning: Proceedings of the Seventeenth International Conference (ICML'2000)","author":"Marco","unstructured":"Marco A Wiering et al. 2000. Multi-agent reinforcement learning for traffic light control . In Machine Learning: Proceedings of the Seventeenth International Conference (ICML'2000) . 1151--1158. Marco A Wiering et al. 2000. Multi-agent reinforcement learning for traffic light control. In Machine Learning: Proceedings of the Seventeenth International Conference (ICML'2000). 1151--1158."},{"key":"e_1_3_2_2_65_1","volume-title":"Gaussian processes for machine learning","author":"Williams Christopher KI","unstructured":"Christopher KI Williams and Carl Edward Rasmussen . 2006. Gaussian processes for machine learning . Vol. 2 . MIT press Cambridge , MA. Christopher KI Williams and Carl Edward Rasmussen. 2006. Gaussian processes for machine learning. Vol. 2. MIT press Cambridge, MA."},{"key":"e_1_3_2_2_66_1","volume-title":"Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning","author":"Williams Ronald J","year":"1992","unstructured":"Ronald J Williams . 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning , Vol. 8 , 3 ( 1992 ), 229--256. Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, Vol. 8, 3 (1992), 229--256."},{"key":"e_1_3_2_2_67_1","unstructured":"Andrew Gordon Wilson Zhiting Hu Ruslan Salakhutdinov and Eric P Xing. 2016a. Deep kernel learning. In Artificial intelligence and statistics. PMLR 370--378. Andrew Gordon Wilson Zhiting Hu Ruslan Salakhutdinov and Eric P Xing. 2016a. Deep kernel learning. In Artificial intelligence and statistics. PMLR 370--378."},{"key":"e_1_3_2_2_68_1","volume-title":"Stochastic variational deep kernel learning. Advances in neural information processing systems","author":"Wilson Andrew G","year":"2016","unstructured":"Andrew G Wilson , Zhiting Hu , Russ R Salakhutdinov , and Eric P Xing . 2016b. Stochastic variational deep kernel learning. Advances in neural information processing systems , Vol. 29 ( 2016 ). Andrew G Wilson, Zhiting Hu, Russ R Salakhutdinov, and Eric P Xing. 2016b. Stochastic variational deep kernel learning. Advances in neural information processing systems, Vol. 29 (2016)."},{"key":"e_1_3_2_2_69_1","volume-title":"Wilcoxon signed-rank test","author":"Woolson Robert F","year":"2007","unstructured":"Robert F Woolson . 2007. Wilcoxon signed-rank test . Wiley encyclopedia of clinical trials ( 2007 ), 1--3. Robert F Woolson. 2007. Wilcoxon signed-rank test. Wiley encyclopedia of clinical trials (2007), 1--3."},{"key":"e_1_3_2_2_70_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICHI52183.2021.00022"},{"key":"e_1_3_2_2_71_1","volume-title":"14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20)","author":"Xiao Wencong","year":"2020","unstructured":"Wencong Xiao , Shiru Ren , Yong Li , Yang Zhang , Pengyang Hou , Zhi Li , Yihui Feng , Wei Lin , and Yangqing Jia . 2020 . {AntMan}: Dynamic Scaling on {GPU} Clusters for Deep Learning . In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20) . 533--548. Wencong Xiao, Shiru Ren, Yong Li, Yang Zhang, Pengyang Hou, Zhi Li, Yihui Feng, Wei Lin, and Yangqing Jia. 2020. {AntMan}: Dynamic Scaling on {GPU} Clusters for Deep Learning. In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). 533--548."},{"key":"e_1_3_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/3447548.3467079"},{"key":"e_1_3_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2022\/80"},{"key":"e_1_3_2_2_74_1","volume-title":"Analysis of Resource Management Methods Based on Reinforcement Learning. In 2021 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS). IEEE, 27--31","author":"Xing Mingzhe","year":"2021","unstructured":"Mingzhe Xing , Ziyun Wang , and Zhen Xiao . 2021 b. Analysis of Resource Management Methods Based on Reinforcement Learning. In 2021 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS). IEEE, 27--31 . Mingzhe Xing, Ziyun Wang, and Zhen Xiao. 2021b. Analysis of Resource Management Methods Based on Reinforcement Learning. In 2021 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS). IEEE, 27--31."},{"key":"e_1_3_2_2_75_1","volume-title":"Uncertainty-aware scheduling of real-time workflows under deadline constraints on multi-cloud systems. Concurrency and Computation: Practice and Experience","author":"Xu Jin","year":"2022","unstructured":"Jin Xu , Huiqun Yu , Guisheng Fan , and Jiayin Zhang . 2022. Uncertainty-aware scheduling of real-time workflows under deadline constraints on multi-cloud systems. Concurrency and Computation: Practice and Experience ( 2022 ), e7562. Jin Xu, Huiqun Yu, Guisheng Fan, and Jiayin Zhang. 2022. Uncertainty-aware scheduling of real-time workflows under deadline constraints on multi-cloud systems. Concurrency and Computation: Practice and Experience (2022), e7562."},{"key":"e_1_3_2_2_76_1","volume-title":"ASTRAEA: A Fair Deep Learning Scheduler for Multi-tenant GPU Clusters","author":"Ye Zhisheng","year":"2021","unstructured":"Zhisheng Ye , Peng Sun , Wei Gao , Tianwei Zhang , Xiaolin Wang , Shengen Yan , and Yingwei Luo . 2021 . ASTRAEA: A Fair Deep Learning Scheduler for Multi-tenant GPU Clusters . IEEE Transactions on Parallel and Distributed Systems ( 2021). Zhisheng Ye, Peng Sun, Wei Gao, Tianwei Zhang, Xiaolin Wang, Shengen Yan, and Yingwei Luo. 2021. ASTRAEA: A Fair Deep Learning Scheduler for Multi-tenant GPU Clusters. IEEE Transactions on Parallel and Distributed Systems (2021)."},{"key":"e_1_3_2_2_77_1","volume-title":"The surprising effectiveness of ppo in cooperative, multi-agent games. arXiv preprint arXiv:2103.01955","author":"Yu Chao","year":"2021","unstructured":"Chao Yu , Akash Velu , Eugene Vinitsky , Yu Wang , Alexandre Bayen , and Yi Wu. 2021. The surprising effectiveness of ppo in cooperative, multi-agent games. arXiv preprint arXiv:2103.01955 ( 2021 ). Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre Bayen, and Yi Wu. 2021. The surprising effectiveness of ppo in cooperative, multi-agent games. arXiv preprint arXiv:2103.01955 (2021)."},{"key":"e_1_3_2_2_78_1","volume-title":"2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 10)","author":"Zaharia Matei","year":"2010","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Michael J Franklin , Scott Shenker , and Ion Stoica . 2010 . Spark: Cluster computing with working sets . In 2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 10) . Matei Zaharia, Mosharaf Chowdhury, Michael J Franklin, Scott Shenker, and Ion Stoica. 2010. Spark: Cluster computing with working sets. In 2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 10)."},{"key":"e_1_3_2_2_79_1","volume-title":"18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21)","author":"Zhang Hong","year":"2021","unstructured":"Hong Zhang , Yupeng Tang , Anurag Khandelwal , Jingrong Chen , and Ion Stoica . 2021 . Caerus:{NIMBLE} Task Scheduling for Serverless Analytics . In 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21) . 653--669. Hong Zhang, Yupeng Tang, Anurag Khandelwal, Jingrong Chen, and Ion Stoica. 2021. Caerus:{NIMBLE} Task Scheduling for Serverless Analytics. In 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21). 653--669."},{"key":"e_1_3_2_2_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/SSCI47803.2020.9308468"},{"key":"e_1_3_2_2_81_1","volume-title":"Deep reinforcement learning-based methods for resource scheduling in cloud computing: A review and future directions. arXiv preprint arXiv:2105.04086","author":"Zhou Guangyao","year":"2021","unstructured":"Guangyao Zhou , Wenhong Tian , and Rajkumar Buyya . 2021. Deep reinforcement learning-based methods for resource scheduling in cloud computing: A review and future directions. arXiv preprint arXiv:2105.04086 ( 2021 ). Guangyao Zhou, Wenhong Tian, and Rajkumar Buyya. 2021. Deep reinforcement learning-based methods for resource scheduling in cloud computing: A review and future directions. arXiv preprint arXiv:2105.04086 (2021)."}],"event":{"name":"KDD '23: The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Long Beach CA USA","acronym":"KDD '23","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3580305.3599241","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,9]],"date-time":"2023-09-09T05:47:58Z","timestamp":1694238478000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599241"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":81,"alternative-id":["10.1145\/3580305.3599241","10.1145\/3580305"],"URL":"http:\/\/dx.doi.org\/10.1145\/3580305.3599241","relation":{},"published":{"date-parts":[[2023,8,4]]},"assertion":[{"value":"2023-08-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}