{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:16:08Z","timestamp":1750220168584,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":54,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,12]],"date-time":"2022-06-12T00:00:00Z","timestamp":1654992000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,12]]},"DOI":"10.1145\/3533028.3533309","type":"proceedings-article","created":{"date-parts":[[2022,5,23]],"date-time":"2022-05-23T22:19:46Z","timestamp":1653344386000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Accelerating container-based deep learning hyperparameter optimization workloads"],"prefix":"10.1145","author":[{"given":"Rui","family":"Liu","sequence":"first","affiliation":[{"name":"University of Chicago"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Wong","sequence":"additional","affiliation":[{"name":"DocuSign, Inc."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dave","family":"Lange","sequence":"additional","affiliation":[{"name":"DocuSign, Inc."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Patrik","family":"Larsson","sequence":"additional","affiliation":[{"name":"DocuSign, Inc."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vinay","family":"Jethava","sequence":"additional","affiliation":[{"name":"DocuSign, Inc."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qing","family":"Zheng","sequence":"additional","affiliation":[{"name":"DocuSign, Inc."}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,6,12]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"TensorFlow: A System for Large-Scale Machine Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 265--283","author":"Abadi Mart\u00edn","year":"2016","unstructured":"Mart\u00edn Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , Andy Davis , Jeffrey Dean , Matthieu Devin , Sanjay Ghemawat , Geoffrey Irving , Michael Isard , Manjunath Kudlur , Josh Levenberg , Rajat Monga , Sherry Moore , Derek Gordon Murray , Benoit Steiner , Paul A. Tucker , Vijay Vasudevan , Pete Warden , Martin Wicke , Yuan Yu , and Xiaoqiang Zheng . 2016 . TensorFlow: A System for Large-Scale Machine Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 265--283 . Mart\u00edn Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek Gordon Murray, Benoit Steiner, Paul A. Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 265--283."},{"key":"e_1_3_2_1_2_1","volume-title":"Ease.ML: A Lifecycle Management System for MLDev and MLOps. In Conference on Innovative Data Systems Research (CIDR).","author":"Aguilar Leonel","year":"2021","unstructured":"Leonel Aguilar , David Dao , Shaoduo Gan , Nezihe Merve Gurel , Nora Hollenstein , Jiawei Jiang , Bojan Karlas , Thomas Lemmin , Tian Li , Yang Li , Susie Rao , Johannes Rausch , Cedric Renggli , Luka Rimanic , Maurice Weber , Shuai Zhang , Zhikuan Zhao , Kevin Schawinski , Wentao Wu , and Ce Zhang . 2021 . Ease.ML: A Lifecycle Management System for MLDev and MLOps. In Conference on Innovative Data Systems Research (CIDR). Leonel Aguilar, David Dao, Shaoduo Gan, Nezihe Merve Gurel, Nora Hollenstein, Jiawei Jiang, Bojan Karlas, Thomas Lemmin, Tian Li, Yang Li, Susie Rao, Johannes Rausch, Cedric Renggli, Luka Rimanic, Maurice Weber, Shuai Zhang, Zhikuan Zhao, Kevin Schawinski, Wentao Wu, and Ce Zhang. 2021. Ease.ML: A Lifecycle Management System for MLDev and MLOps. In Conference on Innovative Data Systems Research (CIDR)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330701"},{"key":"e_1_3_2_1_4_1","unstructured":"Amazon. 2022. SageMaker. https:\/\/aws.amazon.com\/sagemaker\/.  Amazon. 2022. SageMaker. https:\/\/aws.amazon.com\/sagemaker\/."},{"key":"e_1_3_2_1_5_1","volume-title":"International Conference on Machine Learning (ICML)","volume":"28","author":"Bergstra James","unstructured":"James Bergstra , Daniel Yamins , and David D. Cox . 2013. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures . In International Conference on Machine Learning (ICML) , Vol. 28 . 115--123. James Bergstra, Daniel Yamins, and David D. Cox. 2013. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. In International Conference on Machine Learning (ICML), Vol. 28. 115--123."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3399579.3399867"},{"key":"e_1_3_2_1_7_1","volume-title":"TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 578--594","author":"Chen Tianqi","year":"2018","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Q. Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , Carlos Guestrin , and Arvind Krishnamurthy . 2018 . TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 578--594 . Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Q. Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy. 2018. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 578--594."},{"volume-title":"DLSpec: A Deep Learning Task Exchange Specification. In USENIX Conference on Operational Machine Learning (OpML).","author":"Dakkak Abdul","key":"e_1_3_2_1_8_1","unstructured":"Abdul Dakkak , Cheng Li , Jinjun Xiong , and Wen-mei W. Hwu . [n. d.] . DLSpec: A Deep Learning Task Exchange Specification. In USENIX Conference on Operational Machine Learning (OpML). Abdul Dakkak, Cheng Li, Jinjun Xiong, and Wen-mei W. Hwu. [n. d.]. DLSpec: A Deep Learning Task Exchange Specification. In USENIX Conference on Operational Machine Learning (OpML)."},{"key":"e_1_3_2_1_9_1","volume-title":"Spotlight: Optimizing Device Placement for Training Deep Neural Networks. In International Conference on Machine Learning (ICML)","volume":"80","author":"Gao Yuanxiang","year":"2018","unstructured":"Yuanxiang Gao , Li Chen , and Baochun Li . 2018 . Spotlight: Optimizing Device Placement for Training Deep Neural Networks. In International Conference on Machine Learning (ICML) , Vol. 80 . 1662--1670. Yuanxiang Gao, Li Chen, and Baochun Li. 2018. Spotlight: Optimizing Device Placement for Training Deep Neural Networks. In International Conference on Machine Learning (ICML), Vol. 80. 1662--1670."},{"volume-title":"Deep Learning","author":"Goodfellow Ian","key":"e_1_3_2_1_10_1","unstructured":"Ian Goodfellow , Yoshua Bengio , and Aaron Courville . 2016. Deep Learning . MIT Press . http:\/\/www.deeplearningbook.org. Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. http:\/\/www.deeplearningbook.org."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2005.06.042"},{"key":"e_1_3_2_1_12_1","volume-title":"Tiresias: A GPU Cluster Manager for Distributed Deep Learning. In USENIX Symposium on Networked Systems Design and Implementation (NSDI). 485--500","author":"Gu Juncheng","year":"2019","unstructured":"Juncheng Gu , Mosharaf Chowdhury , Kang G. Shin , Yibo Zhu , Myeongjae Jeon , Junjie Qian , Hongqiang Harry Liu , and Chuanxiong Guo . 2019 . Tiresias: A GPU Cluster Manager for Distributed Deep Learning. In USENIX Symposium on Networked Systems Design and Implementation (NSDI). 485--500 . Juncheng Gu, Mosharaf Chowdhury, Kang G. Shin, Yibo Zhu, Myeongjae Jeon, Junjie Qian, Hongqiang Harry Liu, and Chuanxiong Guo. 2019. Tiresias: A GPU Cluster Manager for Distributed Deep Learning. In USENIX Symposium on Networked Systems Design and Implementation (NSDI). 485--500."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00059"},{"key":"e_1_3_2_1_14_1","volume-title":"Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770--778","author":"He Kaiming","year":"2016","unstructured":"Kaiming He , Xiangyu Zhang , Shaoqing Ren , and Jian Sun . 2016 . Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770--778 . Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770--778."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_16_1","volume-title":"MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR abs\/1704.04861","author":"Howard Andrew G.","year":"2017","unstructured":"Andrew G. Howard , Menglong Zhu , Bo Chen , Dmitry Kalenichenko , Weijun Wang , Tobias Weyand , Marco Andreetto , and Hartwig Adam . 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR abs\/1704.04861 ( 2017 ). Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR abs\/1704.04861 (2017)."},{"volume-title":"Densely Connected Convolutional Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2261--2269","author":"Huang Gao","key":"e_1_3_2_1_17_1","unstructured":"Gao Huang , Zhuang Liu , Laurens van der Maaten, and Kilian Q. Weinberger. 2017 . Densely Connected Convolutional Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2261--2269 . Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. 2017. Densely Connected Convolutional Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2261--2269."},{"key":"e_1_3_2_1_18_1","volume-title":"SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and &lt;1MB model size. CoRR abs\/1602.07360","author":"Iandola Forrest N.","year":"2016","unstructured":"Forrest N. Iandola , Matthew W. Moskewicz , Khalid Ashraf , Song Han , William J. Dally , and Kurt Keutzer . 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and &lt;1MB model size. CoRR abs\/1602.07360 ( 2016 ). Forrest N. Iandola, Matthew W. Moskewicz, Khalid Ashraf, Song Han, William J. Dally, and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and &lt;1MB model size. CoRR abs\/1602.07360 (2016)."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3361525.3361538"},{"key":"e_1_3_2_1_20_1","volume-title":"Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. In USENIX Annual Technical Conference (ATC). 947--960","author":"Jeon Myeongjae","year":"2019","unstructured":"Myeongjae Jeon , Shivaram Venkataraman , Amar Phanishayee , Junjie Qian , Wencong Xiao , and Fan Yang . 2019 . Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. In USENIX Annual Technical Conference (ATC). 947--960 . Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, and Fan Yang. 2019. Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. In USENIX Annual Technical Conference (ATC). 947--960."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3190656"},{"volume-title":"Fundamentals of statistical signal processing","author":"Kay Steven","key":"e_1_3_2_1_22_1","unstructured":"Steven Kay . 1993. Fundamentals of statistical signal processing . Prentice Hall PTR. Steven Kay. 1993. Fundamentals of statistical signal processing. Prentice Hall PTR."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342276"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352020.3352022"},{"key":"e_1_3_2_1_25_1","article-title":"Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization","volume":"18","author":"Li Lisha","year":"2017","unstructured":"Lisha Li , Kevin G. Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , and Ameet Talwalkar . 2017 . Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization . Journal of Machine Learning Research 18 (2017), 185:1--185:52. Lisha Li, Kevin G. Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, and Ameet Talwalkar. 2017. Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization. Journal of Machine Learning Research 18 (2017), 185:1--185:52.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3187009.3177737"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357223.3362719"},{"volume-title":"Fifth Workshop on Data Management for End-To-End Machine Learning, In conjunction with the ACM SIGMOD Conference (DEEM@SIGMOD). 3:1--3:11","author":"Liu Rui","key":"e_1_3_2_1_28_1","unstructured":"Rui Liu , Sanjay Krishnan , Aaron J. Elmore , and Michael J. Franklin . 2021. Understanding and optimizing packed neural network training for hyper-parameter tuning . In Fifth Workshop on Data Management for End-To-End Machine Learning, In conjunction with the ACM SIGMOD Conference (DEEM@SIGMOD). 3:1--3:11 . Rui Liu, Sanjay Krishnan, Aaron J. Elmore, and Michael J. Franklin. 2021. Understanding and optimizing packed neural network training for hyper-parameter tuning. In Fifth Workshop on Data Management for End-To-End Machine Learning, In conjunction with the ACM SIGMOD Conference (DEEM@SIGMOD). 3:1--3:11."},{"key":"e_1_3_2_1_29_1","article-title":"Docker: Lightweight Linux Containers for Consistent Development and Deployment","volume":"2014","author":"Merkel Dirk","year":"2014","unstructured":"Dirk Merkel . 2014 . Docker: Lightweight Linux Containers for Consistent Development and Deployment . Linux Journal 2014 , 239, Article 2 (2014). Dirk Merkel. 2014. Docker: Lightweight Linux Containers for Consistent Development and Deployment. Linux Journal 2014, 239, Article 2 (2014).","journal-title":"Linux Journal"},{"key":"e_1_3_2_1_30_1","volume-title":"International Conference on Learning Representations (ICLR).","author":"Mirhoseini Azalia","year":"2018","unstructured":"Azalia Mirhoseini , Anna Goldie , Hieu Pham , Benoit Steiner , Quoc V. Le , and Jeff Dean . 2018 . A Hierarchical Model for Device Placement . In International Conference on Learning Representations (ICLR). Azalia Mirhoseini, Anna Goldie, Hieu Pham, Benoit Steiner, Quoc V. Le, and Jeff Dean. 2018. A Hierarchical Model for Device Placement. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_31_1","volume-title":"Device Placement Optimization with Reinforcement Learning. In International Conference on Machine Learning (ICML)","volume":"70","author":"Mirhoseini Azalia","year":"2017","unstructured":"Azalia Mirhoseini , Hieu Pham , Quoc V. Le , Benoit Steiner , Rasmus Larsen , Yuefeng Zhou , Naveen Kumar , Mohammad Norouzi , Samy Bengio , and Jeff Dean . 2017 . Device Placement Optimization with Reinforcement Learning. In International Conference on Machine Learning (ICML) , Vol. 70 . 2430--2439. Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, and Jeff Dean. 2017. Device Placement Optimization with Reinforcement Learning. In International Conference on Machine Learning (ICML), Vol. 70. 2430--2439."},{"key":"e_1_3_2_1_32_1","volume-title":"Ray: A Distributed Framework for Emerging AI Applications. In USENIX Symposium on Operating Systems Design and Implementation (OSDI), Andrea C. Arpaci-Dusseau and Geoff Voelker (Eds.). 561--577","author":"Moritz Philipp","year":"2018","unstructured":"Philipp Moritz , Robert Nishihara , Stephanie Wang , Alexey Tumanov , Richard Liaw , Eric Liang , Melih Elibol , Zongheng Yang , William Paul , Michael I. Jordan , and Ion Stoica . 2018 . Ray: A Distributed Framework for Emerging AI Applications. In USENIX Symposium on Operating Systems Design and Implementation (OSDI), Andrea C. Arpaci-Dusseau and Geoff Voelker (Eds.). 561--577 . Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, and Ion Stoica. 2018. Ray: A Distributed Framework for Emerging AI Applications. In USENIX Symposium on Operating Systems Design and Implementation (OSDI), Andrea C. Arpaci-Dusseau and Geoff Voelker (Eds.). 561--577."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341301.3359646"},{"key":"e_1_3_2_1_34_1","volume-title":"NeurIPS Workshop on Systems for Machine Learning. 20","author":"Narayanan Deepak","year":"2018","unstructured":"Deepak Narayanan , Keshav Santhanam , Amar Phanishayee , and Matei Zaharia . 2018 . Accelerating deep learning workloads through efficient multi-model execution . In NeurIPS Workshop on Systems for Machine Learning. 20 . Deepak Narayanan, Keshav Santhanam, Amar Phanishayee, and Matei Zaharia. 2018. Accelerating deep learning workloads through efficient multi-model execution. In NeurIPS Workshop on Systems for Machine Learning. 20."},{"key":"e_1_3_2_1_35_1","unstructured":"NVIDIA. 2021. CUDA release: 11.3.0. https:\/\/developer.nvidia.com\/cuda-toolkit  NVIDIA. 2021. CUDA release: 11.3.0. https:\/\/developer.nvidia.com\/cuda-toolkit"},{"key":"e_1_3_2_1_36_1","unstructured":"NVIDIA. 2021. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf. vR495.  NVIDIA. 2021. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf. vR495."},{"key":"e_1_3_2_1_37_1","unstructured":"NVIDIA. 2022. NVIDIA Management Library. https:\/\/developer.nvidia.com\/nvidia-management-library-nvml.  NVIDIA. 2022. NVIDIA Management Library. https:\/\/developer.nvidia.com\/nvidia-management-library-nvml."},{"key":"e_1_3_2_1_38_1","unstructured":"NVIDIA. 2022. NVIDIA Nsight Systems. https:\/\/developer.nvidia.com\/nsight-systems.  NVIDIA. 2022. NVIDIA Nsight Systems. https:\/\/developer.nvidia.com\/nsight-systems."},{"key":"e_1_3_2_1_39_1","volume-title":"High-Performance Deep Learning Library. In Conference on Neural Information Processing Systems (NeurIPS). 8024--8035","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke , SamGross, Francisco Massa , AdamLerer, James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , Alban Desmaison , Andreas K\u00f6pf , Edward Yang , Zachary DeVito , Martin Raison , Alykhan Tejani , Sasank Chilamkurthy , Benoit Steiner , Lu Fang , Junjie Bai , and Soumith Chintala . 2019 . PyTorch: An Imperative Style , High-Performance Deep Learning Library. In Conference on Neural Information Processing Systems (NeurIPS). 8024--8035 . Adam Paszke, SamGross, Francisco Massa, AdamLerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas K\u00f6pf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Conference on Neural Information Processing Systems (NeurIPS). 8024--8035."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190517"},{"volume-title":"Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Qiao Aurick","key":"e_1_3_2_1_41_1","unstructured":"Aurick Qiao , Sang Keun Choe , Suhas Jayaram Subramanya , Willie Neiswanger , Qirong Ho , Hao Zhang , Gregory R. Ganger , and Eric P. Xing . 2021 . Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, and Eric P. Xing. 2021. Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_3_2_1_42_1","volume-title":"a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs\/1910.01108","author":"Sanh Victor","year":"2019","unstructured":"Victor Sanh , Lysandre Debut , Julien Chaumond , and Thomas Wolf . 2019. DistilBERT , a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs\/1910.01108 ( 2019 ). Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs\/1910.01108 (2019)."},{"key":"e_1_3_2_1_43_1","volume-title":"Horovod: fast and easy distributed deep learning in TensorFlow. CoRR abs\/1802.05799","author":"Sergeev Alexander","year":"2018","unstructured":"Alexander Sergeev and Mike Del Balso . 2018. Horovod: fast and easy distributed deep learning in TensorFlow. CoRR abs\/1802.05799 ( 2018 ). Alexander Sergeev and Mike Del Balso. 2018. Horovod: fast and easy distributed deep learning in TensorFlow. CoRR abs\/1802.05799 (2018)."},{"key":"e_1_3_2_1_44_1","volume-title":"Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR).","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman . 2015 . Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR). Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_45_1","volume-title":"LSTM Neural Networks for Language Modeling. In Annual Conference of the International Speech Communication Association (INTERSPEECH). 194--197","author":"Sundermeyer Martin","year":"2012","unstructured":"Martin Sundermeyer , Ralf Schl\u00fcter , and Hermann Ney . 2012 . LSTM Neural Networks for Language Modeling. In Annual Conference of the International Speech Communication Association (INTERSPEECH). 194--197 . Martin Sundermeyer, Ralf Schl\u00fcter, and Hermann Ney. 2012. LSTM Neural Networks for Language Modeling. In Annual Conference of the International Speech Communication Association (INTERSPEECH). 194--197."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_1_47_1","volume-title":"EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In International Conference on Machine Learning (ICML)","volume":"97","author":"Tan Mingxing","unstructured":"Mingxing Tan and Quoc V. Le . 2019 . EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In International Conference on Machine Learning (ICML) , Vol. 97 . 6105--6114. Mingxing Tan and Quoc V. Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In International Conference on Machine Learning (ICML), Vol. 97. 6105--6114."},{"key":"e_1_3_2_1_48_1","volume-title":"Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR abs\/1908.08962","author":"Turc Iulia","year":"2019","unstructured":"Iulia Turc , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR abs\/1908.08962 ( 2019 ). Iulia Turc, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR abs\/1908.08962 (2019)."},{"key":"e_1_3_2_1_49_1","volume-title":"Annual Conference on Neural Information Processing System (NIPS). 5998--6008","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Lukasz Kaiser , and Illia Polosukhin . 2017 . Attention is All you Need . In Annual Conference on Neural Information Processing System (NIPS). 5998--6008 . Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Annual Conference on Neural Information Processing System (NIPS). 5998--6008."},{"volume-title":"Transformers: State-of-the-Art Natural Language Processing. In Conference on Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations. 38--45","author":"Wolf Thomas","key":"e_1_3_2_1_50_1","unstructured":"Thomas Wolf , Lysandre Debut , Victor Sanh , Julien Chaumond , Clement Delangue , Anthony Moi , Pierric Cistac , Tim Rault , R\u00e9mi Louf , Morgan Funtowicz , Joe Davison , Sam Shleifer , Patrick von Platen , Clara Ma , Yacine Jernite , Julien Plu , Canwen Xu , Teven Le Scao , Sylvain Gugger , Mariama Drame , Quentin Lhoest , and Alexander M. Rush . 2020 . Transformers: State-of-the-Art Natural Language Processing. In Conference on Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations. 38--45 . Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, R\u00e9mi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, and Alexander M. Rush. 2020. Transformers: State-of-the-Art Natural Language Processing. In Conference on Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations. 38--45."},{"key":"e_1_3_2_1_51_1","volume-title":"Gandiva: Introspective Cluster Scheduling for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 595--610","author":"Xiao Wencong","year":"2018","unstructured":"Wencong Xiao , Romil Bhardwaj , Ramachandran Ramjee , Muthian Sivathanu , Nipun Kwatra , Zhenhua Han , Pratyush Patel , Xuan Peng , Hanyu Zhao , Quanlu Zhang , Fan Yang , and Lidong Zhou . 2018 . Gandiva: Introspective Cluster Scheduling for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 595--610 . Wencong Xiao, Romil Bhardwaj, Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, and Lidong Zhou. 2018. Gandiva: Introspective Cluster Scheduling for Deep Learning. In USENIX Symposium on Operating Systems Design and Implementation (OSDI). 595--610."},{"key":"e_1_3_2_1_52_1","volume-title":"SLO-Aware Machine Learning Inference Serving. In USENIX Annual Technical Conference (USENIX ATC). 1049--1062","author":"Zhang Chengliang","year":"2019","unstructured":"Chengliang Zhang , Minchen Yu , Wei Wang , and Feng Yan . 2019 . MArk: Exploiting Cloud Services for Cost-Effective , SLO-Aware Machine Learning Inference Serving. In USENIX Annual Technical Conference (USENIX ATC). 1049--1062 . Chengliang Zhang, Minchen Yu, Wei Wang, and Feng Yan. 2019. MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving. In USENIX Annual Technical Conference (USENIX ATC). 1049--1062."},{"key":"e_1_3_2_1_53_1","volume-title":"ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6848--6856","author":"Zhang Xiangyu","year":"2018","unstructured":"Xiangyu Zhang , Xinyu Zhou , Mengxiao Lin , and Jian Sun . 2018 . ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6848--6856 . Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, and Jian Sun. 2018. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6848--6856."},{"key":"e_1_3_2_1_54_1","volume-title":"USENIX Symposium on Operating Systems Design and Implementation OSDI. 515--532","author":"Zhao Hanyu","year":"2020","unstructured":"Hanyu Zhao , Zhenhua Han , Zhi Yang , Quanlu Zhang , Fan Yang , Lidong Zhou , Mao Yang , Francis C. M. Lau , Yuqi Wang , Yifan Xiong , and Bin Wang . 2020 . HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees . In USENIX Symposium on Operating Systems Design and Implementation OSDI. 515--532 . Hanyu Zhao, Zhenhua Han, Zhi Yang, Quanlu Zhang, Fan Yang, Lidong Zhou, Mao Yang, Francis C. M. Lau, Yuqi Wang, Yifan Xiong, and Bin Wang. 2020. HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees. In USENIX Symposium on Operating Systems Design and Implementation OSDI. 515--532."}],"event":{"name":"SIGMOD\/PODS '22: International Conference on Management of Data","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"],"location":"Philadelphia Pennsylvania","acronym":"SIGMOD\/PODS '22"},"container-title":["Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533028.3533309","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3533028.3533309","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:38Z","timestamp":1750186838000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533028.3533309"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,12]]},"references-count":54,"alternative-id":["10.1145\/3533028.3533309","10.1145\/3533028"],"URL":"https:\/\/doi.org\/10.1145\/3533028.3533309","relation":{},"subject":[],"published":{"date-parts":[[2022,6,12]]},"assertion":[{"value":"2022-06-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}