{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:04:51Z","timestamp":1775815491786,"version":"3.50.1"},"reference-count":60,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:p>Autoscaling is a critical mechanism in cloud computing, enabling the autonomous adjustment of computing resources in response to dynamic workloads. This is particularly valuable for co-located, long-running applications with diverse workload patterns. The primary objective of autoscaling is to regulate resource utilization at a desired level, effectively balancing the need for resource optimization with the fulfillment of Service Level Objectives (SLOs). Many existing proactive autoscaling frameworks may encounter prediction deviations arising from the frequent fluctuations of cloud workloads. Reactive frameworks, on the other hand, rely on realtime system feedback, but their hysteretic nature could lead to violations of stringent SLOs. Hybrid frameworks, while prevalent, often feature independently functioning proactive and reactive modules, potentially leading to incompatibility and undermining the overall decision-making efficacy. In addressing these challenges, we propose OptScaler, a collaborative autoscaling framework that integrates proactive and reactive modules through an optimization module. The proactive module delivers reliable future workload predictions to the optimization module, while the reactive module offers a self-tuning estimator for real-time updates. By embedding a Model Predictive Control (MPC) mechanism and chance constraints into the optimization module, we further enhance its robustness. Numerical results have demonstrated the superiority of our workload prediction model and the collaborative framework, leading to over a 36% reduction in SLO violations compared to prevalent reactive, proactive, or hybrid autoscalers. Notably, OptScaler has been successfully deployed at Alipay, providing autoscaling support for the world-leading payment platform.<\/jats:p>","DOI":"10.14778\/3685800.3685829","type":"journal-article","created":{"date-parts":[[2024,11,8]],"date-time":"2024-11-08T17:25:21Z","timestamp":1731086721000},"page":"4090-4103","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["OptScaler: A Collaborative Framework for Robust Autoscaling in the Cloud"],"prefix":"10.14778","volume":"17","author":[{"given":"Ding","family":"Zou","sequence":"first","affiliation":[{"name":"Zhejiang University and Ant Group"}]},{"given":"Wei","family":"Lu","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Zhibo","family":"Zhu","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Xingyu","family":"Lu","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Jun","family":"Zhou","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Xiaojin","family":"Wang","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Kangyu","family":"Liu","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Kefan","family":"Wang","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Renen","family":"Sun","sequence":"additional","affiliation":[{"name":"Ant Group"}]},{"given":"Haiqing","family":"Wang","sequence":"additional","affiliation":[{"name":"Ant Group"}]}],"member":"320","published-online":{"date-parts":[[2024,11,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSC.2017.2711009"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/NOMS.2012.6211900"},{"key":"e_1_2_1_3_1","volume-title":"Retrieved","year":"2023","unstructured":"Amazon. 2023. Dynamic scaling for Amazon EC2 Auto Scaling. Retrieved June 28, 2024 from https:\/\/docs.aws.amazon.com\/autoscaling\/ec2\/userguide\/as-scale-based-on-demand.html"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","unstructured":"Sumit Bose Anjaneyulu Pasala Dheepak A Sridhar Murthy and Ganesan Malaiyandisamy. 2011. SLA Management in Cloud Computing: A Service Provider's Perspective. 413--436. 10.1002\/9780470940105.ch16","DOI":"10.1002\/9780470940105.ch16"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACSOS-C52956.2021.00073"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14778\/3554821.3554840"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3190507","article-title":"A survey and taxonomy of self-aware and self-adaptive cloud autoscaling systems","volume":"51","author":"Chen Tao","year":"2018","unstructured":"Tao Chen, Rami Bahsoon, and Xin Yao. 2018. A survey and taxonomy of self-aware and self-adaptive cloud autoscaling systems. ACM Computing Surveys (CSUR) 51, 3 (2018), 1--40.","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2903733"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-018-0588-8"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2017.12.004"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-018-2387-5"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICAC.2019.00032"},{"key":"e_1_2_1_13_1","volume-title":"DEPTS: Deep Expansion Learning for Periodic Time Series Forecasting. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=AJAR-JgNw__","author":"Fan Wei","year":"2022","unstructured":"Wei Fan, Shun Zheng, Xiaohan Yi, Wei Cao, Yanjie Fu, Jiang Bian, and Tie-Yan Liu. 2022. DEPTS: Deep Expansion Learning for Periodic Time Series Forecasting. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=AJAR-JgNw__"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2016.05.028"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASE.2018.2826723"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIC.2013.119"},{"key":"e_1_2_1_17_1","volume-title":"International Journal of control and automation 3, 4","author":"Holkar KS","year":"2010","unstructured":"KS Holkar and Laxman M Waghmare. 2010. An overview of model predictive control. International Journal of control and automation 3, 4 (2010), 47--63."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-96983-1_11"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCC.2016.66"},{"key":"e_1_2_1_20_1","volume-title":"Retrieved","author":"Jiang Xiaodong","year":"2022","unstructured":"Xiaodong Jiang, Sudeep Srivastava, Sourav Chatterjee, Yang Yu, Jeffrey Handler, Peiyi Zhang, Rohan Bopardikar, Dawei Li, Yanjun Lin, Uttam Thakore, Michael Brundage, Ginger Holt, Caner Komurlu, Rakshita Nagalla, Zhichao Wang, Hechao Sun, Peng Gao, Wei Cheung, Jun Gao, Qi Wang, Marius Guerard, Morteza Kazemi, Yulin Chen, Chong Zhou, Sean Lee, Nikolay Laptev, Tiham\u00e9r Levendovszky, Jake Taylor, Huijun Qian, Jian Zhang, Aida Shoydokova, Trisha Singh, Chengjun Zhu, Zeynep Baz, Christoph Bergmeir, Di Yu, Ahmet Koylan, Kun Jiang, Ploy Temiyasathit, and Emre Yurtbay. 2022. Kats. Retrieved June 28, 2024 from https:\/\/github.com\/facebookresearch\/Kats"},{"key":"e_1_2_1_21_1","volume-title":"International Conference on Learning Representations (ICLR).","author":"Jin Ming","year":"2024","unstructured":"Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y Zhang, Xiaoming Shi, Pin-Yu Chen, Yuxuan Liang, Yuan-Fang Li, Shirui Pan, and Qingsong Wen. 2024. Time-LLM: Time series forecasting by reprogramming large language models. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2018.04.016"},{"key":"e_1_2_1_23_1","volume-title":"Retrieved","year":"2023","unstructured":"Kubernetes. 2023. Kubernetes Documentation \/ Concepts \/ Containers. Retrieved June 28, 2024 from https:\/\/kubernetes.io\/docs\/concepts\/containers\/"},{"key":"e_1_2_1_24_1","volume-title":"Retrieved","year":"2023","unstructured":"Kubernetes. 2023. Kubernetes Documentation \/ Reference \/ Glossary. Retrieved June 28, 2024 from https:\/\/kubernetes.io\/docs\/reference\/glossary\/?all=true#termcluster"},{"key":"e_1_2_1_25_1","volume-title":"itransformer: Inverted transformers are effective for time series forecasting. arXiv preprint arXiv:2310.06625","author":"Liu Yong","year":"2023","unstructured":"Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, and Mingsheng Long. 2023. itransformer: Inverted transformers are effective for time series forecasting. arXiv preprint arXiv:2310.06625 (2023)."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3542929.3563477"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-131X(199705)16:3<147::AID-FOR652>3.0.CO;2-X"},{"key":"e_1_2_1_28_1","volume-title":"Retrieved","year":"2023","unstructured":"Microsoft. 2023. Overview of autoscale in Azure. Retrieved June 28, 2024 from https:\/\/learn.microsoft.com\/en-us\/azure\/azure-monitor\/autoscale\/autoscale-overview"},{"key":"e_1_2_1_29_1","volume-title":"Foundations of machine learning","author":"Mohri Mehryar","unstructured":"Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar. 2018. Foundations of machine learning. MIT press."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0022-2496(02)00028-7"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/UCC.2018.00010"},{"key":"e_1_2_1_32_1","volume-title":"NBEATS: Neural basis expansion analysis for interpretable time series forecasting. arXiv preprint arXiv:1905.10437","author":"Oreshkin Boris N","year":"2019","unstructured":"Boris N Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. 2019. NBEATS: Neural basis expansion analysis for interpretable time series forecasting. arXiv preprint arXiv:1905.10437 (2019)."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.14778\/3611540.3611566"},{"key":"e_1_2_1_34_1","volume-title":"International Conference on Artificial Intelligence and Statistics. PMLR, 8127--8150","author":"Park Youngsuk","year":"2022","unstructured":"Youngsuk Park, Danielle Maddix, Fran\u00e7ois-Xavier Aubet, Kelvin Kan, Jan Gasthaus, and Yuyang Wang. 2022. Learning quantile functions without quantile crossing for distribution-free time series forecasting. In International Conference on Artificial Intelligence and Statistics. PMLR, 8127--8150."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2017.2651810"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2018.00144"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3631504.3631516"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00252"},{"key":"e_1_2_1_39_1","volume-title":"14th USENIX symposium on operating systems design and implementation (OSDI 20)","author":"Qiu Haoran","year":"2020","unstructured":"Haoran Qiu, Subho S Banerjee, Saurabh Jha, Zbigniew T Kalbarczyk, and Ravishankar K Iyer. 2020. {FIRM}: An intelligent fine-grained resource management framework for {SLO-Oriented} microservices. In 14th USENIX symposium on operating systems design and implementation (OSDI 20). 805--825."},{"key":"e_1_2_1_40_1","volume-title":"2023 USENIX Annual Technical Conference (USENIX ATC 23)","author":"Qiu Haoran","year":"2023","unstructured":"Haoran Qiu, Weichao Mao, Chen Wang, Hubertus Franke, Alaa Youssef, Zbigniew T Kalbarczyk, Tamer Ba\u015far, and Ravishankar K Iyer. 2023. {AWARE}: Automate workload autoscaling with reinforcement learning in production cloud systems. In 2023 USENIX Annual Technical Conference (USENIX ATC 23). 387--402."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2020.12.025"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2011.42"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387524"},{"key":"e_1_2_1_44_1","volume-title":"2020 USENIX Annual Technical Conference (USENIX ATC 20)","author":"Shahrad Mohammad","year":"2020","unstructured":"Mohammad Shahrad, Rodrigo Fonseca, Inigo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark Russinovich, and Ricardo Bianchini. 2020. Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). USENIX Association, 205--218. https:\/\/www.usenix.org\/conference\/atc20\/presentation\/shahrad"},{"key":"e_1_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Alexander Shapiro Darinka Dentcheva and Andrzej Ruszczynski. 2021. Lectures on stochastic programming: modeling and theory. SIAM.","DOI":"10.1137\/1.9781611976595"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-020-03148-5"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3489525.3511680"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1342171.1342172"},{"key":"e_1_2_1_49_1","volume-title":"On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Mathematical programming 106","author":"W\u00e4chter Andreas","year":"2006","unstructured":"Andreas W\u00e4chter and Lorenz T Biegler. 2006. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Mathematical programming 106 (2006), 25--57."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3616855.3635806"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2023\/695"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-30637-2_11"},{"key":"e_1_2_1_53_1","volume-title":"TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting. In International Conference on Learning Representations (ICLR).","author":"Wang Shiyu","year":"2024","unstructured":"Shiyu Wang, Haixu Wu, Xiaoming Shi, Tengge Hu, Huakun Luo, Lintao Ma, James Y Zhang, and JUN ZHOU. 2024. TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW58026.2022.00141"},{"key":"e_1_2_1_55_1","volume-title":"Proceedings of the 39 th International Conference on Machine Learning 162","author":"Wu Haixu","year":"2022","unstructured":"Haixu Wu, Jialong Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. 2022. Flowformer: Linearizing transformers with conservation flows. Proceedings of the 39 th International Conference on Machine Learning 162 (2022)."},{"key":"e_1_2_1_56_1","first-page":"22419","article-title":"Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting","volume":"34","author":"Wu Haixu","year":"2021","unstructured":"Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. 2021. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems 34 (2021), 22419--22430.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539063"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSAC.2013.SUP2.1213008"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i13.26852"},{"key":"e_1_2_1_60_1","first-page":"12904","article-title":"MixSeq: connecting macroscopic time series forecasting with microscopic time series data","volume":"34","author":"Zhu Zhibo","year":"2021","unstructured":"Zhibo Zhu, Ziqi Liu, Ge Jin, Zhiqiang Zhang, Lei Chen, Jun Zhou, and Jianyong Zhou. 2021. MixSeq: connecting macroscopic time series forecasting with microscopic time series data. Advances in Neural Information Processing Systems 34 (2021), 12904--12916.","journal-title":"Advances in Neural Information Processing Systems"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3685800.3685829","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T05:30:06Z","timestamp":1735623006000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3685800.3685829"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8]]},"references-count":60,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["10.14778\/3685800.3685829"],"URL":"https:\/\/doi.org\/10.14778\/3685800.3685829","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,8]]},"assertion":[{"value":"2024-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}