{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T21:50:18Z","timestamp":1773006618683,"version":"3.50.1"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T00:00:00Z","timestamp":1772928000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T00:00:00Z","timestamp":1772928000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004769","name":"Universit\u00e0 degli Studi di Pavia","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004769","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Cluster Comput"],"published-print":{"date-parts":[[2026,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Cloud has become an increasingly popular computing paradigm because of its benefits in terms of scalability, flexibility and cost. To make cloud solutions competitive, it is important to exploit their elasticity and dynamically adjust cloud resources in a timely manner to cope with the incoming workloads. This paper proposes a load-aware predictive auto-scaling framework that tries to anticipate workload changes and ensure at the same time the desired utilization level of the cloud resources. To this aim, we forecast the future workloads and the cloud resources necessary to cope with these workload demands. The framework has been extensively tested in a simulation environment based on the CloudSim toolkit. Synthetic and real workloads characterized by different arrival patterns have been considered. The results showcase the effectiveness of the proposed policy in scaling cloud resources according to the predicted arrival patterns.<\/jats:p>","DOI":"10.1007\/s10586-026-05944-x","type":"journal-article","created":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T19:36:11Z","timestamp":1772998571000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Load-aware predictive auto-scaling framework for cloud environments"],"prefix":"10.1007","volume":"29","author":[{"given":"L.","family":"Zanussi","sequence":"first","affiliation":[]},{"given":"D.","family":"Tessera","sequence":"additional","affiliation":[]},{"given":"L.","family":"Massari","sequence":"additional","affiliation":[]},{"given":"B.","family":"Bermejo","sequence":"additional","affiliation":[]},{"given":"C.","family":"Juiz","sequence":"additional","affiliation":[]},{"given":"M.","family":"Calzarossa","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2026,3,8]]},"reference":[{"key":"5944_CR1","volume-title":"Cloud Computing in 2028: From Technology to Business Necessity","author":"D Smith","year":"2023","unstructured":"Smith, D.: Cloud Computing in 2028: From Technology to Business Necessity. Technical report, Gartner Research (2023)"},{"issue":"4","key":"5944_CR2","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1007\/s10723-014-9314-7","volume":"12","author":"T Lorido-Botran","year":"2014","unstructured":"Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments. J. Grid Comput. 12(4), 559\u2013592 (2014)","journal-title":"J. Grid Comput."},{"issue":"4","key":"5944_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3148149","volume":"51","author":"C Qu","year":"2018","unstructured":"Qu, C., Calheiros, R.N., Buyya, R.: Auto-scaling Web Applications in Clouds: A Taxonomy and Survey. ACM Comput. Surv. 51(4), 1\u201333 (2018)","journal-title":"ACM Comput. Surv."},{"key":"5944_CR4","unstructured":"Jacob, B., Lanyon-Hogg, R., Nadgir, D.K., Yassin, A.F.: A practical guide to the IBM Autonomic Computing toolkit. Technical report, IBM Redbooks (2004)"},{"issue":"5","key":"5944_CR5","doi-asserted-by":"publisher","first-page":"668","DOI":"10.1016\/j.comnet.2008.10.026","volume":"53","author":"K Gilly","year":"2009","unstructured":"Gilly, K., Alcaraz, S., Juiz, C., Puigjaner, R.: Analysis of burstiness monitoring and detection in an adaptive Web system. Comput. Netw. 53(5), 668\u2013679 (2009)","journal-title":"Comput. Netw."},{"key":"5944_CR6","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1016\/j.ins.2012.02.018","volume":"199","author":"K Gilly","year":"2012","unstructured":"Gilly, K., Juiz, C., Thomas, N., Puigjaner, R.: Adaptive admission control algorithm in a QoS-aware Web system. Inf. Sci. 199, 58\u201377 (2012)","journal-title":"Inf. Sci."},{"issue":"3","key":"5944_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3190507","volume":"51","author":"T Chen","year":"2018","unstructured":"Chen, T., Bahsoon, R., Yao, X.: A Survey and Taxonomy of Self-Aware and Self-Adaptive Cloud Autoscaling Systems. ACM Computing Surveys 51(3), 1\u201340 (2018)","journal-title":"ACM Computing Surveys"},{"key":"5944_CR8","doi-asserted-by":"publisher","first-page":"2425","DOI":"10.1007\/s10586-021-03265-9","volume":"24","author":"S Verma","year":"2021","unstructured":"Verma, S., Bala, A.: Auto-scaling techniques for IoT-based cloud applications: a review. Clust. Comput. 24, 2425\u20132459 (2021)","journal-title":"Clust. Comput."},{"key":"5944_CR9","first-page":"2793","volume":"45","author":"EG Radhika","year":"2021","unstructured":"Radhika, E.G., Sudha Sadasivam, G.: A review on prediction based autoscaling techniques for heterogeneous applications in cloud environment. Materials Today: Proceedings 45, 2793\u20132800 (2021)","journal-title":"Materials Today: Proceedings"},{"issue":"1","key":"5944_CR10","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1504\/IJGUC.2023.129702","volume":"14","author":"M Catillo","year":"2023","unstructured":"Catillo, M., Villano, U., Rak, M.: A survey on auto-scaling: how to exploit cloud elasticity. Int. J. Grid Util. Comput. 14(1), 37\u201350 (2023)","journal-title":"Int. J. Grid Util. Comput."},{"key":"5944_CR11","doi-asserted-by":"crossref","unstructured":"Iqbal, W., Dailey, M.N., Carrera, D.: Low Cost Quality Aware Multi-tier Application Hosting on the Amazon Cloud. In: Proc. of the Int. Conf. on Future Internet of Things and Cloud, pp. 202\u2013209 (2014)","DOI":"10.1109\/FiCloud.2014.40"},{"key":"5944_CR12","doi-asserted-by":"crossref","unstructured":"Augustyn, D.A., Warchal, L.: Metrics-based auto scaling module for Amazon Web Services cloud platform. In: Kozielski, S., Mrozek, D., Kasprowski, P., Malysiak-Mrozek, B., Kostrzewa, D. (eds.) Beyond Databases, Architectures, and Structures. Communications in Computer and Information Science, vol. 716, pp. 42\u201352 (2017)","DOI":"10.1007\/978-3-319-58274-0_4"},{"issue":"17","key":"5944_CR13","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.5667","volume":"32","author":"MC Calzarossa","year":"2020","unstructured":"Calzarossa, M.C., Massari, L., Tessera, D.: Evaluation of cloud autoscaling strategies under different incoming workload patterns. Concurrency and Computation: Practice and Experience 32(17), e5667 (2020)","journal-title":"Concurrency and Computation: Practice and Experience"},{"key":"5944_CR14","doi-asserted-by":"crossref","unstructured":"Liu, B., Buyya, R., Toosi, A.: A Fuzzy-Based Auto-scaler for Web Applications in Cloud Computing Environments. In: Pahl, C., Vukovic, M., Yin, J., Yu, Q. (eds.) Service-Oriented Computing - ICSOC 2018. Lecture Notes in Computer Science, vol. 11236, pp. 797\u2013811 (2018)","DOI":"10.1007\/978-3-030-03596-9_57"},{"key":"5944_CR15","doi-asserted-by":"crossref","unstructured":"Netto, M.A.S., Cardonha, C.H., Cunha, R.L.F., de Assun\u00e7\u00e3o, M.D.: Evaluating Auto-scaling Strategies for Cloud Computing Environments. In: Proc. of the 22nd Int. Symp. on Modeling, Analysis & Simulation of Computer and Telecommunications Systems - MASCOTS, pp. 187\u2013196 (2014)","DOI":"10.1109\/MASCOTS.2014.32"},{"issue":"4","key":"5944_CR16","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1145\/2382553.2382556","volume":"30","author":"A Gandhi","year":"2012","unstructured":"Gandhi, A., Harchol-Balter, M., Raghunathan, R., Kozuch, A.: AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers. ACM Transactions on Computer Systems 30(4), 14 (2012)","journal-title":"ACM Transactions on Computer Systems"},{"key":"5944_CR17","unstructured":"Gandhi, A., Dube, P., Karve, A., Kochut, A., Zhang, L.: Adaptive, Model-driven Autoscaling for Cloud Applications. In: Proc. of the 11th Int. Conf. on Autonomic Computing - ICAC\u201914, pp. 57\u201364 (2014)"},{"key":"5944_CR18","unstructured":"Gong, Z., Gu, X., Wilkes, J.: PRESS: PRedictive Elastic ReSource Scaling for cloud systems. In: Proc. of the Int. Conf. on Network and Service Management, pp. 9\u201316 (2010)"},{"key":"5944_CR19","doi-asserted-by":"crossref","unstructured":"Roy, N., Dubey, A., Gokhale, A.: Efficient autoscaling in the cloud using predictive models for workload forecasting. In: Proc. of the IEEE 4th Int. Conf. on Cloud Computing - CLOUD\u201911, pp. 500\u2013507 (2011)","DOI":"10.1109\/CLOUD.2011.42"},{"key":"5944_CR20","doi-asserted-by":"crossref","unstructured":"Nikravesh, A.Y., Ajila, S.A., Lung, C.-H.: An autonomic prediction suite for cloud resource provisioning. Journal of Cloud Computing: Advances, Systems and Applications 6(3), (2017)","DOI":"10.1186\/s13677-017-0073-4"},{"key":"5944_CR21","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1007\/s10270-017-0584-y","volume":"17","author":"A Gandhi","year":"2018","unstructured":"Gandhi, A., Dube, P., Karve, A., Kochut, A., Zhang, L.: Model-driven optimal resource scaling in cloud. Software & Systems Modeling 17, 509\u2013526 (2018)","journal-title":"Software & Systems Modeling"},{"key":"5944_CR22","doi-asserted-by":"crossref","unstructured":"Qian, H., Wen, Q., Sun, L., Gu, J., Niu, Q., Tang, Z.: Robustscaler: QoS-aware autoscaling for complex workloads. In: Proc. of the IEEE 38th Int. Conf. on Data Engineering - ICDE, pp. 2762\u20132775 (2022)","DOI":"10.1109\/ICDE53745.2022.00252"},{"key":"5944_CR23","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1016\/j.future.2023.05.017","volume":"148","author":"S Chouliaras","year":"2023","unstructured":"Chouliaras, S., Sotiriadis, S.: An adaptive auto-scaling framework for cloud resource provisioning. Futur. Gener. Comput. Syst. 148, 173\u2013183 (2023)","journal-title":"Futur. Gener. Comput. Syst."},{"key":"5944_CR24","doi-asserted-by":"crossref","unstructured":"Hang, H., Tang, X., Sun, J., Bao, L., Lo, D., Wang, H.: Robust auto-scaling with probabilistic workload forecasting for cloud databases. In: Proc. of the IEEE 40th Int. Conf. on Data Engineering - ICDE, pp. 4016\u20134029 (2024)","DOI":"10.1109\/ICDE60146.2024.00308"},{"key":"5944_CR25","doi-asserted-by":"crossref","unstructured":"Calzarossa, M.C., Della\u00a0Vedova, M.L., Massari, L., Petcu, D., Tabash, M.I.M., Tessera, D.: Workloads in the Clouds. In: Fiondella, L., Puliafito, A. (eds.) Principles of Performance and Reliability Modeling and Evaluation. Springer Series in Reliability Engineering, pp. 525\u2013550 (2016)","DOI":"10.1007\/978-3-319-30599-8_20"},{"issue":"3","key":"5944_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2856127","volume":"48","author":"MC Calzarossa","year":"2016","unstructured":"Calzarossa, M.C., Massari, L., Tessera, D.: Workload characterization: A survey revisited. ACM Computing Surveys 48(3), 1\u201343 (2016)","journal-title":"ACM Computing Surveys"},{"key":"5944_CR27","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1016\/j.future.2017.10.047","volume":"81","author":"J Kumar","year":"2018","unstructured":"Kumar, J., Singh, A.K.: Workload prediction in cloud using artificial neural network and adaptive differential evolution. Futur. Gener. Comput. Syst. 81, 41\u201352 (2018)","journal-title":"Futur. Gener. Comput. Syst."},{"key":"5944_CR28","doi-asserted-by":"crossref","unstructured":"Calzarossa, M.C., Della Vedova, M.L., Massari, L., Nebbione, G., Tessera, D.: Modeling and predicting dynamics of heterogeneous workloads for cloud environments. In: Proc. of the IEEE Symposium on Computers and Communications - ISCC (2019)","DOI":"10.1109\/ISCC47284.2019.8969761"},{"issue":"4","key":"5944_CR29","doi-asserted-by":"publisher","first-page":"2399","DOI":"10.1007\/s10586-019-03010-3","volume":"23","author":"M Masdari","year":"2020","unstructured":"Masdari, M., Khoshnevis, A.: A survey and classification of the workload forecasting methods in cloud computing. Clust. Comput 23(4), 2399\u20132424 (2020)","journal-title":"Clust. Comput"},{"key":"5944_CR30","doi-asserted-by":"crossref","unstructured":"Fernandez, H., Pierre, G., Kielmann, T.: Autoscaling Web Applications in Heterogeneous Cloud Infrastructures. In: Proc. of the IEEE Int. Conf. on Cloud Engineering - IC2E, pp. 195\u2013204 (2014)","DOI":"10.1109\/IC2E.2014.25"},{"issue":"6","key":"5944_CR31","doi-asserted-by":"publisher","first-page":"871","DOI":"10.1016\/j.future.2010.10.016","volume":"27","author":"W Iqbal","year":"2011","unstructured":"Iqbal, W., Dailey, M.N., Carrera, D., Janecek, P.: Adaptive resource provisioning for read intensive multi-tier applications in the cloud. Futur. Gener. Comput. Syst. 27(6), 871\u2013879 (2011)","journal-title":"Futur. Gener. Comput. Syst."},{"key":"5944_CR32","doi-asserted-by":"crossref","unstructured":"Ali-Eldin, A., Tordsson, J., Elmroth, E.: An adaptive hybrid elasticity controller for cloud infrastructures. In: Proc. of the IEEE Network Operations and Management Symposium, pp. 204\u2013212 (2012)","DOI":"10.1109\/NOMS.2012.6211900"},{"key":"5944_CR33","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1016\/j.future.2020.12.025","volume":"118","author":"V Ramp\u00e9rez","year":"2021","unstructured":"Ramp\u00e9rez, V., Soriano, J., Lizcano, D., Lara, J.A.: FLAS: A combination of proactive and reactive auto-scaling architecture for distributed services. Futur. Gener. Comput. Syst. 118, 56\u201372 (2021)","journal-title":"Futur. Gener. Comput. Syst."},{"issue":"1","key":"5944_CR34","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1342171.1342172","volume":"3","author":"B Urgaonkar","year":"2008","unstructured":"Urgaonkar, B., Shenoy, P., Chandra, A., Goyal, P., Wood, T.: Agile dynamic provisioning of multi-tier internet applications. ACM Trans. Auton. Adapt. Syst. 3(1), 1\u201339 (2008)","journal-title":"ACM Trans. Auton. Adapt. Syst."},{"key":"5944_CR35","doi-asserted-by":"crossref","unstructured":"Bouabdallah, R., Lajmi, S., Ghedira, K.: Use of Reactive and Proactive Elasticity to Adjust Resources Provisioning in the Cloud Provider. Proc. of the IEEE 18th Int. Conf. on High Performance Computing and Communications; IEEE 14th Int. Conf. on Smart City; IEEE 2nd Int. Conf. on Data Science and Systems - HPCC\/SmartCity\/DSS, pp. 1155\u20131162 (2016)","DOI":"10.1109\/HPCC-SmartCity-DSS.2016.0162"},{"key":"5944_CR36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.jpdc.2018.04.016","volume":"120","author":"BB J.V.","year":"2018","unstructured":"J.V., B.B., Dharma, D.: HAS: Hybrid auto-scaler for resource scaling in cloud environment. Journal of Parallel and Distributed Computing 120, 1\u201315 (2018)","journal-title":"Journal of Parallel and Distributed Computing"},{"key":"5944_CR37","doi-asserted-by":"crossref","unstructured":"Abdullah, M., Iqbal, W., Erradi, A., Bukhari, F.: Learning Predictive Autoscaling Policies for Cloud-Hosted Microservices Using Trace-Driven Modeling. In: Proc. of the IEEE Int. Conf. on Cloud Computing Technology and Science - CloudCom, pp. 119\u2013126 (2019)","DOI":"10.1109\/CloudCom.2019.00028"},{"issue":"2","key":"5944_CR38","doi-asserted-by":"publisher","first-page":"717","DOI":"10.1007\/s10586-020-03148-5","volume":"24","author":"P Singh","year":"2021","unstructured":"Singh, P., Kaur, A., Gupta, P., Gill, S.S., Jyoti, K.: RHAS: robust hybrid auto-scaling for web applications in cloud computing. Clust. Comput 24(2), 717\u2013737 (2021)","journal-title":"Clust. Comput"},{"issue":"9","key":"5944_CR39","doi-asserted-by":"publisher","first-page":"26369","DOI":"10.1007\/s11042-023-16587-0","volume":"83","author":"NS Joshi","year":"2024","unstructured":"Joshi, N.S., Raghuwanshi, R., Agarwal, Y.M., Annappa, B., Sachin, D.N.: ARIMA-PID: container auto scaling based on predictive analysis and control theory. Multimedia Tools and Applications 83(9), 26369\u201326386 (2024)","journal-title":"Multimedia Tools and Applications"},{"issue":"12","key":"5944_CR40","doi-asserted-by":"publisher","first-page":"4090","DOI":"10.14778\/3685800.3685829","volume":"17","author":"D Zou","year":"2024","unstructured":"Zou, D., Lu, W., Zhu, Z., Lu, X., Zhou, J., Wang, X., Liu, K., Wang, K., Sun, R., Wang, H.: Optscaler: A collaborative framework for robust autoscaling in the cloud. Proceedings of the VLDB Endowment 17(12), 4090\u20134103 (2024)","journal-title":"Proceedings of the VLDB Endowment"},{"key":"5944_CR41","unstructured":"Amazon Web Services, Autoscaling Documentation. https:\/\/docs.aws.amazon.com\/autoscaling\/ [Accessed: April 30th, 2025]"},{"key":"5944_CR42","unstructured":"Microsoft Azure, Autoscale. https:\/\/learn.microsoft.com\/en-us\/azure\/azure-monitor\/autoscale\/autoscale-overview [Accessed: April 30th, 2025]"},{"key":"5944_CR43","unstructured":"Google Cloud Platform, Autoscale to maintain a metric at a target value. https:\/\/cloud.google.com\/compute\/docs\/autoscaler\/ [Accessed: April 30th, 2025]"},{"key":"5944_CR44","volume-title":"Quantitative System Performance: Computer System Analysis Using Queueing Network Models","author":"ED Lazowska","year":"1984","unstructured":"Lazowska, E.D., Zahorjan, J., Graham, G.S., Sevcik, K.C.: Quantitative System Performance: Computer System Analysis Using Queueing Network Models. Prentice-Hall, Englewood Cliffs, New Jersey (1984)"},{"issue":"1","key":"5944_CR45","first-page":"23","volume":"41","author":"RN Calheiros","year":"2011","unstructured":"Calheiros, R.N., Ranjan, R., Beloglazov, A., De Rose, C.A.F., Buyya, R.: CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Software: Practice and Experience 41(1), 23\u201350 (2011)","journal-title":"Software: Practice and Experience"},{"key":"5944_CR46","doi-asserted-by":"publisher","first-page":"81600","DOI":"10.1109\/ACCESS.2023.3300956","volume":"11","author":"C Juiz","year":"2023","unstructured":"Juiz, C., Capo, B., Bermejo, B., Fern\u00e1ndez-Montes, A., Fern\u00e1ndez-Cerero, D.: A Case Study of Transactional Workload Running in Virtual Machines: The Performance Evaluation of a Flight Seats Availability Service. IEEE Access 11, 81600\u201381612 (2023)","journal-title":"IEEE Access"}],"container-title":["Cluster Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-026-05944-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10586-026-05944-x","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-026-05944-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T19:36:16Z","timestamp":1772998576000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10586-026-05944-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,8]]},"references-count":46,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,6]]}},"alternative-id":["5944"],"URL":"https:\/\/doi.org\/10.1007\/s10586-026-05944-x","relation":{},"ISSN":["1386-7857","1573-7543"],"issn-type":[{"value":"1386-7857","type":"print"},{"value":"1573-7543","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,8]]},"assertion":[{"value":"8 May 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 September 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 January 2026","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 March 2026","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"197"}}