{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T09:02:51Z","timestamp":1775638971971,"version":"3.50.1"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:p>Amazon Aurora Serverless is an on-demand, autoscaling configuration for Amazon Aurora with full MySQL and PostgreSQL compatibility. It automatically offers capacity scale-up\/down (i.e., vertical scaling) based on a customer database application's needs. For customers with time-varying workloads, it offers cost savings compared to provisioned Aurora or other alternatives due to its agile and granular scaling and its usage-based charging model. This paper describes the key ideas underlying Aurora Serverless's resource management. To help meet its goals, Aurora Serverless adapts and fine tunes well-established ideas related to resource over-subscription; reactive control informed by recent measurements; distributed &amp; hierarchical decision-making; and innovations in the DB engine, OS, and hypervisor for efficiency. Perhaps the most challenging goal is to offer a consistent resource elasticity experience while operating hosts at high degrees of utilization. Aurora Serverless implements several novel ideas for striking a balance between these opposing needs. Its technique for mapping workloads to hosts ensures that, in the common case, there is adequate spare capacity within a host to support fast scale-up for a workload. In the rare event this is not so, it live migrates workloads to ensure seamless scale-up. Its load distribution strategy is characterized by \"unbalancing\" of load across hosts to enable agile live migrations. Finally, it employs a token bucket-based rate regulation mechanism to prevent a growing workload from saturating its host faster than live migration-based remedial actions.<\/jats:p>","DOI":"10.14778\/3685800.3685825","type":"journal-article","created":{"date-parts":[[2024,11,8]],"date-time":"2024-11-08T17:25:21Z","timestamp":1731086721000},"page":"4038-4050","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Resource Management in Aurora Serverless"],"prefix":"10.14778","volume":"17","author":[{"given":"Bradley","family":"Barnhart","sequence":"first","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Marc","family":"Brooker","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Daniil","family":"Chinenkov","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Tony","family":"Hooper","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Jihoun","family":"Im","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Prakash Chandra","family":"Jha","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Tim","family":"Kraska","sequence":"additional","affiliation":[{"name":"Amazon Web Services and MIT"}]},{"given":"Ashok","family":"Kurakula","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Alexey","family":"Kuznetsov","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Grant","family":"McAlister","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Arjun","family":"Muthukrishnan","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Aravinthan","family":"Narayanan","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Douglas","family":"Terry","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]},{"given":"Bhuvan","family":"Urgaonkar","sequence":"additional","affiliation":[{"name":"Amazon Web Services and Penn State"}]},{"given":"Jiaming","family":"Yan","sequence":"additional","affiliation":[{"name":"Amazon Web Services"}]}],"member":"320","published-online":{"date-parts":[[2024,11,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"2021. DAMON-based Reclamation. https:\/\/lore.kernel.org\/20211019150731.16699-16-sj@kernel.org\/."},{"key":"e_1_2_1_2_1","unstructured":"2024. Amazon Aurora. https:\/\/aws.amazon.com\/rds\/aurora\/."},{"key":"e_1_2_1_3_1","unstructured":"2024. Amazon Aurora Pricing. https:\/\/aws.amazon.com\/rds\/aurora\/pricing\/."},{"key":"e_1_2_1_4_1","unstructured":"2024. Amazon Aurora Serverless. https:\/\/aws.amazon.com\/rds\/aurora\/serverless\/."},{"key":"e_1_2_1_5_1","unstructured":"2024. Aurora Serverless v2 Capacity. https:\/\/docs.aws.amazon.com\/AmazonRDS\/latest\/AuroraUserGuide\/aurora-serverless-v2.how-it-works.html#aurora-serverless-v2.how-it-works.capacity."},{"key":"e_1_2_1_6_1","unstructured":"2024. AWS EC2 Regions and Zones. https:\/\/docs.aws.amazon.com\/AWSEC2\/latest\/UserGuide\/using-regions-availability-zones.html."},{"key":"e_1_2_1_7_1","unstructured":"2024. AWS Nitro System: A combination of dedicated hardware and lightweight hypervisor enabling faster innovation and enhanced security. https:\/\/aws.amazon.com\/ec2\/nitro\/."},{"key":"e_1_2_1_8_1","unstructured":"2024. Control groups. https:\/\/docs.kernel.org\/admin-guide\/cgroup-v1\/cgroups.html."},{"key":"e_1_2_1_9_1","unstructured":"2024. Scriptable database and system performance benchmark. https:\/\/github.com\/akopytov\/sysbench."},{"key":"e_1_2_1_10_1","volume-title":"Firecracker: Lightweight Virtualization for Serverless Applications. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20)","author":"Agache Alexandru","year":"2020","unstructured":"Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight Virtualization for Serverless Applications. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). USENIX Association, Santa Clara, CA, 419--434. https:\/\/www.usenix.org\/conference\/nsdi20\/presentation\/agache"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/65.844498"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems","author":"Martin","year":"1996","unstructured":"Martin F. Arlitt and Carey L. Williamson. 1996. Web Server Workload Characterization: The Search for Invariants. In Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, Philadelphia, Pennsylvania, USA, May 23--26, 1996, Daniel A. Reed and Blaine D. Gaither (Eds.). ACM, 126--137."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2014.78"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2180861.2180865"},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"J. Heinanen T. Finland and R. Guerin. 1999. RFC 2687: A Single Rate Three Color Marker. https:\/\/datatracker.ietf.org\/doc\/html\/rfc2697.","DOI":"10.17487\/rfc2697"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of IEEE International Conference on Communications, ICC 2012","author":"David","year":"2012","unstructured":"David A. Hoeflin and Paul Reeser. 2012. Quantifying the performance impact of overbooking virtualized resources. In Proceedings of IEEE International Conference on Communications, ICC 2012, Ottawa, ON, Canada, June 10--15, 2012. IEEE, 5523--5527."},{"key":"e_1_2_1_17_1","volume-title":"A Robust AFPTAS for Online Bin Packing with Polynomial Migration. CoRR abs\/1302.4213","author":"Jansen Klaus","year":"2013","unstructured":"Klaus Jansen and Kim-Manuel Klein. 2013. A Robust AFPTAS for Online Bin Packing with Polynomial Migration. CoRR abs\/1302.4213 (2013). arXiv:1302.4213 http:\/\/arxiv.org\/abs\/1302.4213"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3526047"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDEW.2007.4401022"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882783"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.14778\/3551793.3551855"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342080"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/s002240000122"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-006-9742-7"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387524"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:CLUS.0000018564.05723.a2"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2018.2823642"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCC.2017.2737628"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3172493"},{"key":"e_1_2_1_30_1","volume-title":"ACM Symposium on Cloud Computing in conjunction with SOSP 2011, SOCC '11","author":"Shen Zhiming","year":"2011","unstructured":"Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, and John Wilkes. 2011. Cloud-Scale: elastic resource scaling for multi-tenant cloud systems. In ACM Symposium on Cloud Computing in conjunction with SOSP 2011, SOCC '11, Cascais, Portugal, October 26--28, 2011. ACM, 5."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366623.3368136"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/IWQoS.2017.7969135"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1064212.1064252"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2004.1264781"},{"key":"e_1_2_1_35_1","volume-title":"Dynamic Provisioning of Multi-tier Internet Applications. In Second International Conference on Autonomic Computing (ICAC 2005","author":"Urgaonkar Bhuvan","year":"2005","unstructured":"Bhuvan Urgaonkar, Prashant J. Shenoy, Abhishek Chandra, and Pawan Goyal. 2005. Dynamic Provisioning of Multi-tier Internet Applications. In Second International Conference on Autonomic Computing (ICAC 2005), 13--16 June 2005, Seattle, WA, USA. IEEE Computer Society, 217--228."},{"key":"e_1_2_1_36_1","volume-title":"Resource Overbooking and Application Profiling in Shared Hosting Platforms. In 5th Symposium on Operating System Design and Implementation (OSDI 2002","author":"Urgaonkar Bhuvan","year":"2002","unstructured":"Bhuvan Urgaonkar, Prashant J. Shenoy, and Timothy Roscoe. 2002. Resource Overbooking and Application Profiling in Shared Hosting Platforms. In 5th Symposium on Operating System Design and Implementation (OSDI 2002), Boston, Massachusetts, USA, December 9--11, 2002."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3056101"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comnet.2009.04.014"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3274808.3274820"},{"key":"e_1_2_1_40_1","volume-title":"Live Migration of Multiple Virtual Machines with Resource Reservation in Cloud Computing Environments. In IEEE International Conference on Cloud Computing, CLOUD 2011","author":"Ye Kejiang","year":"2011","unstructured":"Kejiang Ye, Xiaohong Jiang, Dawei Huang, Jianhai Chen, and Bei Wang. 2011. Live Migration of Multiple Virtual Machines with Resource Reservation in Cloud Computing Environments. In IEEE International Conference on Cloud Computing, CLOUD 2011, Washington, DC, USA, 4--9 July, 2011, Ling Liu and Manish Parashar (Eds.). IEEE Computer Society, 267--274."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3685800.3685825","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T05:33:07Z","timestamp":1735623187000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3685800.3685825"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8]]},"references-count":40,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["10.14778\/3685800.3685825"],"URL":"https:\/\/doi.org\/10.14778\/3685800.3685825","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,8]]},"assertion":[{"value":"2024-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}