{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T06:40:24Z","timestamp":1776840024500,"version":"3.51.2"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"1-2","license":[{"start":{"date-parts":[[2024,2,13]],"date-time":"2024-02-13T00:00:00Z","timestamp":1707782400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Guangdong Key-Area Research and Development Program","award":["2020B010164003"],"award-info":[{"award-number":["2020B010164003"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62072451, 92267105"],"award-info":[{"award-number":["62072451, 92267105"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Science and Technology Development Fund of Macau","award":["0024\/2022\/A1"],"award-info":[{"award-number":["0024\/2022\/A1"]}]},{"DOI":"10.13039\/501100018601","name":"Guangdong Special Support Plan","doi-asserted-by":"crossref","award":["2021TQ06X990"],"award-info":[{"award-number":["2021TQ06X990"]}],"id":[{"id":"10.13039\/501100018601","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Alibaba Innovative Research Program"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Comput. Syst."],"published-print":{"date-parts":[[2024,5,31]]},"abstract":"<jats:p>A common approach to improving resource utilization in data centers is to adaptively provision resources based on the actual workload. One fundamental challenge of doing this in microservice management frameworks, however, is that different components of a service can exhibit significant differences in their impact on end-to-end performance. To make resource management more challenging, a single microservice can be shared by multiple online services that have diverse workload patterns and SLA requirements.<\/jats:p>\n          <jats:p>We present an efficient resource management system, namely Erms, for guaranteeing SLAs with high probability in shared microservice environments. Erms profiles microservice latency as a piece-wise linear function of the workload, resource usage, and interference. Based on this profiling, Erms builds resource scaling models to optimally determine latency targets for microservices with complex dependencies. Erms also designs new scheduling policies at shared microservices to further enhance resource efficiency. Experiments across microservice benchmarks as well as trace-driven simulations demonstrate that Erms can reduce SLA violation probability by 5\u00d7 and more importantly, lead to a reduction in resource usage by 1.6\u00d7, compared to state-of-the-art approaches.<\/jats:p>","DOI":"10.1145\/3631607","type":"journal-article","created":{"date-parts":[[2023,11,6]],"date-time":"2023-11-06T11:53:30Z","timestamp":1699271610000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":21,"title":["Optimizing Resource Management for Shared Microservices: A Scalable System Design"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3064-5841","authenticated-orcid":false,"given":"Shutian","family":"Luo","sequence":"first","affiliation":[{"name":"University of Macau, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-9637-2847","authenticated-orcid":false,"given":"Chenyu","family":"Lin","sequence":"additional","affiliation":[{"name":"University of Macau, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6133-407X","authenticated-orcid":false,"given":"Kejiang","family":"Ye","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Advanced Technology, CAS, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1136-2678","authenticated-orcid":false,"given":"Guoyao","family":"Xu","sequence":"additional","affiliation":[{"name":"Alibaba Group, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2334-3471","authenticated-orcid":false,"given":"Liping","family":"Zhang","sequence":"additional","affiliation":[{"name":"Alibaba Group, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1908-071X","authenticated-orcid":false,"given":"Guodong","family":"Yang","sequence":"additional","affiliation":[{"name":"Alibaba Group, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6657-1154","authenticated-orcid":false,"given":"Huanle","family":"Xu","sequence":"additional","affiliation":[{"name":"University of Macau, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9480-0356","authenticated-orcid":false,"given":"Chengzhong","family":"Xu","sequence":"additional","affiliation":[{"name":"University of Macau, China"}]}],"member":"320","published-online":{"date-parts":[[2024,2,13]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"2021. Alibaba Microservices Cluster Traces.https:\/\/github.com\/alibaba\/clusterdata\/tree\/master\/cluster-trace-microservices-v2021 (2021)."},{"key":"e_1_3_1_3_2","unstructured":"2022. Jaeger. https:\/\/jaegertracing.io\/ (2022)."},{"key":"e_1_3_1_4_2","unstructured":"2022. Prometheus. https:\/\/prometheus.io\/ (2022)."},{"key":"e_1_3_1_5_2","unstructured":"Azure Cloud Container Apps. 2022. https:\/\/azure.microsoft.com\/en-us\/services\/container-apps\/ (2022)."},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472883.3486992"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511804441"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304005"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3492321.3519564"},{"key":"e_1_3_1_11_2","unstructured":"Docker containers. 2022. https:\/\/www.docker.com\/ (2022)."},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2013.6704667"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451125"},{"key":"e_1_3_1_14_2","volume-title":"Lecture Notes in Computer Science","author":"Dragoni Nicola","year":"2018","unstructured":"Nicola Dragoni, Ivan Lanese, Stephan Thordal Larsen, and Manuel Mazzara et al.2018. Microservices: How to make your application scale. In Lecture Notes in Computer Science."},{"key":"e_1_3_1_15_2","unstructured":"Alibaba Cloud Microservices Engine. 2022. https:\/\/www.alibabacloud.com\/product\/microservices-engine (2022)."},{"key":"e_1_3_1_16_2","unstructured":"Google Kubernetes Engine. 2022. https:\/\/cloud.google.com\/kubernetes-engine (2022)."},{"key":"e_1_3_1_17_2","volume-title":"Production-ready Microservices: Building Standardized Systems across an Engineering Organization.","author":"Fowler Susan","year":"2016","unstructured":"Susan Fowler. 2016. Production-ready Microservices: Building Standardized Systems across an Engineering Organization.O\u2019Reilly Media."},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446700"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304013"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304004"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3366625.3368853"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCS.2019.00197"},{"key":"e_1_3_1_23_2","article-title":"On the linear convergence of the alternating direction method of multipliers","volume":"162","author":"Hong Mingyi","year":"2016","unstructured":"Mingyi Hong and Zhi Quan Luo. 2016. On the linear convergence of the alternating direction method of multipliers. Mathematical Programming 162 (2016).","journal-title":"Mathematical Programming"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303958"},{"key":"e_1_3_1_25_2","unstructured":"Kubernetes. 2022. https:\/\/kubernetes.io (2022)."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/3575693.3575751"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/3267809.3267830"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2749475"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472883.3487003"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3174631"},{"key":"e_1_3_1_31_2","volume-title":"Proceedings of ASPLOS","author":"Luo Shutian","year":"2023","unstructured":"Shutian Luo, Huanle Xu, Kejiang Ye, Guoyao Xu, Liping Zhang, Jian He, Guodong Yang, and Chengzhong Xu. 2023. Erms: Efficient resource management for shared microservices with SLA guarantees. In Proceedings of ASPLOS."},{"key":"e_1_3_1_32_2","volume-title":"Proceedings of OSDI","author":"Mahgoub Ashraf","year":"2022","unstructured":"Ashraf Mahgoub, Edgardo Barsallo Yi, Karthick Shankar, Sameh Elnikety, Somali Chaterji, and Saurabh Bagchi. 2022. ORION and the three rights: Sizing, bundling, and prewarming for serverless DAGs. In Proceedings of OSDI."},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472883.3486985"},{"key":"e_1_3_1_34_2","volume-title":"Proceedings of ICS","author":"Mirhosseini Amirhossein","year":"2021","unstructured":"Amirhossein Mirhosseini and Thomas F. Wenisch. 2021. \\(\\mu\\) steal: A theory-backed framework for preemptive work and resource stealing in mixed-criticality microservices. In Proceedings of ICS."},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA47549.2020.00026"},{"key":"e_1_3_1_36_2","volume-title":"Proceedings of ATC","author":"Ortiz Jennifer","year":"2018","unstructured":"Jennifer Ortiz, Brendan Lee, Magdalena Balazinska, Johannes Gehrke, and Joseph L. Hellerstein. 2018. SLAOrchestrator: Reducing the cost of performance SLAs for cloud data analytics. In Proceedings of ATC."},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3485983.3494866"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA47549.2020.00025"},{"key":"e_1_3_1_39_2","volume-title":"Proceedings of OSDI","author":"Qiu Haoran","year":"2020","unstructured":"Haoran Qiu, Subho S. Banerjee, Saurabh Jha, Zbigniew T. Kalbarczyk, and Ravishankar K. Iyer. 2020. FIRM: An intelligent fine-grained resource management framework for SLO-oriented microservices. In Proceedings of OSDI."},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF00116251"},{"key":"e_1_3_1_41_2","article-title":"Llama: A heterogeneous & serverless framework for auto-tuning video analytics pipelines","author":"Romero Francisco","year":"2021","unstructured":"Francisco Romero, Mark Zhao, Neeraja J. Yadwadkar, and Christos Kozyrakis. 2021. Llama: A heterogeneous & serverless framework for auto-tuning video analytics pipelines. Proceedings of ACM SoCC.","journal-title":"Proceedings of ACM SoCC"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387524"},{"key":"e_1_3_1_43_2","volume-title":"Proceedings of OSDI","author":"Sriraman Akshitha","year":"2018","unstructured":"Akshitha Sriraman and Thomas F. Wenisch. 2018. \\(\\mu\\) Tune: Auto-tuned threading for OLDI microservices. In Proceedings of OSDI."},{"key":"e_1_3_1_44_2","unstructured":"Microservices workshop. 2022. http:\/\/www.slideshare.net\/adriancockcroft\/microservices-workshop-craft-conference\/ (2022)."},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCS.2019.00021"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICWS.2019.00023"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446693"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387534"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3267809.3267823"},{"key":"e_1_3_1_50_2","volume-title":"Proceedings of ICSE","author":"Zhou Xiang","year":"2018","unstructured":"Xiang Zhou, Xin Peng, Tao Xie, Jun Sun, Chenjie Xu, Chao Ji, and Wenyun Zhao. 2018. Poster: Benchmarking microservice systems for software engineering research. In Proceedings of ICSE."}],"container-title":["ACM Transactions on Computer Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3631607","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3631607","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:53Z","timestamp":1750291433000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3631607"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,13]]},"references-count":49,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2024,5,31]]}},"alternative-id":["10.1145\/3631607"],"URL":"https:\/\/doi.org\/10.1145\/3631607","relation":{},"ISSN":["0734-2071","1557-7333"],"issn-type":[{"value":"0734-2071","type":"print"},{"value":"1557-7333","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,13]]},"assertion":[{"value":"2023-03-24","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-06","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-02-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}