{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T19:23:22Z","timestamp":1774121002736,"version":"3.50.1"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,7,19]],"date-time":"2023-07-19T00:00:00Z","timestamp":1689724800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Key-Area Research and Development Program of Guangdong Province","award":["2021B0101310002"],"award-info":[{"award-number":["2021B0101310002"]}]},{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"crossref","award":["62293510\/62293513, 62272252, 62272253, 62141412"],"award-info":[{"award-number":["62293510\/62293513, 62272252, 62272253, 62141412"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"NSF of Tianjin","award":["17JCYBJC15300"],"award-info":[{"award-number":["17JCYBJC15300"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2023,9,30]]},"abstract":"<jats:p>\n            Colocating multiple jobs on the same server has been widely applied for improving resource utilization in cloud datacenters. However, the colocated jobs would contend for the shared resources, which could lead to significant performance degradation. An efficient approach for eliminating performance interference is to partition the shared resources among the colocated jobs. However, this makes the resource management in datacenters very challenging. In this paper, we propose JointOPT, the first resource management framework that optimizes job assignment and resource partitioning jointly for improving the throughput of cloud datacenters. JointOPT uses a local search based algorithm to find the near optimal job assignment configuration, and uses a\n            <jats:bold>deep reinforcement learning (DRL)<\/jats:bold>\n            based approach to dynamically partition the shared resources among the colocated jobs. In order to reduce the interaction overhead with real systems, it leverages deep learning to estimate job performance without running them on real servers. We conduct extensive experiments to evaluate JointOPT and the results show that JointOPT significantly outperforms the state-of-the-art baselines, with an advantage from 13.3% to 47.7%.\n          <\/jats:p>","DOI":"10.1145\/3593055","type":"journal-article","created":{"date-parts":[[2023,4,18]],"date-time":"2023-04-18T12:37:50Z","timestamp":1681821470000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Jointly Optimizing Job Assignment and Resource Partitioning for Improving System Throughput in Cloud Datacenters"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4660-0883","authenticated-orcid":false,"given":"Ruobing","family":"Chen","sequence":"first","affiliation":[{"name":"SysNet of Nankai University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6454-4088","authenticated-orcid":false,"given":"Haosen","family":"Shi","sequence":"additional","affiliation":[{"name":"SysNet of Nankai University, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-6567-4774","authenticated-orcid":false,"given":"Jinping","family":"Wu","sequence":"additional","affiliation":[{"name":"SysNet of Nankai University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6623-350X","authenticated-orcid":false,"given":"Yusen","family":"Li","sequence":"additional","affiliation":[{"name":"SysNet of Nankai University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9010-3278","authenticated-orcid":false,"given":"Xiaoguang","family":"Liu","sequence":"additional","affiliation":[{"name":"SysNet of Nankai University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0387-2501","authenticated-orcid":false,"given":"Gang","family":"Wang","sequence":"additional","affiliation":[{"name":"SysNet of Nankai University, China"}]}],"member":"320","published-online":{"date-parts":[[2023,7,19]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"2006. The Python Performance Benchmark Suite. https:\/\/pyperformance.readthedocs.io\/."},{"key":"e_1_3_1_3_2","unstructured":"2006. The SPEC CPU\u00ae 2006 Benchmark Suite. https:\/\/www.spec.org\/cpu2006\/."},{"key":"e_1_3_1_4_2","unstructured":"2017. The SPEC CPU\u00ae 2017 Benchmark Suite. https:\/\/www.spec.org\/cpu2017\/."},{"key":"e_1_3_1_5_2","unstructured":"2020. perf: Linux Profiling with Performance Counters. https:\/\/perf.wiki.kernel.org\/index.php\/."},{"key":"e_1_3_1_6_2","unstructured":"2020. Random Forests. https:\/\/www.stat.berkeley.edu\/breiman\/RandomForests\/cc_home.htm."},{"key":"e_1_3_1_7_2","unstructured":"Andrew J. Herdrich Khawar M. Abbasi and Marcel D. Cornu. 2019. Introduction to Memory Bandwidth Allocation. https:\/\/software.intel.com\/en-us\/articles\/introduction-to-memory-bandwidth-allocation."},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2019.8737460"},{"key":"e_1_3_1_9_2","first-page":"175","volume-title":"Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing","author":"Chen Ruobing","year":"2020","unstructured":"Ruobing Chen, Jinping Wu, Haosen Shi, Yusen Li, Xiaoguang Liu, and Gang Wang. 2020. DRLPart: A deep reinforcement learning framework for optimally efficient and robust resource partitioning on commodity servers. In Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing. 175\u2013188."},{"key":"e_1_3_1_10_2","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1145\/3297858.3304005","article-title":"PARTIES: QoS-aware resource partitioning for multiple interactive services","author":"Chen Shuang","year":"2019","unstructured":"Shuang Chen, Christina Delimitrou, and F. Jos\u00e9 Mart\u00ednez. 2019. PARTIES: QoS-aware resource partitioning for multiple interactive services. Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) (2019), 107\u2013120.","journal-title":"Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2016.06.006"},{"key":"e_1_3_1_12_2","first-page":"77","volume-title":"Proceedings of the 18th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS\u201913)","volume":"48","author":"Delimitrou Christina","year":"2013","unstructured":"Christina Delimitrou and Christos Kozyrakis. 2013. Paragon: QoS-aware scheduling for heterogeneous datacenters. In Proceedings of the 18th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS\u201913), Vol. 48. ACM, 77\u201388."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/2644865.2541941"},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","first-page":"492","DOI":"10.1109\/HPCA.2019.00061","article-title":"Poise: Balancing thread-level parallelism and memory system performance in GPUs using machine learning","author":"Dublish Saumay","year":"2019","unstructured":"Saumay Dublish, Vijay Nagarajan, and Nigel Topham. 2019. Poise: Balancing thread-level parallelism and memory system performance in GPUs using machine learning. 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (2019), 492\u2013505.","journal-title":"2019 IEEE International Symposium on High Performance Computer Architecture (HPCA)"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1109\/HPCA.2018.00019","volume-title":"2018 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201918)","author":"El-Sayed Nosayba","year":"2018","unstructured":"Nosayba El-Sayed, Anurag Mukkara, Po-An Tsai, Harshad Kasture, Xiaosong Ma, and Daniel Sanchez. 2018. KPart: A hybrid cache partitioning-sharing technique for commodity multicores. In 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201918). IEEE, 104\u2013117."},{"key":"e_1_3_1_16_2","first-page":"1662","volume-title":"International Conference on Machine Learning","author":"Gao Yuanxiang","year":"2018","unstructured":"Yuanxiang Gao, Li Chen, and Baochun Li. 2018. Spotlight: Optimizing device placement for training deep neural networks. In International Conference on Machine Learning. 1662\u20131670."},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.iot.2022.100514"},{"key":"e_1_3_1_18_2","first-page":"800","volume-title":"Design, Automation & Test in Europe Conference & Exhibition (DATE\u201917)","author":"Jain Rahul","year":"2017","unstructured":"Rahul Jain, Preeti Ranjan Panda, and Sreenivas Subramoney. 2017. A coordinated multi-agent reinforcement learning approach to multi-level cache co-partitioning. In Design, Automation & Test in Europe Conference & Exhibition (DATE\u201917). IEEE, 800\u2013805."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCC.2019.2920914"},{"key":"e_1_3_1_20_2","first-page":"1842","volume-title":"IEEE INFOCOM 2014","author":"Li Xin","year":"2014","unstructured":"Xin Li, Jie Wu, Shaojie Tang, and Sanglu Lu. 2014. Let\u2019s stay together: Towards traffic aware virtual machine placement in data centers. In IEEE INFOCOM 2014. 1842\u20131850."},{"key":"e_1_3_1_21_2","first-page":"101","article-title":"Random forests and adaptive nearest neighbors","author":"Lin Yi","year":"2002","unstructured":"Yi Lin and Yongho Jeon. 2002. Random forests and adaptive nearest neighbors. Journal of the American Statistical Association (2002), 101\u2013474.","journal-title":"Journal of the American Statistical Association"},{"key":"e_1_3_1_22_2","first-page":"301","volume-title":"2014 ACM\/IEEE 41st International Symposium on Computer Architecture (ISCA\u201914)","author":"Lo David","year":"2014","unstructured":"David Lo, Liqun Cheng, Rama Govindaraju, Luiz Andr\u00e9 Barroso, and Christos Kozyrakis. 2014. Towards energy proportionality for large-scale latency-critical workloads. In 2014 ACM\/IEEE 41st International Symposium on Computer Architecture (ISCA\u201914). 301\u2013312."},{"key":"e_1_3_1_23_2","first-page":"450","volume-title":"International Symposium on Computer Architecture (ISCA\u201915)","author":"Lo David","year":"2015","unstructured":"David Lo, Liqun Cheng, Rama Govindaraju, Parthasarathy Ranganathan, and Christos Kozyrakis. 2015. Heracles: Improving resource efficiency at scale. In International Symposium on Computer Architecture (ISCA\u201915), Vol. 43. ACM, 450\u2013462."},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155650"},{"key":"e_1_3_1_25_2","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1109\/ICAC.2017.29","volume-title":"2017 IEEE International Conference on Autonomic Computing (ICAC\u201917)","author":"Mishra Nikita","year":"2017","unstructured":"Nikita Mishra, John D. Lafferty, and Henry Hoffmann. 2017. ESP: A machine learning approach to predicting application interference. In 2017 IEEE International Conference on Autonomic Computing (ICAC\u201917). 125\u2013134."},{"key":"e_1_3_1_26_2","volume-title":"ICML","author":"Nair Vinod","year":"2010","unstructured":"Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted Boltzmann machines. In ICML."},{"key":"e_1_3_1_27_2","unstructured":"Khang T. Nguyen. 2019. Introduction to Cache Allocation Technology in the Intel \u00ae Xeon \u00ae Processor E5 v4 Family. https:\/\/software.intel.com\/en-us\/articles\/introduction-to-cache-allocation-technology\/."},{"key":"e_1_3_1_28_2","first-page":"15","volume-title":"Proceedings of the 48th International Conference on Parallel Processing","author":"Nikas Konstantinos","year":"2019","unstructured":"Konstantinos Nikas, Nikela Papadopoulou, Dimitra Giantsidi, Vasileios Karakostas, Georgios Goumas, and Nectarios Koziris. 2019. DICER: Diligent cache partitioning for efficient workload consolidation. In Proceedings of the 48th International Conference on Parallel Processing. 15."},{"key":"e_1_3_1_29_2","first-page":"1","volume-title":"Proceedings of the Fourteenth EuroSys Conference 2019","author":"Park Jinsu","year":"2019","unstructured":"Jinsu Park, Seongbeom Park, and Woongki Baek. 2019. CoPart: Coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers. In Proceedings of the Fourteenth EuroSys Conference 2019. 1\u201310."},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1109\/HPCA47549.2020.00025","volume-title":"2020 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201920)","author":"Patel T.","year":"2020","unstructured":"T. Patel and D. Tiwari. 2020. CLITE: Efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers. In 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA\u201920). 193\u2013206."},{"key":"e_1_3_1_31_2","volume-title":"Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques (PACT\u201918)","author":"Romero Francisco","year":"2018","unstructured":"Francisco Romero and Christina Delimitrou. 2018. Mage: Online and interference-aware scheduling for multi-scale heterogeneous systems. In Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques (PACT\u201918). Article 19, 13 pages."},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1145\/2465351.2465386","volume-title":"Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys\u201913)","author":"Schwarzkopf Malte","year":"2013","unstructured":"Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, and John Wilkes. 2013. Omega: Flexible, scalable schedulers for large compute clusters. In Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys\u201913). 351\u2013364."},{"key":"e_1_3_1_33_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998\u20136008."},{"key":"e_1_3_1_34_2","first-page":"986","volume-title":"2021 Design, Automation and Test in Europe Conference and Exhibition (DATE\u201921)","author":"Wu Zhaorui","year":"2021","unstructured":"Zhaorui Wu, Yuhui Deng, Hao Feng, Yi Zhou, and Geyong Min. 2021. Blender: A traffic-aware container placement for containerized data centers. In 2021 Design, Automation and Test in Europe Conference and Exhibition (DATE\u201921). 986\u2013989."},{"key":"e_1_3_1_35_2","first-page":"13","volume-title":"Proceedings of the Thirteenth EuroSys Conference 2018","author":"Xiang Yaocheng","year":"2018","unstructured":"Yaocheng Xiang, Xiaolin Wang, Zihui Huang, Zeyu Wang, Yingwei Luo, and Zhenlin Wang. 2018. DCAPS: Dynamic cache allocation with partial sharing. In Proceedings of the Thirteenth EuroSys Conference 2018. 13."},{"key":"e_1_3_1_36_2","first-page":"16","volume-title":"Proceedings of the 48th International Conference on Parallel Processing","author":"Xiang Yaocheng","year":"2019","unstructured":"Yaocheng Xiang, Chencheng Ye, Xiaolin Wang, Yingwei Luo, and Zhenlin Wang. 2019. EMBA: Efficient memory bandwidth allocation to improve performance on Intel commodity processor. In Proceedings of the 48th International Conference on Parallel Processing. 16."},{"key":"e_1_3_1_37_2","first-page":"14","volume-title":"Proceedings of the Thirteenth EuroSys Conference 2018","author":"Xu Cong","year":"2018","unstructured":"Cong Xu, Karthick Rajamani, Alexandre Ferreira, Wesley Felter, Juan Rubio, and Yang Li. 2018. dCat: Dynamic cache management for efficient, performance-sensitive infrastructure-as-a-service. In Proceedings of the Thirteenth EuroSys Conference 2018. 14."},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSC.2022.3181375"},{"key":"e_1_3_1_39_2","unstructured":"Xiao Zhang Eric Tune Robert Hagmann Rohit Jnagal Vrigo Gokhale and John Wilkes. 2013. CPI2: CPU performance isolation for shared compute clusters(EuroSys\u201913). 379\u2013391."},{"key":"e_1_3_1_40_2","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1109\/MICRO.2014.53","volume-title":"2014 47th Annual IEEE\/ACM International Symposium on Microarchitecture","author":"Zhang Yunqi","year":"2014","unstructured":"Yunqi Zhang, Michael A. Laurenzano, Jason Mars, and Lingjia Tang. 2014. SMiTe: Precise QoS prediction on real-system SMT processors to improve utilization in warehouse scale computers. In 2014 47th Annual IEEE\/ACM International Symposium on Microarchitecture. IEEE, 406\u2013418."},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2015.2442983"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/2872362.2872394"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3593055","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3593055","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:19Z","timestamp":1750178239000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3593055"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,19]]},"references-count":41,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,9,30]]}},"alternative-id":["10.1145\/3593055"],"URL":"https:\/\/doi.org\/10.1145\/3593055","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"value":"1544-3566","type":"print"},{"value":"1544-3973","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,19]]},"assertion":[{"value":"2022-10-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-04-11","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}