{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,24]],"date-time":"2025-10-24T16:44:34Z","timestamp":1761324274495,"version":"3.41.0"},"reference-count":48,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2020,3,4]],"date-time":"2020-03-04T00:00:00Z","timestamp":1583280000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000143","name":"Division of Computing and Communication Foundations","doi-asserted-by":"publisher","award":["CCF-1149252 CCF-1337215 SHF-1527610 AF-1408784"],"award-info":[{"award-number":["CCF-1149252 CCF-1337215 SHF-1527610 AF-1408784"]}],"id":[{"id":"10.13039\/100000143","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000144","name":"Division of Computer and Network Systems","doi-asserted-by":"publisher","award":["CNS-1822085"],"award-info":[{"award-number":["CNS-1822085"]}],"id":[{"id":"10.13039\/100000144","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2020,3,31]]},"abstract":"<jats:p>We draw on reinforcement learning frameworks to design and implement an adaptive controller for managing resource contention. During runtime, the controller observes the dynamic system conditions and optimizes control policies that satisfy latency targets yet improve server utilization. We evaluate a physical prototype that guarantees 95th percentile latencies for a search engine and improves server utilization by up to 70%, compared to exclusively reserving servers for interactive services, for varied batch workloads in machine learning.<\/jats:p>","DOI":"10.1145\/3375714","type":"journal-article","created":{"date-parts":[[2020,3,4]],"date-time":"2020-03-04T12:50:12Z","timestamp":1583326212000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Dynamic Colocation Policies with Reinforcement Learning"],"prefix":"10.1145","volume":"17","author":[{"given":"Yuhao","family":"Li","sequence":"first","affiliation":[{"name":"Duke University, Durham, NC"}]},{"given":"Dan","family":"Sun","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC"}]},{"given":"Benjamin C.","family":"Lee","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC"}]}],"member":"320","published-online":{"date-parts":[[2020,3,4]]},"reference":[{"volume-title":"Retrieved","year":"2017","key":"e_1_2_1_1_1","unstructured":"Intel. (n.d.). Cache Monitoring Technology and Cache Allocation Technology\u2014Intel . Retrieved November 17, 2017 from https:\/\/www.intel.com\/content\/www\/us\/en\/communications\/cache-monitoring-cache-allocation-technologies.html. Intel. (n.d.). Cache Monitoring Technology and Cache Allocation Technology\u2014Intel. Retrieved November 17, 2017 from https:\/\/www.intel.com\/content\/www\/us\/en\/communications\/cache-monitoring-cache-allocation-technologies.html."},{"key":"e_1_2_1_2_1","volume-title":"Retrieved","author":"Manage Paul","year":"2004","unstructured":"Paul Manage ( 2004 ). Cgroups\u2014The Linux Kernel Archives . Retrieved November 17, 2017 from https:\/\/www.kernel.org\/doc\/Documentation\/cgroup-v1\/cgroups.txt. Paul Manage (2004). Cgroups\u2014The Linux Kernel Archives. Retrieved November 17, 2017 from https:\/\/www.kernel.org\/doc\/Documentation\/cgroup-v1\/cgroups.txt."},{"key":"e_1_2_1_3_1","volume-title":"Retrieved","author":"Brodowski Dominik","year":"2020","unstructured":"Dominik Brodowski . (n.d.). CPUFreq Governor\u2014The Linux Kernel Archives . Retrieved January 29, 2020 from https:\/\/www.kernel.org\/doc\/Documentation\/cpu-freq\/governors.txt. Dominik Brodowski. (n.d.). CPUFreq Governor\u2014The Linux Kernel Archives. Retrieved January 29, 2020 from https:\/\/www.kernel.org\/doc\/Documentation\/cpu-freq\/governors.txt."},{"key":"e_1_2_1_4_1","volume-title":"Retrieved","author":"Marshall David","year":"1999","unstructured":"David Marshall . ( 1999 ). IPC:Shared Memory . Retrieved November 20, 2017 from https:\/\/users.cs.cf.ac.uk\/dave\/C\/node27.html. David Marshall. (1999). IPC:Shared Memory. Retrieved November 20, 2017 from https:\/\/users.cs.cf.ac.uk\/dave\/C\/node27.html."},{"key":"e_1_2_1_5_1","volume-title":"Retrieved","author":"National Taiwan University. (n.d.). LIBSVM Data","year":"2020","unstructured":"National Taiwan University. (n.d.). LIBSVM Data : Classification (Binary Class) . Retrieved January 29, 2020 from https:\/\/www.csie.ntu.edu.tw\/ cjlin\/libsvmtools\/datasets\/binary.html. National Taiwan University. (n.d.). LIBSVM Data: Classification (Binary Class). Retrieved January 29, 2020 from https:\/\/www.csie.ntu.edu.tw\/ cjlin\/libsvmtools\/datasets\/binary.html."},{"key":"e_1_2_1_6_1","volume-title":"Retrieved","author":"Spark Apache","year":"2018","unstructured":"Apache Spark . ( 2018 ). MLlib . Retrieved November 17, 2017 from https:\/\/spark.apache.org\/mllib\/. Apache Spark. (2018). MLlib. Retrieved November 17, 2017 from https:\/\/spark.apache.org\/mllib\/."},{"key":"e_1_2_1_7_1","volume-title":"Retrieved","author":"Restelli Marcello","year":"2015","unstructured":"Marcello Restelli . ( 2015 ). Reinforcement Learning: Exploration vs Exploitation . Retrieved November 17, 2017 from http:\/\/home.deib.polimi.it\/restelli\/MyWebSite\/pdf\/rl5.pdf. Marcello Restelli. (2015). Reinforcement Learning: Exploration vs Exploitation. Retrieved November 17, 2017 from http:\/\/home.deib.polimi.it\/restelli\/MyWebSite\/pdf\/rl5.pdf."},{"key":"e_1_2_1_8_1","volume-title":"GigaOm. Retrieved","author":"Benik Alex","year":"2013","unstructured":"Alex Benik . ( 2013 ). The sorry state of server utilization and the impending post-hypervisor era . GigaOm. Retrieved January 29, 2020 from https:\/\/gigaom.com\/2013\/11\/30\/the-sorry-state-of-server-utilization-and-the-impending-post-hypervisor-era\/. Alex Benik. (2013). The sorry state of server utilization and the impending post-hypervisor era. GigaOm. Retrieved January 29, 2020 from https:\/\/gigaom.com\/2013\/11\/30\/the-sorry-state-of-server-utilization-and-the-impending-post-hypervisor-era\/."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2011.2106494"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CEC.2011.5949796"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2007.443"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2006.31"},{"key":"e_1_2_1_13_1","volume-title":"Pegasus: Mapping scientific workflows onto the grid. In Grid Computing","author":"Deelman Ewa","year":"2004","unstructured":"Ewa Deelman , James Blythe , Yolanda Gil , Carl Kesselman , Gaurang Mehta , Sonal Patil , Mei-Hui Su , Karan Vahi , and Miron Livny . 2004 . Pegasus: Mapping scientific workflows onto the grid. In Grid Computing . Springer , 131--140. Ewa Deelman, James Blythe, Yolanda Gil, Carl Kesselman, Gaurang Mehta, Sonal Patil, Mei-Hui Su, Karan Vahi, and Miron Livny. 2004. Pegasus: Mapping scientific workflows onto the grid. In Grid Computing. Springer, 131--140."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451125"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2644865.2541941"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00059"},{"key":"e_1_2_1_17_1","volume-title":"Tilbury","author":"Hellerstein Joseph L.","year":"2004","unstructured":"Joseph L. Hellerstein , Yixin Diao , Sujay Parekh , and Dawn M . Tilbury . 2004 . Feedback Control of Computing Systems. John Wiley 8 Sons. Joseph L. Hellerstein, Yixin Diao, Sujay Parekh, and Dawn M. Tilbury. 2004. Feedback Control of Computing Systems. John Wiley 8 Sons."},{"volume-title":"Proceedings of the IEEE 21st International Symposium on High Performance Computer Architecture (HPCA\u201915)","author":"Hsu Chang-Hong","key":"e_1_2_1_18_1","unstructured":"Chang-Hong Hsu , Yunqi Zhang , Michael A. Laurenzano , David Meisner , Thomas Wenisch , Jason Mars , Lingjia Tang , and Ronald G. Dreslinski . 2015. Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting . In Proceedings of the IEEE 21st International Symposium on High Performance Computer Architecture (HPCA\u201915) . IEEE, Los Alamitos, CA, 271--282. Chang-Hong Hsu, Yunqi Zhang, Michael A. Laurenzano, David Meisner, Thomas Wenisch, Jason Mars, Lingjia Tang, and Ronald G. Dreslinski. 2015. Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting. In Proceedings of the IEEE 21st International Symposium on High Performance Computer Architecture (HPCA\u201915). IEEE, Los Alamitos, CA, 271--282."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1380584.1380585"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2008.21"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830797"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2016.7581261"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1870926.1871068"},{"volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201917)","author":"Llull Qiuyun","key":"e_1_2_1_24_1","unstructured":"Qiuyun Llull , Songchun Fan , Seyed Majid Zahedi , and Benjamin C. Lee . 2017. Cooper: Task colocation with cooperative games . In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201917) . IEEE, Los Alamitos, CA, 421--432. Qiuyun Llull, Songchun Fan, Seyed Majid Zahedi, and Benjamin C. Lee. 2017. Cooper: Task colocation with cooperative games. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201917). IEEE, Los Alamitos, CA, 421--432."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2678373.2665718"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872887.2749475"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10723-014-9314-7"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3005745.3005750"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155650"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/2600239.2600241"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755938"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2017.13"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 2013 USENIX Annual Technical Conference.","author":"Novakovic Dejan","year":"2013","unstructured":"Dejan Novakovic , Nedeljko Vasic , Stanko Novakovic , Dejan Kostic , and Ricardo Bianchini . 2013 . DeepDive: Transparently identifying and managing performance interference in virtualized environments . In Proceedings of the 2013 USENIX Annual Technical Conference. Dejan Novakovic, Nedeljko Vasic, Stanko Novakovic, Dejan Kostic, and Ricardo Bianchini. 2013. DeepDive: Transparently identifying and managing performance interference in virtualized environments. In Proceedings of the 2013 USENIX Annual Technical Conference."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522716"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056037"},{"volume-title":"Proceedings of the 39th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO-39)","author":"Moinuddin","key":"e_1_2_1_36_1","unstructured":"Moinuddin K. Qureshi and Yale N. Patt. 2006. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches . In Proceedings of the 39th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO-39) . IEEE, Los Alamitos, CA, 423--432. Moinuddin K. Qureshi and Yale N. Patt. 2006. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In Proceedings of the 39th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO-39). IEEE, Los Alamitos, CA, 423--432."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2024723.2000073"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIC.2007.21"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the 20th National Conference on Artificial Intelligence (AAAI\u201905)","volume":"2","author":"Tesauro Gerald","year":"2005","unstructured":"Gerald Tesauro . 2005 . Online resource allocation using decompositional reinforcement learning . In Proceedings of the 20th National Conference on Artificial Intelligence (AAAI\u201905) , Vol. 2 . 886--891. Gerald Tesauro. 2005. Online resource allocation using decompositional reinforcement learning. In Proceedings of the 20th National Conference on Artificial Intelligence (AAAI\u201905), Vol. 2. 886--891."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-24455-1_33"},{"volume-title":"Proceedings of the 48th International Symposium on Microarchitecture. ACM","author":"Vamanan Balajee","key":"e_1_2_1_41_1","unstructured":"Balajee Vamanan , Hamza Bin Sohail , Jahangir Hasan , and T. N. Vijaykumar . 2015. TimeTrader: Exploiting latency tail to save datacenter energy for online search . In Proceedings of the 48th International Symposium on Microarchitecture. ACM , New York, NY, 585--597. Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, and T. N. Vijaykumar. 2015. TimeTrader: Exploiting latency tail to save datacenter energy for online search. In Proceedings of the 48th International Symposium on Microarchitecture. ACM, New York, NY, 585--597."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2508148.2485974"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICICTA.2012.115"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465388"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.53"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2954679.2872394"},{"volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA'19)","author":"Mirhosseini Amirhossein","key":"e_1_2_1_47_1","unstructured":"Amirhossein Mirhosseini , Akshitha Sriraman , and Thomas F. Wenisch . 2019. Enhancing server efficiency in the face of killer microseconds . In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA'19) . IEEE, 185--198. Amirhossein Mirhosseini, Akshitha Sriraman, and Thomas F. Wenisch. 2019. Enhancing server efficiency in the face of killer microseconds. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA'19). IEEE, 185--198."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00032"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3375714","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3375714","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:15Z","timestamp":1750199895000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3375714"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,4]]},"references-count":48,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,3,31]]}},"alternative-id":["10.1145\/3375714"],"URL":"https:\/\/doi.org\/10.1145\/3375714","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2020,3,4]]},"assertion":[{"value":"2019-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-03-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}