{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,24]],"date-time":"2025-04-24T05:28:35Z","timestamp":1745472515003,"version":"3.37.3"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T00:00:00Z","timestamp":1614988800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T00:00:00Z","timestamp":1614988800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["639595"],"award-info":[{"award-number":["639595"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100013690","name":"Minist?re de l'?conomie, de la Science et de l'Innovation - Qu?bec","doi-asserted-by":"publisher","award":["2014SGR1051"],"award-info":[{"award-number":["2014SGR1051"]}],"id":[{"id":"10.13039\/100013690","id-type":"DOI","asserted-by":"publisher"}]},{"name":"BSC-CNS Severo Ochoa program","award":["SEV-2015-0493"],"award-info":[{"award-number":["SEV-2015-0493"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cloud Comp"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Modern applications demand resources at an unprecedented level. In this sense, data-centers are required to scale efficiently to cope with such demand. Resource disaggregation has the potential to improve resource-efficiency by allowing the deployment of workloads in more flexible ways. Therefore, the industry is shifting towards disaggregated architectures, which enables new ways to structure hardware resources in data centers. However, determining the best performing resource provisioning is a complicated task. The optimality of resource allocation in a disaggregated data center depends on its topology and the workload collocation. This paper presents <jats:italic>DRMaestro<\/jats:italic>, a framework to orchestrate disaggregated resources transparently from the applications. <jats:italic>DRMaestro<\/jats:italic> uses a novel flow-network model to determine the optimal placement in multiple phases while employing best-efforts on preventing workload performance interference. We first evaluate the impact of disaggregation regarding the additional network requirements under higher network load. The results show that for some applications the impact is minimal, but other ones can suffer up to 80% slowdown in the data transfer part. After that, we evaluate <jats:italic>DRMaestro<\/jats:italic> via a real prototype on Kubernetes and a trace-driven simulation. The results show that <jats:italic>DRMaestro<\/jats:italic> can reduce the total job makespan with a speedup of up to \u22481.20x and decrease the QoS violation up to \u22482.64x comparing with another orchestrator that does not support resource disaggregation.<\/jats:p>","DOI":"10.1186\/s13677-021-00238-6","type":"journal-article","created":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T12:03:08Z","timestamp":1615032188000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["DRMaestro: orchestrating disaggregated resources on virtualized data-centers"],"prefix":"10.1186","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3212-2312","authenticated-orcid":false,"given":"Marcelo","family":"Amaral","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jord\u00e0","family":"Polo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Carrera","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nelson","family":"Gonzalez","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chih-Chieh","family":"Yang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alessandro","family":"Morari","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bruce","family":"D\u2019Amora","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alaa","family":"Youssef","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Malgorzata","family":"Steinder","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,3,6]]},"reference":[{"key":"238_CR1","first-page":"249","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation OSDI\u201916","author":"PX Gao","year":"2016","unstructured":"Gao PX, Narayan A, Karandikar S, Carreira J, Han S, Agarwal R, Ratnasamy S, Shenker S (2016) Network requirements for resource disaggregation In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation OSDI\u201916, 249\u2013264.. USENIX Association, Berkeley, CA, USA. http:\/\/dl.acm.org\/citation.cfm?id=3026877.3026897."},{"key":"238_CR2","doi-asserted-by":"crossref","unstructured":"Katrinis K, Syrivelis D, Pnevmatikatos D, Zervas G, Theodoropoulos D, Koutsopoulos I, Hasharoni K, Raho D, Pinto C, Espina F, Lopez-Buedo S, Chen Q, Nemirovsky M, Roca D, Klos H, Berends T (2016) Rack-scale disaggregated cloud data centers: The dReDBox project vision In: 2016 Design, Automation Test in Europe Conference Exhibition (DATE), 690\u2013695.","DOI":"10.3850\/9783981537079_1014"},{"key":"238_CR3","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1016\/j.comcom.2017.03.009","volume":"107","author":"A Pag\u00e9s","year":"2017","unstructured":"Pag\u00e9s A, Serrano R, Perell J, Spadaro S (2017) On the benefits of resource disaggregation for virtual data centre provisioning in optical data centres. Comput Commun 107:60\u201374. https:\/\/doi.org\/10.1016\/j.comcom.2017.03.009.","journal-title":"Comput Commun"},{"key":"238_CR4","doi-asserted-by":"crossref","unstructured":"Taylor J (2015) Facebook\u2019s data center infrastructure: Open compute, disaggregated rack, and beyond In: 2015 Optical Fiber Communications Conference and Exhibition (OFC), 1.","DOI":"10.1364\/OFC.2015.W1D.5"},{"key":"238_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2768405.2768406","volume-title":"Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers ROSS \u201915","author":"K Keeton","year":"2015","unstructured":"Keeton K (2015) The machine: An architecture for memory-centric computing In: Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers ROSS \u201915, 1.. Association for Computing Machinery, New York, NY, USA. https:\/\/doi.org\/10.1145\/2768405.2768406."},{"key":"238_CR6","volume-title":"Disaggregated Servers Drive Data Center Efficiency and Innovation","author":"S Krishnapura","year":"2017","unstructured":"Krishnapura S, Achuthan S, Lal V, Tang T (2017) Disaggregated Servers Drive Data Center Efficiency and Innovation. Intel, USA. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/best-practices\/disaggregated-server-architecture-drives-data-center-efficiency-paper.pdf."},{"key":"238_CR7","doi-asserted-by":"publisher","unstructured":"Reano C, Silla F (2017) A comparative performance analysis of remote GPU virtualization over three generations of GPUs In: 46th International Conference on Parallel Processing WGorkshops (ICPPW), 121\u2013128. https:\/\/doi.org\/10.1109\/ICPPW.2017.29.","DOI":"10.1109\/ICPPW.2017.29"},{"issue":"2","key":"238_CR8","doi-asserted-by":"publisher","first-page":"324","DOI":"10.1109\/TPDS.2017.2757479","volume":"29","author":"AD Lin","year":"2018","unstructured":"Lin AD, Li CS, Liao W, Franke H (2018) Capacity optimization for resource pooling in virtualized data centers with composable systems. IEEE Trans Parallel Distrib Syst 29(2):324\u2013337. https:\/\/doi.org\/10.1109\/TPDS.2017.2757479.","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"238_CR9","first-page":"295","volume-title":"Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation NSDI\u201911","author":"B Hindman","year":"2011","unstructured":"Hindman B, Konwinski A, Zaharia M, Ghodsi A, Joseph AD, Katz R, Shenker S, Stoica I (2011) Mesos: A platform for fine-grained resource sharing in the data center In: Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation NSDI\u201911, 295\u2013308.. USENIX Association, Berkeley, CA, USA. http:\/\/dl.acm.org\/citation.cfm?id=1972457.1972488."},{"key":"238_CR10","volume-title":"Kubernetes - Scheduling the Future at Cloud Scale","author":"DK Rensin","year":"2015","unstructured":"Rensin DK (2015) Kubernetes - Scheduling the Future at Cloud Scale. Oreilly, 1005 Gravenstein Highway North Sebastopol, CA 95472. http:\/\/www.oreilly.com\/webops-perf\/free\/kubernetes.csp."},{"key":"238_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2523616.2523633","volume-title":"Proceedings of the Symposium on Cloud Computing SOCC \u201913","author":"VK Vavilapalli","year":"2013","unstructured":"Vavilapalli VK, Murthy AC, Douglas C, Agarwal S, Konar M, Evans R, Graves T, Lowe J, Shah H, Seth S, Saha B, Curino C, O\u2019Malley O, Radia S, Reed B, Baldeschwieler E (2013) Apache hadoop yarn: Yet another resource negotiator In: Proceedings of the Symposium on Cloud Computing SOCC \u201913, 1\u201316.. ACM, New York. https:\/\/doi.org\/10.1145\/2523616.2523633."},{"key":"238_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2741948.2741964","volume-title":"Proceedings of the Tenth European Conference on Computer Systems EuroSys \u201915","author":"A Verma","year":"2015","unstructured":"Verma A, Pedrosa L, Korupolu M, Oppenheimer D, Tune E, Wilkes J (2015) Large-scale cluster management at Google with Borg In: Proceedings of the Tenth European Conference on Computer Systems EuroSys \u201915, 1\u201317.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2741948.2741964."},{"key":"238_CR13","first-page":"99","volume-title":"12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)","author":"I Gog","year":"2016","unstructured":"Gog I, Schwarzkopf M, Gleave A, Watson RNM, Hand S (2016) Firmament: Fast, centralized cluster scheduling at scale In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), 99\u2013115.. USENIX Association, Savannah, GA."},{"key":"238_CR14","volume-title":"Matching theory. North-Holland Mathematics Studies","author":"L Lov\u00e1sz","year":"1986","unstructured":"Lov\u00e1sz L, Plummer MD (1986) Matching theory. North-Holland Mathematics Studies, Vol. 121. North-Holland, Amsterdam, Netherlands. https:\/\/doi.org\/10.1016\/S0304-0208(08)73637-5. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0304020808736375."},{"key":"238_CR15","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-4401-2_10","volume-title":"Graph Matching and Similarity","author":"H Bunke","year":"2000","unstructured":"Bunke H, Jiang X (2000) Graph Matching and Similarity. Springer, Boston, MA."},{"key":"238_CR16","volume-title":"Network Flows: Theory, Algorithms, and Applications","author":"RK Ahuja","year":"1993","unstructured":"Ahuja RK, Magnanti TL, Orlin JB (1993) Network Flows: Theory, Algorithms, and Applications. Prentice-Hall, Inc., Upper Saddle River, NJ, USA."},{"issue":"1","key":"238_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1006\/jagm.1995.0805","volume":"22","author":"AV Goldberg","year":"1997","unstructured":"Goldberg AV (1997) An efficient implementation of a scaling minimum-cost flow algorithm. J Algorithm 22(1):1\u201329. https:\/\/doi.org\/10.1006\/jagm.1995.0805.","journal-title":"J Algorithm"},{"key":"238_CR18","first-page":"203","volume-title":"Proceedings of the 9th ACM\/IFIP\/USENIX International Conference on Middleware \u201908","author":"D Carrera","year":"2008","unstructured":"Carrera D, Steinder M, Whalley I, Torres J, Ayguad\u00e9 E (2008) Enabling resource sharing between transactional and batch workloads using dynamic application placement In: Proceedings of the 9th ACM\/IFIP\/USENIX International Conference on Middleware \u201908, 203\u2013222.. Springer, New York. http:\/\/dl.acm.org\/citation.cfm?id=1496950.1496964."},{"issue":"3","key":"238_CR19","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1287\/mnsc.14.3.205","volume":"14","author":"M Klein","year":"1967","unstructured":"Klein M (1967) A primal method for minimal cost flows with applications to the assignment and transportation problems. Manag Sci 14(3):205\u2013220. https:\/\/doi.org\/10.1287\/mnsc.14.3.205.","journal-title":"Manag Sci"},{"issue":"3","key":"238_CR20","doi-asserted-by":"publisher","first-page":"430","DOI":"10.1287\/moor.15.3.430","volume":"15","author":"AV Goldberg","year":"1990","unstructured":"Goldberg AV, Tarjan RE (1990) Finding minimum-cost circulations by successive approximation. Math Oper Res 15(3):430\u2013466. https:\/\/doi.org\/10.1287\/moor.15.3.430.","journal-title":"Math Oper Res"},{"key":"238_CR21","doi-asserted-by":"publisher","first-page":"467","DOI":"10.1145\/3037697.3037713","volume-title":"Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems","author":"Y Hu","year":"2017","unstructured":"Hu Y, Song M, Li T (2017) Towards \u201cfull containerization\u201d in containerized network function virtualization In: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 467\u2013481.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/3037697.3037713."},{"key":"238_CR22","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1145\/1629575.1629601","volume-title":"Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles","author":"M Isard","year":"2009","unstructured":"Isard M, Prabhakaran V, Currey J, Wieder U, Talwar K, Goldberg A (2009) Quincy: Fair scheduling for distributed computing clusters In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles, 261\u2013276.. ACM, New York. https:\/\/doi.org\/10.1145\/1629575.1629601."},{"key":"238_CR23","volume-title":"Native Docker Clustering with Swarm","author":"F Soppelsa","year":"2017","unstructured":"Soppelsa F, Kaewkasi C (2017) Native Docker Clustering with Swarm. Packt Publishing, USA. https:\/\/dl.acm.org\/doi\/book\/10.5555\/3153103."},{"key":"238_CR24","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1145\/2517349.2522716","volume-title":"Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles","author":"K Ousterhout","year":"2013","unstructured":"Ousterhout K, Wendell P, Zaharia M, Stoica I (2013) Sparrow: Distributed, low latency scheduling In: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, 69\u201384.. ACM, New York. https:\/\/doi.org\/10.1145\/2517349.2522716."},{"key":"238_CR25","first-page":"1154","volume-title":"Proceedings of the 29th Conference on Information Communications","author":"X Meng","year":"2010","unstructured":"Meng X, Pappas V, Zhang L (2010) Improving the scalability of data center networks with traffic-aware virtual machine placement In: Proceedings of the 29th Conference on Information Communications, 1154\u20131162.. IEEE Press, San Diego."},{"key":"238_CR26","doi-asserted-by":"publisher","first-page":"428","DOI":"10.1016\/j.future.2016.03.009","volume":"76","author":"W Zhang","year":"2017","unstructured":"Zhang W, Han S, He H, Chen H (2017) Network-aware virtual machine migration in an overcommitted cloud. Futur Gener Comput Syst 76:428\u2013442. https:\/\/doi.org\/10.1016\/j.future.2016.03.009.","journal-title":"Futur Gener Comput Syst"},{"key":"238_CR27","doi-asserted-by":"publisher","first-page":"318","DOI":"10.1109\/SBAC-PAD.2014.49","volume-title":"Proceedings of the 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing","author":"S Iserte","year":"2014","unstructured":"Iserte S, Castell\u00f3 A, Mayo R, Quintana-Ort\u00ed ES, Silla F, Duato J, Rea\u00f1o C, Prades J (2014) Slurm support for remote GPU virtualization: Implementation and performance study In: Proceedings of the 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, 318\u2013325.. IEEE Computer Society, Washington, DC, USA. https:\/\/doi.org\/10.1109\/SBAC-PAD.2014.49."},{"key":"238_CR28","first-page":"44","volume-title":"Workshop on Job Scheduling Strategies for Parallel Processing","author":"MA Jette","year":"2003","unstructured":"Jette MA, Yoo AB, Grondona M (2003) Slurm: Simple linux utility for resource management In: Workshop on Job Scheduling Strategies for Parallel Processing, 44\u201360.. Springer, Berlin, Heidelberg. https:\/\/doi.org\/10.1007\/10968987_3. http:\/\/citeseerx.ist.psu.edu\/viewdoc\/summary?cid=184264."},{"key":"238_CR29","doi-asserted-by":"publisher","first-page":"249","DOI":"10.5220\/0005780502490256","volume-title":"Proceedings of the 6th International Conference on Cloud Computing and Services Science","author":"S Iserte","year":"2016","unstructured":"Iserte S, Clemente-Castell\u00f3 FJ, Castell\u00f3 A, Mayo R, Quintana-Ort\u00ed ES (2016) Enabling GPU virtualization in cloud environments In: Proceedings of the 6th International Conference on Cloud Computing and Services Science, 249\u2013256.. SCITEPRESS - Science and Technology Publications, Lda, Portugal. https:\/\/doi.org\/10.5220\/0005780502490256."},{"issue":"3","key":"238_CR30","doi-asserted-by":"publisher","first-page":"38","DOI":"10.5120\/8738-2991","volume":"55","author":"O Sefraoui","year":"2012","unstructured":"Sefraoui O, Aissaoui M, Eleuldj M (2012) Openstack: Toward an open-source solution for cloud computing. Int J Comput Appl 55(3):38\u201342. https:\/\/doi.org\/10.5120\/8738-2991.","journal-title":"Int J Comput Appl"},{"key":"238_CR31","doi-asserted-by":"publisher","unstructured":"Lama P, Li Y, Aji AM, Balaji P, Dinan J, Xiao S, Zhang Y, Feng W, Thakur R, Zhou X (2013) pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments In: 2013 IEEE 33rd International Conference on Distributed Computing Systems, 145\u2013154. https:\/\/doi.org\/10.1109\/ICDCS.2013.51.","DOI":"10.1109\/ICDCS.2013.51"},{"key":"238_CR32","doi-asserted-by":"publisher","unstructured":"Xiao S, Balaji P, Zhu Q, Thakur R, Coghlan S, Lin H, Wen G, Hong J, Feng W (2012) VOCL: An optimized environment for transparent virtualization of graphics processing units In: 2012 Innovative Parallel Computing (InPar), 1\u201312. https:\/\/doi.org\/10.1109\/InPar.2012.6339609.","DOI":"10.1109\/InPar.2012.6339609"},{"key":"238_CR33","doi-asserted-by":"publisher","unstructured":"Oikawa M, Kawai A, Nomura K, Yasuoka K, Yoshikawa K, Narumi T (2012) DS-CUDA: a middleware to use many GPUs in the cloud environment In: 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 1207\u20131214. https:\/\/doi.org\/10.1109\/SC.Companion.2012.146.","DOI":"10.1109\/SC.Companion.2012.146"},{"key":"238_CR34","doi-asserted-by":"publisher","unstructured":"Liang TY, Chang YW (2011) GridCuda: a grid-enabled CUDA programming toolkit In: 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications, 141\u2013146. https:\/\/doi.org\/10.1109\/WAINA.2011.82.","DOI":"10.1109\/WAINA.2011.82"},{"key":"238_CR35","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1145\/1996121.1996124","volume-title":"Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing","author":"AM Merritt","year":"2011","unstructured":"Merritt AM, Gupta V, Verma A, Gavrilovska A, Schwan K (2011) Shadowfax: scaling in heterogeneous cluster systems via GPGPU assemblies In: Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing, 3\u201310.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/1996121.1996124."},{"issue":"1","key":"238_CR36","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1023\/A:1022643204877","volume":"1","author":"JR Quinlan","year":"1986","unstructured":"Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81\u2013106. https:\/\/doi.org\/10.1023\/A:1022643204877.","journal-title":"Mach Learn"},{"key":"238_CR37","first-page":"1","volume-title":"Proceedings of the 13th International Workshop on Software; Compilers for Embedded Systems","author":"D Fenacci","year":"2010","unstructured":"Fenacci D, Franke B, Thomson J (2010) Workload characterization supporting the development of domain-specific compiler optimizations using decision trees for data mining In: Proceedings of the 13th International Workshop on Software; Compilers for Embedded Systems, 1\u201310.. ACM, New York, NY, USA. http:\/\/doi.acm.org\/10.1145\/1811212.1811219."},{"key":"238_CR38","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1145\/2541940.2541941","volume-title":"Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems","author":"C Delimitrou","year":"2014","unstructured":"Delimitrou C, Kozyrakis C (2014) Quasar: Resource-efficient and qos-aware cluster management In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, 127\u2013144.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2541940.2541941."},{"key":"238_CR39","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1145\/2749469.2749475","volume-title":"Proceedings of the 42Nd Annual International Symposium on Computer Architecture","author":"D Lo","year":"2015","unstructured":"Lo D, Cheng L, Govindaraju R, Ranganathan P, Kozyrakis C (2015) Heracles: Improving resource efficiency at scale In: Proceedings of the 42Nd Annual International Symposium on Computer Architecture, 450\u2013462.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2749469.2749475."},{"key":"238_CR40","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1145\/1152154.1152174","volume-title":"Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques","author":"K Hoste","year":"2006","unstructured":"Hoste K, Phansalkar A, Eeckhout L, Georges A, John LK, De Bosschere K (2006) Performance prediction based on inherent program similarity In: Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques, 114\u2013122.. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/1152154.1152174."},{"key":"238_CR41","unstructured":"Srivastava S (2018) The Poseidon an add-on Kubernetes scheduler for Firmament scheduler framework. https:\/\/github.com\/kubernetes-sigs\/poseidon."},{"key":"238_CR42","doi-asserted-by":"publisher","first-page":"180","DOI":"10.1016\/j.future.2016.07.014","volume":"67","author":"C-S Li","year":"2017","unstructured":"Li C-S, Franke H, Parris C, Abali B, Kesavan M, Chang V (2017) Composable architecture for rack scale big data computing. Futur Gener Comput Syst 67:180\u2013193. https:\/\/doi.org\/10.1016\/j.future.2016.07.014.","journal-title":"Futur Gener Comput Syst"},{"key":"238_CR43","unstructured":"NVIDIA (2019) Multi-process service (MPS) In: Multi-Process Service (MPS), 1. https:\/\/docs.nvidia.com\/deploy\/mps\/index.html. Accessed in 29 January 2019."},{"key":"238_CR44","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3126908.3126933","volume-title":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis","author":"M Amaral","year":"2017","unstructured":"Amaral M, Polo J, Carrera D, Seelam S, Steinder M (2017) Topology-aware GPU scheduling for learning workloads in cloud environments In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 1\u201312.. ACM, Denver, CO, USA. https:\/\/doi.org\/10.1145\/3126908.3126933."},{"key":"238_CR45","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1109\/IISWC.2009.5306797","volume-title":"Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC)","author":"S Che","year":"2009","unstructured":"Che S, Boyer M, Meng J, Tarjan D, Sheaffer JW, Lee S-H, Skadron K (2009) Rodinia: A benchmark suite for heterogeneous computing In: Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC), 44\u201354.. IEEE Computer Society, Washington. https:\/\/doi.org\/10.1109\/IISWC.2009.5306797."},{"key":"238_CR46","unstructured":"Dugan J, Elliott S, Mah BA, Poskanzer J, Prabhu K (2014) iperf3: A TCP, UDP, and SCTP network bandwidth measurement tool. University of California Lawrence Berkeley National Laboratory and U.S. Department of Energy. https:\/\/github.com\/esnet\/iperf. Accessed in 21 Jan 2015."}],"container-title":["Journal of Cloud Computing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-021-00238-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13677-021-00238-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-021-00238-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,7]],"date-time":"2021-03-07T00:04:38Z","timestamp":1615075478000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofcloudcomputing.springeropen.com\/articles\/10.1186\/s13677-021-00238-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,6]]},"references-count":46,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["238"],"URL":"https:\/\/doi.org\/10.1186\/s13677-021-00238-6","relation":{},"ISSN":["2192-113X"],"issn-type":[{"type":"electronic","value":"2192-113X"}],"subject":[],"published":{"date-parts":[[2021,3,6]]},"assertion":[{"value":"26 September 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 February 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 March 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"22"}}