{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,1]],"date-time":"2025-11-01T21:45:26Z","timestamp":1762033526331,"version":"build-2065373602"},"reference-count":33,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2018,4,25]],"date-time":"2018-04-25T00:00:00Z","timestamp":1524614400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key Research &amp; Development (R&amp;D) Plan","award":["2017YFC0803300"],"award-info":[{"award-number":["2017YFC0803300"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["71673292","61503402","61673388"],"award-info":[{"award-number":["71673292","61503402","61673388"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Social Science Foundation of China","award":["17CGL047"],"award-info":[{"award-number":["17CGL047"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>Cloud Computing has emerged as a powerful and promising way for running high performance computing (HPC) jobs. Most HPC jobs are designed under multi-processes paradigm and involve frequent communication and synchronization among parallel processes. However, as the underlying resources of cloud data centers are always shared among multiple tenants, the competition of jobs for limited bandwidth resources lead to unpredictable completion times for jobs in the cloud, which may lead to QoS violation and inefficient utilization of resources when scheduling parallel jobs in the cloud. To tackle the issue, it is essential to provide bandwidth guarantees for parallel jobs running in the cloud. Offering a dedicated virtual cluster (VC) for running applications in the cloud is a popular way to guarantee bandwidth demands. Motivated by these problems, in this paper, we firstly design a time-aware virtual cluster (TVC) request model for parallel jobs and consider how to embed requested TVCs of jobs into cloud efficiently under parallel job scheduling framework. An adaptive bandwidth-aware heuristic algorithm, which is denoted as AdaBa, is proposed to improve the job accept rate by adjusting the priorities of servers to accommodate the VMs of TVC adaptively according to the relative size of requested bandwidth demand. Then, a bandwidth-guaranteed migration and backfilling scheduling algorithm, which is denoted as BgMBF, is designed to schedule parallel jobs and the bandwidth demands are guaranteed by AdaBa. To obtain high job responsiveness performance, a bandwidth-reserved job backfilling strategy is designed when the requested TVC for current scheduled job cannot be allocated in the cloud. The migration cost of BgMBF is also considered and an enhanced version BgMBFSDF is then proposed to minimize the number of migration when the execution time of jobs are known. Through extensive simulation experiments on popular parallel workloads, our proposed TVC embedding algorithm AdaBa achieves up to 15 percent of improvement on accept rate compared with existing algorithms such as Oktupus and greedy algorithm. Our proposed BgMBF and BgMBFSDF also significantly outperform other popular scheduling algorithms integrated with AdaBa on average response time and average bounded slow down.<\/jats:p>","DOI":"10.3390\/sym10050134","type":"journal-article","created":{"date-parts":[[2018,4,25]],"date-time":"2018-04-25T11:15:39Z","timestamp":1524654939000},"page":"134","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Bandwidth-Guaranteed Resource Allocation and Scheduling for Parallel Jobs in Cloud Data Center"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2750-3957","authenticated-orcid":false,"given":"Zhen","family":"Li","sequence":"first","affiliation":[{"name":"College of System Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2962-9254","authenticated-orcid":false,"given":"Bin","family":"Chen","sequence":"additional","affiliation":[{"name":"College of System Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Xiaocheng","family":"Liu","sequence":"additional","affiliation":[{"name":"College of System Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Dandan","family":"Ning","sequence":"additional","affiliation":[{"name":"College of System Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Qihang","family":"Wei","sequence":"additional","affiliation":[{"name":"The 66029th Troop of PLA, Inner Mongolia Autonomous Region 011216, China"}]},{"given":"Yiping","family":"Wang","sequence":"additional","affiliation":[{"name":"The Naval 902 Factory, Shanghai 200083, China"}]},{"given":"Xiaogang","family":"Qiu","sequence":"additional","affiliation":[{"name":"College of System Engineering, National University of Defense Technology, Changsha 410073, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,4,25]]},"reference":[{"key":"ref_1","unstructured":"Pellerin, D., and Ballantyne, D.B.A. (2015). An Introduction to High Performance Computing on AWS: Scalable, Cost-Effective Solutions for Engineering, Business, and Science, Amazon Web Service."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Gupta, A., Sarood, O., Kale, L.V., and Milojicic, D. (2013, January 13\u201316). Improving HPC Application Performance in Cloud through Dynamic Load Balancing. Proceedings of the IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing, Delft, The Netherlands.","DOI":"10.1109\/CCGrid.2013.65"},{"key":"ref_3","unstructured":"Walker, E. (2008). Benchmarking amazon EC2 for high-performance scientific computing. Usenix, 18\u201323."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gupta, A., and Milojicic, D. (2011, January 12\u201313). Evaluation of HPC Applications on Cloud. Proceedings of the Sixth Open Cirrus Summit, Atlanta, GA, USA.","DOI":"10.1109\/OCS.2011.10"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.future.2013.12.024","article-title":"CLOUDRB: A framework for scheduling and managing High-Performance Computing (HPC) applications in science cloud","volume":"34","author":"Somasundaram","year":"2014","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/j.procs.2015.09.220","article-title":"Developing and Operating Time Critical Applications in Clouds: The State of the Art and the SWITCH Approach","volume":"68","author":"Zhao","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_7","unstructured":"Mann, Z.\u00c1., and Stolz, V. (2018). Developing, Provisioning and Controlling Time Critical Applications in Cloud. Advances in Service-Oriented and Cloud Computing, Springer International Publishing."},{"key":"ref_8","unstructured":"Guo, C., Lu, G., Wang, H.J., Kong, C., Sun, P., Wu, W., and Zhang, Y. (December, January 30). SecondNet: A data center network virtualization architecture with bandwidth guarantees. Proceedings of the 6th International Conference, Philadelphia, PA, USA."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1145\/2043164.2018465","article-title":"Towards predictable datacenter networks","volume":"41","author":"Ballani","year":"2011","journal-title":"Acm Sigcomm Comput. Commun. Rev."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"450","DOI":"10.1109\/TPDS.2017.2754366","article-title":"Towards Bandwidth Guarantee for Virtual Clusters under Demand Uncertainty in Multi-tenant Clouds","volume":"29","author":"Yu","year":"2018","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Biran, O., Corradi, A., Fanelli, M., Foschini, L., Nus, A., Raz, D., and Silvera, E. (2012, January 13\u201316). A Stable Network-Aware VM Placement for Cloud Systems. Proceedings of the IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing, Ottawa, ON, Canada.","DOI":"10.1109\/CCGrid.2012.119"},{"key":"ref_12","first-page":"1","article-title":"Congestion-Aware Embedding of Heterogeneous Bandwidth Virtual Data Centers With Hose Model Abstraction","volume":"PP","author":"Yan","year":"2017","journal-title":"IEEE\/ACM Trans. Netw."},{"key":"ref_13","unstructured":"Li, X., Wu, J., Tang, S., and Lu, S. (May, January 27). Let\u2019s stay together: Towards traffic aware virtual machine placement in data centers. Proceedings of the IEEE INFOCOM, Toronto, ON, Canada."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1109\/TNSM.2015.2443838","article-title":"Time-Aware VMFlow Placement, Routing, and Migration for Power Efficiency in Data Centers","volume":"12","author":"Dalvandi","year":"2015","journal-title":"IEEE Trans. Netw. Serv. Manag."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Lifka, D.A. (1995, January 25). The ANL\/IBM SP Scheduling System. Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, Santa Barbara, CA, USA.","DOI":"10.1007\/3-540-60153-8_35"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Feitelson, D.G., and Jette, M.A. (1997, January 5). Improved Utilization and Responsiveness with Gang Scheduling. Proceedings of the Job Scheduling Strategies for Parallel Processing, Geneva, Switzerland.","DOI":"10.2172\/490476"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1874","DOI":"10.1109\/TPDS.2012.262","article-title":"Priority-Based Consolidation of Parallel Workloads in the Cloud","volume":"24","author":"Liu","year":"2013","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1145\/2534169.2486027","article-title":"ElasticSwitch: Practical work-conserving bandwidth guarantees for cloud computing","volume":"43","author":"Popa","year":"2013","journal-title":"Comput. Commun. Rev."},{"key":"ref_19","unstructured":"Rodrigues, H., Santos, J.R., Turner, Y., Soares, P., and Guedes, D. (2011, January 14). Gatekeeper: Supporting bandwidth guarantees for multi-tenant datacenter networks. Proceedings of the Conference on I\/o Virtualization, Berkeley, CA, USA."},{"key":"ref_20","unstructured":"Shieh, A., and Saha, B. (April, January 30). Sharing the data center network. Proceedings of the Usenix Conference on Networked Systems Design and Implementation, Boston, MA, USA."},{"key":"ref_21","first-page":"23","article-title":"Towards performance-centric fairness in datacenter networks","volume":"10","author":"Chen","year":"2013","journal-title":"Int. J. Hybrid Intell. Syst."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2248","DOI":"10.1109\/TPDS.2015.2489646","article-title":"DCloud: Deadline-Aware Resource Allocation for Cloud Computing Jobs","volume":"27","author":"Li","year":"2016","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Yu, R., Xue, G., Zhang, X., and Li, D. (2017, January 1\u20134). Survivable and bandwidth-guaranteed embedding of virtual clusters in cloud data centers. Proceedings of the INFOCOM 2017\u2014IEEE Conference on Computer Communications, Atlanta, GA, USA.","DOI":"10.1109\/INFOCOM.2017.8056945"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Feitelson, D.G., Rudolph, L., and Schwiegelshohn, U. (2004, January 13). Parallel Job Scheduling\u2014A Status Report. Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, New York, NY, USA.","DOI":"10.1007\/10968987"},{"key":"ref_25","unstructured":"Schwiegelshohn, U., and Yahyapour, R. (1998, January 25\u201327). Analysis of First-Come-First-Serve Parallel Job Scheduling. Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA, USA."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.jpdc.2016.06.013","article-title":"Fattened backfilling: An improved strategy for job scheduling in parallel systems","volume":"97","year":"2016","journal-title":"J. Parallel Distrib. Comput."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1109\/71.932708","article-title":"Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling","volume":"12","author":"Feitelson","year":"2001","journal-title":"Parallel Distrib. Syst. IEEE Trans."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1145\/1402946.1402967","article-title":"A scalable, commodity data center network architecture","volume":"38","author":"Loukissas","year":"2008","journal-title":"Acm Sigcomm Comput. Commun. Rev."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1109\/TNET.2002.803918","article-title":"Resource management with hoses: Point-to-cloud services for virtual private networks","volume":"10","author":"Duffield","year":"2002","journal-title":"IEEE\/ACM Trans. Netw."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1137\/0402042","article-title":"Complexity of Scheduling Parallel Task Systems","volume":"2","author":"Du","year":"1989","journal-title":"Siam J. Discret. Math."},{"key":"ref_31","unstructured":"Kenyon, C. (1996, January 28\u201330). Best-fit bin-packing with random order. Proceedings of the Acm-Siam Symposium on Discrete Algorithms, Atlanta, GA, USA."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Feitelson, D.G., and Nitzberg, B. (1970). Job characteristics of a production parallel scientific workload on the NASA Ames iPSC\/860. Lect. Notes Comput. Sci., 337\u2013360.","DOI":"10.1007\/3-540-60153-8_38"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Ye, K., Jiang, X., Ma, R., and Yan, F. (2012, January 20\u201323). VC-Migration: Live Migration of Virtual Clusters in the Cloud. Proceedings of the ACM\/IEEE International Conference on Grid Computing, Beijing, China.","DOI":"10.1109\/Grid.2012.27"}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/10\/5\/134\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:02:09Z","timestamp":1760194929000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/10\/5\/134"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,4,25]]},"references-count":33,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2018,5]]}},"alternative-id":["sym10050134"],"URL":"https:\/\/doi.org\/10.3390\/sym10050134","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2018,4,25]]}}}