{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:41:59Z","timestamp":1777657319981,"version":"3.51.4"},"reference-count":45,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2022,12,3]],"date-time":"2022-12-03T00:00:00Z","timestamp":1670025600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["61902112"],"award-info":[{"award-number":["61902112"]}]},{"name":"National Natural Science Foundation of China","award":["23A520036"],"award-info":[{"award-number":["23A520036"]}]},{"name":"National Natural Science Foundation of China","award":["GCIS202115"],"award-info":[{"award-number":["GCIS202115"]}]},{"name":"Henan University","award":["61902112"],"award-info":[{"award-number":["61902112"]}]},{"name":"Henan University","award":["23A520036"],"award-info":[{"award-number":["23A520036"]}]},{"name":"Henan University","award":["GCIS202115"],"award-info":[{"award-number":["GCIS202115"]}]},{"name":"Guangxi Key Laboratory of Cryptography and Information Security","award":["61902112"],"award-info":[{"award-number":["61902112"]}]},{"name":"Guangxi Key Laboratory of Cryptography and Information Security","award":["23A520036"],"award-info":[{"award-number":["23A520036"]}]},{"name":"Guangxi Key Laboratory of Cryptography and Information Security","award":["GCIS202115"],"award-info":[{"award-number":["GCIS202115"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Accurate workload prediction plays a key role in intelligent scheduling decisions on cloud platforms. There are massive amounts of short-workload sequences in the cloud platform, and the small amount of data and the presence of outliers make accurate workload sequence prediction a challenge. For the above issues, this paper proposes an ensemble learning method based on sample weight transfer and long short-term memory (LSTM), termed as Tr-Predictor. Specifically, a selection method of similar sequences combining time warp edit distance (TWED) and transfer entropy (TE) is proposed to select a source domain dataset with higher similarity for the target workload sequence. Then, we upgrade the basic learner of the ensemble model two-stage TrAdaBoost.R2 to LSTM in the deep model and enhance the ability of the ensemble model to extract sequence features. To optimize the weight adjustment strategy, we adopt a two-stage weight adjustment strategy and select the best weight for the learner according to the sample error and model error. Finally, the above process determines the parameters of the target model and uses the target model to predict the short-task sequences. In the experimental validation, we arbitrarily select nine sets of short-workload data from the Google dataset and three sets of short-workload data from the Alibaba cluster to verify the prediction effectiveness of the proposed algorithm. The experimental results show that compared with the commonly used cloud workload prediction methods Tr-Predictor has higher prediction accuracy on the small-sample workload. The prediction indicators of the ablation experiments show the performance gain of each part in the proposed method.<\/jats:p>","DOI":"10.3390\/e24121770","type":"journal-article","created":{"date-parts":[[2022,12,5]],"date-time":"2022-12-05T02:56:26Z","timestamp":1670208986000},"page":"1770","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Tr-Predictior: An Ensemble Transfer Learning Model for Small-Sample Cloud Workload Prediction"],"prefix":"10.3390","volume":"24","author":[{"given":"Chunhong","family":"Liu","sequence":"first","affiliation":[{"name":"College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China"},{"name":"Engineering Lab of Intelligence Business, Xinxiang 453007, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jie","family":"Jiao","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weili","family":"Li","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingxiong","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junna","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China"},{"name":"Engineering Lab of Intelligence Business, Xinxiang 453007, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.compeleceng.2018.06.006","article-title":"Recent security challenges in cloud computing","volume":"71","author":"Subramanian","year":"2018","journal-title":"Comput. Electr. Eng."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1109\/TSC.2017.2711009","article-title":"Elasticity in cloud computing: State of the art and research challenges","volume":"11","author":"AlDhuraibi","year":"2017","journal-title":"IEEE Trans. Serv. Comput."},{"key":"ref_3","unstructured":"Zeng, F.S. (2017, January 25). Large-scale cluster management at Google with Borg. Proceedings of the Tenth European Conference on Computer Systems, Singapore."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1016\/j.neucom.2020.08.076","article-title":"A proactive autoscaling and energy-efficient VM allocation framework using online multi-resource neural network for cloud data center","volume":"426","author":"Saxena","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Roy, N., Dubey, A., and Gokhale, A. (2011, January 4\u20139). Efficient autoscaling in the cloud using predictive models for workload forecasting. Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing, Washington, DC, USA.","DOI":"10.1109\/CLOUD.2011.42"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1016\/j.neucom.2020.11.011","article-title":"Integrated deep learning method for workload and resource prediction in cloud systems","volume":"424","author":"Bi","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1825","DOI":"10.1109\/TPDS.2021.3128037","article-title":"Adaptive resource efficient microservice deployment in cloud-edge continuum","volume":"33","author":"Fu","year":"2021","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Reiss, C., Tumanov, A., Ganger, G., Katz, R., and Kozuch, M.A. (2013, January 14). Heterogeneity and dynamicity of clouds at scale: Google trace analysis. Proceedings of the Third ACM Symposium on Cloud Computing, San Jose, CA, USA.","DOI":"10.1145\/2391229.2391236"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1016\/j.neucom.2016.04.021","article-title":"Neural networks for pattern-based short-term load forecasting: A comparative study","volume":"205","author":"Dudek","year":"2016","journal-title":"Neurocomputing"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Gao, J., Wang, H., and Shen, H. (2020, January 3\u20136). Machine learning based workload prediction in cloud computing. Proceedings of the 2020 29th international conference on computer communications and networks (ICCCN), Honolulu, HI, USA.","DOI":"10.1109\/ICCCN49398.2020.9209730"},{"key":"ref_11","unstructured":"Weng, Q., Xiao, W., Yu, Y., Wang, W., Wang, C., He, J., and Ding, Y. (2022, January 4\u20136). MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), Renton, WA, USA."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Keshari, R., Ghosh, S., Chhabra, S., Vatsa, M., and Singh, R. (2020, January 24\u201326). Unravelling small sample size problems in the deep learning world. Proceedings of the 2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM), New Delhi, India.","DOI":"10.1109\/BigMM50055.2020.00028"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2399","DOI":"10.1007\/s10586-019-03010-3","article-title":"A survey and classification of the workload forecasting methods in cloud computing","volume":"23","author":"Masdari","year":"2020","journal-title":"Clust. Comput."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Daraghmeh, M., Agarwal, A., Manzano, R., and Zaman, M. (2021, January 14\u201323). Time Series Forecasting using Facebook Prophet for Cloud Resource Management. Proceedings of the 2021 IEEE International Conference on Communications Workshops, Xiamen, China.","DOI":"10.1109\/ICCWorkshops50388.2021.9473607"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"105340","DOI":"10.1016\/j.dib.2020.105340","article-title":"Application of the ARIMA model on the COVID-2019 epidemic dataset","volume":"29","author":"Benvenuto","year":"2020","journal-title":"Data Brief"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yu, Y., Jindal, V., Bastani, F., Li, F., and Yen, I. (2018, January 23\u201327). Improving the Smartness of Cloud Management via Machine Learning Based Workload Prediction. Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC), Tokyo, Japan.","DOI":"10.1109\/COMPSAC.2018.10200"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1016\/j.jnca.2016.12.017","article-title":"An adaptive prediction approach based on workload pattern discrimination in the cloud","volume":"80","author":"Liu","year":"2017","journal-title":"J. Netw. Comput. Appl."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"10043","DOI":"10.1007\/s00521-021-05770-9","article-title":"An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model","volume":"33","author":"Ouhame","year":"2021","journal-title":"Neural Comput. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Mathelin, A., Richard, G., Deheeger, F., Mougeot, M., and Vayatis, N. (2021, January 1\u20133). Adversarial weighting for domain adaptation in regression. Proceedings of the 2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI), Beijing, China.","DOI":"10.1109\/ICTAI52525.2021.00015"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1371","DOI":"10.1007\/s00202-020-00930-x","article-title":"A hybrid transfer learning model for short-term electric load forecasting","volume":"102","author":"Xu","year":"2020","journal-title":"Electr. Eng."},{"key":"ref_21","first-page":"1","article-title":"Sample Augmentation for Intelligent Milling Tool Wear Condition Monitoring Using Numerical Simulation and Generative Adversarial Network","volume":"70","author":"Zhu","year":"2021","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"110622","DOI":"10.1016\/j.measurement.2021.110622","article-title":"A new tool wear condition monitoring method based on deep learning under small samples","volume":"189","author":"Zhou","year":"2022","journal-title":"Measurement"},{"key":"ref_23","first-page":"5","article-title":"Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology","volume":"1","author":"Liu","year":"2019","journal-title":"Engineering"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"119322","DOI":"10.1016\/j.energy.2020.119322","article-title":"A general transfer learning-based framework for thermal load prediction in regional energy system","volume":"217","author":"Lu","year":"2021","journal-title":"Energy"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3446","DOI":"10.1016\/j.egyr.2020.12.010","article-title":"An electricity load forecasting model for Integrated Energy System based on BiGAN and transfer learning","volume":"6","author":"Zhou","year":"2020","journal-title":"Energy Rep."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1016\/j.knosys.2014.04.035","article-title":"An empirical evaluation of similarity measures for time series classification","volume":"67","author":"Serra","year":"2014","journal-title":"Knowl. Based Syst."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1109\/TPAMI.2008.76","article-title":"Time warp edit distance with stiffness adjustment for time series matching","volume":"31","author":"Marteau","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_28","first-page":"1","article-title":"Efficient and Privacy-Preserving Similarity Range Query over Encrypted Time Series Data","volume":"99","author":"Zheng","year":"2021","journal-title":"IEEE Trans. Dependable Secur. Comput."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Sun, S., Wei, Y., and Wang, S. (2018, January 11\u201313). AdaBoost-LSTM ensemble learning for financial time series forecasting. Proceedings of the International Conference on Computational Science, Wuxi, China.","DOI":"10.1007\/978-3-319-93713-7_55"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.1162\/neco_a_01199","article-title":"A review of recurrent neural networks: LSTM cells and network architectures","volume":"31","author":"Yu","year":"2019","journal-title":"Neural Comput."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"105551","DOI":"10.1016\/j.aap.2020.105551","article-title":"Improving the transferability of the crash prediction model using the TrAdaBoost. R2 algorithm","volume":"141","author":"Tang","year":"2020","journal-title":"Accid. Anal. Prev."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Bossomaier, T., Barnett, L., Harr\u00e9, M., and Lizier, J. (2016). An Introduction to Transfer Entropy, Springer.","DOI":"10.1007\/978-3-319-43222-9"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"408","DOI":"10.1016\/j.neucom.2014.02.016","article-title":"Localization and regularization of normalized transfer entropy","volume":"139","author":"Choi","year":"2014","journal-title":"Neurocomputing"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1016\/j.physa.2017.04.089","article-title":"Comparison of transfer entropy methods for financial time series","volume":"482","author":"He","year":"2017","journal-title":"Phys. Stat. Mech. Appl."},{"key":"ref_35","unstructured":"Pardoe, D., and Stone, P. (2010, January 21\u201324). Boosting for regression transfer. Proceedings of the 27th International Conference on International Conference on Machine Learning (ICML\u201910), Madison, WI, USA."},{"key":"ref_36","unstructured":"(2022, November 06). Google\/Cluster-Data. Available online: https:\/\/github.com\/google\/cluster-data."},{"key":"ref_37","unstructured":"Reiss, C., Wilkes, J., and Hellerstein, J.L. (2022, November 02). Google Cluster-Usage Traces: Format+ Schema, Google Inc. Available online: https:\/\/xueshu.studiodahu.com\/citations?view_op=view_citation&hl=zh-CN&user=tfKeplgAAAAJ&citation_for_view=tfKeplgAAAAJ:YsMSGLbcyi4C."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Guo, J., Chang, Z., Wang, S., Ding, H., Feng, Y., Mao, L., and Bao, Y. (2019, January 24\u201325). Who limits the resource efficiency of my datacenter: An analysis of alibaba datacenter traces. Proceedings of the 2019 IEEE\/ACM 27th International Symposium on Quality of Service (IWQoS), Phoenix, AZ, USA.","DOI":"10.1145\/3326285.3329074"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1109\/TCC.2014.2350475","article-title":"Workload prediction using ARIMA model and its impact on cloud applications\u2019 QoS","volume":"3","author":"Calheiros","year":"2014","journal-title":"IEEE Trans. Cloud Comput."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Shi, Q., Yin, J., Cai, J., Cichocki, A., Yokota, T., Chen, L., Yuan, M., and Zeng, J. (2020, January 7\u201312). Block Hankel tensor ARIMA for multiple short time series forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i04.6032"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"107227","DOI":"10.1016\/j.cie.2021.107227","article-title":"A manufacturing quality prediction model based on AdaBoost-LSTM with rough knowledge","volume":"155","author":"Bai","year":"2021","journal-title":"Comput. Ind. Eng."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1016\/j.ijforecast.2015.12.003","article-title":"A new metric of absolute percentage error for intermittent demand forecasts","volume":"32","author":"Kim","year":"2021","journal-title":"Int. J. Forecast."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"79","DOI":"10.3354\/cr030079","article-title":"Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance","volume":"30","author":"Willmott","year":"2005","journal-title":"Clim. Res."},{"key":"ref_44","unstructured":"Gareth, J., Witten, D., Trevor, H., and Robert, T. (2014). An Introduction to Statistical Learning: With Applications in R, Springer."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"e623","DOI":"10.7717\/peerj-cs.623","article-title":"The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation","volume":"7","author":"Chicco","year":"2021","journal-title":"Peerj Comput. Sci."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/12\/1770\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:33:27Z","timestamp":1760146407000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/12\/1770"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,3]]},"references-count":45,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["e24121770"],"URL":"https:\/\/doi.org\/10.3390\/e24121770","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,3]]}}}