{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T20:03:08Z","timestamp":1774728188373,"version":"3.50.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,3,11]],"date-time":"2023-03-11T00:00:00Z","timestamp":1678492800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,11]],"date-time":"2023-03-11T00:00:00Z","timestamp":1678492800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"University of Miskolc"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cloud Comp"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Serverless computing is stepping forward to provide a cloud environment that mainly focuses on managing infrastructure, resources and configurations on the behalf of a user. Research in this field can\u2019t rely on commercial providers such as AWS and Azure, as their inflexibility and cost often limits the required levels of reproducibility and scalability. Therefore, simulators have been opted as an alternative solution by the research community. They offer a reduced-cost and easy-setup environment. To get respectable precision, simulators use real traces collected and offered by commercial providers. These traces represent comprehensive information of executed tasks that reflect user behaviour. Due to serverless computing\u2019s recency, typical workload traces employed by IaaS simulators are not well adoptable to the new computing model. <\/jats:p><jats:p>In this paper, we propose an approach for generating realistic serverless traces. We enhance our previous generator approach that was based on the Azure Functions dataset. Our new, genetic algorithm based approach improves the statistical properties of the generated traces. We also enabled arbitrary scaling of the workload, while maintaining real users\u2019 behaviour. These advances further support reproducibility in the serverless research community. We validated the results of our generator approach using the coefficient of determination (<jats:inline-formula><jats:alternatives><jats:tex-math>$$R^2$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:msup>\n                    <mml:mi>R<\/mml:mi>\n                    <mml:mn>2<\/mml:mn>\n                  <\/mml:msup>\n                <\/mml:math><\/jats:alternatives><\/jats:inline-formula>), which shows that our generated workload closely matches the original dataset\u2019s characteristics in terms of execution time, memory utilisation as well as user participation percentage. To demonstrate the benefits of the reusability of the generated traces, we applied them with a diverse set of simulators and shown that they offer reproducible results independently of the simulator used.<\/jats:p>","DOI":"10.1186\/s13677-023-00397-8","type":"journal-article","created":{"date-parts":[[2023,3,26]],"date-time":"2023-03-26T20:23:47Z","timestamp":1679862227000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Enriching computing simulators by generating realistic serverless traces"],"prefix":"10.1186","volume":"12","author":[{"given":"Dilshad Hassan","family":"Sallo","sequence":"first","affiliation":[]},{"given":"Gabor","family":"Kecskemeti","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,11]]},"reference":[{"key":"397_CR1","doi-asserted-by":"crossref","unstructured":"Shafiei H, Khonsari A, Mousavi P (2022) Serverless computing: A survey of opportunities, challenges, and applications.\u00a0ACM Comput Surv 54(11s):1\u201332","DOI":"10.1145\/3510611"},{"key":"397_CR2","unstructured":"Jounaid S (2020) Opendc serverless: Design, implementation and evaluation of a faas platform simulator. PhD thesis, Vrije Universiteit Amsterdam"},{"key":"397_CR3","doi-asserted-by":"crossref","unstructured":"Hasan M, Siddique MA (2021) A research-oriented mathematical model for cloud simulations. In: 2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). IEEE, pp 875\u2013878","DOI":"10.1109\/I-SMAC52330.2021.9640902"},{"key":"397_CR4","doi-asserted-by":"crossref","unstructured":"Sallo DH, Kecskemeti G (2022) Towards generating realistic trace for simulating functions-as-a-service. In: European Conference on Parallel Processing. Springer, pp 428\u2013439","DOI":"10.1007\/978-3-031-06156-1_34"},{"issue":"1","key":"397_CR5","first-page":"23","volume":"41","author":"RN Calheiros","year":"2011","unstructured":"Calheiros RN, Ranjan R, Beloglazov A, De Rose CA, Buyya R (2011) Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw: Pract Experience 41(1):23\u201350","journal-title":"Softw: Pract Experience"},{"key":"397_CR6","doi-asserted-by":"publisher","first-page":"188","DOI":"10.1016\/j.simpat.2015.05.009","volume":"58","author":"G Kecskemeti","year":"2015","unstructured":"Kecskemeti G (2015) Dissect-cf: a simulator to foster energy-aware scheduling in infrastructure clouds. Simul Model Pract Theory 58:188\u2013218","journal-title":"Simul Model Pract Theory"},{"issue":"13\u201315","key":"397_CR7","doi-asserted-by":"publisher","first-page":"1175","DOI":"10.1002\/cpe.710","volume":"14","author":"R Buyya","year":"2002","unstructured":"Buyya R, Murshed M (2002) Gridsim: A toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing. Concurr Comput: Pract Experience 14(13\u201315):1175\u20131220","journal-title":"Concurr Comput: Pract Experience"},{"key":"397_CR8","doi-asserted-by":"crossref","unstructured":"Jeon H, Cho C, Shin S, Yoon S (2019) A cloudsim-extension for simulating distributed functions-as-a-service. In: 2019 20th International Conference on parallel and distributed computing, applications and technologies (PDCAT). IEEE, pp 386\u2013391","DOI":"10.1109\/PDCAT46702.2019.00076"},{"key":"397_CR9","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1016\/j.future.2020.07.017","volume":"114","author":"T Rausch","year":"2021","unstructured":"Rausch T, Rashed A, Dustdar S (2021) Optimized container scheduling for data-intensive serverless edge computing. Futur Gener Comput Syst 114:259\u2013271","journal-title":"Futur Gener Comput Syst"},{"key":"397_CR10","doi-asserted-by":"crossref","unstructured":"Mahmoudi N, Khazaei H (2021) Simfaas: A performance simulator for serverless computing platforms. arXiv preprint arXiv:2102.08904","DOI":"10.5220\/0010376500230033"},{"key":"397_CR11","unstructured":"Shahrad M, Fonseca R, Goiri \u00cd, Chaudhry G, Batum P, Cooke J, Laureano E, Tresness C, Russinovich M, Bianchini R (2020) Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In: 2020\u00a0{USENIX} Annual Technical Conference ({USENIX}{ATC} 20).\u00a0\u00a0arXiv preprint arXiv:2003.03423. pp 205\u2013218"},{"issue":"4","key":"397_CR12","first-page":"1","volume":"20","author":"A Tangherloni","year":"2019","unstructured":"Tangherloni A, Spolaor S, Rundo L, Nobile MS, Cazzaniga P, Mauri G, Li\u00f2 P, Merelli I, Besozzi D (2019) Genhap: a novel computational method based on genetic algorithms for haplotype assembly. BMC Bioinformatics 20(4):1\u201314","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"397_CR13","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1007\/s10723-012-9208-5","volume":"10","author":"A N\u00fa\u00f1ez","year":"2012","unstructured":"N\u00fa\u00f1ez A, V\u00e1zquez-Poletti JL, Caminero AC, Casta\u00f1\u00e9 GG, Carretero J, Llorente IM (2012) icancloud: A flexible and scalable cloud infrastructure simulator. J Grid Comput 10(1):185\u2013209","journal-title":"J Grid Comput"},{"key":"397_CR14","doi-asserted-by":"crossref","unstructured":"Chen W, Deelman E (2012) Workflowsim: A toolkit for simulating scientific workflows in distributed environments. In: 2012 IEEE 8th international conference on E-science. IEEE, pp 1\u20138","DOI":"10.1109\/eScience.2012.6404430"},{"issue":"1","key":"397_CR15","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1109\/TASE.2013.2266338","volume":"12","author":"W Tian","year":"2013","unstructured":"Tian W, Zhao Y, Xu M, Zhong Y, Sun X (2013) A toolkit for modeling and simulation of real-time virtual machine allocation in a cloud data center. IEEE Trans Autom Sci Eng 12(1):153\u2013161","journal-title":"IEEE Trans Autom Sci Eng"},{"key":"397_CR16","doi-asserted-by":"crossref","unstructured":"Wickremasinghe B, Calheiros RN, Buyya R (2010) Cloudanalyst: A cloudsim-based visual modeller for analysing cloud computing environments and applications. In: 2010 24th IEEE international conference on advanced information networking and applications. IEEE, pp 446\u2013452","DOI":"10.1109\/AINA.2010.32"},{"issue":"3","key":"397_CR17","doi-asserted-by":"publisher","first-page":"1263","DOI":"10.1007\/s11227-010-0504-1","volume":"62","author":"D Kliazovich","year":"2012","unstructured":"Kliazovich D, Bouvry P, Khan SU (2012) Greencloud: a packet-level simulator of energy-aware cloud computing data centers. J Supercomput 62(3):1263\u20131283","journal-title":"J Supercomput"},{"issue":"5","key":"397_CR18","doi-asserted-by":"publisher","first-page":"2535","DOI":"10.1007\/s11227-018-2636-7","volume":"75","author":"A Siavashi","year":"2019","unstructured":"Siavashi A, Momtazpour M (2019) Gpucloudsim: an extension of cloudsim for modeling and simulation of gpus in cloud data centers. J Supercomput 75(5):2535\u20132561","journal-title":"J Supercomput"},{"key":"397_CR19","doi-asserted-by":"crossref","unstructured":"Casanova H (2001) Simgrid: A toolkit for the simulation of application scheduling. In: Proceedings First IEEE\/ACM International Symposium on Cluster Computing and the Grid. IEEE, pp 430\u2013437","DOI":"10.1109\/CCGRID.2001.923223"},{"key":"397_CR20","doi-asserted-by":"crossref","unstructured":"Meisner D, Wu J, Wenisch TF (2012) Bighouse: A simulation infrastructure for data center systems. In: 2012 IEEE International Symposium on Performance Analysis of Systems & Software. IEEE, pp 35\u201345","DOI":"10.1109\/ISPASS.2012.6189204"},{"key":"397_CR21","doi-asserted-by":"crossref","unstructured":"Son J, Dastjerdi AV, Calheiros RN, Ji X, Yoon Y, Buyya R (2015) Cloudsimsdn: Modeling and simulation of software-defined cloud data centers. In: 2015 15th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE, pp 475\u2013484","DOI":"10.1109\/CCGrid.2015.87"},{"key":"397_CR22","doi-asserted-by":"crossref","unstructured":"Higashino WA, Capretz MA, Bittencourt LF (2015) Cepsim: A simulator for cloud-based complex event processing. In: 2015 IEEE International Congress on Big Data. IEEE, pp 182\u2013190","DOI":"10.1109\/BigDataCongress.2015.34"},{"key":"397_CR23","doi-asserted-by":"crossref","unstructured":"Kohne A, Spohr M, Nagel L, Spinczyk O (2014) Federatedcloudsim: a sla-aware federated cloud simulation framework. In: Proceedings of the 2nd International Workshop on CrossCloud Systems. pp 1\u20135","DOI":"10.1145\/2676662.2676674"},{"issue":"5","key":"397_CR24","first-page":"595","volume":"43","author":"RN Calheiros","year":"2013","unstructured":"Calheiros RN, Netto MA, De Rose CA, Buyya R (2013) Emusim: an integrated emulation and simulation environment for modeling, evaluation, and validation of performance of cloud computing applications. Softw: Pract Experience 43(5):595\u2013612","journal-title":"Softw: Pract Experience"},{"key":"397_CR25","doi-asserted-by":"crossref","unstructured":"Teixeira\u00a0S\u00e1 T, Calheiros RN, Gomes DG (2014) Cloudreports: An extensible simulation tool for energy-aware cloud computing environments. In: cloud computing. Springer, pp 127\u2013142","DOI":"10.1007\/978-3-319-10530-7_6"},{"key":"397_CR26","unstructured":"Keller G, Tighe M, Lutfiyya H, Bauer M (2013) Dcsim: A data centre simulation tool. In: 2013 IFIP\/IEEE International Symposium on Integrated Network Management (IM 2013). IEEE, pp 1090\u20131091"},{"issue":"2","key":"397_CR27","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1007\/s10723-016-9390-y","volume":"15","author":"Z Cai","year":"2017","unstructured":"Cai Z, Li Q, Li X (2017) Elasticsim: A toolkit for simulating workflows with cloud resource runtime auto-scaling and stochastic task execution times. J Grid Comput 15(2):257\u2013272","journal-title":"J Grid Comput"},{"key":"397_CR28","doi-asserted-by":"crossref","unstructured":"Kathiravelu P, Veiga L (2014) Concurrent and distributed cloudsim simulations. In: 2014 IEEE 22nd International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems. IEEE, pp 490\u2013493","DOI":"10.1109\/MASCOTS.2014.70"}],"container-title":["Journal of Cloud Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-023-00397-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13677-023-00397-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-023-00397-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,26]],"date-time":"2023-03-26T20:24:09Z","timestamp":1679862249000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofcloudcomputing.springeropen.com\/articles\/10.1186\/s13677-023-00397-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,11]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["397"],"URL":"https:\/\/doi.org\/10.1186\/s13677-023-00397-8","relation":{},"ISSN":["2192-113X"],"issn-type":[{"value":"2192-113X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,11]]},"assertion":[{"value":"27 August 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 January 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 March 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"36"}}