{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,29]],"date-time":"2025-11-29T08:01:20Z","timestamp":1764403280355,"version":"3.37.3"},"reference-count":13,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,6]],"date-time":"2023-09-06T00:00:00Z","timestamp":1693958400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,6]],"date-time":"2023-09-06T00:00:00Z","timestamp":1693958400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100018563","name":"University of Macedonia","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100018563","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Comput Manag Sci"],"published-print":{"date-parts":[[2023,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In the big data era which we have entered, the development of smart scheduler has become a necessity. A Distributed Stream Processing System (DSPS) has the role of assigning processing tasks to the available resources (dynamically or not) and route streaming data between them. Smart and efficient task scheduling can reduce latencies and eliminate network congestions. The most commonly used scheduler is the default Storm scheduler, which has proven to have certain disadvantages, like the inability to handle system changes in a dynamic environment. In such cases, rescheduling is necessary. This paper is an extension of a previous work on dynamic task scheduling. In such a scenario, some type of rescheduling is necessary to have the system working in the most efficient way. In this paper, we extend our previous works Souravlas and Anastasiadou (Appl Sci 10(14):4796, 2020); Souravlas et\u00a0al. (Appl Sci 11(1):61, 2021) and present a mathematical model that offers better balance and produces fewer communication steps. The scheduler is based on the idea of generating larger sets of communication steps among the system nodes, which we call superclasses. Our experiments have shown that this scheme achieves better balancing and reduces the overall latency.<\/jats:p>","DOI":"10.1007\/s10287-023-00474-y","type":"journal-article","created":{"date-parts":[[2023,9,6]],"date-time":"2023-09-06T06:01:32Z","timestamp":1693980092000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Mathematical modeling for further improving task scheduling on Big Data systems"],"prefix":"10.1007","volume":"20","author":[{"given":"Stavros","family":"Souravlas","sequence":"first","affiliation":[]},{"given":"Sofia","family":"Anastasiadou","sequence":"additional","affiliation":[]},{"given":"Angelo","family":"Sifaleras","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,6]]},"reference":[{"key":"474_CR1","doi-asserted-by":"publisher","first-page":"9609","DOI":"10.1007\/s11227-020-03223-z","volume":"20","author":"A Al-Sinayyid","year":"2020","unstructured":"Al-Sinayyid A, Zhu M (2020) Job scheduler for streaming applications in heterogeneous distributed processing systems. J Supercomput 20:9609\u20139628","journal-title":"J Supercomput"},{"key":"474_CR2","doi-asserted-by":"crossref","unstructured":"Aniello L, Baldoni R, Querzoni L (2013) Adaptive online scheduling in Storm. In: Proceedings of the 7th ACM International Conference on Distributed Event-based Systems (DEBS \u201913), pp. 207\u2013218","DOI":"10.1145\/2488222.2488267"},{"key":"474_CR3","doi-asserted-by":"crossref","unstructured":"Cardellini V, Grassi V, Presti L, Nardelli M (2016) Optimal operator placement for distributed stream processing applications. In: Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems. DEBS \u201916, pp. 69\u201380","DOI":"10.1145\/2933267.2933312"},{"issue":"12","key":"474_CR4","doi-asserted-by":"publisher","first-page":"1825","DOI":"10.14778\/3137765.3137786","volume":"10","author":"A Floratou","year":"2017","unstructured":"Floratou A, Agrawal A, Graham B, Rao S, Ramasamy K (2017) Dhalion: self-regulating stream processing in Heron. Proc VLDB Endow 10(12):1825\u20131836","journal-title":"Proc VLDB Endow"},{"key":"474_CR5","doi-asserted-by":"crossref","unstructured":"Meng-Meng C, Chuang Z, Zhao L, Ke-Fu X (2014) A task scheduling approach for real-time stream processing. In: Proceedings of the International Conference on Cloud Computing and Big Data, pp. 160\u2013167","DOI":"10.1109\/CCBD.2014.22"},{"key":"474_CR6","doi-asserted-by":"crossref","unstructured":"Peng B, Hosseini M, Hong Z, Farivar R, Campbell R (2015) R-Storm: Resource-aware scheduling in storm. In: Proceedings of the 16th Annual Middleware Conference. Middleware \u201915, pp. 149\u2013161. ACM, Vancouver, BC, Canada","DOI":"10.1145\/2814576.2814808"},{"key":"474_CR7","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1016\/j.jpdc.2018.02.003","volume":"117","author":"A Shukla","year":"2018","unstructured":"Shukla A, Simmhan Y (2018) Model-driven scheduling for distributed stream processing systems. J Parall Distrib Comput 117:98\u2013114","journal-title":"J Parall Distrib Comput"},{"key":"474_CR8","doi-asserted-by":"crossref","unstructured":"Shukla A, Simmhan Y (2018) Toward reliable and rapid elasticity for streaming dataflows on clouds. In: Proceedings of the 38th International Conference on Distributed Computing Systems (ICDCS), pp. 1096\u20131106","DOI":"10.1109\/ICDCS.2018.00109"},{"issue":"14","key":"474_CR9","doi-asserted-by":"publisher","first-page":"4796","DOI":"10.3390\/app10144796","volume":"10","author":"S Souravlas","year":"2020","unstructured":"Souravlas S, Anastasiadou S (2020) Pipelined dynamic scheduling of big data streams. Appl Sci 10(14):4796","journal-title":"Appl Sci"},{"issue":"1","key":"474_CR10","doi-asserted-by":"publisher","first-page":"61","DOI":"10.3390\/app11010061","volume":"11","author":"S Souravlas","year":"2021","unstructured":"Souravlas S, Anastasiadou S, Katsavounis S (2021) More on pipelined dynamic scheduling of big data streams. Appl Sci 11(1):61","journal-title":"Appl Sci"},{"issue":"5","key":"474_CR11","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1080\/17445760.2019.1585848","volume":"35","author":"N Tantalaki","year":"2020","unstructured":"Tantalaki N, Souravlas S, Roumeliotis M (2020) A review on big data real-time stream processing and its scheduling techniques. Int J Parall Emerg Distrib Syst 35(5):571\u2013601","journal-title":"Int J Parall Emerg Distrib Syst"},{"key":"474_CR12","unstructured":"Tom ZJ, Fu JD, Richard TBM, Winslett M, Yin Y, Zhang Z (2015) DRS: Dynamic resource scheduling for real-time analytics over fast streams. In: Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, pp. 411\u2013420"},{"key":"474_CR13","doi-asserted-by":"crossref","unstructured":"Xu J, Chen Z, Tang J, Su S (2014) T-Storm: Traffic-aware online scheduling in Storm. ICDCS \u201914, pp. 535\u2013544","DOI":"10.1109\/ICDCS.2014.61"}],"container-title":["Computational Management Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10287-023-00474-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10287-023-00474-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10287-023-00474-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,10]],"date-time":"2023-11-10T16:12:32Z","timestamp":1699632752000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10287-023-00474-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,6]]},"references-count":13,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,12]]}},"alternative-id":["474"],"URL":"https:\/\/doi.org\/10.1007\/s10287-023-00474-y","relation":{},"ISSN":["1619-697X","1619-6988"],"issn-type":[{"type":"print","value":"1619-697X"},{"type":"electronic","value":"1619-6988"}],"subject":[],"published":{"date-parts":[[2023,9,6]]},"assertion":[{"value":"30 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"40"}}