{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T15:42:36Z","timestamp":1766158956419,"version":"3.41.0"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2022,8,10]],"date-time":"2022-08-10T00:00:00Z","timestamp":1660089600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Manage. Inf. Syst."],"published-print":{"date-parts":[[2022,12,31]]},"abstract":"<jats:p>\n            With the improvement of global infrastructure,\n            <jats:bold>Cyber-Physical Systems (CPS)<\/jats:bold>\n            have become an important component of Industry 4.0. Both the application as well as the machine work together to improve the task of interdependencies. Machine learning methods in CPS require the monitoring of computational algorithms, including adopting optimizations, fine-tuning cyber systems, improving resource utilization, as well as reducing vulnerability and also computation time. By leveraging the tremendous parallelism provided by\n            <jats:bold>General-Purpose Graphics Processing Units (GPGPU)<\/jats:bold>\n            as well as OpenCL, it is possible to dramatically reduce the execution time of data-parallel programs. However, when running an application with tiny amounts of data on a GPU, GPU resources are wasted because the program may not be able to fully utilize the GPU cores. This is because there is no mechanism for kernels to share a GPU due to the lack of OS support for GPUs. Optimal device selection is required to reduce the high power of the GPU. In this paper, we propose an energy reduction method for heterogeneous clustering. This study focuses on load balancing; resource-aware processor selection based on machine learning is performed using code features. The proposed method identifies energy-efficient kernel candidates (from the employment pool). Then, it selects a pair of kernel candidates from all possibilities that lead to a reduction in both energy consumption as well as execution time. Experimental results show that the proposed kernel approach reduces execution time by 2.23 times compared to a baseline scheduling system. Experiments have also shown that the execution time is 1.2 times faster than state-of-the-art approaches.\n          <\/jats:p>","DOI":"10.1145\/3543859","type":"journal-article","created":{"date-parts":[[2022,6,11]],"date-time":"2022-06-11T22:42:21Z","timestamp":1654987341000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3933-4273","authenticated-orcid":false,"given":"Usman","family":"Ahmed","sequence":"first","affiliation":[{"name":"Western Norway University of Applied Sciences, Bergen, Norway"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8768-9709","authenticated-orcid":false,"given":"Jerry Chun-Wei","family":"Lin","sequence":"additional","affiliation":[{"name":"Western Norway University of Applied Sciences, Bergen, Norway"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9851-4103","authenticated-orcid":false,"given":"Gautam","family":"Srivastava","sequence":"additional","affiliation":[{"name":"Brandon University, Brandon, MB, Canada and China Medical University, Taichung City, Taiwan"}]}],"member":"320","published-online":{"date-parts":[[2022,8,10]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"e5606","article-title":"RALB-HC: A resource-aware load balancer for heterogeneous cluster","author":"Ahmed Usman","unstructured":"Usman Ahmed, Muhammad Aleem, Yasir Noman Khalid, Muhammad Arshad Islam, and Muhammad Azhar Iqbal. [n.d.]. RALB-HC: A resource-aware load balancer for heterogeneous cluster. Concurrency and Computation: Practice and Experience ([n. d.]), e5606.","journal-title":"Concurrency and Computation: Practice and Experience"},{"key":"e_1_3_1_3_2","first-page":"1242","volume-title":"The International Workshop on Semantic Evaluation","author":"Ahmed Usman","year":"2019","unstructured":"Usman Ahmed, Humera Liaquat, Luqman Ahmed, and Syed Jawad Hussain. 2019. Suggestion miner at SemEval-2019 task 9: Suggestion detection in online forum using word graph. In The International Workshop on Semantic Evaluation. 1242\u20131246."},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.suscom.2022.100683"},{"key":"e_1_3_1_5_2","first-page":"1","article-title":"A load balance multi-scheduling model for OpenCL kernel tasks in an integrated cluster","author":"Ahmed Usman","year":"2020","unstructured":"Usman Ahmed, Jerry Chun Wei Lin, Gautam Srivastava, and Muhammad Aleem. 2020. A load balance multi-scheduling model for OpenCL kernel tasks in an integrated cluster. Soft Computing (2020), 1\u201314.","journal-title":"Soft Computing"},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Usman Ahmed Jerry Chun-Wei Lin and Gautam Srivastava. 2021. Network-aware SDN load balancer with deep active learning based intrusion detection model. In 2021 International Joint Conference on Neural Networks (IJCNN\u201921) IEEE 1\u20136.","DOI":"10.1109\/IJCNN52387.2021.9534424"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2015.15"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2016.05.006"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICPPW.2012.14"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1631"},{"key":"e_1_3_1_11_2","first-page":"82","volume-title":"ACM Symposium on Parallelism in Algorithms and Architectures","author":"Becchi Michela","year":"2010","unstructured":"Michela Becchi, Surendra Byna, Srihari Cadambi, and Srimat Chakradhar. 2010. Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory. In ACM Symposium on Parallelism in Algorithms and Architectures. 82\u201391."},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/2400682.2400716"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCC.2011.20"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-013-0870-6"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-9473(01)00065-2"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.5555\/3378999"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/2856636.2856639"},{"key":"e_1_3_1_18_2","volume-title":"The Workshop on Applications for Multi-and Many-Core Processors","author":"Gregg Chris","year":"2011","unstructured":"Chris Gregg, Michael Boyer, Kim Hazelwood, and Kevin Skadron. 2011. Dynamic heterogeneous scheduling decisions using historical runtime data. In The Workshop on Applications for Multi-and Many-Core Processors."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-19861-8_16"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-43659-3_50"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/IBCAST.2019.8667174"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2019.05.015"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00607-021-00958-2"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-018-2435-1"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3307363.3307377"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/2464996.2465007"},{"key":"e_1_3_1_27_2","volume-title":"The BSD Conference","volume":"5","author":"Lattner Chris","year":"2008","unstructured":"Chris Lattner. 2008. LLVM and clang: Next generation compiler technology. In The BSD Conference, Vol. 5."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2015.14"},{"key":"e_1_3_1_29_2","first-page":"45","volume-title":"The Annual IEEE\/ACM International Symposium on Microarchitecture","author":"Luk Chi-Keung","year":"2009","unstructured":"Chi-Keung Luk, Sunpyo Hong, and Hyesoon Kim. 2009. Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In The Annual IEEE\/ACM International Symposium on Microarchitecture. 45\u201355."},{"issue":"1","key":"e_1_3_1_30_2","first-page":"1","article-title":"Novel machine learning for big data analytics in intelligent support information management systems","volume":"13","author":"Lv Zhihan","year":"2021","unstructured":"Zhihan Lv, Ranran Lou, Hailin Feng, Dongliang Chen, and Haibin Lv. 2021. Novel machine learning for big data analytics in intelligent support information management systems. ACM Transactions on Management Information System (TMIS) 13, 1 (2021), 1\u201321.","journal-title":"ACM Transactions on Management Information System (TMIS)"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-014-1511-6"},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1145\/2884045.2884051","volume-title":"Annual Workshop on General Purpose Processing using Graphics Processing Unit","author":"P\u00e9rez Borja","year":"2016","unstructured":"Borja P\u00e9rez, Jos\u00e9 Luis Bosque, and Ram\u00f3n Beivide. 2016. Simplifying programming and load balancing of data parallel applications on heterogeneous systems. In Annual Workshop on General Purpose Processing using Graphics Processing Unit. 42\u201351."},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-12-824557-6.00015-7"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2010.69"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10723-015-9340-0"},{"key":"e_1_3_1_36_2","volume-title":"The General Purpose GPUs","author":"Wen Yuan","year":"2017","unstructured":"Yuan Wen and Michael F. P. O'Boyle. 2017. Merge or separate?. In The General Purpose GPUs."},{"key":"e_1_3_1_37_2","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1145\/3038228.3038235","volume-title":"The General Purpose GPUs","author":"Wen Yuan","year":"2017","unstructured":"Yuan Wen and Michael F. P. O\u2019Boyle. 2017. Merge or separate?: Multi-job scheduling for OpenCL kernels on CPU\/GPU platforms. In The General Purpose GPUs. 22\u201331."},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC.2014.7116910"}],"container-title":["ACM Transactions on Management Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3543859","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3543859","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:40Z","timestamp":1750268980000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3543859"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,10]]},"references-count":37,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,12,31]]}},"alternative-id":["10.1145\/3543859"],"URL":"https:\/\/doi.org\/10.1145\/3543859","relation":{},"ISSN":["2158-656X","2158-6578"],"issn-type":[{"type":"print","value":"2158-656X"},{"type":"electronic","value":"2158-6578"}],"subject":[],"published":{"date-parts":[[2022,8,10]]},"assertion":[{"value":"2021-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-08-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}