{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:13:22Z","timestamp":1760242402275,"version":"build-2065373602"},"reference-count":51,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2017,8,4]],"date-time":"2017-08-04T00:00:00Z","timestamp":1501804800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This paper aims to develop a low-cost, high-performance and high-reliability computing system to process large-scale data using common data mining algorithms in the Internet of Things (IoT) computing environment. Considering the characteristics of IoT data processing, similar to mainstream high performance computing, we use a GPU (Graphics Processing Unit) cluster to achieve better IoT services. Firstly, we present an energy consumption calculation method (ECCM) based on WSNs. Then, using the CUDA (Compute Unified Device Architecture) Programming model, we propose a Two-level Parallel Optimization Model (TLPOM) which exploits reasonable resource planning and common compiler optimization techniques to obtain the best blocks and threads configuration considering the resource constraints of each node. The key to this part is dynamic coupling Thread-Level Parallelism (TLP) and Instruction-Level Parallelism (ILP) to improve the performance of the algorithms without additional energy consumption. Finally, combining the ECCM and the TLPOM, we use the Reliable GPU Cluster Architecture (RGCA) to obtain a high-reliability computing system considering the nodes\u2019 diversity, algorithm characteristics, etc. The results show that the performance of the algorithms significantly increased by 34.1%, 33.96% and 24.07% for Fermi, Kepler and Maxwell on average with TLPOM and the RGCA ensures that our IoT computing system provides low-cost and high-reliability services.<\/jats:p>","DOI":"10.3390\/s17081799","type":"journal-article","created":{"date-parts":[[2017,8,4]],"date-time":"2017-08-04T11:07:08Z","timestamp":1501844828000},"page":"1799","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["RGCA: A Reliable GPU Cluster Architecture for Large-Scale Internet of Things Computing Based on Effective Performance-Energy Optimization"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8684-4332","authenticated-orcid":false,"given":"Yuling","family":"Fang","sequence":"first","affiliation":[{"name":".University of Shanghai for Science and Technology, Shanghai 200093, China"}]},{"given":"Qingkui","family":"Chen","sequence":"additional","affiliation":[{"name":".University of Shanghai for Science and Technology, Shanghai 200093, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0394-4635","authenticated-orcid":false,"given":"Neal N.","family":"Xiong","sequence":"additional","affiliation":[{"name":".University of Shanghai for Science and Technology, Shanghai 200093, China"},{"name":".Department of Mathematics and Computer Science, Northeastern State University, Tahlequah, OK 74464, USA"}]},{"given":"Deyu","family":"Zhao","sequence":"additional","affiliation":[{"name":".University of Shanghai for Science and Technology, Shanghai 200093, China"}]},{"given":"Jingjuan","family":"Wang","sequence":"additional","affiliation":[{"name":".University of Shanghai for Science and Technology, Shanghai 200093, China"}]}],"member":"1968","published-online":{"date-parts":[[2017,8,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/s10796-014-9492-7","article-title":"The internet of things: A survey","volume":"17","author":"Li","year":"2015","journal-title":"Inf. Syst. Front."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2233","DOI":"10.1109\/TII.2014.2300753","article-title":"Internet of Things in Industries: A Survey","volume":"10","author":"Xu","year":"2014","journal-title":"IEEE Trans. Ind. Inform."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1645","DOI":"10.1016\/j.future.2013.01.010","article-title":"Internet of Things (IoT): A vision, architectural elements, and future directions","volume":"29","author":"Gubbi","year":"2013","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"4922","DOI":"10.3390\/s130404922","article-title":"Energy-efficient algorithm for broadcasting in ad hoc wireless sensor networks","volume":"13","author":"Xiong","year":"2013","journal-title":"Sensors"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jnca.2015.02.001","article-title":"Information centric network: Research challenges and opportunities","volume":"52","author":"Vasilakos","year":"2015","journal-title":"J. Netw. Comput. Appl."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1596","DOI":"10.1109\/TII.2014.2302638","article-title":"An Integrated System for Regional Environmental Monitoring and Management Based on Internet of Things","volume":"10","author":"Fang","year":"2014","journal-title":"IEEE Trans. Ind. Inform."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/JIOT.2014.2306328","article-title":"Internet of Things for Smart Cities","volume":"1","author":"Zanella","year":"2014","journal-title":"IEEE Int. Things J."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1109\/JIOT.2013.2296516","article-title":"An Information Framework for Creating a Smart City Through Internet of Things","volume":"1","author":"Jin","year":"2014","journal-title":"IEEE Int. Things J."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1002\/ett.2704","article-title":"Sensing as a service model for smart cities supported by Internet of Things","volume":"25","author":"Perera","year":"2014","journal-title":"Trans. Emerg. Telecommun. Technol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"24818","DOI":"10.3390\/s151024818","article-title":"A Survey on Energy Conserving Mechanisms for the Internet of Things: Wireless Networking Aspects","volume":"15","author":"Abbas","year":"2015","journal-title":"Sensors"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"28603","DOI":"10.3390\/s151128603","article-title":"A Novel Scheme for an Energy Efficient Internet of Things Based on Wireless Sensor Networks","volume":"15","author":"Rani","year":"2015","journal-title":"Sensors"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Hussain, S., Matin, A.W., and Islam, O. (2007, January 2\u20134). Genetic algorithm for energy efficient clusters in wireless sensor networks. Proceedings of the Fourth International Conference on Information Technology (ITNG\u201907), Las Vegas, NV, USA.","DOI":"10.1109\/ITNG.2007.97"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Bruce, B.R., Petke, J., and Harman, M. (2015, January 11\u201315). Reducing energy consumption using genetic improvement. Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation, Madrid, Spain.","DOI":"10.1145\/2739480.2754752"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1007\/s10710-016-9273-9","article-title":"Genetic improvement of GPU software","volume":"18","author":"Langdon","year":"2017","journal-title":"Genet. Program. Evolvable Mach."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Yi, M., Chen, Q.K., and Xiong, N.N. (2016). An Effective Massive Sensor Network Data Access Scheme Based on Topology Control for the Internet of Things. Sensors, 16.","DOI":"10.3390\/s16111846"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Prinsloo, J., and Malekian, R. (2016). Accurate vehicle location system using RFID, an Internet of Things approach. Sensors, 16.","DOI":"10.3390\/s16060825"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1109\/SURV.2013.103013.00206","article-title":"Data Mining for Internet of Things: A Survey","volume":"16","author":"Tsai","year":"2014","journal-title":"IEEE Commun. Surv. Tutor."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Chen, F., Deng, P., Wan, J.F., Zhang, D.Q., Vasilakos, A.V., and Rong, X.H. (2015). Data Mining for the Internet of Things: Literature Review and Challenges. Int. J. Distrib. Sens. Netw., 11.","DOI":"10.1155\/2015\/431047"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1109\/TKDE.2013.109","article-title":"Data Mining with Big Data","volume":"26","author":"Wu","year":"2014","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1109\/MIC.2012.50","article-title":"From Databases to Big Data","volume":"16","author":"Madden","year":"2012","journal-title":"IEEE Internet Comput."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1109\/SURV.2013.042313.00197","article-title":"Context Aware Computing for The Internet of Things: A Survey","volume":"16","author":"Perera","year":"2014","journal-title":"IEEE Commun. Surv. Tutor."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1126\/science.1197448","article-title":"More Is Less: Signal Processing and the Data Deluge","volume":"331","author":"Baraniuk","year":"2011","journal-title":"Science"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1003","DOI":"10.1109\/TKDE.2002.1033770","article-title":"CLARANS: A method for clustering objects for spatial data mining","volume":"14","author":"Ng","year":"2002","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zhang, T., Ramakrishnan, R., and Livny, M. (1996, January 4\u20136). BIRCH: An efficient data clustering method for very large databases. Proceedings of the ACM SIGMOD International Conference on Management of Data, Montreal, QC, Canada.","DOI":"10.1145\/233269.233324"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1279","DOI":"10.1109\/TKDE.2004.58","article-title":"Efficient phrase-based document indexing for web document clustering","volume":"16","author":"Hammouda","year":"2004","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Shen, B., Liu, Y., and Wang, X. (2010, January 9\u201311). Research on data mining models for the Internet of Things. Proceedings of the International Conference on Image Analysis and Signal Processing (IASP), Hangzhou, China.","DOI":"10.1109\/IASP.2010.5476146"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","article-title":"Data clustering: 50 years beyond k-means","volume":"31","author":"Jain","year":"2010","journal-title":"Pattern Recognit. Lett."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10115-007-0114-2","article-title":"Top 10 algorithms in data mining","volume":"14","author":"Wu","year":"2008","journal-title":"Knowl. Inf. Syst."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Choubey, P.K., Pateria, S., Saxena, A., Chirayil, S.B.V.P., Jha, K.K., and Basaiah, S. (2015, January 12\u201313). Power Efficient, Bandwidth Optimized and Fault Tolerant Sensor Management for IoT in Smart Home. Proceedings of the IEEE International Advance Computing Conference (IACC), Banglore, India.","DOI":"10.1109\/IADCC.2015.7154732"},{"key":"ref_30","unstructured":"Bouman, C.A., Shapiro, M., Cook, G., Atkins, C.B., and Cheng, H. (1997). Cluster: An Unsupervised Algorithm for Modeling GAUSSIAN Mixtures, Purdue University."},{"key":"ref_31","unstructured":"Xiao, H. (2010). Towards Parallel and Distributed Computing in Large-Scale Data Mining: A Survey, Technical University of Munich. Technical Report."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Bagheri, B., Ahmadi, H., and Labbafi, R. (2010, January 6\u20138). Application of data mining and feature extraction on intelligent fault diagnosis by artificial neural network and k-nearest neighbor. Proceedings of the XIX International Conference on Electrical Machines (ICEM), Rome, Italy.","DOI":"10.1109\/ICELMACH.2010.5607984"},{"key":"ref_33","unstructured":"Nvidia, C. (2012). C Best Practices Guide, NVIDIA."},{"key":"ref_34","unstructured":"(2012). GTX680: The Fastest, Most Efficient GPU Ever Built, Whitepaper."},{"key":"ref_35","unstructured":"Cook, S. (2012). CUDA Programming: A Developer's Guide to Parallel Computing with GPUs, Morgan Kaufmann Publishers Inc."},{"key":"ref_36","unstructured":"Kay\u0131ran, O., Jog, A., Kandemir, M.T., and Das, C.R. (2013, January 7\u201311). Neither more nor less: Optimizing thread-level parallelism for GPGPUs. Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, Edinburgh, UK."},{"key":"ref_37","unstructured":"(2017, April 22). CPU Power Dissipation. Available online: https:\/\/en.wikipedia.org\/wiki\/CPU_power_dissipation."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1016\/j.ins.2015.03.027","article-title":"Re-Stream: Real-time and energy-efficient resource scheduling in big data stream computing environments","volume":"319","author":"Sun","year":"2015","journal-title":"Inf. Sci."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1016\/j.future.2011.04.017","article-title":"Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing","volume":"28","author":"Beloglazov","year":"2012","journal-title":"Future Gener. Comput. Syst. Int. J. Grid Comput. Esci."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1109\/MCAS.2009.935693","article-title":"Enabling Power-Efficient DVFS Operations on Silicon","volume":"10","author":"Ma","year":"2010","journal-title":"IEEE Circuits Syst. Mag."},{"key":"ref_41","unstructured":"Chen, Q.K., L, C.F., Cao, Q.C., and Yi, M. (2017). The Power Consumption Monitoring System of GPU Cluster. (Patent CN205983447U)."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Rogers, T.G., Johnson, D.R., O\u2019Connor, M., and Keckler, S.W. (2015, January 13\u201317). A variable warp size architecture. Proceedings of the 42nd Annual International Symposium on Computer Architecture, Portland, OR, USA.","DOI":"10.1145\/2749469.2750410"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Ausavarungnirun, R., Ghose, S., Kayiran, O., Loh, G.H., Das, C.R., Kandemir, M.T., and Mutlu, O. (2015, January 18\u201321). Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance. Proceedings of the International Conference on Parallel Architecture and Compilation (PACT), San Francisco, CA, USA.","DOI":"10.1109\/PACT.2015.38"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Yoon, M.K., Kim, K., Lee, S., Ro, W.W., and Annavaram, M. (2016, January 18\u201322). Virtual Thread: Maximizing thread-level parallelism beyond GPU scheduling limit. Proceedings of the ACM\/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), Seoul, Korea.","DOI":"10.1109\/ISCA.2016.59"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Xiang, P., Yang, Y., Mantor, M., Rubin, N., and Zhou, H. (2012, January 19\u201323). Many-thread aware instruction-level parallelism: Architecting shader cores for GPU computing. Proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, Minneapolis, MN, USA.","DOI":"10.1145\/2370816.2370890"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.ins.2015.02.016","article-title":"Large-scale fingerprint identification on GPU","volume":"306","author":"Cappelli","year":"2015","journal-title":"Inf. Sci."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Kim, K., Lee, S., Yoon, M.K., Koo, G., Ro, W.W., and Annavaram, M. (2016, January 12\u201316). Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding. Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA), Barcelona, Spain.","DOI":"10.1109\/HPCA.2016.7446062"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Wong, H., Papadopoulou, M.M., Sadooghi-Alvandi, M., and Moshovos, A. (2010, January 28\u201330). Demystifying GPU Microarchitecture through Microbenchmarking. Proceedings of the 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS), White Plains, NY, USA.","DOI":"10.1109\/ISPASS.2010.5452013"},{"key":"ref_49","unstructured":"Volkov, V. (2010, January 20\u201323). Better performance at lower occupancy. Proceedings of the GPU Technology Conference (GTC), San Jose, CA, USA."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Lee, S.-H., and Skadron, K. (2009, January 4\u20136). Rodinia: A benchmark suite for heterogeneous computing. Proceedings of the IEEE International Symposium on Workload Characterization, 2009 (IISWC 2009), Austin, TX, USA.","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Danalis, A., Marin, G., McCurdy, C., Meredith, J.S., Roth, P.C., Spafford, K., Tipparaju, V., and Vetter, J.S. (2010, January 14). The scalable heterogeneous computing (SHOC) benchmark suite. Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, Pittsburgh, PA, USA.","DOI":"10.1145\/1735688.1735702"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/8\/1799\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T18:41:27Z","timestamp":1760208087000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/8\/1799"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,4]]},"references-count":51,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2017,8]]}},"alternative-id":["s17081799"],"URL":"https:\/\/doi.org\/10.3390\/s17081799","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2017,8,4]]}}}