{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T04:37:58Z","timestamp":1777696678162,"version":"3.51.4"},"reference-count":47,"publisher":"SAGE Publications","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2024,9,19]]},"abstract":"<jats:p>Parallel power loads anomalies are processed by a fast-density peak clustering technique that capitalizes on the hybrid strengths of Canopy and K-means algorithms all within Apache Mahout\u2019s distributed machine-learning environment. The study taps into Apache Hadoop\u2019s robust tools for data storage and processing, including HDFS and MapReduce, to effectively manage and analyze big data challenges. The preprocessing phase utilizes Canopy clustering to expedite the initial partitioning of data points, which are subsequently refined by K-means to enhance clustering performance. Experimental results confirm that incorporating the Canopy as an initial step markedly reduces the computational effort to process the vast quantity of parallel power load abnormalities. The Canopy clustering approach, enabled by distributed machine learning through Apache Mahout, is utilized as a preprocessing step within the K-means clustering technique. The hybrid algorithm was implemented to minimise the length of time needed to address the massive scale of the detected parallel power load abnormalities. Data vectors are generated based on the time needed, sequential and parallel candidate feature data are obtained, and the data rate is combined. After classifying the time set using the canopy with the K-means algorithm and the vector representation weighted by factors, the clustering impact is assessed using purity, precision, recall, and F value. The results showed that using canopy as a preprocessing step cut the time it proceeds to deal with the significant number of power load abnormalities found in parallel using a fast density peak dataset and the time it proceeds for the k-means algorithm to run. Additionally, tests demonstrate that combining canopy and the K-means algorithm to analyze data performs consistently and dependably on the Hadoop platform and has a clustering result that offers a scalable and effective solution for power system monitoring.<\/jats:p>","DOI":"10.3233\/ida-230573","type":"journal-article","created":{"date-parts":[[2024,2,2]],"date-time":"2024-02-02T10:45:49Z","timestamp":1706870749000},"page":"1321-1346","source":"Crossref","is-referenced-by-count":3,"title":["Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm"],"prefix":"10.1177","volume":"28","author":[{"given":"Ahmed Hadi Ali","family":"Al-Jumaili","sequence":"first","affiliation":[{"name":"Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia"},{"name":"Computer Centre Department, University of Fallujah, Anbar, Iraq"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ravie Chandren","family":"Muniyandi","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohammad Kamrul","family":"Hasan","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mandeep Jit","family":"Singh","sequence":"additional","affiliation":[{"name":"Department of Electrical, Electronic and System Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Johnny Koh Siaw","family":"Paw","sequence":"additional","affiliation":[{"name":"Institute of Sustainable Energy, Universiti Tenaga Nasional (The Energy University), Jalan Ikram-Uniten, Kajang, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Abdulmajeed","family":"Al-Jumaily","sequence":"additional","affiliation":[{"name":"Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","reference":[{"issue":"2","key":"10.3233\/IDA-230573_ref1","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1007\/s11277-020-07408-w","article-title":"A novel artificial intelligence based timing synchronization scheme for smart grid applications","volume":"114","author":"Hasan","year":"2020","journal-title":"Wirel. Pers. Commun"},{"issue":"21","key":"10.3233\/IDA-230573_ref2","doi-asserted-by":"publisher","first-page":"9820","DOI":"10.3390\/APP11219820","article-title":"A conceptual and systematics for intelligent power management system-based cloud computing: Prospects, and challenges","volume":"11","author":"AL-Jumaili","year":"2021","journal-title":"Appl. Sci"},{"issue":"17","key":"10.3233\/IDA-230573_ref3","doi-asserted-by":"publisher","first-page":"6124","DOI":"10.3390\/en15176124","article-title":"Day-ahead load demand forecasting in urban community cluster microgrids using machine learning methods","volume":"15","author":"Rao","year":"2022","journal-title":"Energies"},{"key":"10.3233\/IDA-230573_ref4","doi-asserted-by":"publisher","first-page":"2206","DOI":"10.1016\/j.egyr.2023.09.029","article-title":"Advancements in intelligent cloud computing for power optimization and battery management in hybrid renewable energy systems: A comprehensive review","volume":"10","author":"AL-Jumaili","year":"2023","journal-title":"Energy Reports"},{"issue":"3","key":"10.3233\/IDA-230573_ref5","doi-asserted-by":"publisher","first-page":"2651","DOI":"10.1109\/TPWRS.2012.2232316","article-title":"An efficient state estimation algorithm considering zero injection constraints","volume":"28","author":"Guo","year":"2013","journal-title":"IEEE Trans. Power Syst"},{"issue":"2","key":"10.3233\/IDA-230573_ref6","doi-asserted-by":"publisher","first-page":"2641","DOI":"10.1016\/j.aej.2021.01.004","article-title":"A novel design of fractional Meyer wavelet neural networks with application to the nonlinear singular fractional Lane-Emden systems","volume":"60","author":"Sabir","year":"2021","journal-title":"Alexandria Eng. J"},{"issue":"1","key":"10.3233\/IDA-230573_ref8","doi-asserted-by":"publisher","first-page":"57","DOI":"10.2514\/3.10094","article-title":"Computation of highly swirling confined flow with a reynolds stress turbulence model","volume":"27","author":"Hogg","year":"1989","journal-title":"AIAA J"},{"issue":"4","key":"10.3233\/IDA-230573_ref9","doi-asserted-by":"publisher","first-page":"003685042211321","DOI":"10.1177\/00368504221132144","article-title":"Intelligent based hybrid renewable energy resources forecasting and real time power demand management system for resilient energy systems","volume":"105","author":"Amir","year":"2022","journal-title":"Sci. Prog"},{"issue":"1","key":"10.3233\/IDA-230573_ref10","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1115\/1.3243600","article-title":"Numerical computation of turbulent flow in a square-sectioned 180 deg bend","volume":"111","author":"Choi","year":"1989","journal-title":"J. Fluids Eng. Trans. ASME"},{"key":"10.3233\/IDA-230573_ref11","doi-asserted-by":"publisher","DOI":"10.1016\/j.esr.2020.100523"},{"issue":"1","key":"10.3233\/IDA-230573_ref12","doi-asserted-by":"publisher","first-page":"92","DOI":"10.3390\/iot1010006","article-title":"Time-pattern profiling from smart meter data to detect outliers in energy consumption","volume":"1","author":"Hurst","year":"2020","journal-title":"IoT"},{"issue":"1","key":"10.3233\/IDA-230573_ref13","doi-asserted-by":"publisher","first-page":"830","DOI":"10.1109\/TSG.2017.2753738","article-title":"A tunable fraud detection system for advanced metering infrastructure using short-lived patterns","volume":"10","author":"Zanetti","year":"2019","journal-title":"IEEE Trans. Smart Grid"},{"key":"10.3233\/IDA-230573_ref14","doi-asserted-by":"publisher","DOI":"10.1109\/EEEIC.2017.7977752"},{"issue":"2","key":"10.3233\/IDA-230573_ref15","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1049\/iet-cps.2017.0063","article-title":"Entropy-based electricity theft detection in AMI network","volume":"3","author":"Singh","year":"2018","journal-title":"IET Cyber-Physical Syst. Theory Appl"},{"issue":"c","key":"10.3233\/IDA-230573_ref16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/jiot.2021.3131160","article-title":"Real-World Evaluation of Power Consumption and Performance of NB-IoT in Malaysia","volume":"4662","author":"Alobaidy","year":"2021","journal-title":"IEEE Internet Things J"},{"key":"10.3233\/IDA-230573_ref17","doi-asserted-by":"publisher","DOI":"10.1109\/IITCEE57236.2023.10091089"},{"key":"10.3233\/IDA-230573_ref18","doi-asserted-by":"publisher","first-page":"18459","DOI":"10.1109\/ACCESS.2017.2712258","article-title":"Multi-layered clustering for power consumption profiling in smart grids","volume":"5","author":"Al-Jarrah","year":"2017","journal-title":"IEEE Access"},{"issue":"6","key":"10.3233\/IDA-230573_ref19","doi-asserted-by":"publisher","first-page":"4393","DOI":"10.1109\/TPWRS.2019.2915283","article-title":"High-precision dynamic modeling of two-staged photovoltaic power station clusters","volume":"34","author":"Li","year":"2019","journal-title":"IEEE Trans. Power Syst"},{"issue":"3","key":"10.3233\/IDA-230573_ref20","doi-asserted-by":"publisher","first-page":"816","DOI":"10.35833\/MPCE.2021.000464","article-title":"Real-time subsynchronous control interaction monitoring using improved intrinsic time-scale decomposition","volume":"11","author":"Wang","year":"2023","journal-title":"J. Mod. Power Syst. Clean Energy"},{"key":"10.3233\/IDA-230573_ref21","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2023.107601"},{"key":"10.3233\/IDA-230573_ref22","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.121358"},{"key":"10.3233\/IDA-230573_ref23","doi-asserted-by":"crossref","first-page":"100864","DOI":"10.1016\/j.swevo.2021.100864","article-title":"A memetic algorithm based on two_Arch2 for multi-depot heterogeneous-vehicle capacitated arc routing problem","volume":"63","author":"Cao","year":"2021","journal-title":"Swarm Evol. Comput."},{"key":"10.3233\/IDA-230573_ref24","doi-asserted-by":"publisher","first-page":"79128","DOI":"10.1109\/ACCESS.2021.3083960","article-title":"Enhancement of frequency control for stand-alone multi-microgrids","volume":"9","author":"Singh","year":"2021","journal-title":"IEEE Access"},{"issue":"4","key":"10.3233\/IDA-230573_ref25","doi-asserted-by":"publisher","first-page":"10733","DOI":"10.1080\/15567036.2022.2158251","article-title":"Optimal dynamic frequency regulation of renewable energy based hybrid power system utilizing a novel TDF-TIDF controller","volume":"44","author":"Singh","year":"2022","journal-title":"Energy Sources, Part A Recover. Util. Environ. Eff"},{"key":"10.3233\/IDA-230573_ref26","doi-asserted-by":"publisher","first-page":"103190","DOI":"10.1016\/j.seta.2023.103190","article-title":"Dynamic load modeling for bulk load-using synchrophasors with wide area measurement system for smart grid real-time load monitoring and optimization","volume":"57","author":"Hasan","year":"2023","journal-title":"Sustain. Energy Technol. Assessments"},{"issue":"5","key":"10.3233\/IDA-230573_ref27","doi-asserted-by":"publisher","first-page":"2437","DOI":"10.1109\/TSG.2016.2548565","article-title":"Clustering of electricity consumption behavior dynamics toward big data applications","volume":"7","author":"Wang","year":"2016","journal-title":"IEEE Trans. Smart Grid"},{"key":"10.3233\/IDA-230573_ref28","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1016\/j.is.2015.04.007","article-title":"Time-series clustering \u2013 A decade review","volume":"53","author":"Aghabozorgi","year":"2015","journal-title":"Inf. Syst"},{"issue":"1","key":"10.3233\/IDA-230573_ref29","first-page":"134","article-title":"An improved hybrid technique for energy and delay routing in mobile ad-hoc networks","volume":"12","author":"Hassan","year":"2017","journal-title":"Int. J. Appl. Eng. Res"},{"key":"10.3233\/IDA-230573_ref30","doi-asserted-by":"publisher","first-page":"116279","DOI":"10.1016\/j.eswa.2021.116279","article-title":"Distributed evidential clustering toward time series with big data issue","volume":"191","author":"Gong","year":"2022","journal-title":"Expert Syst. Appl"},{"key":"10.3233\/IDA-230573_ref31","doi-asserted-by":"publisher","DOI":"10.1109\/ICCMC56507.2023.10083636"},{"key":"10.3233\/IDA-230573_ref32","doi-asserted-by":"publisher","DOI":"10.1109\/GCAT55367.2022.9972088"},{"issue":"3","key":"10.3233\/IDA-230573_ref33","doi-asserted-by":"crossref","first-page":"119","DOI":"10.2991\/ijndc.k.200515.007","article-title":"High performance hadoop distributed file system","volume":"8","author":"Elkawkagy","year":"2020","journal-title":"Int. J. Networked Distrib. Comput"},{"issue":"6","key":"10.3233\/IDA-230573_ref34","doi-asserted-by":"publisher","first-page":"2952","DOI":"10.3390\/s23062952","article-title":"Big data analytics using cloud computing based frameworks for power management systems: Status, constraints, and future recommendations","volume":"23","author":"AL-Jumaili","year":"2023","journal-title":"Sensors"},{"issue":"1","key":"10.3233\/IDA-230573_ref35","first-page":"267","article-title":"Interval-valued neutrosophic soft expert set from real space to complex space","volume":"132","author":"Al-Sharqi","year":"2022","journal-title":"C. Model. Eng. Sci"},{"issue":"4","key":"10.3233\/IDA-230573_ref36","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1016\/j.jksuci.2017.06.001","article-title":"Big Data technologies: A survey","volume":"30","author":"Oussous","year":"2018","journal-title":"J. King Saud Univ. \u2013 Comput. Inf. Sci"},{"issue":"1","key":"10.3233\/IDA-230573_ref37","first-page":"4999","article-title":"Apache mahout: Machine learning on distributed dataflow systems","volume":"21","author":"Anil","year":"2020","journal-title":"J. Mach. Learn. Res"},{"issue":"4","key":"10.3233\/IDA-230573_ref39","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1109\/msmc.2019.2961160","article-title":"The role of visual assessment of clusters for big data analysis: From real-world internet of things","volume":"6","author":"Palaniswami","year":"2020","journal-title":"IEEE Syst. Man, Cybern. Mag"},{"issue":"2","key":"10.3233\/IDA-230573_ref40","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1007\/s10723-019-09504-z","article-title":"Research on Parallel Adaptive Canopy-K-Means Clustering Algorithm for Big Data Mining Based on Cloud Platform","volume":"18","author":"Xia","year":"2020","journal-title":"J. Grid Comput"},{"issue":"2","key":"10.3233\/IDA-230573_ref41","doi-asserted-by":"publisher","first-page":"226","DOI":"10.3390\/j2020016","article-title":"Research on K-Value Selection Method of K-Means Clustering Algorithm","volume":"2","author":"Yuan","year":"2019","journal-title":"J"},{"issue":"5","key":"10.3233\/IDA-230573_ref42","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s42979-020-00283-z","article-title":"Cross-validation approach to evaluate clustering algorithms: An experimental study using multi-label datasets","volume":"1","author":"Tarekegn","year":"2020","journal-title":"SN Comput. Sci"},{"key":"10.3233\/IDA-230573_ref43","doi-asserted-by":"publisher","first-page":"107804","DOI":"10.1016\/j.est.2023.107804","article-title":"An effective cascade control strategy for frequency regulation of renewable energy based hybrid power system with energy storage system","volume":"68","author":"Singh","year":"2023","journal-title":"J. Energy Storage"},{"issue":"4","key":"10.3233\/IDA-230573_ref44","doi-asserted-by":"publisher","first-page":"2381","DOI":"10.1007\/s10462-019-09736-1","article-title":"Spatiotemporal clustering: A review","volume":"53","author":"Ansari","year":"2020","journal-title":"Artif. Intell. Rev"},{"issue":"4","key":"10.3233\/IDA-230573_ref45","doi-asserted-by":"publisher","first-page":"379","DOI":"10.3233\/MGS-200336","article-title":"Parallel and fault-tolerant k-means clustering based on the actor model","volume":"16","author":"Taamneh","year":"2020","journal-title":"Multiagent Grid Syst"},{"issue":"3","key":"10.3233\/IDA-230573_ref46","doi-asserted-by":"publisher","first-page":"776","DOI":"10.1007\/s10618-020-00678-9","article-title":"An efficient K-means clustering algorithm for tall data","volume":"34","author":"Cap\u00f3","year":"2020","journal-title":"Data Min. Knowl. Discov"},{"key":"10.3233\/IDA-230573_ref47","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1016\/j.simpat.2014.05.005","article-title":"Parallel and distributed computing models on a graphics processing unit to accelerate simulation of membrane systems","volume":"47","author":"Maroosi","year":"2014","journal-title":"Simul. Model. Pract. Theory"},{"key":"10.3233\/IDA-230573_ref48","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1016\/j.tcs.2014.05.004","article-title":"Accelerated execution of P systems with active membranes to solve the N-queens problem","volume":"551","author":"Maroosi","year":"2014","journal-title":"Theor. Comput. Sci"},{"key":"10.3233\/IDA-230573_ref49","doi-asserted-by":"publisher","first-page":"106315","DOI":"10.1016\/j.ijepes.2020.106315","article-title":"Data mining for abnormal power consumption pattern detection based on local matrix reconstruction","volume":"123","author":"Feng","year":"2020","journal-title":"Int. J. Electr. Power Energy Syst"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-230573","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:20:40Z","timestamp":1777454440000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.medra.org\/servlet\/aliasResolver?alias=iospress&doi=10.3233\/IDA-230573"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,19]]},"references-count":47,"journal-issue":{"issue":"5"},"URL":"https:\/\/doi.org\/10.3233\/ida-230573","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"value":"1088-467X","type":"print"},{"value":"1571-4128","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,19]]}}}