{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T00:15:14Z","timestamp":1771978514938,"version":"3.50.1"},"reference-count":64,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2024,10,25]],"date-time":"2024-10-25T00:00:00Z","timestamp":1729814400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>Power generators are one of the critical assets of power grids. The early detection of faults in power generators is essential to prevent cutoffs of the electrical supply in the power grid. This work presents a comparative analysis of machine learning (ML) models for the generator fault diagnosis. The objective is to show the ability of simple and ensemble ML models to diagnose faults using as attributes partial discharges and dissipation factor data. For this purpose, a generator fault database was built, gathering information from operational data curated by power generator experts. The hyper-parameters of the ML models were selected using a grid search (GS) and cross-validation (CV) optimization. ML models were evaluated with class imbalance and multi-classification metrics, a correspondence analysis, and model performance by class (fault type). Furthermore, the selected ML model was validated by experts through a diagnosis system prototype. The results show that the gradient boosting model presented the best performance according to the performance metrics among single and ensemble ML models. Likewise, the model showed a good capacity to detect type 3 and 4 faults, which are the most catastrophic failures for the generator and must be detected in a timely manner for prompt correction. This work gives an insight into the need and effort required to implement an online diagnostic system that provides information about the power generator health index to help engineers reduce the time taken to find and repair incipient faults and avoid loss of power generation and catastrophic failures of power generators.<\/jats:p>","DOI":"10.3390\/bdcc8110145","type":"journal-article","created":{"date-parts":[[2024,10,25]],"date-time":"2024-10-25T10:50:08Z","timestamp":1729853408000},"page":"145","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Fault Diagnosis in Power Generators: A Comparative Analysis of Machine Learning Models"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-3415-3747","authenticated-orcid":false,"given":"Quetzalli","family":"Amaya-Sanchez","sequence":"first","affiliation":[{"name":"Tecnologico Nacional de M\u00e9xico, Instituto Tecnologico de Orizaba, Orizaba 94320, Mexico"}]},{"given":"Marco Julio del Moral","family":"Argumedo","sequence":"additional","affiliation":[{"name":"Tecnologico Nacional de M\u00e9xico, Instituto Tecnologico de Orizaba, Orizaba 94320, Mexico"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9813-9657","authenticated-orcid":false,"given":"Alberto Alfonso","family":"Aguilar-Lasserre","sequence":"additional","affiliation":[{"name":"Tecnologico Nacional de M\u00e9xico, Instituto Tecnologico de Orizaba, Orizaba 94320, Mexico"}]},{"given":"Oscar Alfonso","family":"Reyes Martinez","sequence":"additional","affiliation":[{"name":"Instituto Nacional de Electricidad y Energias Limpias; Cuernavaca 62490, Mexico"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0764-045X","authenticated-orcid":false,"given":"Gustavo","family":"Arroyo-Figueroa","sequence":"additional","affiliation":[{"name":"Instituto Nacional de Electricidad y Energias Limpias; Cuernavaca 62490, Mexico"}]}],"member":"1968","published-online":{"date-parts":[[2024,10,25]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Onu, P., Mbohwa, C., and Pradhan, A. (2023, January 16\u201317). Machine Learning: A Comprehensive Exploration of Fault Detection and Diagnosis in Smart Grids. Proceedings of the 2023 International Conference on Electrical, Computer and Energy Technologies (ICECET), Cape Town, South Africa.","DOI":"10.1109\/ICECET58911.2023.10389596"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"5467","DOI":"10.1016\/j.egyr.2021.08.133","article-title":"A comprehensive review: Machine learning and its application in integrated power system","volume":"7","author":"Kumbhar","year":"2021","journal-title":"Energy Rep."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Baker, E., Nese, S.V., and Dursun, E. (2023). Hybrid Condition Monitoring System for Power Transformer Fault Diagnosis. Energies, 16.","DOI":"10.3390\/en16031151"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"11","DOI":"10.4067\/S0718-07642016000200003","article-title":"Diagnostico de Generadores Electricos de Potencia con Tecnicas de Monitoreo en Linea y Fuera de Linea","volume":"27","year":"2016","journal-title":"Inf. Tecnol."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"109947","DOI":"10.1016\/j.measurement.2021.109947","article-title":"A survey of fault prediction and location methods in electrical energy distribution networks","volume":"184","author":"Dashti","year":"2021","journal-title":"Measurement"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Yang, B., Li, F., Ye, J., and Song, W. (2019, January 4\u20138). Condition Monitoring and Fault Diagnosis of Generators in Power Networks. Proceedings of the 2019 IEEE Power & Energy Society General Meeting (PESGM), Atlanta, GA, USA.","DOI":"10.1109\/PESGM40551.2019.8973427"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Rigatos, G. (2016). Condition Monitoring and Fault Diagnosis for Electric Power Generators. Intelligent Renewable Energy Systems. Green Energy and Technology, Springer.","DOI":"10.1007\/978-3-319-39156-4_9"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Li, S., Li, X., Cui, Y., and Li, H. (2023). Review of Transformer Health Index from the Perspective of Survivability and Condition Assessment. Electronics, 12.","DOI":"10.3390\/electronics12112407"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Huang, L. (2023, January 14\u201316). Intelligent Condition Monitoring and Fault Diagnosis of Generator based on Internet of Things and Big Data Technology. In Proceeding of the 2023 IEEE 13th International Conference on Electronics Information and Emergency Communication (ICEIEC), Beijing, China.","DOI":"10.1109\/ICEIEC58029.2023.10200780"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Cocconcelli, M., Capelli, L., Cavalaglio Camargo Molano, J., and Borghi, D. (2018). Development of a Methodology for Condition-Based Maintenance in a Large-Scale Application Field. Machines, 6.","DOI":"10.3390\/machines6020017"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/MEI.2008.4581636","article-title":"Insulation Failure Mechanisms of Power Generators [Feature Article]","volume":"24","author":"Tari","year":"2008","journal-title":"IEEE Electr. Insul. Mag."},{"key":"ref_12","unstructured":"Krieg-Wezelenburg, M.G., Krieg, J.H., Messel, R., Pant, S., Montano, O., Velazquez, G., Tremblay, R., Sedding, H., Shaikh, A., and Delucchi, S. (2023). DDF Measurements on Stator Windings. CIGRE Technical Brochure, CIGRE."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"104684","DOI":"10.1016\/j.engfailanal.2020.104684","article-title":"Root cause analysis improved with machine learning for failure analysis in power transformers","volume":"115","author":"Velasquez","year":"2020","journal-title":"Eng. Fail. Anal."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"114105","DOI":"10.1016\/j.eswa.2020.114105","article-title":"Improved few-shot learning method for transformer fault diagnosis based on approximation space and belief functions","volume":"167","author":"Xu","year":"2021","journal-title":"Expert. Syst. Appl."},{"key":"ref_15","unstructured":"Duque, J.F.P., Martinez, M.T.V., Hurtado, A.P., Carrasco, E.M., Sancho, B.L., Krommydas, K.F., Plakas, K.A., Karavas, C.G., Kurashvili, A.S., and Dikaiakos, C.N. (2021, January 18\u201321). Inter-Area Oscillation Study of the Greek Power System Using an Automatic Toolbox. Proceedings of the IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe), Espoo, Finland."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2843","DOI":"10.1109\/TLA.2018.8804247","article-title":"Health index for transformer condition assessment","volume":"16","author":"Arias","year":"2018","journal-title":"IEEE Lat. Am. Trans."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Santamaria-Bonfil, G., Arroyo-Figueroa, G., Zuniga-Garcia, M.A., Azcarraga Ramos, C.G., and Bassam, A. (2024). Power Transformer Fault Detection: A Comparison of Standard Machine Learning and autoML Approaches. Energies, 17.","DOI":"10.3390\/en17010077"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1863","DOI":"10.1007\/s00521-014-1677-y","article-title":"Artificial neural network approaches for fault classification: Comparison and performance","volume":"25","author":"Nagpal","year":"2014","journal-title":"Neural Comput. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"106899","DOI":"10.1016\/j.epsr.2020.106899","article-title":"Transformer fault types and severity class prediction based on neural pattern-recognition techniques","volume":"191","author":"Taha","year":"2020","journal-title":"Electr. Power Syst. Res."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Mortada, M.A., Yacout, S., and Lakis, A. (2023). Power Transformer Fault Diagnosis Using Neural Network Optimization Techniques. Mathematics, 11.","DOI":"10.3390\/math11224693"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1109\/TDEI.2022.3148453","article-title":"Induction of Decision Trees to Diagnose Incipient Faults in Power Transformers","volume":"29","author":"Menezes","year":"2022","journal-title":"IEEE Trans. Dielectr. Electr. Insul."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1109\/TLA.2013.6601752","article-title":"Bayesian Networks applied to Failure Diagnosis in Power Transformer","volume":"11","author":"Carita","year":"2013","journal-title":"IEEE Lat. Am. Trans."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wu, Q., and Zhang, H. (2019). A novel expertise-guided machine learning model for internal fault state diagnosis of power transformers. Sustainability, 11.","DOI":"10.3390\/su11061562"},{"key":"ref_24","first-page":"8441","article-title":"Application of k-Nearest Neighbor (kNN) Machine Algorithm for Transformer Fault Classification","volume":"29","author":"Kumar","year":"2020","journal-title":"Int. J. Adv. Sci. Technol."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Sekatane, P.M., and Bokoro, P. (2023). Partial Discharge Localization through k-NN and SVM. Energies, 16.","DOI":"10.3390\/en16217430"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1007\/s10845-013-0750-1","article-title":"Fault diagnosis in power transformers using multi-class logical analysis of data","volume":"25","author":"Mortada","year":"2014","journal-title":"J. Intell. Manuf."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1791","DOI":"10.1109\/TPWRD.2012.2197868","article-title":"Statistical machine learning and dissolved gas analysis: A review","volume":"27","author":"Mirowski","year":"2012","journal-title":"IEEE Trans. Power Deliv."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"11352","DOI":"10.1016\/j.eswa.2009.03.022","article-title":"Fault diagnosis of power transformer based on support vector machine with genetic algorithm","volume":"36","author":"Fei","year":"2009","journal-title":"Expert Syst. Appl."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Wang, Y., Fan, X., Zhang, W., Zhuo, R., Hao, J., and Shi, Z. (2020). An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine. Energies, 13.","DOI":"10.3390\/en13246678"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Khalyasmaa, A.I., Matrenin, P.V., Eroshenko, S.A., Manusov, V.Z., Bramm, A.M., and Romanov, A.M. (2022). Romanov AM. Data Mining Applied to Decision Support Systems for Power Transformers\u2019 Health Diagnostics. Mathematics, 10.","DOI":"10.3390\/math10142486"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Wang, T., Li, Q., Yang, J., Xie, T., Wu, P., and Liang, J. (2023). Transformer Fault Diagnosis Method Based on Incomplete Data and TPE-XGBoost. Appl. Sci., 13.","DOI":"10.3390\/app13137539"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1734","DOI":"10.1002\/tee.23247","article-title":"Transformer Fault Diagnosis Based on Stacking Ensemble Learning","volume":"15","author":"Wang","year":"2020","journal-title":"IEEJ Trans. Electr. Electron. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"215","DOI":"10.4067\/S0718-07642020000500215","article-title":"Predicci\u00f3n de parada de m\u00e1quinas generadoras en una central hidroel\u00e9ctrica por medio de miner\u00eda de datos","volume":"31","author":"Piedrahita","year":"2020","journal-title":"Inf. Tecnol."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"106428","DOI":"10.1016\/j.engfailanal.2022.106428","article-title":"An interval type-2 fuzzy enhanced best\u2013worst method for the evaluation of ship diesel generator failures","volume":"138","author":"Yucesan","year":"2022","journal-title":"Eng. Fail. Anal."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"987","DOI":"10.1016\/j.renene.2022.04.061","article-title":"A feature extraction and machine learning framework for bearing fault diagnosis","volume":"191","author":"Cui","year":"2022","journal-title":"Renew. Energy"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Lin, W.-M. (2023). Generator Fault Diagnosis with Bit-Coding Support Vector Regression Algorithm. Energies, 16.","DOI":"10.3390\/en16083582"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"8929","DOI":"10.1016\/j.egyr.2021.11.058","article-title":"Open switch fault diagnosis of cascade H-bridge multi-level inverter in distributed power generators by machine learning algorithms","volume":"7","author":"Ali","year":"2021","journal-title":"Energy Rep."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"042015","DOI":"10.1088\/1757-899X\/518\/4\/042015","article-title":"Neural Networks for Synchronous Generator Fault Diagnosis","volume":"518","author":"Nashee","year":"2019","journal-title":"IOP Conf. Ser. Mater. Sci. Eng."},{"key":"ref_39","first-page":"1196","article-title":"Condition Monitoring and Faults Diagnosis for Synchronous Generator Using Neural Networks","volume":"12","author":"Elfaki","year":"2014","journal-title":"Telkomnika Indones. J. Electr. Eng."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Altaf, S., Al-Anbuky, A., and GholamHosseini, H. (2014, January 18\u201320). Fault diagnosis in a distributed motor network using Artificial Neural Network. In Proccedings of the International Symposium on Power Electronics, Electrical Drives, Automation and Motion, Ischia, Italy.","DOI":"10.1109\/SPEEDAM.2014.6871946"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"107614","DOI":"10.1016\/j.ress.2021.107614","article-title":"A fault detector\/classifier for closed-ring power generators using machine learning","volume":"212","author":"Quintanilha","year":"2021","journal-title":"Reliab. Eng. Syst. Saf."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"506","DOI":"10.1016\/j.net.2022.10.019","article-title":"Proposal of a new method for learning of diesel generator sounds and detecting abnormal sounds using an unsupervised deep learning algorithm","volume":"55","author":"Jo","year":"2023","journal-title":"Nucl. Eng. Technol."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.rser.2018.09.012","article-title":"Deep Learning for fault detection in wind turbines","volume":"98","author":"Helbing","year":"2018","journal-title":"Renew. Sustain. Energy Rev."},{"key":"ref_44","unstructured":"S\u00e1nchez, Q.A. (2024). Desarrollo de un Modelo Inteligente Para el Diagn\u00f3stico de Fallas en Generadores de Potencia. [Master\u2019s Thesis, Technological Institute of Mexico Orizaba Campus]. (In Spanish)."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Bacevicius, M., and Paulauskaite-Taraseviciene, A. (2023). Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem. Appl. Sci., 13.","DOI":"10.3390\/app13127328"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Scholkopf, B., Smola, A.J., and Bach, F. (2002). Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press.","DOI":"10.7551\/mitpress\/4175.001.0001"},{"key":"ref_47","first-page":"197","article-title":"Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications","volume":"Volume 38","author":"Gudivada","year":"2018","journal-title":"Handbook of Statistics"},{"key":"ref_48","unstructured":"Xiao, P. (2022). Artificial Intelligence Programming with Python: From Zero to Hero, Wiley."},{"key":"ref_49","unstructured":"Vapnik, V.N. (1998). Statistical Learning Theory, Wiley-Interscience."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Rokach, L., and Maimon, O.Z. (2007). Data Mining with Decision Trees: Theory and Applications, World Science.","DOI":"10.1142\/9789812771728"},{"key":"ref_51","unstructured":"Mitchell, T.M. (1997). Machine Learning, McGraw-Hill."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning, Springer Series in Statistics, Springer.","DOI":"10.1007\/978-0-387-21606-5"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1007\/s11749-016-0481-7","article-title":"A random forest guided tour","volume":"25","author":"Biau","year":"2016","journal-title":"Test"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","article-title":"Stochastic gradient boosting; Computational statistics & data analysis","volume":"38","author":"Friedman","year":"2002","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1007\/s10462-020-09896-5","article-title":"A comparative analysis of gradient boosting algorithms","volume":"54","year":"2021","journal-title":"Artif. Intell. Rev."},{"key":"ref_56","unstructured":"Grandini, M., Bagli, E., and Visani, G. (2020). Metrics for multi-class classification. arXiv."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Warrens, M.J. (2015). Five ways to look at cohen\u2019s kappa. J. Psychol. Psychother., 5.","DOI":"10.4172\/2161-0487.1000197"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Markoulidakis, I., Rallis, I., Georgoulas, I., Kopsiaftis, G., Doulamis, A., and Doulamis, N. (2021). Multiclass confusion matrix reduction method and its application on net promoter score classification problem. Technologies, 9.","DOI":"10.3390\/technologies9040081"},{"key":"ref_59","unstructured":"Oliphant, T.E. (2024, October 12). A Guide to NumPy. Available online: https:\/\/web.mit.edu\/dvp\/Public\/numpybook.pdf."},{"key":"ref_60","first-page":"1","article-title":"Pandas: A foundational python library for data analysis and statistics","volume":"14","author":"McKinney","year":"2011","journal-title":"Python High Perform. Sci. Comput."},{"key":"ref_61","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD \u201916, New York, NY, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1186\/s40537-020-00349-y","article-title":"Boosting methods for multi-class imbalanced data classification: An experimental review","volume":"7","author":"Tanha","year":"2020","journal-title":"J. Big Data"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1145\/1046456.1046467","article-title":"A critical review of multi-objective optimization in data mining","volume":"6","author":"Freitas","year":"2004","journal-title":"ACM SIGKDD Explor. Newsl."}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/8\/11\/145\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:20:47Z","timestamp":1760113247000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/8\/11\/145"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,25]]},"references-count":64,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2024,11]]}},"alternative-id":["bdcc8110145"],"URL":"https:\/\/doi.org\/10.3390\/bdcc8110145","relation":{},"ISSN":["2504-2289"],"issn-type":[{"value":"2504-2289","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,25]]}}}