{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T16:50:47Z","timestamp":1776876647262,"version":"3.51.2"},"reference-count":41,"publisher":"Maximum Academic Press","license":[{"start":{"date-parts":[[2022,1,14]],"date-time":"2022-01-14T00:00:00Z","timestamp":1642118400000},"content-version":"unspecified","delay-in-days":13,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":["cambridge.org"],"crossmark-restriction":true},"short-container-title":["The Knowledge Engineering Review"],"published-print":{"date-parts":[[2022]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>This paper presents a methodology that permits to automate binary classification using the minimum possible number of attributes. In this methodology, the success of the binary prediction does not lie in the accuracy of an algorithm but in the evaluation metrics, which give information about the goodness of fit; which is an important factor when the data batch is unbalanced. The proposed methodology assesses the possible biases in identifying one algorithm as the best performer when considering the goodness of fit of an algorithm through evaluation metrics. The dimension of data has been reduced through the cumulative explained variance. Then, the performance of six machine learning classification models has been compared through Matthew correlation coefficient (MCC), area under curve \u2013 receiver operating characteristic (ROC-AUC), and area under curve \u2013 precision-recall (AUC-PR). The results show graphically and numerically how the evaluation metrics interfere with the most optimal outcome of an algorithm. The algorithms with the best performance in terms of evaluation metrics have been random forest and gradient boosting. In the imbalanced datasets, MCC has provided better prediction results than ROC-AUC or AUC-PR. The proposed methodology is adapted to the case of bankruptcy prediction.<\/jats:p>","DOI":"10.1017\/s026988892100014x","type":"journal-article","created":{"date-parts":[[2022,1,18]],"date-time":"2022-01-18T23:42:45Z","timestamp":1642549365000},"update-policy":"https:\/\/doi.org\/10.1017\/policypage","source":"Crossref","is-referenced-by-count":10,"title":["Evaluation metrics and dimensional reduction for binary classification algorithms: a case study on bankruptcy prediction"],"prefix":"10.48130","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2194-572X","authenticated-orcid":false,"given":"Mar\u00eda E.","family":"P\u00e9rez-Pons","sequence":"first","affiliation":[]},{"given":"Javier","family":"Parra-Dominguez","sequence":"additional","affiliation":[]},{"given":"Guillermo","family":"Hern\u00e1ndez","sequence":"additional","affiliation":[]},{"given":"Enrique","family":"Herrera-Viedma","sequence":"additional","affiliation":[]},{"given":"Juan M.","family":"Corchado","sequence":"additional","affiliation":[]}],"member":"27968","published-online":{"date-parts":[[2022,1,14]]},"reference":[{"key":"S026988892100014X_ref17","unstructured":"Chih-Wei, Hsu , Chih-Chung, Chang , Chih-Jen, Lin , et al. A practical guide to support vector classification, 2003."},{"key":"S026988892100014X_ref33","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1086\/209665","article-title":"Forecasting bankruptcy more accurately: A simple hazard model","volume":"74","author":"Tyler","year":"2001","journal-title":"The journal of business"},{"key":"S026988892100014X_ref18","doi-asserted-by":"crossref","first-page":"102254","DOI":"10.1016\/j.ipm.2020.102254","article-title":"Value assessment of companies by using an enterprise value assessment system based on their public transfer specification","volume":"57","author":"Win-Bin","year":"2020","journal-title":"Information Processing and Management"},{"key":"S026988892100014X_ref40","doi-asserted-by":"crossref","first-page":"364","DOI":"10.4236\/jfrm.2017.64026","article-title":"Machine learning approaches to predicting company bankruptcy","volume":"6","author":"Wenhao","year":"2017","journal-title":"Journal of Financial Risk Management"},{"key":"S026988892100014X_ref20","unstructured":"Utkarsh Mahadeo Khaire and Dhanalakshmi, R . Stability of feature selection algorithm: A review. Journal of King Saud University-Computer and Information Sciences, 2019."},{"key":"S026988892100014X_ref38","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102388"},{"key":"S026988892100014X_ref34","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09682-y"},{"key":"S026988892100014X_ref30","doi-asserted-by":"crossref","first-page":"758","DOI":"10.1016\/j.ipm.2018.01.010","article-title":"A survey towards an integration of big data analytics to big insights for value-creation","volume":"54","author":"Mandeep Kaur","year":"2018","journal-title":"Information Processing and Management"},{"key":"S026988892100014X_ref7","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.compeleceng.2013.11.024","article-title":"A survey on feature selection methods","volume":"40","author":"Girish","year":"2014","journal-title":"Computers and Electrical Engineering"},{"key":"S026988892100014X_ref19","first-page":"605","article-title":"Application of k-nearest neighbor (knn) approach for predicting economic events: Theoretical background","volume":"3","author":"Sadegh Bafandeh","year":"2013","journal-title":"International Journal of Engineering Research and Applications"},{"key":"S026988892100014X_ref11","doi-asserted-by":"publisher","DOI":"10.18178\/ijmlc.2018.8.2.676"},{"key":"S026988892100014X_ref25","unstructured":"OECD. Country statistical profile: Spain 2020. OECD ilibrary, 2018. URL https:\/\/www.oecd-ilibrary.org\/."},{"key":"S026988892100014X_ref23","doi-asserted-by":"crossref","unstructured":"Larry, Li and Silvia, Z Islam . Firm and industry specific determinants of capital structure: Evidence from the australian market. International Review of Economics & Finance, 59: 425\u2013437, 2019.","DOI":"10.1016\/j.iref.2018.10.007"},{"key":"S026988892100014X_ref1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.10.040"},{"key":"S026988892100014X_ref5","doi-asserted-by":"publisher","DOI":"10.1007\/s11142-004-6341-9"},{"key":"S026988892100014X_ref36","first-page":"3","article-title":"The asian crisis: the high debt model versus the wall street-treasury-imf complex","author":"Robert","year":"1998","journal-title":"New left review"},{"key":"S026988892100014X_ref41","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2016.04.001"},{"key":"S026988892100014X_ref13","doi-asserted-by":"crossref","unstructured":"Daryush, Foroghi , Amirhassan, Monadjemi , et al. Applying decision tree to predict bankruptcy. In 2011 IEEE International Conference on Computer Science and Automation Engineering, volume 4, pages 165\u2013169. IEEE, 2011.","DOI":"10.1109\/CSAE.2011.5952826"},{"key":"S026988892100014X_ref28","doi-asserted-by":"publisher","DOI":"10.1016\/S0378-4371(02)01882-4"},{"key":"S026988892100014X_ref4","doi-asserted-by":"publisher","DOI":"10.2307\/2490171"},{"key":"S026988892100014X_ref35","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/j.dss.2018.06.011","article-title":"An investigation of bankruptcy prediction in imbalanced datasets","volume":"112","author":"David","year":"2018","journal-title":"Decision Support Systems"},{"key":"S026988892100014X_ref31","article-title":"The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets","volume":"10","author":"Takaya","year":"2015","journal-title":"PloS one"},{"key":"S026988892100014X_ref14","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2008.239"},{"key":"S026988892100014X_ref39","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2010.103"},{"key":"S026988892100014X_ref37","doi-asserted-by":"crossref","first-page":"908","DOI":"10.4236\/jmf.2017.74049","article-title":"Bankruptcy prediction using machine learning","volume":"7","author":"Nanxi","year":"2017","journal-title":"Journal of Mathematical Finance"},{"key":"S026988892100014X_ref22","doi-asserted-by":"publisher","DOI":"10.1080\/10106049.2016.1170892"},{"key":"S026988892100014X_ref27","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2011.10.007"},{"key":"S026988892100014X_ref9","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143874"},{"key":"S026988892100014X_ref2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1540-6261.1968.tb00843.x"},{"key":"S026988892100014X_ref15","doi-asserted-by":"publisher","DOI":"10.1023\/B:RAST.0000013627.90884.b7"},{"key":"S026988892100014X_ref29","doi-asserted-by":"crossref","first-page":"895","DOI":"10.1016\/j.procs.2019.12.065","article-title":"Review of bankruptcy prediction using machine learning and deep learning techniques","volume":"162","author":"Yi","year":"2019","journal-title":"Procedia Computer Science"},{"key":"S026988892100014X_ref24","doi-asserted-by":"crossref","first-page":"102210","DOI":"10.1016\/j.ipm.2020.102210","article-title":"Machine learning classification of entrepreneurs in british historical census data","volume":"57","author":"Piero","year":"2020","journal-title":"Information Processing and Management"},{"key":"S026988892100014X_ref12","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-19222-2_13"},{"key":"S026988892100014X_ref6","unstructured":"Bellovary, Jodi L , Giacomino, Don E , and Akers, Michael D . A review of bankruptcy prediction studies: 1930 to present. Journal of Financial education, pages 1\u201342, 2007."},{"key":"S026988892100014X_ref21","doi-asserted-by":"crossref","first-page":"6325","DOI":"10.3390\/su12166325","article-title":"Corporate default predictions using machine learning: Literature review","volume":"12","author":"Hyeongjun","year":"2020","journal-title":"Sustainability"},{"key":"S026988892100014X_ref26","doi-asserted-by":"publisher","DOI":"10.2307\/2490395"},{"key":"S026988892100014X_ref32","doi-asserted-by":"publisher","DOI":"10.1145\/1980022.1980064"},{"key":"S026988892100014X_ref3","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1016\/j.eswa.2017.04.006","article-title":"Machine learning models and bankruptcy prediction","volume":"83","author":"Flavio","year":"2017","journal-title":"Expert Systems with Applications"},{"key":"S026988892100014X_ref16","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1016\/j.eswa.2018.09.039","article-title":"Bankruptcy prediction using imaged financial ratios and convolutional neural networks","volume":"117","author":"Tadaaki","year":"2019","journal-title":"Expert systems with applications"},{"key":"S026988892100014X_ref10","doi-asserted-by":"crossref","first-page":"1954","DOI":"10.1016\/j.jbankfin.2007.12.034","article-title":"Capital structure around the world: The roles of firm-and country-specific determinants","volume":"32","author":"Abe De","year":"2008","journal-title":"Journal of Banking and Finance"},{"key":"S026988892100014X_ref8","first-page":"1","article-title":"The advantages of the matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation","volume":"21","author":"Davide","year":"2020","journal-title":"BMC genomics"}],"container-title":["The Knowledge Engineering Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S026988892100014X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T14:42:22Z","timestamp":1767624142000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S026988892100014X\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"references-count":41,"alternative-id":["S026988892100014X"],"URL":"https:\/\/doi.org\/10.1017\/s026988892100014x","relation":{},"ISSN":["0269-8889","1469-8005"],"issn-type":[{"value":"0269-8889","type":"print"},{"value":"1469-8005","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022]]},"assertion":[{"value":"\u00a9 The Author(s), 2022. Published by Cambridge University Press","name":"copyright","label":"Copyright","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}}],"article-number":"e1"}}