{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T14:40:16Z","timestamp":1777646416601,"version":"3.51.4"},"reference-count":53,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2024,12,23]],"date-time":"2024-12-23T00:00:00Z","timestamp":1734912000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["62376074"],"award-info":[{"award-number":["62376074"]}]},{"name":"National Natural Science Foundation of China","award":["RKX20231110090859012"],"award-info":[{"award-number":["RKX20231110090859012"]}]},{"name":"National Natural Science Foundation of China","award":["SGDX20230116091244004"],"award-info":[{"award-number":["SGDX20230116091244004"]}]},{"name":"National Natural Science Foundation of China","award":["KCXST20221021111404010"],"award-info":[{"award-number":["KCXST20221021111404010"]}]},{"name":"National Natural Science Foundation of China","award":["JSGGKQTD20221101115655027"],"award-info":[{"award-number":["JSGGKQTD20221101115655027"]}]},{"name":"National Natural Science Foundation of China","award":["KJZD20231023095959002"],"award-info":[{"award-number":["KJZD20231023095959002"]}]},{"name":"National Natural Science Foundation of China","award":["KP191001"],"award-info":[{"award-number":["KP191001"]}]},{"name":"Shenzhen Science and Technology Program","award":["62376074"],"award-info":[{"award-number":["62376074"]}]},{"name":"Shenzhen Science and Technology Program","award":["RKX20231110090859012"],"award-info":[{"award-number":["RKX20231110090859012"]}]},{"name":"Shenzhen Science and Technology Program","award":["SGDX20230116091244004"],"award-info":[{"award-number":["SGDX20230116091244004"]}]},{"name":"Shenzhen Science and Technology Program","award":["KCXST20221021111404010"],"award-info":[{"award-number":["KCXST20221021111404010"]}]},{"name":"Shenzhen Science and Technology Program","award":["JSGGKQTD20221101115655027"],"award-info":[{"award-number":["JSGGKQTD20221101115655027"]}]},{"name":"Shenzhen Science and Technology Program","award":["KJZD20231023095959002"],"award-info":[{"award-number":["KJZD20231023095959002"]}]},{"name":"Shenzhen Science and Technology Program","award":["KP191001"],"award-info":[{"award-number":["KP191001"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["62376074"],"award-info":[{"award-number":["62376074"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["RKX20231110090859012"],"award-info":[{"award-number":["RKX20231110090859012"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["SGDX20230116091244004"],"award-info":[{"award-number":["SGDX20230116091244004"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["KCXST20221021111404010"],"award-info":[{"award-number":["KCXST20221021111404010"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["JSGGKQTD20221101115655027"],"award-info":[{"award-number":["JSGGKQTD20221101115655027"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["KJZD20231023095959002"],"award-info":[{"award-number":["KJZD20231023095959002"]}]},{"name":"Shenzhen Humanities and Social Sciences Key Research Bases","award":["KP191001"],"award-info":[{"award-number":["KP191001"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["62376074"],"award-info":[{"award-number":["62376074"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["RKX20231110090859012"],"award-info":[{"award-number":["RKX20231110090859012"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["SGDX20230116091244004"],"award-info":[{"award-number":["SGDX20230116091244004"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["KCXST20221021111404010"],"award-info":[{"award-number":["KCXST20221021111404010"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["JSGGKQTD20221101115655027"],"award-info":[{"award-number":["JSGGKQTD20221101115655027"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["KJZD20231023095959002"],"award-info":[{"award-number":["KJZD20231023095959002"]}]},{"name":"Harbin Institute of Technology (Shenzhen) Joint Basic Education Cultivation Project","award":["KP191001"],"award-info":[{"award-number":["KP191001"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems"],"abstract":"<jats:p>With the rapid development of the capital market, financial fraud cases are becoming increasingly common. The evolving fraud strategies pose significant threats to financial regulation, market order, and the interests of ordinary investors. In order to combine the generalization performance of different machine learning methods and improve the effectiveness of financial fraud prediction, this paper proposes a novel financial fraud prediction framework based on stacking ensemble learning. This framework, based on data from listed companies, comprehensively considers financial ratio indicators and non-financial indicators. It uses the stacking ensemble technique to integrate numerous base models of machine learning algorithms for predicting financial fraud. Furthermore, the proposed framework has high versatility and is suitable for various tasks related to financial fraud prediction, addressing the problem of model selection difficulties in previous research due to different scenarios and data. We also conducted case studies on specific companies and industries, confirming the significant interpretability and practical applicability of the proposed framework. The results show that the recall rate and Area Under Curve (AUC) of our framework reached 0.8246 and 0.8146, respectively, surpassing mainstream machine learning models such as XGBoost and LightGBM in existing studies. This research study is of great significance for predicting the increasing number of financial fraud cases, providing a reliable tool for financial regulatory institutions and investors.<\/jats:p>","DOI":"10.3390\/systems12120588","type":"journal-article","created":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T02:50:56Z","timestamp":1735008656000},"page":"588","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["A Financial Fraud Prediction Framework Based on Stacking Ensemble Learning"],"prefix":"10.3390","volume":"12","author":[{"given":"Shanshan","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Economics and Management, Harbin Institute of Technology, Shenzhen 518000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-7609-0623","authenticated-orcid":false,"given":"Haotian","family":"Wu","sequence":"additional","affiliation":[{"name":"Faculty of Computer, Harbin Institute of Technology, Harbin 150001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric W. T.","family":"Ngai","sequence":"additional","affiliation":[{"name":"Department of Management and Marketing, The Hong Kong Polytechnic University, Hong Kong 00852, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4593-802X","authenticated-orcid":false,"given":"Jifan","family":"Ren","sequence":"additional","affiliation":[{"name":"School of Economics and Management, Harbin Institute of Technology, Shenzhen 518000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daojing","family":"He","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tengyun","family":"Ma","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4003-3571","authenticated-orcid":false,"given":"Yubin","family":"Li","sequence":"additional","affiliation":[{"name":"School of Economics and Management, Harbin Institute of Technology, Shenzhen 518000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,12,23]]},"reference":[{"key":"ref_1","unstructured":"ACFE (2020). Report to the Nations 2020 Global Study on Occupational Fraud and Abuse, Association of Certified Fraud Examiners. Available online: https:\/\/legacy.acfe.com\/report-to-the-nations\/2020\/."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Kwok, B.K. (2017). Accounting Irregularities in Financial Statements: A Definitive Guide for Litigators, Auditors and Fraud Investigators, Routledge.","DOI":"10.4324\/9781315263441"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"100559","DOI":"10.1016\/j.accinf.2022.100559","article-title":"Detecting accounting fraud in companies reporting under US GAAP through data mining","volume":"45","year":"2022","journal-title":"Int. J. Account. Inf. Syst."},{"key":"ref_4","unstructured":"Cressey, D. (1953). Other People\u2019s Money, Patterson Smith. A Study of the Social Psychology of Embezzlement."},{"key":"ref_5","first-page":"143","article-title":"Patterns of similarity of corporate frauds","volume":"21","author":"Imoniana","year":"2016","journal-title":"Qual. Rep."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"384","DOI":"10.51594\/farj.v6i3.899","article-title":"Reviewing the role of big data analytics in financial fraud detection","volume":"6","author":"Shoetan","year":"2024","journal-title":"Financ. Account. Res. J."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"109118","DOI":"10.1016\/j.cie.2023.109118","article-title":"Tracking down financial statement fraud by analyzing the supplier-customer relationship network","volume":"178","author":"Li","year":"2023","journal-title":"Comput. Ind. Eng."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"113402","DOI":"10.1016\/j.dss.2020.113402","article-title":"Drivers of and barriers to decision support technology use by financial report auditors","volume":"139","author":"Meredith","year":"2020","journal-title":"Decis. Support Syst."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1111\/1475-679X.12292","article-title":"Detecting accounting fraud in publicly traded US firms using a machine learning approach","volume":"58","author":"Bao","year":"2020","journal-title":"J. Account. Res."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"116148","DOI":"10.1016\/j.eswa.2021.116148","article-title":"Fraud detection in publicly traded US firms using Beetle Antennae Search: A machine learning approach","volume":"191","author":"Khan","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"113913","DOI":"10.1016\/j.dss.2022.113913","article-title":"Attentive statement fraud detection: Distinguishing multimodal financial data with fine-grained attention","volume":"167","author":"Wang","year":"2023","journal-title":"Decis. Support Syst."},{"key":"ref_12","first-page":"507","article-title":"Why do tree-based models still outperform deep learning on typical tabular data?","volume":"35","author":"Grinsztajn","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1111\/j.1911-3846.2010.01041.x","article-title":"Predicting material accounting misstatements","volume":"28","author":"Dechow","year":"2011","journal-title":"Contemp. Account. Res."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"19","DOI":"10.2308\/ajpt-50009","article-title":"Financial statement fraud detection: An analysis of statistical and machine learning algorithms","volume":"30","author":"Perols","year":"2011","journal-title":"Audit. A J. Pract. Theory"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.2307\/41703508","article-title":"Metafraud: A meta-learning framework for detecting financial fraud","volume":"36","author":"Abbasi","year":"2012","journal-title":"Mis. Q."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1016\/j.dss.2010.11.006","article-title":"Detection of financial statement fraud and feature selection using data mining techniques","volume":"50","author":"Ravisankar","year":"2011","journal-title":"Decis. Support Syst."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hassanniakalager, A., Perotti, P., and Tsoligkas, F. (2022). A Machine Learning Approach to Detect Accounting Frauds. SSRN Electron. J.","DOI":"10.2139\/ssrn.4117764"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1109\/TSMCA.2009.2029559","article-title":"RUSBoost: A hybrid approach to alleviating class imbalance","volume":"40","author":"Seiffert","year":"2009","journal-title":"IEEE Trans. Syst. Man Cybern.-Part A Syst. Humans"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"71","DOI":"10.2307\/2490171","article-title":"Financial ratios as predictors of failure","volume":"4","author":"Beaver","year":"1966","journal-title":"J. Account. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1002\/j.1099-1174.1995.tb00084.x","article-title":"Detection of management fraud: A neural network approach","volume":"4","author":"Fanning","year":"1995","journal-title":"Intell. Syst. Account. Financ. Manag."},{"key":"ref_21","first-page":"14","article-title":"Assessing the risk of management fraud through neural network technology","volume":"16","author":"Green","year":"1997","journal-title":"Auditing"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1002\/(SICI)1099-1174(199803)7:1<21::AID-ISAF138>3.0.CO;2-K","article-title":"Neural network detection of management fraud using published financial data","volume":"7","author":"Fanning","year":"1998","journal-title":"Intell. Syst. Account. Financ. Manag."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1146","DOI":"10.1287\/mnsc.1100.1174","article-title":"Detecting management fraud in public companies","volume":"56","author":"Cecchini","year":"2010","journal-title":"Manag. Sci."},{"key":"ref_24","first-page":"349","article-title":"A Bayesian approach for predicting material accounting misstatements","volume":"21","author":"Xu","year":"2014","journal-title":"Asia-Pac. J. Account. Econ."},{"key":"ref_25","first-page":"185","article-title":"Application of selected data mining techniques in unintentional accounting error detection","volume":"16","author":"Papik","year":"2021","journal-title":"Equilib. Q. J. Econ. Econ. Policy"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.eswa.2016.06.016","article-title":"Detecting financial misstatements with fraud intention using multi-class cost-sensitive learning","volume":"62","author":"Kim","year":"2016","journal-title":"Expert Syst. Appl."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.knosys.2017.05.001","article-title":"Mining corporate annual reports for intelligent detection of financial statement fraud\u2014A comparative study of machine learning methods","volume":"128","author":"Hajek","year":"2017","journal-title":"Knowl.-Based Syst."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1111\/1475-679X.12294","article-title":"What are you saying? Using topic to detect financial misreporting","volume":"58","author":"Brown","year":"2020","journal-title":"J. Account. Res."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1016\/j.dss.2010.08.009","article-title":"Identification of fraudulent financial statements using linguistic credibility analysis","volume":"50","author":"Humpherys","year":"2011","journal-title":"Decis. Support Syst."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"113421","DOI":"10.1016\/j.dss.2020.113421","article-title":"Deep learning for detecting financial statement fraud","volume":"139","author":"Craja","year":"2020","journal-title":"Decis. Support Syst."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Jan, C.L. (2021). Detection of financial statement fraud using deep learning for sustainable development of capital markets under information asymmetry. Sustainability, 13.","DOI":"10.3390\/su13179879"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"64","DOI":"10.3846\/jbem.2019.10179","article-title":"Detection models for unintentional financial restatements","volume":"21","author":"Papik","year":"2020","journal-title":"J. Bus. Econ. Manag."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"769","DOI":"10.2991\/ijcis.d.210203.007","article-title":"Comparing performances and effectiveness of machine learning classifiers in detecting financial accounting fraud for Turkish SMEs","volume":"14","author":"Hamal","year":"2021","journal-title":"Int. J. Comput. Intell. Syst."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"107487","DOI":"10.1016\/j.asoc.2021.107487","article-title":"A financial statement fraud model based on synthesized attribute selection and a dataset with missing values and imbalanced classes","volume":"108","author":"Cheng","year":"2021","journal-title":"Appl. Soft Comput."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"4601","DOI":"10.1111\/acfi.12742","article-title":"Lifting the numbers game: Identifying key input variables and a best-performing model to detect financial statement fraud","volume":"61","author":"Gepp","year":"2021","journal-title":"Account. Financ."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.dss.2015.04.006","article-title":"Financial fraud detection using vocal, linguistic and financial cues","volume":"74","author":"Throckmorton","year":"2015","journal-title":"Decis. Support Syst."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Yao, J., Pan, Y., Yang, S., Chen, Y., and Li, Y. (2019). Detecting fraudulent financial statements for the sustainable development of the socio-economy in China: A multi-analytic approach. Sustainability, 11.","DOI":"10.3390\/su11061579"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"114231","DOI":"10.1016\/j.dss.2024.114231","article-title":"The information content of financial statement fraud risk: An ensemble learning approach","volume":"174","author":"Duan","year":"2024","journal-title":"Decis. Support Syst."},{"key":"ref_39","first-page":"93","article-title":"Developing a model to predict fraudulent financial reporting","volume":"15","author":"Khaksari","year":"2024","journal-title":"Int. J. Nonlinear Anal. Appl."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1108\/JFC-09-2022-0219","article-title":"Fraud detection using fraud triangle theory: Evidence from China","volume":"31","author":"Rahman","year":"2024","journal-title":"J. Financ. Crime"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"100682","DOI":"10.1016\/j.accinf.2024.100682","article-title":"Accounting fraud detection using contextual language learning","volume":"53","author":"Bhattacharya","year":"2024","journal-title":"Int. J. Account. Inf. Syst."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1007\/s11142-020-09563-8","article-title":"Using machine learning to detect misstatements","volume":"26","author":"Bertomeu","year":"2021","journal-title":"Rev. Account. Stud."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1007\/s10551-022-05120-2","article-title":"Using machine learning to predict corporate fraud: Evidence based on the GONE framework","volume":"186","author":"Xu","year":"2023","journal-title":"J. Bus. Ethics"},{"key":"ref_44","first-page":"1","article-title":"Preventing the unpleasant: Fraudulent financial statement detection using financial ratios","volume":"17","author":"Pazarskis","year":"2021","journal-title":"J. Oper. Risk"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.accinf.2017.06.004","article-title":"Enhancement of fraud detection for narratives in annual reports","volume":"26","author":"Chen","year":"2017","journal-title":"Int. J. Account. Inf. Syst."},{"key":"ref_46","first-page":"104","article-title":"Forecasting fraudulent financial statements using data mining","volume":"3","author":"Kotsiantis","year":"2006","journal-title":"Int. J. Comput. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1007\/BF00116037","article-title":"The strength of weak learnability","volume":"5","author":"Schapire","year":"1990","journal-title":"Mach. Learn."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","volume":"55","author":"Freund","year":"1997","journal-title":"J. Comput. Syst. Sci."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). Xgboost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_50","first-page":"3146","article-title":"Lightgbm: A highly efficient gradient boosting decision tree","volume":"30","author":"Ke","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1111\/j.1467-8640.2012.00425.x","article-title":"Machine learning methods for detecting patterns of management fraud","volume":"28","author":"Whiting","year":"2012","journal-title":"Comput. Intell."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-006-6226-1","article-title":"Extremely randomized trees","volume":"63","author":"Geurts","year":"2006","journal-title":"Mach. Learn."}],"container-title":["Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-8954\/12\/12\/588\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:58:52Z","timestamp":1760115532000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-8954\/12\/12\/588"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,23]]},"references-count":53,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["systems12120588"],"URL":"https:\/\/doi.org\/10.3390\/systems12120588","relation":{},"ISSN":["2079-8954"],"issn-type":[{"value":"2079-8954","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,23]]}}}