{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:16:06Z","timestamp":1768900566938,"version":"3.49.0"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,11,26]],"date-time":"2024-11-26T00:00:00Z","timestamp":1732579200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"},{"start":{"date-parts":[[2024,11,26]],"date-time":"2024-11-26T00:00:00Z","timestamp":1732579200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"}],"funder":[{"DOI":"10.13039\/501100024055","name":"Nova Scotia Health Authority","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100024055","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Discov Artif Intell"],"DOI":"10.1007\/s44163-024-00199-0","type":"journal-article","created":{"date-parts":[[2024,11,26]],"date-time":"2024-11-26T13:00:47Z","timestamp":1732626047000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["The effect of resampling techniques on the performances of machine learning clinical risk prediction models in the setting of severe class imbalance: development and internal validation in a retrospective cohort"],"prefix":"10.1007","volume":"4","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3121-9780","authenticated-orcid":false,"given":"Janny Xue Chen","family":"Ke","sequence":"first","affiliation":[]},{"given":"Arunachalam","family":"DhakshinaMurthy","sequence":"additional","affiliation":[]},{"given":"Ronald B.","family":"George","sequence":"additional","affiliation":[]},{"given":"Paula","family":"Branco","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,11,26]]},"reference":[{"issue":"10170","key":"199_CR1","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1016\/S0140-6736(18)33139-8","volume":"393","author":"D Nepogodiev","year":"2019","unstructured":"Nepogodiev D, et al. Global burden of postoperative death. Lancet. 2019;393(10170):401.","journal-title":"Lancet"},{"issue":"4","key":"199_CR2","doi-asserted-by":"publisher","first-page":"959","DOI":"10.1097\/ALN.0b013e3182a4e94d","volume":"119","author":"SR Moonesinghe","year":"2013","unstructured":"Moonesinghe SR, Mythen MG, Das P, Rowan KM, Grocott MPW. Risk stratification tools for predicting morbidity and mortality in adult patients undergoing major surgery: qualitative systematic review. Anesthesiology. 2013;119(4):959\u201381.","journal-title":"Anesthesiology"},{"issue":"10","key":"199_CR3","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pmed.1003253","volume":"17","author":"DJN Wong","year":"2020","unstructured":"Wong DJN, Harris S, Sahni A, Bedford JR, Cortes L, Shawyer R, et al. Developing and validating subjective and objective risk-assessment measures for predicting mortality after major surgery: an international prospective cohort study. PLOS Med. 2020;17(10): e1003253.","journal-title":"PLOS Med"},{"issue":"3","key":"199_CR4","first-page":"525","volume":"119","author":"MJG Sigakis","year":"2013","unstructured":"Sigakis MJG, Bittner EA, Wanderer JP. Validation of a risk stratification index and risk quantification index for predicting patient outcomesin-hospital mortality, 30-day mortality, 1-year mortality, and length-of-stay. Anesthesiol J Am Soc Anesthesiol. 2013;119(3):525\u201340.","journal-title":"Anesthesiol J Am Soc Anesthesiol"},{"issue":"4","key":"199_CR5","doi-asserted-by":"publisher","first-page":"649","DOI":"10.1097\/ALN.0000000000002186","volume":"129","author":"CK Lee","year":"2018","unstructured":"Lee CK, Hofer I, Gabel E, Baldi P, Cannesson M. Development and validation of a deep neural network model for prediction of postoperative in-hospital mortality. Anesthesiology. 2018;129(4):649\u201362.","journal-title":"Anesthesiology"},{"issue":"6","key":"199_CR6","doi-asserted-by":"publisher","first-page":"877","DOI":"10.1016\/j.bja.2019.07.030","volume":"123","author":"BL Hill","year":"2019","unstructured":"Hill BL, Brown R, Gabel E, Rakocz N, Lee C, Cannesson M, et al. An automated machine learning-based model predicts postoperative mortality using readily-extractable preoperative electronic health record data. Br J Anaesth. 2019;123(6):877\u201386.","journal-title":"Br J Anaesth"},{"issue":"5","key":"199_CR7","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1016\/j.bja.2019.07.025","volume":"123","author":"BA Fritz","year":"2019","unstructured":"Fritz BA, Cui Z, Zhang M, He Y, Chen Y, Kronzer A, et al. Deep-learning model for predicting 30-day postoperative mortality. Br J Anaesth. 2019;123(5):688\u201395.","journal-title":"Br J Anaesth"},{"key":"199_CR8","doi-asserted-by":"publisher","DOI":"10.1007\/s12630-022-02287-0","author":"JXC Ke","year":"2022","unstructured":"Ke JXC, McIsaac DI, George RB, Branco P, Cook EF, Beattie WS, et al. Postoperative mortality risk prediction that incorporates intraoperative vital signs: development and internal validation in a historical cohort. Can J Anesth. 2022. https:\/\/doi.org\/10.1007\/s12630-022-02287-0.","journal-title":"Can J Anesth"},{"key":"199_CR9","doi-asserted-by":"crossref","unstructured":"Kazemi P, Lau F, Simpao AF, Williams RJ, Matava C. The state of adoption of anesthesia information management systems in Canadian academic anesthesia departments: a survey. Can J Anaesth J Can Anesth [Internet]. 2021 Jan 29; Available from: https:\/\/rdcu.be\/cesb5","DOI":"10.1007\/s12630-021-01924-4"},{"issue":"11","key":"199_CR10","doi-asserted-by":"publisher","first-page":"1270","DOI":"10.1038\/s41592-021-01302-4","volume":"18","author":"FM Megahed","year":"2021","unstructured":"Megahed FM, Chen YJ, Megahed A, Ong Y, Altman N, Krzywinski M. The class imbalance problem. Nat Methods. 2021;18(11):1270\u20132.","journal-title":"Nat Methods"},{"issue":"2","key":"199_CR11","first-page":"31:1","volume":"49","author":"P Branco","year":"2016","unstructured":"Branco P, Torgo L, Ribeiro RP. A survey of predictive modeling on imbalanced domains. ACM Comput Surv. 2016;49(2):31:1-31:50.","journal-title":"ACM Comput Surv"},{"issue":"3","key":"199_CR12","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0118432","volume":"10","author":"T Saito","year":"2015","unstructured":"Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10(3): e0118432.","journal-title":"PLoS ONE"},{"key":"199_CR13","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1007\/978-3-030-88942-5_6","volume-title":"Discovery science: lecture notes in computer science","author":"JG Gaudreault","year":"2021","unstructured":"Gaudreault JG, Branco P, Gama J. An analysis of performance metrics for imbalanced classification. In: Soares C, Torgo L, editors. Discovery science: lecture notes in computer science. Cham: Springer International Publishing; 2021. p. 67\u201377."},{"issue":"2","key":"199_CR14","doi-asserted-by":"publisher","DOI":"10.1001\/jamanetworkopen.2019.20733","volume":"3","author":"N Brajer","year":"2020","unstructured":"Brajer N, Cozzi B, Gao M, Nichols M, Revoir M, Balu S, et al. Prospective and external evaluation of a machine learning model to predict in-hospital mortality of adults at time of admission. JAMA Netw Open. 2020;3(2): e1920733.","journal-title":"JAMA Netw Open"},{"key":"199_CR15","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1016\/j.jbi.2018.02.008","volume":"79","author":"R Davoodi","year":"2018","unstructured":"Davoodi R, Moradi MH. Mortality prediction in intensive care units (ICUs) using a deep rule-based fuzzy classifier. J Biomed Inform. 2018;79:48\u201359.","journal-title":"J Biomed Inform"},{"key":"199_CR16","doi-asserted-by":"crossref","unstructured":"Kabir MF, Ludwig S. Classification of Breast Cancer Risk Factors Using Several Resampling Approaches. In: 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA) [Internet]. 2018 [cited 2024 Sep 22]. p. 1243\u20138. Available from: https:\/\/ieeexplore.ieee.org\/document\/8614227","DOI":"10.1109\/ICMLA.2018.00202"},{"key":"199_CR17","doi-asserted-by":"publisher","first-page":"109960","DOI":"10.1109\/ACCESS.2021.3102399","volume":"9","author":"M Khushi","year":"2021","unstructured":"Khushi M, Shaukat K, Alam TM, Hameed IA, Uddin S, Luo S, et al. A comparative performance analysis of data resampling methods on imbalance medical data. IEEE Access. 2021;9:109960\u201375.","journal-title":"IEEE Access"},{"issue":"136","key":"199_CR18","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1016\/j.patrec.2020.05.035","volume":"1","author":"C Wang","year":"2020","unstructured":"Wang C, Deng C, Wang S. Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost. Pattern Recognit Lett. 2020;1(136):190\u20137.","journal-title":"Pattern Recognit Lett"},{"issue":"6","key":"199_CR19","doi-asserted-by":"publisher","first-page":"155013292211069","DOI":"10.1177\/15501329221106935","volume":"18","author":"P Zhang","year":"2022","unstructured":"Zhang P, Jia Y, Shang Y. Research and application of XGBoost in imbalanced data. Int J Distrib Sens Netw. 2022;18(6):15501329221106936.","journal-title":"Int J Distrib Sens Netw"},{"key":"199_CR20","unstructured":"Dakshinamurthy A. GitHub In hospital Mortality Prediction Research Project [Internet]. 2022 [cited 2022 Sep 8]. Available from: https:\/\/github.com\/Arunachalam4505\/In-hospital-Mortality-Prediction-Research-Project"},{"key":"199_CR21","unstructured":"scikit-learn: machine learning in Python\u2014scikit-learn 1.1.2 documentation [Internet]. [cited 2022 Sep 8]. Available from: https:\/\/scikit-learn.org\/stable\/"},{"key":"199_CR22","doi-asserted-by":"crossref","unstructured":"Bergstra J, Yamins D, Cox DD. Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures. In: Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28. Atlanta, GA. JMLR.org; 2013. p. I-115\u2013I-123. (ICML\u201913).","DOI":"10.25080\/Majora-8b375195-003"},{"key":"199_CR23","unstructured":"imbalanced-learn documentation\u2014Version 0.9.1 [Internet]. [cited 2022 Sep 8]. Available from: https:\/\/imbalanced-learn.org\/stable\/"},{"issue":"1","key":"199_CR24","first-page":"1","volume":"97","author":"M Kubat","year":"1997","unstructured":"Kubat M, Matwin S. Addressing the curse of imbalanced training sets: one-sided selection. Fourteenth Int Conf Mach Learn. 1997;97(1):1\u20138.","journal-title":"Fourteenth Int Conf Mach Learn"},{"issue":"1","key":"199_CR25","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1111\/j.0824-7935.2004.t01-1-00228.x","volume":"20","author":"A Estabrooks","year":"2004","unstructured":"Estabrooks A, Jo T, Japkowicz N. A multiple resampling method for learning from imbalanced data sets. Comput Intell. 2004;20(1):18\u201336.","journal-title":"Comput Intell"},{"key":"199_CR26","unstructured":"Zhang J, Mani I. KNN approach to unbalanced data distributions: a case study involving information extraction. In: Proceedings of the ICML\u20192003 Workshop on Learning from Imbalanced Datasets; 2003."},{"issue":"1","key":"199_CR27","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16(1):321\u201357.","journal-title":"J Artif Intell Res"},{"issue":"61","key":"199_CR28","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1613\/jair.1.11192","volume":"20","author":"A Fernandez","year":"2018","unstructured":"Fernandez A, Garcia S, Herrera F, Chawla NV. SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J Artif Intell Res. 2018;20(61):863\u2013905.","journal-title":"J Artif Intell Res"},{"issue":"2","key":"199_CR29","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol. 2005;67(2):301\u201320.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"key":"199_CR30","volume-title":"Classification and regression trees","author":"L Breiman","year":"1984","unstructured":"Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. Monterey, CA: Wadsworth and Brooks; 1984."},{"issue":"1","key":"199_CR31","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Mach Learn. 2001;45(1):5\u201332.","journal-title":"Mach Learn"},{"key":"199_CR32","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C. XGBoost: A scalable tree boosting system. Proc 22nd ACM SIGKDD Int Conf Knowl Discov Data Min. 2016.","DOI":"10.1145\/2939672.2939785"},{"key":"199_CR33","unstructured":"Pelikan M, Goldberg DE, Cant\u00fa-Paz E. BOA: the Bayesian optimization algorithm. In: Proceedings of the 1st Annual Conference on Genetic and Evolutionary Computation - Volume 1. San Francisco, CA: Morgan Kaufmann Publishers Inc.; 1999. p. 525\u201332. (GECCO\u201999)."},{"key":"199_CR34","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-7138-7","volume-title":"An introduction to statistical learning: with applications in R. Springer texts in statistics","author":"G James","year":"2013","unstructured":"James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning: with applications in R. Springer texts in statistics. New York: Springer; 2013."},{"issue":"1","key":"199_CR35","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1097\/EDE.0b013e3181c30fb2","volume":"21","author":"EW Steyerberg","year":"2010","unstructured":"Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for some traditional and novel measures. Epidemiol Camb Mass. 2010;21(1):128\u201338.","journal-title":"Epidemiol Camb Mass"},{"issue":"9","key":"199_CR36","doi-asserted-by":"publisher","first-page":"1525","DOI":"10.1093\/jamia\/ocac093","volume":"29","author":"R van den Goorbergh","year":"2022","unstructured":"van den Goorbergh R, van Smeden M, Timmerman D, Van Calster B. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. J Am Med Inform Assoc. 2022;29(9):1525\u201334.","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"199_CR37","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/s41512-019-0064-7","volume":"3","author":"AJ Vickers","year":"2019","unstructured":"Vickers AJ, van Calster B, Steyerberg EW. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res. 2019;3(1):18.","journal-title":"Diagn Progn Res"},{"issue":"1","key":"199_CR38","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1186\/1471-2105-14-106","volume":"14","author":"R Blagus","year":"2013","unstructured":"Blagus R, Lusa L. SMOTE for high-dimensional class-imbalanced data. BMC Bioinformatics. 2013;14(1):106.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"199_CR39","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc Ser B. 1995;57(1):289\u2013300.","journal-title":"J Roy Stat Soc Ser B"}],"container-title":["Discover Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44163-024-00199-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44163-024-00199-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44163-024-00199-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,26]],"date-time":"2024-11-26T13:03:11Z","timestamp":1732626191000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44163-024-00199-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,26]]},"references-count":39,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["199"],"URL":"https:\/\/doi.org\/10.1007\/s44163-024-00199-0","relation":{},"ISSN":["2731-0809"],"issn-type":[{"value":"2731-0809","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,26]]},"assertion":[{"value":"9 July 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 November 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 November 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We obtained research ethics approval (Nova Scotia Health Authority Research Ethics Board, Halifax, NS, Canada; file # 1024251), and conducted this study according to the principles of the Declaration of Helsinki.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Dr. Janny Ke received salary support as the Clinical Data Lead, St. Paul\u2019s Hospital, Canada, for Project \u201cReducing Opioid Use for Pain Management\u201d from Canadian Digital Supercluster DIGITAL and Consortium (Careteam Technologies Inc, Thrive Health Inc, Excelar Technologies (Connected Displays Inc), Providence Health Care Ventures Inc, and Xerus Inc). Dr Ke provided paid consulting for Careflow Technologies (Connected Displays Inc), funded via Providence Health Care Ventures. Dr. Ke receives research and salary support for Project \"Continuous Connected Patient Care\" in patients with high perioperative risk, funded by DIGITAL and Consortium: Medtronic Canada ULC, Cloud Diagnostics Canada ULC, Excelar Technologies (Connected Displays Inc), Providence Health Care Ventures Inc, 3D Bridge Solutions Inc, and FluidAI (NERv Technology Inc). Dr. Paula Branco received research funding from NSERC, Mitacs, IBM and University of Ottawa.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"91"}}