{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T06:22:04Z","timestamp":1772778124070,"version":"3.50.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2014,10,17]],"date-time":"2014-10-17T00:00:00Z","timestamp":1413504000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2015,10]]},"DOI":"10.1007\/s10115-014-0794-3","type":"journal-article","created":{"date-parts":[[2014,10,16]],"date-time":"2014-10-16T06:14:49Z","timestamp":1413440089000},"page":"247-270","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":155,"title":["Class imbalance revisited: a new experimental setup to assess the performance of treatment methods"],"prefix":"10.1007","volume":"45","author":[{"given":"Ronaldo C.","family":"Prati","sequence":"first","affiliation":[]},{"given":"Gustavo E. A. P. A.","family":"Batista","sequence":"additional","affiliation":[]},{"given":"Diego F.","family":"Silva","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2014,10,17]]},"reference":[{"issue":"1","key":"794_CR1","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1145\/1007730.1007735","volume":"6","author":"GEAPA Batista","year":"2004","unstructured":"Batista GEAPA, Prati RC, Monard MC (2004) A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor 6(1):20\u201329","journal-title":"SIGKDD Explor"},{"issue":"1","key":"794_CR2","doi-asserted-by":"crossref","first-page":"231","DOI":"10.2307\/2528368","volume":"21","author":"BM Bennett","year":"1965","unstructured":"Bennett BM (1965) Confidence limits for a ratio using Wilcoxon\u2019s signed rank test. Biometics 21(1):231\u2013234","journal-title":"Biometics"},{"key":"794_CR3","doi-asserted-by":"crossref","unstructured":"Berrar D, Lozano JA (2013) Significance tests or confidence intervals: which are preferable for the comparison of classifiers?. J Exp Theor Artif Intell 25(2):189\u2013206. http:\/\/www.ingentaconnect.com\/content\/tandf\/teta\/2013\/00000025\/00000002\/art00003","DOI":"10.1080\/0952813X.2012.680252"},{"key":"794_CR4","unstructured":"Borgelt C (2012) Christian borgelt web page. http:\/\/www.borgelt.net\/"},{"key":"794_CR5","unstructured":"Chang C-C, Lin C-J (2012) Libsvm\u2014a library for support vector machines. http:\/\/www.csie.ntu.edu.tw\/cjlin\/libsvm\/"},{"key":"794_CR6","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321\u2013357","journal-title":"J Artif Intell Res"},{"key":"794_CR7","doi-asserted-by":"crossref","unstructured":"Cieslak D, Chawla N (2008) Analyzing pets on imbalanced datasets when training and testing class distributions differ. In: Pacific-Asia conference on advances in knowledge discovery and data mining, pp 519\u2013526","DOI":"10.1007\/978-3-540-68125-0_46"},{"key":"794_CR8","doi-asserted-by":"crossref","unstructured":"Clark P, Boswell R (1991) Rule induction with CN2: some recent improvements. In: European working session on machine learning, pp 151\u2013163","DOI":"10.1007\/BFb0017011"},{"issue":"1","key":"794_CR9","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1016\/j.artmed.2005.03.002","volume":"37","author":"G Cohen","year":"2006","unstructured":"Cohen G, Hilario M, Sax H, Hugonnet S, Geissbhler A (2006) Learning from imbalanced data in surveillance of nosocomial infection. Artif Intell Med 37(1):7\u201318","journal-title":"Artif Intell Med"},{"key":"794_CR10","doi-asserted-by":"crossref","unstructured":"Cohen WW (1995) Fast effective rule induction. In: International conference on machine learning. Morgan Kaufmann, Los Altos, CA, pp 115\u2013123","DOI":"10.1016\/B978-1-55860-377-6.50023-2"},{"key":"794_CR11","doi-asserted-by":"crossref","unstructured":"Domingos P (1999) Metacost: a general method for making classifiers cost-sensitive. In: Fayyad UM, Chaudhuri S, Madigan D (eds) ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 155\u2013164","DOI":"10.1145\/312129.312220"},{"issue":"8","key":"794_CR12","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1016\/j.patrec.2005.10.010","volume":"27","author":"T Fawcett","year":"2006","unstructured":"Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27(8):861\u2013874","journal-title":"Pattern Recognit Lett"},{"key":"794_CR13","doi-asserted-by":"crossref","unstructured":"Foody GM (2009) Classification accuracy comparison: Hypothesis tests and the use of confidence intervals in evaluations of difference, equivalence and non-inferiority. Remote Sens Environ 113(8):1658\u20131663. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0034425709000923","DOI":"10.1016\/j.rse.2009.03.014"},{"key":"794_CR14","unstructured":"Frank A, Asuncion A (2010) UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml"},{"key":"794_CR15","unstructured":"Froemke C, Hothorn L, Schneider M (2012) Confidence intervals for the ratio of locations and for the ratio of scales of two paired samples. Technical report, The Comprehensive R Archive Network. http:\/\/cran.r-project.org\/web\/packages\/pairedCI\/index.html"},{"issue":"4","key":"794_CR16","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1109\/TSMCC.2011.2161285","volume":"42","author":"M Galar","year":"2012","unstructured":"Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F (2012) A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybern Part C 42(4):463\u2013484","journal-title":"IEEE Trans Syst Man Cybern Part C"},{"issue":"1","key":"794_CR17","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1145\/1007730.1007736","volume":"6","author":"H Guo","year":"2004","unstructured":"Guo H, Viktor HL (2004) Learning from imbalanced data sets with boosting and data generation: the databoost-im approach. SIGKDD Explor 6(1):30\u201339","journal-title":"SIGKDD Explor"},{"key":"794_CR18","doi-asserted-by":"crossref","unstructured":"Han H, Wang W-Y, Mao B-H (2005) Borderline-smote: a new over-sampling method in imbalanced data sets learning. In: International conference on advances in intelligent computing. Lecture notes in computer science. Springer, Berlin, pp 878\u2013887. doi: 10.1007\/11538059_91","DOI":"10.1007\/11538059_91"},{"key":"794_CR19","unstructured":"He H, Bai Y, Garcia E, Li S (2008) Adasyn: adaptive synthetic sampling approach for imbalanced learning. In: IEEE international joint conference on neural networks, pp 1322\u20131328"},{"issue":"9","key":"794_CR20","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","volume":"21","author":"H He","year":"2009","unstructured":"He H, Garcia EA (2009) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21(9):1263\u20131284","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"5","key":"794_CR21","doi-asserted-by":"crossref","first-page":"429","DOI":"10.3233\/IDA-2002-6504","volume":"6","author":"N Japkowicz","year":"2002","unstructured":"Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5):429\u2013449","journal-title":"Intell Data Anal"},{"key":"794_CR22","doi-asserted-by":"crossref","unstructured":"Khoshgoftaar TM, Seiffert C, Hulse JV, Napolitano A, Folleco A (2007) Learning with limited minority class data. In: International conference on machine learning and applications, pp 348\u2013353","DOI":"10.1109\/ICMLA.2007.76"},{"issue":"2\u20133","key":"794_CR23","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1023\/A:1007452223027","volume":"30","author":"M Kubat","year":"1998","unstructured":"Kubat M, Holte RC, Matwin S (1998) Machine learning for the detection of oil spills in satellite radar images. Mach Learn 30(2\u20133):195\u2013215","journal-title":"Mach Learn"},{"key":"794_CR24","doi-asserted-by":"crossref","unstructured":"Liu X-Y, Wu J, Zhou Z-H (2006) Exploratory under-sampling for class-imbalance learning. In: IEEE international conference on data mining, pp 965\u2013969","DOI":"10.1109\/ICDM.2006.68"},{"key":"794_CR25","doi-asserted-by":"crossref","unstructured":"Liu X-Y, Zhou Z-H (2006) The influence of class imbalance on cost-sensitive learning: an empirical study. In: \u2018ICDM\u2019, IEEE Computer Society, pp 970\u2013974","DOI":"10.1109\/ICDM.2006.158"},{"key":"794_CR26","unstructured":"Michie D, Spiegelhalter DJ, Taylor CC (1994) Machine learning, neural and statistical classification. Ellis Horwood, New york"},{"issue":"1","key":"794_CR27","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1145\/1007730.1007738","volume":"6","author":"C Phua","year":"2004","unstructured":"Phua C, Alahakoon D, Lee V (2004) Minority report in fraud detection: classification of skewed data. SIGKDD Explor 6(1):50\u201359","journal-title":"SIGKDD Explor"},{"issue":"11","key":"794_CR28","doi-asserted-by":"crossref","first-page":"1601","DOI":"10.1109\/TKDE.2011.59","volume":"23","author":"RC Prati","year":"2011","unstructured":"Prati RC, Batista GEAPA, Monard MC (2011) A survey on graphical methods for classification predictive performance evaluation. IEEE Trans Knowl Data Eng 23(11):1601\u20131618","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"794_CR29","unstructured":"Prati RC, Batista GEAPA, Silva DF (2013) Paper website. http:\/\/sites.labic.icmc.usp.br\/ClassImbalanceRevisited\/"},{"key":"794_CR30","first-page":"445","volume-title":"International conference on machine learning","author":"FJ Provost","year":"1998","unstructured":"Provost FJ, Fawcett T, Kohavi R (1998) The case against accuracy estimation for comparing induction algorithms. In: Shavlik JW (ed) International conference on machine learning. Morgan Kaufmann, Los Altos, CA, pp 445\u2013453"},{"key":"794_CR31","unstructured":"Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, Los Altos, CA"},{"key":"794_CR32","doi-asserted-by":"crossref","unstructured":"Wallace B, Small K, Brodley C, Trikalinos T (2011) Class imbalance, redux. In: IEEE international conference on data mining, pp 754\u2013763","DOI":"10.1109\/ICDM.2011.33"},{"key":"794_CR33","doi-asserted-by":"crossref","unstructured":"Wang X, Matwin S, Japkowicz N, Liu X (2013) Cost-sensitive boosting algorithms for imbalanced multi-instance datasets. In: Za\u00efane OR, Zilles S (eds) Canadian conference on artificial intelligence, vol 7884 of lecture notes in computer science. Springer, Berlin, pp 174\u2013186","DOI":"10.1007\/978-3-642-38457-8_15"},{"issue":"1","key":"794_CR34","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1145\/1007730.1007734","volume":"6","author":"GM Weiss","year":"2004","unstructured":"Weiss GM (2004) Mining with rarity: a unifying framework. SIGKDD Explor 6(1):7\u201319","journal-title":"SIGKDD Explor"},{"key":"794_CR35","unstructured":"Weiss GM, McCarthy K, Zabar B (2007) Cost-sensitive learning vs. sampling: which is best for handling unbalanced classes with unequal error costs? In: IEEE international conference on data mining, pp 35\u201341"},{"key":"794_CR36","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1613\/jair.1199","volume":"19","author":"GM Weiss","year":"2003","unstructured":"Weiss GM, Provost F (2003) Learning when training data are costly: the effect of class distribution on tree induction. J Artif Intell Res 19:315\u2013354","journal-title":"J Artif Intell Res"},{"key":"794_CR37","unstructured":"Wu G, Chang EY (2003) Class-boundary alignment for imbalanced dataset learning. In: Workshop on learning from imbalanced Datasets in international conference on machine learning"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-014-0794-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10115-014-0794-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-014-0794-3","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,16]],"date-time":"2019-08-16T07:35:55Z","timestamp":1565940955000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10115-014-0794-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,10,17]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,10]]}},"alternative-id":["794"],"URL":"https:\/\/doi.org\/10.1007\/s10115-014-0794-3","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"value":"0219-1377","type":"print"},{"value":"0219-3116","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,10,17]]}}}