{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T14:45:50Z","timestamp":1781534750164,"version":"3.54.5"},"reference-count":41,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T00:00:00Z","timestamp":1778198400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62176107"],"award-info":[{"award-number":["62176107"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"award":["62176107"],"award-info":[{"award-number":["62176107"]}],"id":[{"id":"https:\/\/ror.org\/01h0zpd94","id-type":"ROR","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>In real-world scenarios, large-scale datasets often exhibit a long-tailed data distribution. Training deep neural networks on such data typically leads to a bias towards head classes. Existing studies have demonstrated that the reweighting strategy is an effective means to alleviate the long-tailed issue. Recent studies suggest that incorporating class difficulty into reweighting can yield superior results. However, the method of quantifying class difficulty by an independent validation set has shown limitations in practical applications, i.e., wasting training samples and inaccurate estimations. To address this issue, this study proposes a novel model based on K-fold cross-validation, called the adaptive combination validation model, which contains two main innovations: first, both class and sample difficulty are quantified by using a more comprehensive and authentic estimation strategy, i.e., K-fold cross-validation, to obtain accurate and robust estimations; second, we extract the prediction probability distributions of samples, which reflect sample difficulty, from different model branches and design a distribution-harmonized loss to simultaneously focus on the effects of reweighted and original distributions. Extensive experiments on several popular long-tailed image recognition datasets (CIFAR10-LT and CIFAR100-LT, with several varying imbalance rates, and ImageNet-LT) demonstrate that the proposed method can effectively alleviate the long-tailed issue and achieve state-of-the-art performance on most datasets.<\/jats:p>","DOI":"10.3390\/info17050455","type":"journal-article","created":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T19:10:52Z","timestamp":1778267452000},"page":"455","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["ACVM: An Adaptive Combination Validation Mechanism for Long-Tailed Image Recognition"],"prefix":"10.3390","volume":"17","author":[{"given":"Tianci","family":"Sun","sequence":"first","affiliation":[{"name":"School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-6667-3537","authenticated-orcid":false,"given":"Wanqiu","family":"He","sequence":"additional","affiliation":[{"name":"School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3417-2727","authenticated-orcid":false,"given":"Changbin","family":"Shao","sequence":"additional","affiliation":[{"name":"School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shang","family":"Zheng","sequence":"additional","affiliation":[{"name":"School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9621-4158","authenticated-orcid":false,"given":"Hualong","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2026,5,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_2","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollar, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the European Conference of Computer Vision (ECCV 2014), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet large scale visual recognition challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Wang, J., Zhang, W., Zang, Y., Cao, Y., Pang, J., Gong, T., Chen, K., Liu, Z., Loy, C.C., and Lin, D. (2021, January 20\u201325). Seesaw loss for long-tailed instance segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00957"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"10795","DOI":"10.1109\/TPAMI.2023.3268118","article-title":"Deep long-tailed learning: A survey","volume":"45","author":"Zhang","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","first-page":"4175","article-title":"Balanced meta-softmax for long-tailed visual recognition","volume":"33","author":"Ren","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Malisiewicz, T., Gupta, A., and Efros, A.A. (2011, January 6\u201313). Ensemble of exemplar-svms for object detection and beyond. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126229"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Alshammari, S., Wang, Y.-X., Ramanan, D., and Kong, S. (2022, January 18\u201324). Long-tailed recognition via weight balancing. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00677"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Jamal, M.A., Brown, M., Yang, M.-H., Wang, L., and Gong, B. (2020, January 17\u201324). Rethinking class-balanced methods for long-tailed visual recognition from a domain adaptation perspective. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00763"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Li, M., Cheung, Y.-M., and Lu, Y. (2022, January 18\u201324). Long-tailed visual recognition via gaussian clouded logit adjustment. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.36227\/techrxiv.17031920.v1"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhou, B., Cui, Q., Wei, X.-S., and Chen, Z.-M. (2020, January 14\u201319). BBN: Bilateral-branch network with cumulative learning for long-tailed visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00974"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1016\/j.neunet.2018.07.011","article-title":"A Systematic Study of the Class Imbalance Problem in Convolutional Neural Networks","volume":"106","author":"Buda","year":"2018","journal-title":"Neural Netw."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","article-title":"Learning from Imbalanced Data","volume":"21","author":"He","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 21\u201326). Focal Loss for Dense Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhang, Z., and Pfister, T. (2021, January 11\u201317). Learning Fast Sample Re-Weighting Without Reward Data. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00076"},{"key":"ref_17","first-page":"1917","article-title":"Meta-Weight-Net: Learning an Explicit Mapping for Sample Weighting","volume":"32","author":"Shu","year":"2019","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Cui, Y., Jia, M., Lin, T.-Y., Song, Y., and Belongie, S. (2019, January 16\u201321). Class-Balanced Loss Based on Effective Number of Samples. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00949"},{"key":"ref_19","first-page":"1567","article-title":"Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss","volume":"32","author":"Cao","year":"2019","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2517","DOI":"10.1007\/s11263-022-01643-3","article-title":"Class-Difficulty Based Methods for Long-Tailed Visual Recognition","volume":"130","author":"Sinha","year":"2022","journal-title":"Int. J. Comput. Vis."},{"key":"ref_21","unstructured":"Anguita, D., Ghelardoni, L., Ghio, A., Oneto, L., and Ridella, S. (2012, January 25\u201327). The \u2019K\u2019 in K-Fold Cross Validation. Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), Bruges, Belgium. Available online: https:\/\/www.esann.org\/sites\/default\/files\/proceedings\/legacy\/es2012-62.pdf."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Shen, L., Lin, Z., and Huang, Q. (2016, January 11\u201314). Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks. Proceedings of the European Conference of Computer Vision (ECCV 2016), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46478-7_29"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"429","DOI":"10.3233\/IDA-2002-6504","article-title":"The Class Imbalance Problem: A Systematic Study","volume":"6","author":"Japkowicz","year":"2002","journal-title":"Intell. Data Anal."},{"key":"ref_24","unstructured":"Byrd, J., and Lipton, Z. (2019, January 9\u201315). What Is the Effect of Importance Weighting in Deep Learning?. Proceedings of the International Conference on Machine Learning (ICML), Long Beach, CA, USA."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: Synthetic Minority Over-Sampling Technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wu, T., Huang, Q., Liu, Z., Wang, Y., and Lin, D. (2020, January 23\u201328). Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets. Proceedings of the European Conference of Computer Vision (ECCV 2020), Glasgow, UK.","DOI":"10.1007\/978-3-030-58548-8_10"},{"key":"ref_27","unstructured":"Kang, B., Xie, S., Rohrbach, M., Yan, Z., Gordo, A., Feng, J., and Kalantidis, Y. (2019). Decoupling Representation and Classifier for Long-Tailed Recognition. arXiv."},{"key":"ref_28","unstructured":"Chen, X., Zhou, Y., Wu, D., Zhang, W., Zhou, Y., Li, B., and Wang, W. (March, January 22). Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification. Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Wang, T., Zhu, Y., Zhao, C., Zeng, W., Wang, J., and Tang, M. (2021, January 19\u201325). Adaptive Class Suppression Loss for Long-Tail Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00312"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Li, B., Yao, Y., Tan, J., Zhang, G., Yu, F., Lu, J., and Luo, Y. (2022, January 18\u201324). Equalized Focal Loss for Dense Long-Tailed Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00686"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Tan, J., Wang, C., Li, B., Li, Q., Ouyang, W., Yin, C., and Yan, J. (2020, January 17\u201324). Equalization Loss for Long-Tailed Object Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01168"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Huang, C., Li, Y., Loy, C.C., and Tang, X. (2016, January 27\u201330). Learning Deep Representation for Imbalanced Classification. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.580"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Mahajan, D., Girshick, R., Ramanathan, V., He, K., Paluri, M., Li, Y., Bharambe, A., and Van Der Maaten, L. (2018, January 8\u201314). Exploring the Limits of Weakly Supervised Pretraining. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01216-8_12"},{"key":"ref_34","unstructured":"Sinha, S., and Ohashi, H. (March, January 28). Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-Based Learning Applied to Document Recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., and Yu, S.X. (2019, January 16\u201321). Large-Scale Long-Tailed Recognition in an Open World. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00264"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Park, S., Lim, J., Jeon, Y., and Choi, J.Y. (2021, January 11\u201317). Influence-Balanced Loss for Imbalanced Visual Classification. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00077"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, X., Fang, Z., Wen, Y., Li, Z., and Qiao, Y. (2017, January 22\u201329). Range Loss for Deep Face Recognition with Long-Tailed Training Data. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.578"},{"key":"ref_39","first-page":"3069","article-title":"Simple and scalable predictive uncertainty estimation using deep ensembles","volume":"30","author":"Lakshminarayanan","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_40","unstructured":"Hinton, G., Vinyals, O., and Dean, J. (2015). Distilling the knowledge in a neural network. arXiv."},{"key":"ref_41","unstructured":"Guo, C., Pleiss, G., Sun, Y., and Weinberger, K.Q. (2017, January 6\u201311). On calibration of modern neural networks. Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, NSW, Australia."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/5\/455\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T19:32:10Z","timestamp":1778268730000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/5\/455"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5,8]]},"references-count":41,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2026,5]]}},"alternative-id":["info17050455"],"URL":"https:\/\/doi.org\/10.3390\/info17050455","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,5,8]]}}}