{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:38:05Z","timestamp":1760060285607,"version":"build-2065373602"},"reference-count":59,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2025,8,11]],"date-time":"2025-08-11T00:00:00Z","timestamp":1754870400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Laboratory of Big Data and Decision Making of the National University of Defense Technology"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>In real-world visual recognition tasks, long-tailed distribution is a pervasive challenge, where the extreme class imbalance severely limits the representation learning capability of deep models. Although supervised learning has demonstrated certain potential in long-tailed visual recognition, these models\u2019 gradient updates dominated by head classes often lead to insufficient representation of tail classes, resulting in ambiguous decision boundaries. While existing Supervised Contrastive Learning variants mitigate class bias through instance-level similarity comparison, they are still limited by biased negative sample selection and insufficient modeling of the feature space structure. To address this, we propose Rebalancing Supervised Contrastive Learning (Reb-SupCon), which constructs a balanced and discriminative feature space during model training to alleviate performance deviation. Our method consists of two key components: (1) a dynamic rebalancing factor that automatically adjusts sample contributions through differentiable weighting, thereby establishing class-balanced feature representations; (2) a prototype-aware enhancement module that further improves feature discriminability by explicitly constraining the geometric structure of the feature space through introduced feature prototypes, enabling locally discriminative feature reconstruction. This breaks through the limitations of conventional instance contrastive learning and helps the model to identify more reasonable decision boundaries. Experimental results show that this method demonstrates superior performance on mainstream long-tailed benchmark datasets, with ablation studies and feature visualizations validating the modules\u2019 synergistic effects.<\/jats:p>","DOI":"10.3390\/bdcc9080204","type":"journal-article","created":{"date-parts":[[2025,8,11]],"date-time":"2025-08-11T08:10:32Z","timestamp":1754899832000},"page":"204","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Rebalancing in Supervised Contrastive Learning for Long-Tailed Visual Recognition"],"prefix":"10.3390","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-3214-0310","authenticated-orcid":false,"given":"Jiahui","family":"Lv","sequence":"first","affiliation":[{"name":"Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Jun","family":"Lei","sequence":"additional","affiliation":[{"name":"Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1804-9198","authenticated-orcid":false,"given":"Jun","family":"Zhang","sequence":"additional","affiliation":[{"name":"Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Chao","family":"Chen","sequence":"additional","affiliation":[{"name":"Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Shuohao","family":"Li","sequence":"additional","affiliation":[{"name":"Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,8,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2477","DOI":"10.1109\/TMM.2024.3521699","article-title":"Fastersal: Robust and real-time single-stream architecture for rgb-d salient object detection","volume":"27","author":"Zhang","year":"2024","journal-title":"IEEE Trans. Multimed."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Khadraoui, A., and Zemmouri, E. (2023, January 23\u201325). Pyramid scene parsing network for driver distraction classification. Proceedings of the International Conference on Artificial Intelligence and Smart Environment, Errachidia, Morocco.","DOI":"10.56294\/dm2023154"},{"key":"ref_4","first-page":"1","article-title":"Neural architecture search: A survey","volume":"20","author":"Elsken","year":"2019","journal-title":"J. Mach. Learn. Res."},{"key":"ref_5","first-page":"3634","article-title":"Nats-bench: Benchmarking nas algorithms for architecture topology and size","volume":"44","author":"Dong","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet large scale visual recognition challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the Computer Vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland. Proceedings, Part v 13.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., and Yu, S.X. (2019, January 115\u201320). Large-scale long-tailed recognition in an open world. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00264"},{"key":"ref_9","first-page":"1567","article-title":"Learning imbalanced datasets with label-distribution-aware margin loss","volume":"32","author":"Cao","year":"2019","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Li, T., Cao, P., Yuan, Y., Fan, L., Yang, Y., Feris, R.S., Indyk, P., and Katabi, D. (2022, January 18\u201324). Targeted supervised contrastive learning for long-tailed recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00679"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mojahed, A.J., Moattar, M.H., and Ghaffari, H. (2024). Supervised Density-Based Metric Learning Based on Bhattacharya Distance for Imbalanced Data Classification Problems. Big Data Cogn. Comput., 8.","DOI":"10.3390\/bdcc8090109"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Tan, J., Wang, C., Li, B., Li, Q., Ouyang, W., Yin, C., and Yan, J. (2020, January 14\u201319). Equalization loss for long-tailed object recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01168"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Cui, J., Zhong, Z., Liu, S., Yu, B., and Jia, J. (2021, January 11\u201317). Parametric contrastive learning. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00075"},{"key":"ref_14","first-page":"18661","article-title":"Supervised contrastive learning","volume":"33","author":"Khosla","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"110415","DOI":"10.1016\/j.asoc.2023.110415","article-title":"A broad review on class imbalance learning techniques","volume":"143","author":"Rezvani","year":"2023","journal-title":"Appl. Soft Comput."},{"key":"ref_16","first-page":"4216","article-title":"Out-of-distribution detection in long-tailed recognition with calibrated outlier class learning","volume":"38","author":"Miao","year":"2024","journal-title":"AAAI Conf. Artif. Intell."},{"key":"ref_17","first-page":"1577","article-title":"Probability guided loss for long-tailed multi-label image classification","volume":"37","author":"Lin","year":"2023","journal-title":"AAAI Conf. Artif. Intell."},{"key":"ref_18","unstructured":"Kang, B., Xie, S., Rohrbach, M., Yan, Z., Gordo, A., Feng, J., and Kalantidis, Y. (2019). Decoupling representation and classifier for long-tailed recognition. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"13538","DOI":"10.1109\/TNNLS.2023.3269907","article-title":"Divide and retain: A dual-phase modeling for long-tailed visual recognition","volume":"35","author":"Zhang","year":"2023","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Ye, C., Tsuchida, R., Petersson, L., and Barnes, N. (2024, January 3\u20138). Label shift estimation for class-imbalance problem: A bayesian approach. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV57701.2024.00111"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2761","DOI":"10.1007\/s11831-023-09884-2","article-title":"Self-supervised learning: A succinct review","volume":"30","author":"Rani","year":"2023","journal-title":"Arch. Comput. Methods Eng."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"9052","DOI":"10.1109\/TPAMI.2024.3415112","article-title":"A survey on self-supervised learning: Algorithms, applications, and future trends","volume":"46","author":"Gui","year":"2024","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"He, K., Fan, H., Wu, Y., Xie, S., and Girshick, R. (2020, January 14\u201319). Momentum contrast for unsupervised visual representation learning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"ref_24","unstructured":"Chen, X., Fan, H., Girshick, R., and He, K. (2020). Improved baselines with momentum contrastive learning. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Zhu, J., Wang, Z., Chen, J., Chen, Y.P.P., and Jiang, Y.G. (2022, January 18\u201324). Balanced contrastive learning for long-tailed visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00678"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"5890","DOI":"10.1109\/TPAMI.2024.3369102","article-title":"Probabilistic contrastive learning for long-tailed visual recognition","volume":"46","author":"Du","year":"2024","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Park, S., Hong, Y., Heo, B., Yun, S., and Choi, J.Y. (2022, January 18\u201324). The majority can help the minority: Context-rich minority oversampling for long-tailed classification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00676"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"103951","DOI":"10.1016\/j.dsp.2023.103951","article-title":"CRmix: A regularization by clip** images and replacing mixed samples for imbalanced classification","volume":"135","author":"Li","year":"2023","journal-title":"Digit. Signal Process."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"10554","DOI":"10.1109\/TKDE.2023.3266648","article-title":"Cost-sensitive online adaptive kernel learning for large-scale imbalanced classification","volume":"35","author":"Chen","year":"2023","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhou, B., Cui, Q., Wei, X.S., and Chen, Z.M. (2020, January 14\u201319). Bbn: Bilateral-branch network with cumulative learning for long-tailed visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00974"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Wang, P., Han, K., Wei, X.S., Zhang, L., and Wang, L. (2021, January 19\u201325). Contrastive learning based hybrid networks for long-tailed image classification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00100"},{"key":"ref_32","unstructured":"Kang, B., Li, Y., Xie, S., Yuan, Z., and Feng, J. (2020, January 26\u201330). Exploring balanced feature spaces for representation learning. Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"He, X., Fu, S., Ding, X., Cao, Y., and Wang, H. (2023, January 29). Uniformly distributed category prototype-guided vision-language framework for long-tail recognition. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada.","DOI":"10.1145\/3581783.3611904"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Lin, X., Wang, S., Cai, R., Liu, Y., Fu, Y., Yu, Z., Tang, W., and Kot, A. (2024, January 16\u201322). Suppress and rebalance: Towards generalized multi-modal face anti-spoofing. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.00028"},{"key":"ref_35","first-page":"21271","article-title":"Bootstrap your own latent-a new approach to self-supervised learning","volume":"33","author":"Grill","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_36","first-page":"6396","article-title":"Decoupled contrastive learning for long-tailed recognition","volume":"38","author":"Xuan","year":"2024","journal-title":"AAAI Conf. Artif. Intell."},{"key":"ref_37","unstructured":"Baik, J.S., Yoon, I.Y., Kim, K.H., and Choi, J.W. (October, January 29). Distribution-Aware Robust Learning from Long-Tailed Data with Noisy Labels. Proceedings of the European Conference on Computer Vision, Milan, Italy."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhu, L., and Yang, Y. (2020, January 14\u201319). Inflated episodic memory with region self-attention for long-tailed visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00440"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"160105","DOI":"10.1007\/s11432-021-3489-1","article-title":"Prototype-based classifier learning for long-tailed visual recognition","volume":"65","author":"Wei","year":"2022","journal-title":"Sci. China Inf. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1109\/TMM.2024.3521665","article-title":"Prototype Alignment with Dedicated Experts for Test-Agnostic Long-Tailed Recognition","volume":"27","author":"Guo","year":"2024","journal-title":"IEEE Trans. Multimed."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Jin, Y., Li, M., Lu, Y., Cheung, Y.M., and Wang, H. (2023, January 18\u201322). Long-tailed visual recognition via self-heterogeneous integration with knowledge excavation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.02269"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"684","DOI":"10.1109\/TIP.2013.2293423","article-title":"Gradient magnitude similarity deviation: A highly efficient perceptual image quality index","volume":"23","author":"Xue","year":"2013","journal-title":"IEEE Trans. Image Process."},{"key":"ref_43","first-page":"9912","article-title":"Unsupervised learning of visual features by contrasting cluster assignments","volume":"33","author":"Caron","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Cui, Y., Jia, M., Lin, T.Y., Song, Y., and Belongie, S. (2019, January 15\u201320). Class-balanced loss based on effective number of samples. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00949"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Van Horn, G., Mac Aodha, O., Song, Y., Cui, Y., Sun, C., Shepard, A., Adam, H., Perona, P., and Belongie, S. (2018, January 18\u201322). The inaturalist species classification and detection dataset. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00914"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Xie, S., Girshick, R., Doll\u00e1r, P., Tu, Z., and He, K. (2017, January 21\u201326). Aggregated residual transformations for deep neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.634"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Cubuk, E.D., Zoph, B., Shlens, J., and Le, Q.V. (2020, January 14\u201319). Randaugment: Practical automated data augmentation with a reduced search space. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00359"},{"key":"ref_49","first-page":"3695","article-title":"Reslt: Residual learning for long-tailed recognition","volume":"45","author":"Cui","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Hou, C., Zhang, J., Wang, H., and Zhou, T. (2023, January 2\u20136). Subclass-balancing contrastive learning for long-tailed recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00497"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Zhou, Z., Li, L., Zhao, P., Heng, P.A., and Gong, W. (2023, January 18\u201322). Class-conditional sharpness-aware minimization for deep long-tailed recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00341"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"5026","DOI":"10.1109\/TAI.2024.3401102","article-title":"Adjusting logit in Gaussian form for long-tailed visual recognition","volume":"5","author":"Li","year":"2024","journal-title":"IEEE Trans. Artif. Intell."},{"key":"ref_53","unstructured":"Vu, D.Q., and Thu, M.T.H. (2024, January 16\u201317). Smooth Balance Softmax for Long-Tailed Image Classification. Proceedings of the International Conference on Advances in Information and Communication Technology, Thai Nguyen, Vietnam."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Mahajan, D., Girshick, R., Ramanathan, V., He, K., Paluri, M., Li, Y., Bharambe, A., and Van Der Maaten, L. (2018, January 8\u201314). Exploring the limits of weakly supervised pretraining. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01216-8_12"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Hong, Y., Han, S., Choi, K., Seo, S., Kim, B., and Chang, B. (2021, January 19\u201325). Disentangling label distribution for long-tailed visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00656"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Chou, H.P., Chang, S.C., Pan, J.Y., Wei, W., and Juan, D.C. (2020, January 23\u201328). Remix: Rebalanced mixup. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-65414-6_9"},{"key":"ref_58","unstructured":"Wang, X., Lian, L., Miao, Z., Liu, Z., and Yu, S.X. (2020). Long-tailed recognition by routing diverse distribution-aware experts. arXiv."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Sharma, S., Xian, Y., Yu, N., and Singh, A. (2023). Learning prototype classifiers for long-tailed recognition. arXiv.","DOI":"10.24963\/ijcai.2023\/151"}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/8\/204\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:27:51Z","timestamp":1760034471000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/8\/204"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,11]]},"references-count":59,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2025,8]]}},"alternative-id":["bdcc9080204"],"URL":"https:\/\/doi.org\/10.3390\/bdcc9080204","relation":{},"ISSN":["2504-2289"],"issn-type":[{"type":"electronic","value":"2504-2289"}],"subject":[],"published":{"date-parts":[[2025,8,11]]}}}