{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T13:30:32Z","timestamp":1770989432566,"version":"3.50.1"},"reference-count":48,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,1,11]],"date-time":"2024-01-11T00:00:00Z","timestamp":1704931200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Project of Guangxi Science and Technology","award":["GuiKeAB23026040"],"award-info":[{"award-number":["GuiKeAB23026040"]}]},{"name":"Research Fund of Guangxi Key Lab of Multi-source Information Mining & Security","award":["20-A-01-02"],"award-info":[{"award-number":["20-A-01-02"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,4,30]]},"abstract":"<jats:p>It is a significant issue to deal with long-tailed data when classifying images. A nonlocal hybrid network (NHN) that takes into account both feature learning and classifier learning is proposed. The NHN can capture the existence of dependencies between two locations that are far away from each other as well as alleviate the impact of long-tailed data on the model to some extent. The dependency relationship between distant pixels is obtained first through a nonlocal module to extract richer feature representations. Then, a learnable soft class center is proposed to balance the supervised contrastive loss and reduce the impact of long-tailed data on feature learning. For efficiency, a logit adjustment strategy is adopted to correct the bias caused by the different label distributions between the training and test sets and obtain a classifier that is more suitable for long-tailed data. Finally, extensive experiments are conducted on two benchmark datasets, the long-tailed CIFAR and the large-scale real-world iNaturalist 2018, both of which have imbalanced label distributions. The experimental results show that the proposed NHN model is efficient and promising.<\/jats:p>","DOI":"10.1145\/3630256","type":"journal-article","created":{"date-parts":[[2023,11,2]],"date-time":"2023-11-02T22:24:42Z","timestamp":1698963882000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Nonlocal Hybrid Network for Long-tailed Image Classification"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8514-9263","authenticated-orcid":false,"given":"Rongjiao","family":"Liang","sequence":"first","affiliation":[{"name":"Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4052-1823","authenticated-orcid":false,"given":"Shichao","family":"Zhang","sequence":"additional","affiliation":[{"name":"Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3368-857X","authenticated-orcid":false,"given":"Wenzhen","family":"Zhang","sequence":"additional","affiliation":[{"name":"Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7632-8411","authenticated-orcid":false,"given":"Guixian","family":"Zhang","sequence":"additional","affiliation":[{"name":"Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-1442-5980","authenticated-orcid":false,"given":"Jinyun","family":"Tang","sequence":"additional","affiliation":[{"name":"Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, China"}]}],"member":"320","published-online":{"date-parts":[[2024,1,11]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.38"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2018.07.011"},{"key":"e_1_3_1_4_2","article-title":"Learning imbalanced datasets with label-distribution-aware margin loss","volume":"32","author":"Cao Kaidi","year":"2019","unstructured":"Kaidi Cao, Colin Wei, Adrien Gaidon, Nikos Arechiga, and Tengyu Ma. 2019. Learning imbalanced datasets with label-distribution-aware margin loss. Advances in Neural Information Processing Systems 32 (2019).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_5_2","article-title":"Global context networks","author":"Cao Yue","year":"2020","unstructured":"Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, and Han Hu. 2020. Global context networks. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020).","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3231742"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00075"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00949"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00432"},{"key":"e_1_3_1_10_2","first-page":"10503","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Deng Zongyong","year":"2021","unstructured":"Zongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, and Xuehong Sun. 2021. PML: Progressive margin loss for long-tailed age classification. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10503\u201310512."},{"key":"e_1_3_1_11_2","article-title":"Accurate, large minibatch SGD: Training ImageNet in 1 hour","author":"Goyal Priya","year":"2017","unstructured":"Priya Goyal, Piotr Doll\u00e1r, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017. Accurate, large minibatch SGD: Training ImageNet in 1 hour. arXiv preprint arXiv:1706.02677 (2017).","journal-title":"arXiv preprint arXiv:1706.02677"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00656"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2019.2914680"},{"key":"e_1_3_1_17_2","volume-title":"International Conference on Learning Representations","author":"Kang Bingyi","year":"2020","unstructured":"Bingyi Kang, Yu Li, Sa Xie, Zehuan Yuan, and Jiashi Feng. 2020. Exploring balanced feature spaces for representation learning. In International Conference on Learning Representations."},{"key":"e_1_3_1_18_2","article-title":"Decoupling representation and classifier for long-tailed recognition","author":"Kang Bingyi","year":"2019","unstructured":"Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, and Yannis Kalantidis. 2019. Decoupling representation and classifier for long-tailed recognition. arXiv preprint arXiv:1910.09217 (2019).","journal-title":"arXiv preprint arXiv:1910.09217"},{"issue":"8","key":"e_1_3_1_19_2","doi-asserted-by":"crossref","first-page":"3573","DOI":"10.1109\/TNNLS.2017.2732482","article-title":"Cost-sensitive learning of deep feature representations from imbalanced data","volume":"29","author":"Khan Salman H.","year":"2017","unstructured":"Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous A. Sohel, and Roberto Togneri. 2017. Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Transactions on Neural Networks and Learning Systems 29, 8 (2017), 3573\u20133587.","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"e_1_3_1_20_2","first-page":"18661","article-title":"Supervised contrastive learning","volume":"33","author":"Khosla Prannay","year":"2020","unstructured":"Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised contrastive learning. Advances in Neural Information Processing Systems 33 (2020), 18661\u201318673.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_21_2","article-title":"Learning multiple layers of features from tiny images","author":"Krizhevsky Alex","year":"2009","unstructured":"Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Handbook of Systemic Autoimmune Diseases 1, 4 (2009).","journal-title":"Handbook of Systemic Autoimmune Diseases"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.324"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3465220"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00264"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01216-8_12"},{"key":"e_1_3_1_26_2","article-title":"Long-tail learning via logit adjustment","author":"Menon Aditya Krishna","year":"2020","unstructured":"Aditya Krishna Menon, Sadeep Jayasumana, Ankit Singh Rawat, Himanshu Jain, Andreas Veit, and Sanjiv Kumar. 2020. Long-tail learning via logit adjustment. arXiv preprint arXiv:2007.07314 (2020).","journal-title":"arXiv preprint arXiv:2007.07314"},{"key":"e_1_3_1_27_2","article-title":"Survey of resampling techniques for improving classification performance in unbalanced datasets","author":"More Ajinkya","year":"2016","unstructured":"Ajinkya More. 2016. Survey of resampling techniques for improving classification performance in unbalanced datasets. arXiv preprint arXiv:1608.06048 (2016).","journal-title":"arXiv preprint arXiv:1608.06048"},{"key":"e_1_3_1_28_2","first-page":"4175","article-title":"Balanced meta-softmax for long-tailed visual recognition","volume":"33","author":"Ren Jiawei","year":"2020","unstructured":"Jiawei Ren, Cunjun Yu, Xiao Ma, Haiyu Zhao, Shuai Yi, et\u00a0al. 2020. Balanced meta-softmax for long-tailed visual recognition. Advances in Neural Information Processing Systems 33 (2020), 4175\u20134186.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"20","key":"e_1_3_1_29_2","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"A graphical aid to the interpretation and validation of cluster analysis","author":"Rousseeuw Silhouettes","year":"1987","unstructured":"Silhouettes Rousseeuw. 1987. A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20 (1987), 53.","journal-title":"J. Comput. Appl. Math."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1948.tb01338.x"},{"key":"e_1_3_1_31_2","article-title":"Meta-weight-net: Learning an explicit mapping for sample weighting","volume":"32","author":"Shu Jun","year":"2019","unstructured":"Jun Shu, Qi Xie, Lixuan Yi, Qian Zhao, Sanping Zhou, Zongben Xu, and Deyu Meng. 2019. Meta-weight-net: Learning an explicit mapping for sample weighting. Advances in Neural Information Processing Systems 32 (2019).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_32_2","first-page":"1513","article-title":"Long-tailed classification by keeping the good and removing the bad momentum causal effect","volume":"33","author":"Tang Kaihua","year":"2020","unstructured":"Kaihua Tang, Jianqiang Huang, and Hanwang Zhang. 2020. Long-tailed classification by keeping the good and removing the bad momentum causal effect. Advances in Neural Information Processing Systems 33 (2020), 1513\u20131524.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_33_2","first-page":"3784","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wang Jianfeng","year":"2021","unstructured":"Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, and Zhenghua Xu. 2021. RSG: A simple but effective module for learning imbalanced datasets. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3784\u20133793."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00100"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58568-6_43"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00512"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314202"},{"key":"e_1_3_1_39_2","first-page":"162","volume-title":"European Conference on Computer Vision","author":"Wu Tong","year":"2020","unstructured":"Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, and Dahua Lin. 2020. Distribution-balanced loss for multi-label classification in long-tailed datasets. In European Conference on Computer Vision. Springer, 162\u2013178."},{"key":"e_1_3_1_40_2","first-page":"8659","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wu Tong","year":"2021","unstructured":"Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, and Dahua Lin. 2021. Adversarial robustness under long-tailed distribution. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8659\u20138668."},{"key":"e_1_3_1_41_2","first-page":"19290","article-title":"Rethinking the value of labels for improving class-imbalanced learning","volume":"33","author":"Yang Yuzhe","year":"2020","unstructured":"Yuzhe Yang and Zhi Xu. 2020. Rethinking the value of labels for improving class-imbalanced learning. Advances in Neural Information Processing Systems 33 (2020), 19290\u201319301.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_42_2","article-title":"Identifying and compensating for feature deviation in imbalanced deep learning","author":"Ye Han-Jia","year":"2020","unstructured":"Han-Jia Ye, Hong-You Chen, De-Chuan Zhan, and Wei-Lun Chao. 2020. Identifying and compensating for feature deviation in imbalanced deep learning. arXiv preprint arXiv:2001.01385 (2020).","journal-title":"arXiv preprint arXiv:2001.01385"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00585"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00344"},{"key":"e_1_3_1_45_2","article-title":"To balance or not to balance: A simple-yet-effective approach for learning with long-tailed distributions","author":"Zhang Junjie","year":"2019","unstructured":"Junjie Zhang, Lingqiao Liu, Peng Wang, and Chunhua Shen. 2019. To balance or not to balance: A simple-yet-effective approach for learning with long-tailed distributions. arXiv preprint arXiv:1912.04486 (2019).","journal-title":"arXiv preprint arXiv:1912.04486"},{"key":"e_1_3_1_46_2","first-page":"2361","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang Songyang","year":"2021","unstructured":"Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, and Jian Sun. 2021. Distribution alignment: A unified framework for long-tail visual recognition. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2361\u20132370."},{"key":"e_1_3_1_47_2","article-title":"Deep long-tailed learning: A survey","author":"Zhang Yifan","year":"2021","unstructured":"Yifan Zhang, Bingyi Kang, Bryan Hooi, Shuicheng Yan, and Jiashi Feng. 2021. Deep long-tailed learning: A survey. arXiv preprint arXiv:2110.04596 (2021).","journal-title":"arXiv preprint arXiv:2110.04596"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01622"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00974"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3630256","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3630256","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:45:53Z","timestamp":1750178753000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3630256"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,11]]},"references-count":48,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,4,30]]}},"alternative-id":["10.1145\/3630256"],"URL":"https:\/\/doi.org\/10.1145\/3630256","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,11]]},"assertion":[{"value":"2023-01-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-21","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}