{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,18]],"date-time":"2026-07-18T18:28:50Z","timestamp":1784399330329,"version":"3.55.0"},"reference-count":34,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2022,4,22]],"date-time":"2022-04-22T00:00:00Z","timestamp":1650585600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the Opening Foundation of Yunnan Key Laboratory of Computer Technologies Application","award":["202101BE070001-013"],"award-info":[{"award-number":["202101BE070001-013"]}]},{"name":"the Yunnan Fundamental Research Projects","award":["202101BE070001-008"],"award-info":[{"award-number":["202101BE070001-008"]}]},{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61971208"],"award-info":[{"award-number":["61971208"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"the Yunnan Reserve Talents of Young and Middle-aged Academic and Technical Leaders","award":["2019HB005"],"award-info":[{"award-number":["2019HB005"]}]},{"name":"the Yunnan Young Top Talents of Ten Thousands Plan","award":["201873"],"award-info":[{"award-number":["201873"]}]},{"name":"the Major Science and Technology Projects in Yunnan Province","award":["202002AB080001-8"],"award-info":[{"award-number":["202002AB080001-8"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In the field of remote sensing image applications, RGB and infrared image object detection is an important technology. The object detection performance can be improved and the robustness of the algorithm will be enhanced by making full use of their complementary information. Existing RGB-infrared detection methods do not explicitly encourage RGB and infrared images to achieve effective multimodal learning. We find that when fusing RGB and infrared images, cross-modal redundant information weakens the degree of complementary information fusion. Inspired by this observation, we propose a redundant information suppression network (RISNet) which suppresses cross-modal redundant information and facilitates the fusion of RGB-Infrared complementary information. Specifically, we design a novel mutual information minimization module to reduce the redundancy between RGB appearance features and infrared radiation features, which enables the network to take full advantage of the complementary advantages of multimodality and improve the object detection performance. In addition, in view of the drawbacks of the current artificial classification of lighting conditions, such as the subjectivity of artificial classification and the lack of comprehensiveness (divided into day and night only), we propose a method based on histogram statistics to classify lighting conditions in more detail. Experimental results on two public RGB-infrared object detection datasets demonstrate the superiorities of our proposed method over the state-of-the-art approaches, especially under challenging conditions such as poor illumination, complex background, and low contrast.<\/jats:p>","DOI":"10.3390\/rs14092020","type":"journal-article","created":{"date-parts":[[2022,4,24]],"date-time":"2022-04-24T00:45:21Z","timestamp":1650761121000},"page":"2020","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":94,"title":["Improving RGB-Infrared Object Detection by Reducing Cross-Modality Redundancy"],"prefix":"10.3390","volume":"14","author":[{"given":"Qingwang","family":"Wang","sequence":"first","affiliation":[{"name":"Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yongke","family":"Chi","sequence":"additional","affiliation":[{"name":"Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tao","family":"Shen","sequence":"additional","affiliation":[{"name":"Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jian","family":"Song","sequence":"additional","affiliation":[{"name":"Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zifeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yan","family":"Zhu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,4,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Washington, DC, USA.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_2","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the 28th International Conference on Neural Information Processing Systems, Cambridge, MA, USA."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Lin, T., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., and Berg, A.C. (2016, January 8\u201316). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Liu, W., Ren, G., Yu, R., Guo, S., Zhu, J., and Zhang, L. (2021). Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions. arXiv.","DOI":"10.1609\/aaai.v36i2.20072"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Chen, Y., Li, W., Sakaridis, C., Dai, D., and Gool, L.V. (2018, January 18\u201322). Domain adaptive faster r-cnn for object detection in the wild. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00352"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Zhao, C., Wang, J., Su, N., Yan, Y., and Xing, X. (2022). Low Contrast Infrared Target Detection Method Based on Residual Thermal Backbone Network and Weighting Loss Function. Remote Sens., 14.","DOI":"10.3390\/rs14010177"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"9813","DOI":"10.1109\/TGRS.2020.3044958","article-title":"Attentional Local Contrast Networks for Infrared Small Target Detection","volume":"59","author":"Dai","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","first-page":"1","article-title":"Detect Larger at Once: Large-Area Remote-Sensing Image Arbitrary-Oriented Ship Detection","volume":"19","author":"Su","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Pan, X., Ren, Y., Sheng, K., Dong, W., Yuan, H., Guo, X., Ma, C., and Xu, X. (2020, January 16\u201318). Dynamic refinement network for oriented and densely packed object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01122"},{"key":"ref_11","unstructured":"Minaee, S., Luo, P., Lin, Z., and Bowyer, K. (2021). Going deeper into face detection: A survey. arXiv."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Dang, L.M., Wang, H., Li, Y., Min, K., Kwak, J.T., Lee, O.N., Park, H., and Moon, H. (2020). Fusarium Wilt of Radish Detection Using RGB and Near Infrared Images from Unmanned Aerial Vehicles. Remote Sens., 12.","DOI":"10.3390\/rs12172863"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Iwashita, Y., Nakashima, K., Stoica, A., and Kurazume, R. (2019, January 28\u201330). TU-Net and TDeepLab: Deep Learning-Based Terrain Classification Robust to Illumination Changes, Combining Visible and Thermal Imagery. Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA.","DOI":"10.1109\/MIPR.2019.00057"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1109\/TITS.2016.2567418","article-title":"A unified framework for concurrent pedestrian and cyclist detection","volume":"18","author":"Li","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Tian, W., Deng, Z., Yin, D., Zheng, Z., Huang, Y., and Bi, X. (2021). 3D Pedestrian Detection in Farmland by Monocular RGB Image and Far-Infrared Sensing. Remote Sens., 13.","DOI":"10.3390\/rs13152896"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.patcog.2018.08.005","article-title":"Illumination-aware faster R-CNN for robust multispectral pedestrian detection","volume":"85","author":"Li","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_17","first-page":"509","article-title":"Multispectral Pedestrian Detection using Deep Fusion Convolutional Neural Networks","volume":"587","author":"Wagner","year":"2016","journal-title":"ESANN"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.inffus.2018.09.015","article-title":"Cross-modality interactive attention network for multispectral pedestrian detection","volume":"50","author":"Zhang","year":"2019","journal-title":"Inf. Fusion"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"165071","DOI":"10.1109\/ACCESS.2020.3022623","article-title":"Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection","volume":"8","author":"Zhang","year":"2020","journal-title":"IEEE Access"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"103770","DOI":"10.1016\/j.infrared.2021.103770","article-title":"Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection","volume":"116","author":"Fu","year":"2021","journal-title":"Infrared Phys. Technol."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Cao, Z., Yang, H., Zhao, J., Guo, S., and Li, L. (2021). Attention Fusion for One-Stage Multispectral Pedestrian Detection. Sensors, 21.","DOI":"10.3390\/s21124184"},{"key":"ref_22","unstructured":"Zhang, L., Liu, Z., Chen, X., and Yang, X. (2019). The cross-modality disparity problem in multispectral pedestrian detection. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhang, H., Fromont, E., Lefevre, S., and Avignon, B. (2021, January 3\u20138). Guided attentive feature fusion for multispectral pedestrian detection. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV48630.2021.00012"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., and Lu, H. (2019, January 15\u201320). Dual attention network for scene segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Zhu, X., Cheng, D., Zhang, Z., Lin, S., and Dai, J. (2019, January 27\u201328). An empirical study of spatial attention mechanisms in deep networks. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00679"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Hwang, S., Park, J., Kim, N., Choi, Y., and So Kweon, I. (2015, January 7\u201312). Multispectral Pedestrian Detection: Benchmark Dataset and Baseline. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298706"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Sun, Y., Cao, B., Zhu, P., and Hu, Q. (2021). Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning. arXiv.","DOI":"10.1109\/TCSVT.2022.3168279"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syst. Tech. J."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhang, J., Fan, D.P., Dai, Y., Yu, X., Zhong, Y., Barnes, N., and Shao, L. (2021, January 10\u201317). RGB-D Saliency Detection via Cascaded Mutual Information Minimization. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00430"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhou, K., Chen, L., and Cao, X. (2020, January 23\u201328). Improving multispectral pedestrian detection by addressing modality imbalance problems. Proceedings of the European Conference on Computer Vision (ECCV), Glasgow, UK.","DOI":"10.1007\/978-3-030-58523-5_46"},{"key":"ref_32","unstructured":"Li, C., Song, D., Tong, R., and Tang, M. (2018). Multispectral pedestrian detection via simultaneous detection and segmentation. arXiv."},{"key":"ref_33","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhang, L., Zhu, X., Chen, X., Yang, X., Lei, Z., and Liu, Z. (2019, January 27\u201328). Weakly aligned cross-modal learning for multispectral pedestrian detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00523"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/9\/2020\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:59:01Z","timestamp":1760137141000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/9\/2020"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,22]]},"references-count":34,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2022,5]]}},"alternative-id":["rs14092020"],"URL":"https:\/\/doi.org\/10.3390\/rs14092020","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,22]]}}}