{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:19:56Z","timestamp":1760145596449,"version":"build-2065373602"},"reference-count":35,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2024,8,13]],"date-time":"2024-08-13T00:00:00Z","timestamp":1723507200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Double First-Class Innovation Research Project for the People\u2019s Public Security University of China","award":["2023SYL08"],"award-info":[{"award-number":["2023SYL08"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>In order to minimize the disparity between visible and infrared modalities and enhance pedestrian feature representation, a cross-modality person re-identification method is proposed, which integrates modality generation and feature enhancement. Specifically, a lightweight network is used for dimension reduction and augmentation of visible images, and intermediate modalities are generated to bridge the gap between visible images and infrared images. The Convolutional Block Attention Module is embedded into the ResNet50 backbone network to selectively emphasize key features sequentially from both channel and spatial dimensions. Additionally, the Gradient Centralization algorithm is introduced into the Stochastic Gradient Descent optimizer to accelerate convergence speed and improve generalization capability of the network model. Experimental results on SYSU-MM01 and RegDB datasets demonstrate that our improved network model achieves significant performance gains, with an increase in Rank-1 accuracy of 7.12% and 6.34%, as well as an improvement in mAP of 4.00% and 6.05%, respectively.<\/jats:p>","DOI":"10.3390\/e26080681","type":"journal-article","created":{"date-parts":[[2024,8,13]],"date-time":"2024-08-13T09:08:10Z","timestamp":1723540090000},"page":"681","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement"],"prefix":"10.3390","volume":"26","author":[{"given":"Yihan","family":"Bi","sequence":"first","affiliation":[{"name":"School of Information and Cyber Security, People\u2019s Public Security University of China, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rong","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Information and Cyber Security, People\u2019s Public Security University of China, Beijing 100038, China"},{"name":"Key Laboratory of Security Prevention Technology and Risk Assessment of Ministry of Public Security, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qianli","family":"Zhou","sequence":"additional","affiliation":[{"name":"Beijing Public Security Bureau, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhaolong","family":"Zeng","sequence":"additional","affiliation":[{"name":"School of Information and Cyber Security, People\u2019s Public Security University of China, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ronghui","family":"Lin","sequence":"additional","affiliation":[{"name":"School of Information and Cyber Security, People\u2019s Public Security University of China, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mingjie","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Information and Cyber Security, People\u2019s Public Security University of China, Beijing 100038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,8,13]]},"reference":[{"key":"ref_1","first-page":"57","article-title":"A Review of Person Re-Identification Based on Deep Learning","volume":"23","author":"Yang","year":"2023","journal-title":"China Water Transp. (Second. Half Mon.)"},{"key":"ref_2","first-page":"1100","article-title":"Review of Person Re-identification","volume":"48","author":"Wang","year":"2022","journal-title":"J. Beijing Inst. Technol."},{"key":"ref_3","unstructured":"Liu, T., and Liu, Z. (2021). Overview of Cross Modality Person Re-Identification Research. Mod. Comput. Sci., 135\u2013139."},{"key":"ref_4","first-page":"2018","article-title":"A cross-modality person re-identification method for visible-infrared images","volume":"50","author":"Sun","year":"2022","journal-title":"J. Beijing Univ. Aeronaut. Astronaut."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Han, C., Pan, P., Zheng, A., and Tang, J. (2021). Cross-Modality Person Re-Identification Based on Heterogeneous Center Loss and Non-Local Features. Entropy, 23.","DOI":"10.3390\/e23070919"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Wu, A., Zheng, W.S., Yu, H.X., Gong, S., and Lai, J. (2017, January 22\u201329). RGB-infrared cross-modality person re-identification. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.575"},{"key":"ref_7","first-page":"10","article-title":"Cross-modal pedestrian re-recognition based on color randomization and full related attention","volume":"42","author":"Yu","year":"2023","journal-title":"Foreign Electron. Meas. Technol."},{"key":"ref_8","first-page":"94","article-title":"Cross-modal person re-identification algorithm based on multi-level joint clustering with feature enhancement","volume":"38","author":"Fan","year":"2024","journal-title":"J. Electron. Meas. Instrum."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Wang, C., Zhang, C., Feng, Y., Ji, Y., and Ding, J. (2022). Learning Visible Thermal Person Re-Identification via Spatial Dependence and Dual Constraint Loss. Entropy, 24.","DOI":"10.3390\/e24040443"},{"key":"ref_10","first-page":"1210","article-title":"Multi-granularity cross-modality person re-identification with hetero-center angular constraints","volume":"45","author":"Zou","year":"2024","journal-title":"Comput. Eng. Des."},{"key":"ref_11","first-page":"221","article-title":"Visible-infrared Person Re-Identification Via Feature Constrained Learning","volume":"61","author":"Zhang","year":"2024","journal-title":"Prog. Laser Optoelectron."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Wang, Z., Wang, Z., Zheng, Y., Chuang, Y.Y., and Satoh, S.I. (2019, January 15\u201320). Learning to reduce dual-level discrepancy for infrared-visible person re-identification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00071"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wang, G., Zhang, T., Cheng, J., Liu, S., Yang, Y., and Hou, Z. (2019, January 27\u201328). RGB-infrared cross-modality person re-identification via joint pixel and feature alignment. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00372"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.patrec.2021.07.006","article-title":"RGB-IR cross-modality person ReID based on teacher-student GAN model","volume":"150","author":"Zhang","year":"2021","journal-title":"Pattern Recognit. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"2872","DOI":"10.1109\/TPAMI.2021.3054775","article-title":"Deep learning for person re-identification: A survey and outlook. IEEE Trans","volume":"44","author":"Ye","year":"2021","journal-title":"Pattern Anal. Mach. Intell."},{"key":"ref_16","first-page":"9","article-title":"A Review of Cross-Modal Person Re-Identification","volume":"46","author":"Liu","year":"2022","journal-title":"Telev. Technol."},{"key":"ref_17","unstructured":"Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014). Generative adversarial networks. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Li, D., Wei, X., Hong, X., and Gong, Y. (2020, January 7\u201312). Infrared-visible cross-modal person re-identification with an x modality. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i04.5891"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional block attention module. Proceedings of the European Conference on Computer vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_20","unstructured":"Yong, H., Huang, J., Hua, X., and Zhang, L. (2020, January 23\u201328). Gradient centralization: A new optimization technique for deep neural networks. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part I 16."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201323). Non-local neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_23","first-page":"1655","article-title":"Fine-tuning CNN image retrieval with no human annotation","volume":"41","author":"Tolias","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_24","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning, PMLR."},{"key":"ref_25","unstructured":"Glorot, X., Bordes, A., and Bengio, Y. (2011). Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, PMLR. JMLR Workshop and Conference Proceedings."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_27","unstructured":"Hermans, A., Beyer, L., and Leibe, B. (2017). In defense of the triplet loss for person re-identification. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Nguyen, D.T., Hong, H.G., Kim, K.W., and Park, K.R. (2017). Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, 17.","DOI":"10.3390\/s17030605"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1661","DOI":"10.1109\/TCSVT.2016.2515309","article-title":"An asymmetric distance model for cross-view feature mapping in person reidentification","volume":"27","author":"Chen","year":"2016","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Ye, M., Lan, X., Li, J., and Yuen, P. (2018, January 2\u20137). Hierarchical discriminative learning for visible thermal person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.12293"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1109\/TIFS.2019.2921454","article-title":"Bi-directional center-constrained top-ranking for visible thermal person re-identification","volume":"15","author":"Ye","year":"2019","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_32","unstructured":"Hao, Y., Wang, N., Li, J., and Gao, X. (February, January 27). HSME: Hypersphere manifold embedding for visible thermal person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Ye, M., Lan, X., and Leng, Q. (2019, January 21\u201325). Modality-aware collaborative learning for visible thermal person re-identification. Proceedings of the 27th ACM International Conference on Multimedia, Nice, France.","DOI":"10.1145\/3343031.3351043"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1109\/TIP.2019.2928126","article-title":"Learning modality-specific representations for visible-infrared person re-identification","volume":"29","author":"Feng","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_35","unstructured":"Choi, S., Lee, S., Kim, Y., Kim, T., and Kim, C. (2022, January 18\u201324). Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/26\/8\/681\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:35:41Z","timestamp":1760110541000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/26\/8\/681"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,13]]},"references-count":35,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2024,8]]}},"alternative-id":["e26080681"],"URL":"https:\/\/doi.org\/10.3390\/e26080681","relation":{},"ISSN":["1099-4300"],"issn-type":[{"type":"electronic","value":"1099-4300"}],"subject":[],"published":{"date-parts":[[2024,8,13]]}}}