{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T23:27:42Z","timestamp":1773962862376,"version":"3.50.1"},"reference-count":44,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T00:00:00Z","timestamp":1647993600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Visible thermal person re-identification (VT Re-ID) is the task of matching pedestrian images collected by thermal and visible light cameras. The two main challenges presented by VT Re-ID are the intra-class variation between pedestrian images and the cross-modality difference between visible and thermal images. Existing works have principally focused on local representation through cross-modality feature distribution, but ignore the internal connection of the local features of pedestrian body parts. Therefore, this paper proposes a dual-path attention network model to establish the spatial dependency relationship between the local features of the pedestrian feature map and to effectively enhance the feature extraction. Meanwhile, we propose cross-modality dual-constraint loss, which adds the center and boundary constraints for each class distribution in the embedding space to promote compactness within the class and enhance the separability between classes. Our experimental results show that our proposed approach has advantages over the state-of-the-art methods on the two public datasets SYSU-MM01 and RegDB. The result for the SYSU-MM01 is Rank-1\/mAP 57.74%\/54.35%, and the result for the RegDB is Rank-1\/mAP 76.07%\/69.43%.<\/jats:p>","DOI":"10.3390\/e24040443","type":"journal-article","created":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T12:20:25Z","timestamp":1648038025000},"page":"443","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Learning Visible Thermal Person Re-Identification via Spatial Dependence and Dual-Constraint Loss"],"prefix":"10.3390","volume":"24","author":[{"given":"Chuandong","family":"Wang","sequence":"first","affiliation":[{"name":"School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China"}]},{"given":"Chi","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China"}]},{"given":"Yujian","family":"Feng","sequence":"additional","affiliation":[{"name":"School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China"}]},{"given":"Yimu","family":"Ji","sequence":"additional","affiliation":[{"name":"School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China"}]},{"given":"Jianyu","family":"Ding","sequence":"additional","affiliation":[{"name":"School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,3,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zheng, L., Bie, Z., Sun, Y., Wang, J., Su, C., Wang, S., and Tian, Q. (2016). Mars: A video benchmark for large-scale person re-identification. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46466-4_52"},{"key":"ref_2","unstructured":"Zheng, L., Yang, Y., and Hauptmann, A.G. (2016). Person re-identification: Past, present and future. arXiv."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Yi, D., Lei, Z., Liao, S., and Li, S.Z. (2014, January 24\u201328). Deep metric learning for person re-identification. Proceedings of the International Conference on Pattern Recognition, Stockholm, Sweden.","DOI":"10.1109\/ICPR.2014.16"},{"key":"ref_4","unstructured":"Wu, L., Shen, C., and Hengel, A.v.d. (2016). Personnet: Person re-identification with deep convolutional neural networks. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Chang, X., Hospedales, T.M., and Xiang, T. (2018, January 18\u201323). Multi-level factorisation net for person re-identification. Proceedings of the Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00225"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/j.neucom.2019.12.100","article-title":"Hetero-center loss for cross-modality person re-identification","volume":"386","author":"Zhu","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"653","DOI":"10.1109\/LSP.2021.3065903","article-title":"Strong but simple baseline with dual-granularity triplet loss for visible-thermal person re-identification","volume":"28","author":"Liu","year":"2021","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wang, Z., Wang, Z., Zheng, Y., Chuang, Y.-Y., and Satoh, S. (2019, January 15\u201320). Learning to reduce dual-level discrepancy for infrared-visible person re-identification. Proceedings of the Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00071"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Lu, Y., Wu, Y., Liu, B., Zhang, T., Li, B., Chu, Q., and Yu, N. (2020, January 13\u201319). Cross-modality person re-identification with shared-specific feature transfer. Proceedings of the Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01339"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"4414","DOI":"10.1109\/TMM.2020.3042080","article-title":"Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification","volume":"23","author":"Liu","year":"2020","journal-title":"IEEE Trans. Multimed."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 23\u201330). Deep residual learning for image recognition. Proceedings of the Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Peng, C., Zhang, X., Yu, G., Luo, G., and Sun, J. (2017, January 21\u201326). Large kernel matters\u2013improve semantic segmentation by global convolutional network. Proceedings of the Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.189"},{"key":"ref_14","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_15","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201323). Non-local neural networks. Proceedings of the Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Cao, Y., Xu, J., Lin, S., Wei, F., and Hu, H. (2019, January 27\u201328). Gcnet: Non-local networks meet squeeze-excitation networks and beyond. Proceedings of the Computer Vision Workshops, Seoul, Korea.","DOI":"10.1109\/ICCVW.2019.00246"},{"key":"ref_19","unstructured":"Stollenga, M., Masci, J., Gomez, F., and Schmidhuber, J. (2014). Deep networks with internal selective attention through feedback connections. arXiv."},{"key":"ref_20","first-page":"2017","article-title":"Spatial transformer networks","volume":"28","author":"Jaderberg","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_21","first-page":"207","article-title":"Distance metric learning for large margin nearest neighbor classification","volume":"10","author":"Weinberger","year":"2009","journal-title":"J. Mach. Learn. Res."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zheng, W.-S., Gong, S., and Xiang, T. (2011, January 20\u201325). Person re-identification by probabilistic relative distance comparison. Proceedings of the Computer Vision and Pattern Recognition, Colorado Springs, CO, USA.","DOI":"10.1109\/CVPR.2011.5995598"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wu, A., Zheng, W.-S., Yu, H.-X., Gong, S., and Lai, J. (2017, January 22\u201329). Rgb-infrared cross-modality person re-identification. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.575"},{"key":"ref_24","first-page":"8385","article-title":"Hsme: Hypersphere manifold embedding for visible thermal person re-identification","volume":"33","author":"Hao","year":"2019","journal-title":"Assoc. Adv. Artif. Intell."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Dai, P., Ji, R., Wang, H., Wu, Q., and Huang, Y. (2018, January 13\u201319). Cross-modality person re-identification with generative adversarial training. Proceedings of the International Joint Conference on Artificial Intelligence, Stockholm, Sweden.","DOI":"10.24963\/ijcai.2018\/94"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.neucom.2020.01.089","article-title":"Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification","volume":"398","author":"Liu","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"8861","DOI":"10.1109\/TIP.2021.3120881","article-title":"Attend to the difference: Cross-modality person re-identification via contrastive correlation","volume":"30","author":"Zhang","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Basaran, E., G\u00f6kmen, M., and Kamasak, M.E. (2020). An efficient framework for visible\u2013infrared cross modality person re-identification. Signal Process. Image Commun., 87.","DOI":"10.1016\/j.image.2020.115933"},{"key":"ref_29","first-page":"12144","article-title":"Cross-modality paired-images generation for rgb-infrared person re-identification","volume":"34","author":"Wang","year":"2020","journal-title":"Assoc. Adv. Artif. Intell."},{"key":"ref_30","unstructured":"Elsayed, G.F., Kornblith, S., and Le, Q.V. (2019). Saccader: Improving accuracy of hard attention models for vision. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ye, M., Wang, Z., Lan, X., and Yuen, P.C. (2018, January 13\u201319). Visible thermal person re-identification via dual-constrainted top-ranking. Proceedings of the International Joint Conference on Artificial Intelligence, Stockholm, Sweden.","DOI":"10.24963\/ijcai.2018\/152"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zang, X., Li, G., Gao, W., and Shu, X. (2021). Learning to disentangle scenes for person re-identification. Image Vis. Comput., 116.","DOI":"10.1016\/j.imavis.2021.104330"},{"key":"ref_33","first-page":"2897","article-title":"Hpiln: A feature learning framework for cross-modality person re-identification","volume":"13","author":"Zhao","year":"2019","journal-title":"Inst. Eng. Technol."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ye, M., Lan, X., and Leng, Q. (2019, January 21\u201325). Modality-aware collaborative learning for visible thermal person re-identification. Proceedings of the Association for Computing Machinery, Nice, France.","DOI":"10.1145\/3343031.3351043"},{"key":"ref_35","first-page":"7501","article-title":"Hierarchical discriminative learning for visible thermal person re-identification","volume":"32","author":"Ye","year":"2018","journal-title":"Assoc. Adv. Artif. Intell."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Nguyen, D.T., Hong, H.G., Kim, K.W., and Park, K.R. (2017). Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, 17.","DOI":"10.3390\/s17030605"},{"key":"ref_37","unstructured":"Wang, G., Zhang, T., Cheng, J., Liu, S., Yang, Y., and Hou, Z. (November, January 27). Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1765","DOI":"10.1007\/s11263-019-01290-1","article-title":"Rgb-ir person re-identification by cross-modality similarity preservation","volume":"128","author":"Wu","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Ye, M., Shen, J., Lin, G., Xiang, T., Shao, L., and Hoi, S.C. (2021). Deep learning for person re-identification: A survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/TPAMI.2021.3054775"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Hao, Y., Wang, N., Gao, X., Li, J., and Wang, X. (2019, January 21\u201325). Dual-alignment feature embedding for cross-modality person re-identification. Proceedings of the Association for Computing Machinery, Nice, France.","DOI":"10.1145\/3343031.3351006"},{"key":"ref_41","first-page":"4610","article-title":"Infrared-visible cross-modal person re-identification with an x modality","volume":"34","author":"Li","year":"2020","journal-title":"Assoc. Adv. Artif. Intell."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Ye, M., Shen, J., Crandall, D.J., Shao, L., and Luo, J. (2020). Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-030-58520-4_14"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"728","DOI":"10.1109\/TIFS.2020.3001665","article-title":"Visible-infrared person re-identification via homogeneous augmented tri-modal learning","volume":"16","author":"Ye","year":"2020","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chen, Y., Wan, L., Li, Z., Jing, Q., and Sun, Z. (2021, January 20\u201325). Neural feature search for rgb-infrared person re-identification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00065"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/4\/443\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:41:32Z","timestamp":1760136092000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/4\/443"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,23]]},"references-count":44,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2022,4]]}},"alternative-id":["e24040443"],"URL":"https:\/\/doi.org\/10.3390\/e24040443","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,23]]}}}