{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,12]],"date-time":"2026-06-12T16:58:45Z","timestamp":1781283525192,"version":"3.54.1"},"reference-count":59,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,10,13]],"date-time":"2023-10-13T00:00:00Z","timestamp":1697155200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,10,13]],"date-time":"2023-10-13T00:00:00Z","timestamp":1697155200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62101092"],"award-info":[{"award-number":["62101092"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["DUT20RC(3)083"],"award-info":[{"award-number":["DUT20RC(3)083"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Vis. Intell."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>As a vital vision task, person re-identification (Re-ID) aims to retrieve the same person under non-overlapping cameras. It is a very challenging task due to the presence of complex backgrounds, diverse illuminations and different perspectives. In this work, we integrate the advantages of convolutional neural networks (CNNs) and transformers, and propose a novel learning framework named convolutional multi-level transformer (CMT) for image-based person Re-ID. More specifically, we first propose a scale-aware feature enhancement (SFE) module to extract multi-scale local features from a pre-trained CNN backbone. Then, we introduce a part-aware transformer encoder (PTE) to further mine discriminative local information guided by global semantics. Finally, a deeply-supervised learning (DSL) technique is adopted to optimize the proposed CMT and improve its training efficiency. Extensive experiments on four large-scale Re-ID benchmarks demonstrate that our method performs favorably against several state-of-the-art methods.<\/jats:p>","DOI":"10.1007\/s44267-023-00025-8","type":"journal-article","created":{"date-parts":[[2023,10,13]],"date-time":"2023-10-13T09:01:28Z","timestamp":1697187688000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":37,"title":["Learning convolutional multi-level transformers for image-based person re-identification"],"prefix":"10.1007","volume":"1","author":[{"given":"Peilei","family":"Yan","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xuehu","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1206-1444","authenticated-orcid":false,"given":"Pingping","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Huchuan","family":"Lu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2023,10,13]]},"reference":[{"key":"25_CR1","first-page":"1988","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"C. C. Loy","year":"2009","unstructured":"Loy, C. C., Xiang, T., & Gong, S. (2009). Multi-camera activity correlation analysis. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1988\u20131995). Piscataway: IEEE."},{"key":"25_CR2","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.patrec.2012.07.005","volume":"34","author":"X. Wang","year":"2013","unstructured":"Wang, X. (2013). Intelligent multi-camera video surveillance: a review. Pattern Recognition Letters, 34, 3\u201319.","journal-title":"Pattern Recognition Letters"},{"issue":"5","key":"25_CR3","doi-asserted-by":"publisher","first-page":"1224","DOI":"10.1109\/TPAMI.2017.2709749","volume":"40","author":"L. Zheng","year":"2017","unstructured":"Zheng, L., Yang, Y., & Tian, Q. (2017). Sift meets CNN: a decade survey of instance retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(5), 1224\u20131244.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"25_CR4","unstructured":"Zheng, L., Yang, Y., & Hauptmann, A. G. (2016). Person re-identification: past, present and future. Preprint. arXiv:1610.02984."},{"key":"25_CR5","first-page":"2360","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"M. Farenzena","year":"2010","unstructured":"Farenzena, M., Bazzani, L., Perina, A., Murino, V., & Cristani, M. (2010). Person re-identification by symmetry-driven accumulation of local features. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2360\u20132367). Piscataway: IEEE."},{"key":"25_CR6","first-page":"391","volume-title":"Proceedings of the 12th European conference on computer vision","author":"C. Liu","year":"2012","unstructured":"Liu, C., Gong, S., Loy, C. C., & Lin, X. (2012). Person re-identification: what features are important? In A. Fusiello, V. Murino, & R. Cucchiara (Eds.), Proceedings of the 12th European conference on computer vision (pp. 391\u2013401). Cham: Springer."},{"key":"25_CR7","first-page":"4184","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Z. Shi","year":"2015","unstructured":"Shi, Z., Hospedales, T. M., & Xiang, T. (2015). Transferring a semantic representation for person re-identification and search. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4184\u20134193). Piscataway: IEEE."},{"key":"25_CR8","first-page":"480","volume-title":"Proceedings of the 15th European conference on computer vision","author":"Y. Sun","year":"2018","unstructured":"Sun, Y., Zheng, L., Yang, Y., Tian, Q., & Wang, S. (2018). Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In F. Manhardt, W. Kehl, N. Navab, et al. (Eds.), Proceedings of the 15th European conference on computer vision (pp. 480\u2013496). Cham: Springer."},{"key":"25_CR9","first-page":"3702","volume-title":"2019 IEEE international conference on computer vision","author":"K. Zhou","year":"2019","unstructured":"Zhou, K., Yang, Y., Cavallaro, A., & Xiang, T. (2019). Omni-scale feature learning for person re-identification. In 2019 IEEE international conference on computer vision (pp. 3702\u20133712). Piscataway: IEEE."},{"key":"25_CR10","doi-asserted-by":"publisher","first-page":"274","DOI":"10.1145\/3240508.3240552","volume-title":"Proceedings of the 26th ACM international conference on multimedia","author":"G. Wang","year":"2018","unstructured":"Wang, G., Yuan, Y., Chen, X., Li, J., & Zhou, X. (2018). Learning discriminative features with multiple granularities for person re-identification. In S. Boll, K. M. Lee, J. Luo, et al. (Eds.), Proceedings of the 26th ACM international conference on multimedia (pp. 274\u2013282). New York: ACM."},{"key":"25_CR11","first-page":"8351","volume-title":"2019 IEEE international conference on computer vision","author":"T. Chen","year":"2019","unstructured":"Chen, T., Ding, S., Xie, J., Yuan, Y., Chen, W., Yang, Y., et al. (2019). ABD-net: attentive but diverse person re-identification. In 2019 IEEE international conference on computer vision (pp. 8351\u20138361). Piscataway: IEEE."},{"key":"25_CR12","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., et\u00a0al. (2017). Attention is all you need. Preprint. arXiv:1706.03762."},{"key":"25_CR13","unstructured":"Zhu, K., Guo, H., Zhang, S., Wang, Y., Huang, G., Qiao, H., et\u00a0al. (2021). Aaformer: auto-aligned transformer for person re-identification. Preprint. arXiv:2104.00921."},{"key":"25_CR14","first-page":"15013","volume-title":"2021 IEEE international conference on computer vision","author":"S. He","year":"2021","unstructured":"He, S., Luo, H., Wang, P., Wang, F., Li, H., & Jiang, W. (2021). TransReID: transformer-based object re-identification. In 2021 IEEE international conference on computer vision (pp. 15013\u201315022). Piscataway: IEEE."},{"key":"25_CR15","first-page":"8514","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"F. Zheng","year":"2019","unstructured":"Zheng, F., Deng, C., Sun, X., Jiang, X., Guo, X., Yu, Z., et al. (2019). Pyramidal person re-identification via multi-loss dynamic training. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8514\u20138522). Piscataway: IEEE."},{"key":"25_CR16","first-page":"3633","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Q. Yang","year":"2019","unstructured":"Yang, Q., Yu, H.-X., Wu, A., & Zheng, W.-S. (2019). Patch-based discriminative feature learning for unsupervised person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3633\u20133642). Piscataway: IEEE."},{"key":"25_CR17","first-page":"7308","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Y. Cho","year":"2022","unstructured":"Cho, Y., Kim, W. J., Hong, S., & Yoon, S. E. (2022). Part-based pseudo label refinement for unsupervised person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 7308\u20137318). Piscataway: IEEE."},{"key":"25_CR18","first-page":"371","volume-title":"2019 IEEE international conference on computer vision","author":"B. Chen","year":"2019","unstructured":"Chen, B., Deng, W., & Hu, J. (2019). In 2019 IEEE international conference on computer vision (pp. 371\u2013381). Piscataway: IEEE."},{"key":"25_CR19","first-page":"1025","volume-title":"2021 IEEE international conference on computer vision","author":"Y. Rao","year":"2021","unstructured":"Rao, Y., Chen, G., Lu, J., & Zhou, J. (2021). Counterfactual attention learning for fine-grained visual categorization and re-identification. In 2021 IEEE international conference on computer vision (pp. 1025\u20131034). Piscataway: IEEE."},{"key":"25_CR20","doi-asserted-by":"publisher","first-page":"7663","DOI":"10.1109\/TIP.2021.3107211","volume":"30","author":"G. Chen","year":"2021","unstructured":"Chen, G., Gu, T., Lu, J., Bao, J.-A., & Zhou, J. (2021). Person re-identification via attention pyramid. IEEE Transactions on Image Processing, 30, 7663\u20137676.","journal-title":"IEEE Transactions on Image Processing"},{"key":"25_CR21","first-page":"3186","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Z. Zhang","year":"2020","unstructured":"Zhang, Z., Lan, C., Zeng, W., Jin, X., & Chen, Z. (2020). Relation-aware global attention for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3186\u20133195). Piscataway: IEEE."},{"key":"25_CR22","first-page":"2285","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"W. Li","year":"2018","unstructured":"Li, W., Zhu, X., & Gong, S. (2018). Harmonious attention network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2285\u20132294). Piscataway: IEEE."},{"key":"25_CR23","first-page":"1","volume-title":"Proceedings of the 10th international conference on learning representations","author":"A. Dosovitskiy","year":"2020","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., et al. (2020). An image is worth 16x16 words: transformers for image recognition at scale. In Proceedings of the 10th international conference on learning representations (pp. 1\u201313). Retrieved August 25, 2023, from https:\/\/openreview.net\/pdf?id=YicbFdNTTy."},{"key":"25_CR24","first-page":"4150","volume-title":"2019 IEEE international conference on computer vision","author":"S. Lai","year":"2021","unstructured":"Lai, S., Chai, Z., & Wei, X. (2021). Transformer meets part model: adaptive part division for person re-identification. In 2019 IEEE international conference on computer vision (pp. 4150\u20134157). Piscataway: IEEE."},{"key":"25_CR25","first-page":"2898","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Y. Li","year":"2021","unstructured":"Li, Y., He, J., Zhang, T., Liu, X., Zhang, Y. D., & Wu, F. (2021). Diverse part discovery: occluded person re-identification with part-aware transformer. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2898\u20132907). Piscataway: IEEE."},{"key":"25_CR26","first-page":"1992","volume-title":"Advances in neural information processing systems 34","author":"S. Liao","year":"2021","unstructured":"Liao, S., & Shao, L. (2021). Transmatcher: deep image matching through transformers for generalizable person re-identification. In M. Ranzato, A. Beygelzimer, Y. Dauphin, et al. (Eds.), Advances in neural information processing systems 34 (pp. 1992\u20132003). Red Hook: Curran Associates."},{"key":"25_CR27","doi-asserted-by":"publisher","first-page":"1155","DOI":"10.1109\/LSP.2021.3087079","volume":"28","author":"G. Wang","year":"2021","unstructured":"Wang, G., Chen, X., Gao, J., Zhou, X., & Ge, S. (2021). Self-guided body part alignment with relation transformers for occluded person re-identification. IEEE Signal Processing Letters, 28, 1155\u20131159.","journal-title":"IEEE Signal Processing Letters"},{"key":"25_CR28","unstructured":"Chen, X., Xu, J., Xu, J., & Gao, S. (2021). OH-Former: omni-relational high-order transformer for person re-identification. Preprint. arXiv:2109.11159."},{"key":"25_CR29","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1145\/3474085.3475283","volume-title":"Proceedings of the 29th ACM international conference on multimedia","author":"Z. Ma","year":"2021","unstructured":"Ma, Z., Zhao, Y., & Li, J. (2021). Pose-guided inter-and intra-part relational transformer for occluded person re-identification. In H. T. Shen, Y. Zhuang, J. R. Smith, et al. (Eds.), Proceedings of the 29th ACM international conference on multimedia (pp. 1487\u20131496). New York: ACM."},{"key":"25_CR30","unstructured":"Liu, X., Zhang, P., Yu, C., Lu, H., Qian, X., & Yang, X. (2021). A video is worth three views: trigeminal transformers for video-based person re-identification. Preprint. arXiv:2104.01745."},{"key":"25_CR31","first-page":"770","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"K. He","year":"2016","unstructured":"He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770\u2013778). Piscataway: IEEE."},{"key":"25_CR32","unstructured":"Yu, F., & Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. Preprint. arXiv:1511.07122."},{"key":"25_CR33","doi-asserted-by":"publisher","first-page":"516","DOI":"10.1145\/3474085.3475202","volume-title":"Proceedings of the 29th ACM international conference on multimedia","author":"G. Zhang","year":"2021","unstructured":"Zhang, G., Zhang, P., Qi, J., & Lu, H. (2021). HAT: hierarchical aggregation transformers for person re-identification. In H. T. Shen, Y. Zhuang, J. R. Smith, et al. (Eds.), Proceedings of the 29th ACM international conference on multimedia (pp. 516\u2013525). New York: ACM."},{"key":"25_CR34","first-page":"202","volume-title":"2017 IEEE international conference on computer vision","author":"P. Zhang","year":"2017","unstructured":"Zhang, P., Wang, D., Lu, H., Wang, H., & Ruan, X. (2017). Amulet: aggregating multi-level convolutional features for salient object detection. In 2017 IEEE international conference on computer vision (pp. 202\u2013211). Piscataway: IEEE."},{"key":"25_CR35","first-page":"2818","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"C. Szegedy","year":"2016","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., & Wojna, Z. (2016). Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2818\u20132826). Piscataway: IEEE."},{"key":"25_CR36","unstructured":"Hermans, A., Beyer, L., & Leibe, B. (2017). In defense of the triplet loss for person re-identification. Preprint. arXiv:1703.07737."},{"key":"25_CR37","first-page":"1116","volume-title":"2015 IEEE international conference on computer vision","author":"L. Zheng","year":"2015","unstructured":"Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J. D., & Tian, Q. (2015). Scalable person re-identification: a benchmark. In 2015 IEEE international conference on computer vision (pp. 1116\u20131124). Piscataway: IEEE."},{"key":"25_CR38","first-page":"3754","volume-title":"2017 IEEE international conference on computer vision","author":"Z. Zheng","year":"2017","unstructured":"Zheng, Z., Zheng, L., & Yang, Y. (2017). Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In 2017 IEEE international conference on computer vision (pp. 3754\u20133762). Piscataway: IEEE."},{"key":"25_CR39","first-page":"152","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"W. Li","year":"2014","unstructured":"Li, W., Zhao, R., Xiao, T., & Wang, X. (2014). Deepreid: deep filter pairing neural network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 152\u2013159). Piscataway: IEEE."},{"key":"25_CR40","first-page":"79","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"L. Wei","year":"2018","unstructured":"Wei, L., Zhang, S., Gao, W., & Tian, Q. (2018). Person transfer GAN to bridge domain gap for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 79\u201388). Piscataway: IEEE."},{"key":"25_CR41","first-page":"13001","volume-title":"Proceedings of the 34th AAAI conference on artificial intelligence","author":"Z. Zhong","year":"2020","unstructured":"Zhong, Z., Zheng, L., Kang, G., Li, S., & Yang, Y. (2020). Random erasing data augmentation. In Proceedings of the 34th AAAI conference on artificial intelligence (pp. 13001\u201313008). Palo Alto: AAAI Press."},{"key":"25_CR42","unstructured":"Kingma, D. P., & Ba, J. (2014). Adam: a method for stochastic optimization. Preprint. arXiv:1412.6980."},{"key":"25_CR43","first-page":"3300","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"X. Chen","year":"2020","unstructured":"Chen, X., Fu, C., Zhao, Y., Zheng, F., Song, J., Ji, R., et al. (2020). Salience-guided cascaded suppression network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3300\u20133310). Piscataway: IEEE."},{"key":"25_CR44","first-page":"5363","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"J. Si","year":"2018","unstructured":"Si, J., Zhang, H., Li, C.-G., Kuen, J., Kong, X., Kot, A. C., et al. (2018). Dual attention matching network for context-aware feature sequence based person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5363\u20135372). Piscataway: IEEE."},{"key":"25_CR45","first-page":"365","volume-title":"Proceedings of the 15th European conference on computer vision","author":"C. Wang","year":"2018","unstructured":"Wang, C., Zhang, Q., Huang, C., Liu, W., & Wang, X. (2018). Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In F. Manhardt, W. Kehl, N. Navab, et al. (Eds.), Proceedings of the 15th European conference on computer vision (pp. 365\u2013381). Cham: Springer."},{"key":"25_CR46","first-page":"9317","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"R. Hou","year":"2019","unstructured":"Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., & Chen, X. (2019). Interaction-and-aggregation network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 9317\u20139326). Piscataway: IEEE."},{"key":"25_CR47","first-page":"1487","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"H. Luo","year":"2019","unstructured":"Luo, H., Gu, Y., Liao, X., Lai, S., & Jiang, W. (2019). Bag of tricks and a strong baseline for deep person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1487\u20131495). Piscataway: IEEE."},{"key":"25_CR48","first-page":"1062","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"M. M. Kalayeh","year":"2018","unstructured":"Kalayeh, M. M., Basaran, E., G\u00f6kmen, M., Kamasak, M. E., & Shah, M. (2018). Human semantic parsing for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1062\u20131071). Piscataway: IEEE."},{"key":"25_CR49","first-page":"7134","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"C.-P. Tay","year":"2019","unstructured":"Tay, C.-P., Roy, S., & Yap, K.-H. (2019). AANet: attribute attention network for person re-identifications. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 7134\u20137143). Piscataway: IEEE."},{"key":"25_CR50","first-page":"5735","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"M. Zheng","year":"2019","unstructured":"Zheng, M., Karanam, S., Wu, Z., & Radke, R. J. (2019). Re-identification with consistent attentive Siamese networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5735\u20135744). Piscataway: IEEE."},{"key":"25_CR51","first-page":"1389","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"W. Yang","year":"2019","unstructured":"Yang, W., Huang, H., Zhang, Z., Chen, X., Huang, K., & Zhang, S. (2019). Towards rich feature discovery with class activation maps augmentation for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1389\u20131398). Piscataway: IEEE."},{"key":"25_CR52","first-page":"8030","volume-title":"2019 IEEE international conference on computer vision","author":"P. Fang","year":"2019","unstructured":"Fang, P., Zhou, J., Roy, S. K., Petersson, L., & Harandi, M. (2019). Bilinear attention networks for person retrieval. In 2019 IEEE international conference on computer vision (pp. 8030\u20138039). Piscataway: IEEE."},{"key":"25_CR53","first-page":"3691","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Z. Dai","year":"2019","unstructured":"Dai, Z., Chen, M., Gu, X., Zhu, S., & Tan, P. (2019). Batch dropblock network for person re-identification and beyond. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3691\u20133701). Piscataway: IEEE."},{"key":"25_CR54","first-page":"2138","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Z. Zheng","year":"2019","unstructured":"Zheng, Z., Yang, X., Yu, Z., Zheng, L., Yang, Y., & Kautz, J. (2019). Joint discriminative and generative learning for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2138\u20132147). Piscataway: IEEE."},{"key":"25_CR55","first-page":"3143","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"X. Jin","year":"2020","unstructured":"Jin, X., Lan, C., Zeng, W., Chen, Z., & Zhang, L. (2020). Style normalization and restitution for generalizable person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3143\u20133152). Piscataway: IEEE."},{"key":"25_CR56","doi-asserted-by":"crossref","unstructured":"Zhu, K., Guo, H., Liu, Z., Tang, M., & Wang, J. (2020). Identity-guided human semantic parsing for person re-identification. Preprint. arXiv:2007.13467.","DOI":"10.1007\/978-3-030-58580-8_21"},{"key":"25_CR57","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1145\/3394171.3414056","volume-title":"Proceedings of the 28th ACM international conference on multimedia","author":"B. Xu","year":"2020","unstructured":"Xu, B., He, L., Liao, X., Liu, W., Sun, Z., & Mei, T. (2020). Black Re-ID: a head-shoulder descriptor for the challenging problem of person re-identification. In C. W. Chen, R. Cucchiara, X.-S. Hua, et al. (Eds.), Proceedings of the 28th ACM international conference on multimedia (pp. 673\u2013681). New York: ACM."},{"key":"25_CR58","doi-asserted-by":"crossref","unstructured":"Li, H., Wu, G., & Zheng, W.-S. (2021). Combined depth space based architecture search for person re-identification. Preprint. arXiv:2104.04163.","DOI":"10.1109\/CVPR46437.2021.00666"},{"key":"25_CR59","first-page":"2579","volume":"9","author":"L. van der Maaten","year":"2008","unstructured":"van der Maaten, L., & Hinton, G. (2008). Visualizing data using t-SNE. Journal of Machine Learning Research, 9, 2579\u20132605.","journal-title":"Journal of Machine Learning Research"}],"container-title":["Visual Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44267-023-00025-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44267-023-00025-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44267-023-00025-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,19]],"date-time":"2023-11-19T20:03:59Z","timestamp":1700424239000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44267-023-00025-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,13]]},"references-count":59,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["25"],"URL":"https:\/\/doi.org\/10.1007\/s44267-023-00025-8","relation":{},"ISSN":["2731-9008"],"issn-type":[{"value":"2731-9008","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,13]]},"assertion":[{"value":"17 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 September 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 September 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 October 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"24"}}