{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T22:09:39Z","timestamp":1740175779233,"version":"3.37.3"},"reference-count":85,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,5,23]],"date-time":"2024-05-23T00:00:00Z","timestamp":1716422400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,23]],"date-time":"2024-05-23T00:00:00Z","timestamp":1716422400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62076033"],"award-info":[{"award-number":["62076033"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Thanks to the success of deep learning over the past few years, the video person re-identification (ReID) algorithms have achieved high accuracy on multiple public benchmark datasets. However, the available video person ReID datasets cover a limited range of real-world scenarios, and they have several obvious limitations: limited camera viewing angles, tiny variations of the shooting scene, and even errors in manual labels. These disadvantages prevent video person ReID from being widely used in real-life scenarios. In this work, a new high-quality multi-situation video person ReID dataset, named MSA-BUPT, is built to promote the video person ReID task in large-scale urban surveillance. Specifically, MSA-BUPT contains 684 identities, 2,665 trajectories, and nearly 250,000 frames from 200-h videos across various complex scenarios. Person attribute annotations and unannotated video data are also provided for other research perspectives, such as cross-modality ReID, cross-domain ReID, and so on. Furthermore, two plug-and-play components are used to improve retrieval capabilities: a new scenario-based data augmentation method is proposed to alleviate the person misalignment problem; a re-ranking strategy based on person attribute is applied to make secondary adjustments to the content to the results of the model. The extensive experimental results show that the above methods improve the performance of some representative state-of-the-art models on the new dataset.<\/jats:p>","DOI":"10.1007\/s40747-024-01474-4","type":"journal-article","created":{"date-parts":[[2024,5,23]],"date-time":"2024-05-23T04:01:55Z","timestamp":1716436915000},"page":"5865-5881","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Situational diversity in video person re-identification: introducing MSA-BUPT dataset"],"prefix":"10.1007","volume":"10","author":[{"given":"Ruining","family":"Zhao","sequence":"first","affiliation":[]},{"given":"Jiaxuan","family":"Liu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6506-7298","authenticated-orcid":false,"given":"Zhicheng","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Ziqi","family":"He","sequence":"additional","affiliation":[]},{"given":"Fei","family":"Su","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,5,23]]},"reference":[{"key":"1474_CR1","doi-asserted-by":"publisher","unstructured":"Song G, Leng B, Liu Y, Hetang C, Cai S (2018) Region-based quality estimation network for large-scale person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32 . https:\/\/doi.org\/10.1609\/aaai.v32i1.12305","DOI":"10.1609\/aaai.v32i1.12305"},{"key":"1474_CR2","doi-asserted-by":"publisher","unstructured":"Wu Y, Lin Y, Dong X, Yan Y, Ouyang W, Yang Y (2018) Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 5177\u20135186. https:\/\/doi.org\/10.1109\/CVPR.2018.00543","DOI":"10.1109\/CVPR.2018.00543"},{"key":"1474_CR3","doi-asserted-by":"publisher","first-page":"3436","DOI":"10.1007\/s10489-019-01459-8","volume":"49","author":"J Liu","year":"2019","unstructured":"Liu J, Sun C, Xu X, Xu B, Yu S (2019) A spatial and temporal features mixture model with body parts for video-based person re-identification. Appl Intell 49:3436\u20133446. https:\/\/doi.org\/10.1007\/s10489-019-01459-8","journal-title":"Appl Intell"},{"key":"1474_CR4","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"1474_CR5","doi-asserted-by":"publisher","unstructured":"Hou R, Chang H, Ma B, Huang R, Shan S (2021) Bicnet-tks: learning efficient spatial-temporal representation for video person re-identification. In: 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2014\u20132023. https:\/\/doi.org\/10.1109\/CVPR46437.2021.00205","DOI":"10.1109\/CVPR46437.2021.00205"},{"issue":"3","key":"1474_CR6","doi-asserted-by":"publisher","first-page":"1366","DOI":"10.1109\/TIP.2018.2878505","volume":"28","author":"J Dai","year":"2019","unstructured":"Dai J, Zhang P, Wang D, Lu H, Wang H (2019) Video person re-identification by temporal residual learning. IEEE Trans Image Process 28(3):1366\u20131377. https:\/\/doi.org\/10.1109\/TIP.2018.2878505","journal-title":"IEEE Trans Image Process"},{"key":"1474_CR7","doi-asserted-by":"publisher","unstructured":"Wang Y, Zhang P, Gao S, Geng X, Lu H, Wang D (2021) Pyramid spatial-temporal aggregation for video-based person re-identification. In: 2021 IEEE\/CVF International Conference on Computer Vision, pp. 12006\u201312015. https:\/\/doi.org\/10.1109\/ICCV48922.2021.01181","DOI":"10.1109\/ICCV48922.2021.01181"},{"key":"1474_CR8","doi-asserted-by":"publisher","DOI":"10.1016\/j.jprocont.2023.103112","volume":"132","author":"H Tao","year":"2023","unstructured":"Tao H, Zheng J, Wei J, Paszke W, Rogers E, Stojanovic V (2023) Repetitive process based indirect-type iterative learning control for batch processes with model uncertainty and input delay. J Process Control 132:103112. https:\/\/doi.org\/10.1016\/j.jprocont.2023.103112","journal-title":"J Process Control"},{"key":"1474_CR9","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1016\/j.ins.2023.03.070","volume":"634","author":"H Wan","year":"2023","unstructured":"Wan H, Luan X, Stojanovic V, Liu F (2023) Self-triggered finite-time control for discrete-time markov jump systems. Inform Sci 634:101\u2013121. https:\/\/doi.org\/10.1016\/j.ins.2023.03.070","journal-title":"Inform Sci"},{"key":"1474_CR10","doi-asserted-by":"publisher","unstructured":"Zheng L, Bie Z, Sun Y, Wang J, Su C, Wang S, Tian Q (2016) Mars: a video benchmark for large-scale person re-identification. In: Leibe B, Matas J, Sebe N, Welling M (eds) Computer vision\u2013ECCV 2016. Lecture Notes in Computer Science, vol. 9910, pp. 868\u2013884. https:\/\/doi.org\/10.1007\/978-3-319-46466-4_52","DOI":"10.1007\/978-3-319-46466-4_52"},{"key":"1474_CR11","doi-asserted-by":"publisher","unstructured":"Wang T, Gong S, Zhu X, Wang S (2014) Person re-identification by video ranking. In: Computer Vision\u2013ECCV 2014. Lecture Notes in Computer Science, vol. 8692, pp. 688\u2013703. Springer. https:\/\/doi.org\/10.1007\/978-3-319-10593-2_45","DOI":"10.1007\/978-3-319-10593-2_45"},{"key":"1474_CR12","doi-asserted-by":"publisher","unstructured":"Li J, Wang G, Yan Y, Yu F, Jia Q, Qin J, Ding S, Yang X (2023) Generalizable person search on open-world user-generated video content. https:\/\/doi.org\/10.48550\/arXiv.2310.10068. arXiv preprint arXiv:2310.10068","DOI":"10.48550\/arXiv.2310.10068"},{"key":"1474_CR13","doi-asserted-by":"publisher","unstructured":"Guo P, Liu H, Wu J, Wang G, Wang T (2023) Semantic-aware consistency network for cloth-changing person re-identification. In: Proceedings of the 31st ACM International Conference on Multimedia, pp. 8730\u20138739 . https:\/\/doi.org\/10.1145\/3581783.3612416","DOI":"10.1145\/3581783.3612416"},{"key":"1474_CR14","doi-asserted-by":"publisher","unstructured":"Xiang S, You G, Li L, Guan M, Liu T, Qian D, Fu Y (2022) Rethinking illumination for person re-identification: A unified view. In: 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 4730\u20134738. https:\/\/doi.org\/10.1109\/CVPRW56347.2022.00519","DOI":"10.1109\/CVPRW56347.2022.00519"},{"key":"1474_CR15","doi-asserted-by":"publisher","unstructured":"Jiao J, Zheng W-S, Wu A, Zhu X, Gong S (2018) Deep low-resolution person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32. https:\/\/doi.org\/10.1609\/aaai.v32i1.12284","DOI":"10.1609\/aaai.v32i1.12284"},{"key":"1474_CR16","doi-asserted-by":"publisher","unstructured":"Sun X, Zheng L (2019) Dissecting person re-identification from the viewpoint of viewpoint. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 608\u2013617. https:\/\/doi.org\/10.1109\/CVPR.2019.00070","DOI":"10.1109\/CVPR.2019.00070"},{"key":"1474_CR17","doi-asserted-by":"publisher","unstructured":"Davila D, Du D, Lewis B, Funk C, Van\u00a0Pelt J, Collins R, Corona K, Brown M, McCloskey S, Hoogs A, Clipp B (2023) Mevid: Multi-view extended videos with identities for video person re-identification. In: 2023 IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 1634\u20131643. https:\/\/doi.org\/10.1109\/WACV56688.2023.00168","DOI":"10.1109\/WACV56688.2023.00168"},{"key":"1474_CR18","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.121305","volume":"237","author":"P Wu","year":"2024","unstructured":"Wu P, Wang Z, Li H, Zeng N (2024) Kd-par: a knowledge distillation-based pedestrian attribute recognition model with multi-label mixed feature learning network. Expert Syst Appl 237:121305. https:\/\/doi.org\/10.1016\/j.eswa.2023.121305","journal-title":"Expert Syst Appl"},{"key":"1474_CR19","doi-asserted-by":"publisher","unstructured":"Huang H, Yang W, Chen X, Zhao X, Huang K, Lin J, Huang G, Du D (2018) Eanet: enhancing alignment for cross-domain person re-identification. https:\/\/doi.org\/10.48550\/arXiv.1812.11369. arXiv:1812.11369","DOI":"10.48550\/arXiv.1812.11369"},{"key":"1474_CR20","doi-asserted-by":"publisher","unstructured":"Kumar D, Siva P, Marchwica P, Wong A (2019) Fairest of them all: establishing a strong baseline for cross-domain person reid. https:\/\/doi.org\/10.48550\/arXiv.1907.12016. arXiv preprint arXiv:1907.12016","DOI":"10.48550\/arXiv.1907.12016"},{"key":"1474_CR21","doi-asserted-by":"publisher","unstructured":"Wang D, Zhang S (2020) Unsupervised person re-identification via multi-label classification. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 10978\u201310987. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01099","DOI":"10.1109\/CVPR42600.2020.01099"},{"key":"1474_CR22","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2023.126498","volume":"550","author":"X Song","year":"2023","unstructured":"Song X, Wu N, Song S, Zhang Y, Stojanovic V (2023) Bipartite synchronization for cooperative-competitive neural networks with reaction\u2013diffusion terms via dual event-triggered mechanism. Neurocomputing 550:126498. https:\/\/doi.org\/10.1016\/j.neucom.2023.126498","journal-title":"Neurocomputing"},{"issue":"8","key":"1474_CR23","doi-asserted-by":"publisher","first-page":"3368","DOI":"10.1109\/TIP.2014.2330763","volume":"23","author":"L Zheng","year":"2014","unstructured":"Zheng L, Wang S, Tian Q (2014) Coupled binary embedding for large-scale image retrieval. IEEE Transa Image Process 23(8):3368\u20133380. https:\/\/doi.org\/10.1109\/TIP.2014.2330763","journal-title":"IEEE Transa Image Process"},{"issue":"10","key":"1474_CR24","doi-asserted-by":"publisher","first-page":"2597","DOI":"10.1109\/TMM.2019.2958756","volume":"22","author":"H Luo","year":"2020","unstructured":"Luo H, Jiang W, Gu Y, Liu F, Liao X, Lai S, Gu J (2020) A strong baseline and batch normalization neck for deep person re-identification. IEEE Trans Multimed 22(10):2597\u20132609. https:\/\/doi.org\/10.1109\/TMM.2019.2958756","journal-title":"IEEE Trans Multimed"},{"key":"1474_CR25","doi-asserted-by":"publisher","unstructured":"Ren J, Ma X, Xu C, Zhao H, Yi S (2021) Havana: hierarchical and variation-normalized autoencoder for person re-identification. https:\/\/doi.org\/10.48550\/arXiv.2101.02568. arXiv preprint arXiv:2101.02568","DOI":"10.48550\/arXiv.2101.02568"},{"key":"1474_CR26","doi-asserted-by":"publisher","unstructured":"Suh Y, Wang J, Tang S, Mei T, Lee KM (2018) Part-aligned bilinear representations for person re-identification. In: Proceedings of the European Conference on Computer Vision, pp. 402\u2013419. https:\/\/doi.org\/10.1109\/TCSVT.2020.3037179","DOI":"10.1109\/TCSVT.2020.3037179"},{"key":"1474_CR27","doi-asserted-by":"publisher","unstructured":"Wang X, Zhao R (2018) Person re-identification: System design and evaluation overview. In: Person Re-Identification, pp. 351\u2013370. https:\/\/doi.org\/10.1007\/978-1-4471-6296-4_17","DOI":"10.1007\/978-1-4471-6296-4_17"},{"key":"1474_CR28","doi-asserted-by":"publisher","unstructured":"Si J, Zhang H, Li C-G, Kuen J, Kong X, Kot AC, Wang G (2018) Dual attention matching network for context-aware feature sequence based person re-identification. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 5363\u20135372. https:\/\/doi.org\/10.1109\/CVPR.2018.00562","DOI":"10.1109\/CVPR.2018.00562"},{"key":"1474_CR29","doi-asserted-by":"publisher","unstructured":"Li J, Zhang S, Huang T (2019) Multi-scale 3d convolution network for video based person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8618\u20138625. https:\/\/doi.org\/10.1609\/aaai.v33i01.33018618","DOI":"10.1609\/aaai.v33i01.33018618"},{"key":"1474_CR30","doi-asserted-by":"publisher","unstructured":"Fu Y, Wang X, Wei Y, Huang T (2019) Sta: spatial-temporal attention for large-scale video-based person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8287\u20138294. https:\/\/doi.org\/10.1609\/aaai.v33i01.33018287","DOI":"10.1609\/aaai.v33i01.33018287"},{"key":"1474_CR31","doi-asserted-by":"publisher","unstructured":"Zhou Z, Huang Y, Wang W, Wang L, Tan T (2017) See the forest for the trees: joint spatial and temporal recurrent neural networks for video-based person re-identification. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 6776\u20136785. https:\/\/doi.org\/10.1109\/CVPR.2017.717","DOI":"10.1109\/CVPR.2017.717"},{"key":"1474_CR32","doi-asserted-by":"publisher","unstructured":"Li S, Bak S, Carr P, Wang X (2018) Diversity regularized spatiotemporal attention for video-based person re-identification. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 369\u2013378. https:\/\/doi.org\/10.1109\/CVPR.2018.00046","DOI":"10.1109\/CVPR.2018.00046"},{"issue":"12","key":"1474_CR33","doi-asserted-by":"publisher","first-page":"8776","DOI":"10.1109\/TII.2022.3151766","volume":"18","author":"X Zang","year":"2022","unstructured":"Zang X, Li G, Gao W (2022) Multidirection and multiscale pyramid in transformer for video-based pedestrian retrieval. IEEE Trans Ind Inform 18(12):8776\u20138785. https:\/\/doi.org\/10.1109\/TII.2022.3151766","journal-title":"IEEE Trans Ind Inform"},{"key":"1474_CR34","doi-asserted-by":"publisher","unstructured":"Liu X, Zhang P, Lu H (2023) Video-based person re-identification with long short-term representation learning. In: International Conference on Image and Graphics. Lecture Notes in Computer Science, vol. 14355, pp. 55\u201367. https:\/\/doi.org\/10.1007\/978-3-031-46305-1_5","DOI":"10.1007\/978-3-031-46305-1_5"},{"issue":"4","key":"1474_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3573203","volume":"19","author":"K Wang","year":"2023","unstructured":"Wang K, Ding C, Pang J, Xu X (2023) Context sensing attention network for video-based person re-identification. ACM Trans Multimed Comput Commun Appl 19(4):1\u201320. https:\/\/doi.org\/10.1145\/3573203","journal-title":"ACM Trans Multimed Comput Commun Appl"},{"key":"1474_CR36","doi-asserted-by":"publisher","unstructured":"Liu C-T, Wu C-W, Wang Y-CF, Chien S-Y (2019) Spatially and temporally efficient non-local attention network for video-based person re-identification. https:\/\/doi.org\/10.48550\/arXiv.1908.01683. arXiv preprint arXiv:1908.01683","DOI":"10.48550\/arXiv.1908.01683"},{"key":"1474_CR37","doi-asserted-by":"publisher","unstructured":"Yan Y, Qin J, Chen J, Liu L, Zhu F, Tai Y, Shao L (2020) Learning multi-granular hypergraphs for video-based person re-identification. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2899\u20132908. https:\/\/doi.org\/10.1109\/CVPR42600.2020.00297","DOI":"10.1109\/CVPR42600.2020.00297"},{"key":"1474_CR38","doi-asserted-by":"publisher","unstructured":"Yang J, Zheng W-S, Yang Q, Chen Y-C, Tian Q (2020) Spatial-temporal graph convolutional network for video-based person re-identification. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 3286\u20133296. https:\/\/doi.org\/10.1109\/CVPR42600.2020.00335","DOI":"10.1109\/CVPR42600.2020.00335"},{"key":"1474_CR39","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1016\/j.neunet.2022.12.015","volume":"160","author":"H Pan","year":"2023","unstructured":"Pan H, Chen Y, He Z (2023) Multi-granularity graph pooling for video-based person re-identification. Neural Netw 160:22\u201333. https:\/\/doi.org\/10.1016\/j.neunet.2022.12.015","journal-title":"Neural Netw"},{"key":"1474_CR40","doi-asserted-by":"publisher","unstructured":"Simonyan K, Zisserman A (2014) Two-stream convolutional networks for action recognition in videos. Adv Neural Inform Process Syst 27: 568\u2013576. https:\/\/doi.org\/10.48550\/arXiv.1406.2199","DOI":"10.48550\/arXiv.1406.2199"},{"key":"1474_CR41","doi-asserted-by":"publisher","unstructured":"Chung D, Tahboub K, Delp EJ (2017) A two stream siamese convolutional neural network for person re-identification. In: 2017 IEEE International Conference on Computer Vision, pp. 1992\u20132000. https:\/\/doi.org\/10.1109\/ICCV.2017.218","DOI":"10.1109\/ICCV.2017.218"},{"key":"1474_CR42","doi-asserted-by":"publisher","unstructured":"Feichtenhofer C, Pinz A, Wildes RP(2017) Spatiotemporal multiplier networks for video action recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 7445\u20137454. https:\/\/doi.org\/10.1109\/CVPR.2017.787","DOI":"10.1109\/CVPR.2017.787"},{"key":"1474_CR43","doi-asserted-by":"publisher","unstructured":"McLaughlin N, Rincon J, Miller P (2016) Recurrent convolutional network for video-based person re-identification. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1325\u20131334. https:\/\/doi.org\/10.1109\/CVPR.2016.148","DOI":"10.1109\/CVPR.2016.148"},{"key":"1474_CR44","doi-asserted-by":"publisher","unstructured":"Liu Y, Yuan Z, Zhou W, Li H (2019) Spatial and temporal mutual promotion for video-based person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8786\u20138793. https:\/\/doi.org\/10.1609\/aaai.v33i01.33018786","DOI":"10.1609\/aaai.v33i01.33018786"},{"key":"1474_CR45","doi-asserted-by":"publisher","unstructured":"Gu X, Chang H, Ma B, Zhang H, Chen X (2020) Appearance-preserving 3d convolution for video-based person re-identification. In: Vedaldi A, Bischof H, Brox T, Frahm J-M (eds) Computer Vision \u2013 ECCV 2020. Lecture Notes in Computer Science, vol. 12347, pp. 228\u2013243. https:\/\/doi.org\/10.1007\/978-3-030-58536-5_14","DOI":"10.1007\/978-3-030-58536-5_14"},{"key":"1474_CR46","doi-asserted-by":"publisher","unstructured":"He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: 2021 IEEE\/CVF International Conference on Computer Vision, pp. 14993\u201315002. https:\/\/doi.org\/10.1109\/ICCV48922.2021.01474","DOI":"10.1109\/ICCV48922.2021.01474"},{"key":"1474_CR47","doi-asserted-by":"publisher","unstructured":"Zhang G, Zhang Y, Zhang T, Li B, Pu S (2023) Pha: Patch-wise high-frequency augmentation for transformer-based person re-identification. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 14133\u201314142. https:\/\/doi.org\/10.1109\/CVPR52729.2023.01358","DOI":"10.1109\/CVPR52729.2023.01358"},{"key":"1474_CR48","doi-asserted-by":"publisher","first-page":"7917","DOI":"10.1109\/TMM.2022.3231103","volume":"25","author":"Z Tang","year":"2023","unstructured":"Tang Z, Zhang R, Peng Z, Chen J, Lin L (2023) Multi-stage spatio-temporal aggregation transformer for video person re-identification. IEEE Trans Multimed 25:7917\u20137929. https:\/\/doi.org\/10.1109\/TMM.2022.3231103","journal-title":"IEEE Trans Multimed"},{"key":"1474_CR49","doi-asserted-by":"publisher","unstructured":"Yu C, Liu X, Wang Y, Zhang P, Lu H (2023) Tf-clip: Learning text-free clip for video-based person re-identification (2023). https:\/\/doi.org\/10.48550\/arXiv.2312.09627. arXiv preprint arXiv:2312.09627","DOI":"10.48550\/arXiv.2312.09627"},{"key":"1474_CR50","doi-asserted-by":"publisher","unstructured":"Bai S, Ma B, Chang H, Huang R, Chen X (2022) Salient-to-broad transition for video person re-identification. In: 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7329\u20137338. https:\/\/doi.org\/10.1109\/CVPR52688.2022.00719","DOI":"10.1109\/CVPR52688.2022.00719"},{"key":"1474_CR51","doi-asserted-by":"publisher","unstructured":"Hou R, Chang H, Ma B, Shan S, Chen X (2020) Temporal complementary learning for video person re-identification. In: Computer Vision\u2013ECCV 2020. Lecture Notes in Computer Science, vol. 12370, pp. 388\u2013405. https:\/\/doi.org\/10.1007\/978-3-030-58595-2_24","DOI":"10.1007\/978-3-030-58595-2_24"},{"key":"1474_CR52","doi-asserted-by":"publisher","unstructured":"Chen D, Li H, Xiao T, Yi S, Wang X (2019) Video person re-identification with competitive snippet-similarity aggregation and co-attentive snippet embedding. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 1169\u20131178. https:\/\/doi.org\/10.1109\/CVPR.2018.00128","DOI":"10.1109\/CVPR.2018.00128"},{"key":"1474_CR53","doi-asserted-by":"publisher","unstructured":"Hou R, Ma B, Chang H, Gu X, Shan S, Chen X (2019) Vrstc: Occlusion-free video person re-identification. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7176\u20137185. https:\/\/doi.org\/10.1109\/CVPR.2019.00735","DOI":"10.1109\/CVPR.2019.00735"},{"key":"1474_CR54","doi-asserted-by":"publisher","unstructured":"Kim M, Cho M, Lee S (2023) Feature disentanglement learning with switching and aggregation for video-based person re-identification. In: 2023 IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 1603\u20131612. https:\/\/doi.org\/10.1109\/WACV56688.2023.00165","DOI":"10.1109\/WACV56688.2023.00165"},{"key":"1474_CR55","doi-asserted-by":"publisher","unstructured":"Huang Y, Zha Z-J, Fu X, Hong R, Li L (2020) Real-world person re-identification via degradation invariance learning. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 14072\u201314082. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01409","DOI":"10.1109\/CVPR42600.2020.01409"},{"key":"1474_CR56","doi-asserted-by":"publisher","unstructured":"Wang Y, Liao S, Shao L (2020) Surpassing real-world source training data: random 3d characters for generalizable person re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp. 3422\u20133430. https:\/\/doi.org\/10.48550\/arXiv.2006.12774","DOI":"10.48550\/arXiv.2006.12774"},{"key":"1474_CR57","doi-asserted-by":"publisher","unstructured":"Hirzer M, Beleznai C, Roth PM, Bischof H (2011) Person re-identification by descriptive and discriminative classification. In: Heyden A, Kahl F (eds) Image analysis. SCIA 2011. Lecture Notes in Computer Science, vol. 6688, pp. 91\u2013102. https:\/\/doi.org\/10.1007\/978-3-642-21227-7_9","DOI":"10.1007\/978-3-642-21227-7_9"},{"key":"1474_CR58","doi-asserted-by":"publisher","unstructured":"Li J, Zhang S, Wang J, Gao W, Tian Q (2019) Global-local temporal representations for video person re-identification. In: 2019 IEEE\/CVF International Conference on Computer Vision, pp. 3958\u20133967. https:\/\/doi.org\/10.1109\/ICCV.2019.00406","DOI":"10.1109\/ICCV.2019.00406"},{"key":"1474_CR59","doi-asserted-by":"publisher","unstructured":"Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: Hua G, J\u00e9gou H (eds) Computer vision\u2013ECCV 2016 Workshops. Lecture Notes in Computer Science, vol. 9914, pp. 17\u201335. https:\/\/doi.org\/10.1007\/978-3-319-48881-3_2","DOI":"10.1007\/978-3-319-48881-3_2"},{"key":"1474_CR60","doi-asserted-by":"publisher","unstructured":"Gou M, Karanam S, Liu W, Camps O, Radke RJ (2017) Dukemtmc4reid: a large-scale multi-camera person re-identification dataset. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1425\u20131434 (2017). https:\/\/doi.org\/10.1109\/CVPRW.2017.185","DOI":"10.1109\/CVPRW.2017.185"},{"key":"1474_CR61","doi-asserted-by":"publisher","unstructured":"Nguyen H, Nguyen K, Sridharan S, Fookes C (2023) Aerial-ground person re-id. In: 2023 IEEE International Conference on Multimedia and Expo, pp. 2585\u20132590. https:\/\/doi.org\/10.1109\/ICME55011.2023.00440","DOI":"10.1109\/ICME55011.2023.00440"},{"key":"1474_CR62","doi-asserted-by":"publisher","unstructured":"Ge Z, Liu S, Wang F, Li Z, Sun J (2021) Yolox: exceeding yolo series in 2021. https:\/\/doi.org\/10.48550\/arXiv.2107.08430. arXiv preprint arXiv:2107.08430","DOI":"10.48550\/arXiv.2107.08430"},{"key":"1474_CR63","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1016\/j.patcog.2018.11.025","volume":"88","author":"X Xin","year":"2019","unstructured":"Xin X, Wang J, Xie R, Zhou S, Huang W, Zheng N (2019) Semi-supervised person re-identification using multi-view clustering. Pattern Recogn 88:285\u2013297. https:\/\/doi.org\/10.1016\/j.patcog.2018.11.025","journal-title":"Pattern Recogn"},{"key":"1474_CR64","doi-asserted-by":"publisher","unstructured":"Ye M, Lan X, Yuen PC (2018) Robust anchor embedding for unsupervised video person re-identification in the wild. In: Computer Vision \u2013 ECCV 2018. Lecture Notes in Computer Science, pp. 176\u2013193. https:\/\/doi.org\/10.1007\/978-3-030-01234-2_11","DOI":"10.1007\/978-3-030-01234-2_11"},{"key":"1474_CR65","doi-asserted-by":"publisher","unstructured":"Dou Z, Wang Z, Li Y, Wang S (2023) Identity-seeking self-supervised representation learning for generalizable person re-identification. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 15847\u201315858. https:\/\/doi.org\/10.1109\/ICCV51070.2023.01452","DOI":"10.1109\/ICCV51070.2023.01452"},{"key":"1474_CR66","doi-asserted-by":"publisher","unstructured":"Choi S, Kim T, Jeong M, Park H, Kim C (2021) Meta batch-instance normalization for generalizable person re-identification. In: 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 3424\u20133434. https:\/\/doi.org\/10.1109\/CVPR46437.2021.00343","DOI":"10.1109\/CVPR46437.2021.00343"},{"key":"1474_CR67","doi-asserted-by":"publisher","unstructured":"Mekhazni D, Dufau M, Desrosiers C, Pedersoli M, Granger E (2023) Camera alignment and weighted contrastive learning for domain adaptation in video person reid. In: 2023 IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 1624\u20131633. https:\/\/doi.org\/10.1109\/WACV56688.2023.00167","DOI":"10.1109\/WACV56688.2023.00167"},{"key":"1474_CR68","doi-asserted-by":"publisher","unstructured":"Zhang S, Yang Q, Cheng D, Xing Y, Liang G, Wang P, Zhang Y (2023) Ground-to-aerial person search: Benchmark dataset and approach. In: Proceedings of the 31st ACM International Conference on Multimedia, pp. 789\u2013799. https:\/\/doi.org\/10.1145\/3581783.3612105","DOI":"10.1145\/3581783.3612105"},{"key":"1474_CR69","doi-asserted-by":"publisher","unstructured":"Arkushin D, Cohen B, Peleg S, Fried O (2024) Geff: improving any clothes-changing person reid model using gallery enrichment with face features. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 152\u2013162. https:\/\/doi.org\/10.48550\/arXiv.2211.13807","DOI":"10.48550\/arXiv.2211.13807"},{"key":"1474_CR70","doi-asserted-by":"publisher","unstructured":"Wang Y, Xu K, Chai Y, Jiang Y, Qi G (2023) Semantic consistent feature construction and multi-granularity feature learning for visible-infrared person re-identification. Visual Comput:1\u201317. https:\/\/doi.org\/10.1007\/s00371-023-02923-w","DOI":"10.1007\/s00371-023-02923-w"},{"key":"1474_CR71","doi-asserted-by":"publisher","first-page":"3182","DOI":"10.1109\/TIP.2022.3165376","volume":"31","author":"C Liang","year":"2022","unstructured":"Liang C, Zhang Z, Zhou X, Li B, Zhu S, Hu W (2022) Rethinking the competition between detection and reid in multiobject tracking. IEEE Trans Image Process 31:3182\u20133196. https:\/\/doi.org\/10.1109\/TIP.2022.3165376","journal-title":"IEEE Trans Image Process"},{"issue":"1","key":"1474_CR72","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1007\/s10489-021-02390-7","volume":"52","author":"Q Liu","year":"2022","unstructured":"Liu Q, Teng Q, Chen H, Li B, Qing L (2022) Dual adaptive alignment and partitioning network for visible and infrared cross-modality person re-identification. Appl Intell 52(1):547\u2013563. https:\/\/doi.org\/10.1007\/s10489-021-02390-7","journal-title":"Appl Intell"},{"issue":"1","key":"1474_CR73","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","volume":"6","author":"C Shorten","year":"2019","unstructured":"Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):1\u201348. https:\/\/doi.org\/10.1186\/s40537-019-0197-0","journal-title":"J Big Data"},{"issue":"5","key":"1474_CR74","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1111\/1754-9485.13261","volume":"65","author":"P Chlap","year":"2021","unstructured":"Chlap P, Min H, Vandenberg N, Dowling J, Holloway L, Haworth A (2021) A review of medical image data augmentation techniques for deep learning applications. J Med Imaging Radiat Oncol 65(5):545\u2013563. https:\/\/doi.org\/10.1111\/1754-9485.13261","journal-title":"J Med Imaging Radiat Oncol"},{"issue":"12","key":"1474_CR75","doi-asserted-by":"publisher","first-page":"254","DOI":"10.3390\/jimaging7120254","volume":"7","author":"L Nanni","year":"2021","unstructured":"Nanni L, Paci M, Brahnam S, Lumini A (2021) Comparison of different image data augmentation approaches. J Imaging 7(12):254. https:\/\/doi.org\/10.3390\/jimaging7120254","journal-title":"J Imaging"},{"key":"1474_CR76","doi-asserted-by":"publisher","unstructured":"McLaughlin N, Del\u00a0Rincon JM, Miller P (2015) Data-augmentation for reducing dataset bias in person re-identification. In: 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, pp. 1\u20136. https:\/\/doi.org\/10.1109\/AVSS.2015.7301739","DOI":"10.1109\/AVSS.2015.7301739"},{"key":"1474_CR77","doi-asserted-by":"publisher","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1\u20139. https:\/\/doi.org\/10.1109\/CVPR.2015.7298594","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"1474_CR78","doi-asserted-by":"publisher","unstructured":"Niu K, Huang Y, Ouyang W, Wang L (2020) Improving description-based person re-identification by multi-granularity image-text alignments. IEEE Trans Image Process 29:5542\u20135556. https:\/\/doi.org\/10.1109\/TIP.2020.2984883","DOI":"10.1109\/TIP.2020.2984883"},{"issue":"1","key":"1474_CR79","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1006\/jmps.1999.1279","volume":"44","author":"MW Browne","year":"2000","unstructured":"Browne MW (2000) Cross-validation methods. J Math Psychol 44(1):108\u2013132. https:\/\/doi.org\/10.1006\/jmps.1999.1279","journal-title":"J Math Psychol"},{"key":"1474_CR80","doi-asserted-by":"publisher","unstructured":"Gu X, Ma B, Chang H, Shan S, Chen X (2019) Temporal knowledge propagation for image-to-video person re-identification. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 9647\u20139656. https:\/\/doi.org\/10.1109\/ICCV.2019.00974","DOI":"10.1109\/ICCV.2019.00974"},{"key":"1474_CR81","doi-asserted-by":"publisher","first-page":"8821","DOI":"10.1109\/TIP.2020.3001693","volume":"29","author":"Y Wu","year":"2020","unstructured":"Wu Y, Bourahla OEF, Li X, Wu F, Tian Q, Zhou X (2020) Adaptive graph representation learning for video person re-identification. IEEE Trans Image Process 29:8821\u20138830. https:\/\/doi.org\/10.1109\/TIP.2020.3001693","journal-title":"IEEE Trans Image Process"},{"key":"1474_CR82","doi-asserted-by":"publisher","unstructured":"Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: 2015 IEEE International Conference on Computer Vision, pp. 1116\u20131124 (2015). https:\/\/doi.org\/10.1109\/ICCV.2015.133","DOI":"10.1109\/ICCV.2015.133"},{"key":"1474_CR83","doi-asserted-by":"publisher","unstructured":"Wu L, Shen C, Hengel AVD (2016) Deep recurrent convolutional networks for video-based person re-identification: an end-to-end approach. https:\/\/doi.org\/10.48550\/arXiv.1606.01609. arXiv preprint arXiv:1606.01609","DOI":"10.48550\/arXiv.1606.01609"},{"key":"1474_CR84","doi-asserted-by":"publisher","unstructured":"Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 13001\u201313008. https:\/\/doi.org\/10.1609\/aaai.v34i07.7000","DOI":"10.1609\/aaai.v34i07.7000"},{"issue":"1","key":"1474_CR85","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/0169-7439(87)80084-9","volume":"2","author":"S Wold","year":"1987","unstructured":"Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemomet Intell Lab Syst 2(1):37\u201352. https:\/\/doi.org\/10.1016\/0169-7439(87)80084-9","journal-title":"Chemomet Intell Lab Syst"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01474-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-024-01474-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01474-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,17]],"date-time":"2024-07-17T17:30:13Z","timestamp":1721237413000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-024-01474-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,23]]},"references-count":85,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["1474"],"URL":"https:\/\/doi.org\/10.1007\/s40747-024-01474-4","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"type":"print","value":"2199-4536"},{"type":"electronic","value":"2198-6053"}],"subject":[],"published":{"date-parts":[[2024,5,23]]},"assertion":[{"value":"13 December 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 March 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}