{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T17:28:31Z","timestamp":1770917311700,"version":"3.50.1"},"reference-count":52,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2022,3,5]],"date-time":"2022-03-05T00:00:00Z","timestamp":1646438400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,3,5]],"date-time":"2022-03-05T00:00:00Z","timestamp":1646438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61603233"],"award-info":[{"award-number":["61603233"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2022,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Vehicle re-identification (ReID) means to identify the target vehicle in large-scale surveillance videos captured by multiple cameras, where robust and distinctive visual features of vehicles are critical to the performance. Recently, the researchers have approached the problem with attention based models. However, most of these models use strongly-supervised methods, which rely on expensive extra labels, e.g., keypoints(vehicle wheels , logo and lamps) and attributes(e.g., color and type). Therefore, we propose a joint metric learning approach to solve the problem. We present an end-to-end Partition and Fusion Multi-branch Network (PFMN), a novel approach to effectively learn discriminative features without any annotations or additional attributes. For hard samples, which means different vehicles with similar appearance or the same vehicle with different appearances, a novel variant of hard sampling triplet loss is proposed. Based on extensive experiments, we have proved the effectiveness of our proposed method. On the challenging public data sets VeRi-776 and VehicleID, our model outperforms most state-of-the-art algorithms on mAP and rank-1. Especially on mINP, which measures the cost of model retrieval hard samples, we can achieve a significant improvement.<\/jats:p>","DOI":"10.1007\/s40747-022-00692-y","type":"journal-article","created":{"date-parts":[[2022,3,5]],"date-time":"2022-03-05T09:02:42Z","timestamp":1646470962000},"page":"4005-4020","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Joint metric learning of local and global features for vehicle re-identification"],"prefix":"10.1007","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6563-9206","authenticated-orcid":false,"given":"Junge","family":"Shen","sequence":"first","affiliation":[]},{"given":"Jian","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Xin","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Zhaoyong","family":"Mao","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,5]]},"reference":[{"issue":"4","key":"692_CR1","doi-asserted-by":"publisher","first-page":"1624","DOI":"10.1109\/TITS.2011.2158001","volume":"12","author":"J Zhang","year":"2011","unstructured":"Zhang J, Wang FY, Wang K, Lin WH, Xu X, Chen C (2011) Data-driven intelligent transportation systems: a survey. IEEE Trans Intell Transp Syst 12(4):1624\u20131639","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"692_CR2","doi-asserted-by":"crossref","unstructured":"Wang Z, Li X, Zhu X et al (2021) Big data-driven public transportation network: a simulation approach. Complex Intell Syst, 1-13","DOI":"10.1007\/s40747-021-00462-2"},{"key":"692_CR3","doi-asserted-by":"crossref","unstructured":"Alsufyani A, Alotaibi Y, Almagrabi A.O et al.(2021) Optimized intelligent data management framework for a cyber-physical system for computational applications. Complex Intell Syst, 1-13","DOI":"10.1007\/s40747-021-00511-w"},{"key":"692_CR4","doi-asserted-by":"crossref","unstructured":"FERENC Z A, LEARNED-MILLER E G, MALIK J (2005) Building a classification cascade for visual identification from one example. In: The IEEE International Conference on Computer Vision, pp 286-293","DOI":"10.1109\/ICCV.2005.52"},{"key":"692_CR5","doi-asserted-by":"crossref","unstructured":"LIU X, LIU W, MEI T, et al (2016) A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: The European Conference on Computer Vision. Heidelberg: Springer, pp 869-884","DOI":"10.1007\/978-3-319-46475-6_53"},{"key":"692_CR6","doi-asserted-by":"crossref","unstructured":"Liu X, Liu W, Ma H, et al. (2016) Large-scale vehicle re-identification in urban surveillance videos[C]\/\/2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1-6","DOI":"10.1109\/ICME.2016.7553002"},{"key":"692_CR7","doi-asserted-by":"crossref","unstructured":"Zhang N, Ju Z, Yang C et al (2021) Special issue on interpretation of deep learning: prediction, representation, quantification and visualization. Complex Intell Syst, 1-3","DOI":"10.1007\/s40747-021-00539-y"},{"key":"692_CR8","doi-asserted-by":"crossref","unstructured":"Priya S, Uthra RA (2021) Deep learning framework for handling concept drift and class imbalanced complex decision-making on streaming data. Complex Intell Syst, 1-17","DOI":"10.1007\/s40747-021-00456-0"},{"key":"692_CR9","doi-asserted-by":"crossref","unstructured":"Xia Y, Zhang J, Jiang T et al.(2021) HatchEnsemble: an efficient and practical uncertainty quantification method for deep neural networks. Complex Intell Syst, 1-15","DOI":"10.1007\/s40747-021-00463-1"},{"key":"692_CR10","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1007\/s40747-016-0024-6","volume":"2","author":"A Anuse","year":"2016","unstructured":"Anuse A, Vyas V (2016) A novel training algorithm for convolutional neural network. Complex Intell Syst 2:221\u2013234","journal-title":"Complex Intell Syst"},{"key":"692_CR11","doi-asserted-by":"crossref","unstructured":"Meng D , Li L , Liu X , et al (2020) Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification. In: IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 7101-7110","DOI":"10.1109\/CVPR42600.2020.00713"},{"key":"692_CR12","doi-asserted-by":"crossref","unstructured":"Wang Z , Tang L , Liu X , et al (2017) Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification. In: IEEE International Conference on Computer Vision (ICCV). IEEE, pp 379-387","DOI":"10.1109\/ICCV.2017.49"},{"key":"692_CR13","doi-asserted-by":"crossref","unstructured":"Pasupa K, Kittiworapanya P, Hongngern N et al (2021) Evaluation of deep learning algorithms for semantic segmentation of car parts. Complex Intell Syst, 1-13","DOI":"10.1007\/s40747-021-00397-8"},{"key":"692_CR14","doi-asserted-by":"crossref","unstructured":"Saleem S, Amin J, Sharif M et al (2021) A deep network designed for segmentation and classification of leukemia using fusion of the transfer learning models. Complex Intell Syst, 1-16","DOI":"10.1007\/s40747-021-00473-z"},{"key":"692_CR15","unstructured":"Hermans A, Lucas B, Bastian L (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737"},{"key":"692_CR16","unstructured":"Zhun Z, Liang Z, Donglin C, Shaozi L (2017) Re-ranking person re-identification with k-reciprocal encoding. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 3652-3661"},{"key":"692_CR17","doi-asserted-by":"crossref","unstructured":"Liu H, Tian Y, Yang Y, Pang L and Huang T (2016) Deep relative distance learning: Tell the difference between similar vehicles. In: Proceedings of the Conference on Computer Vision and Pattern Recognition (Piscataway, NJ: IEEE), pp 2167-2175","DOI":"10.1109\/CVPR.2016.238"},{"key":"692_CR18","doi-asserted-by":"crossref","unstructured":"Tang Z, Naphade M, Liu M, Yang X, Birchfield S, Wang S, Kumar R,Anastasiu D.C, Hwang J (2019) Cityflow: A city-scale benchmark for multi-target multi-camera vehicle tracking and re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pp 8797-8806","DOI":"10.1109\/CVPR.2019.00900"},{"key":"692_CR19","doi-asserted-by":"crossref","unstructured":"Yang L, Luo P, Loy C.C, Tang X (2015) A large-scale car dataset for fine-grained categorization and verification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pp 3973-3981","DOI":"10.1109\/CVPR.2015.7299023"},{"key":"692_CR20","doi-asserted-by":"crossref","unstructured":"Liu X, Liu W, Mei T (2018) PROVID: progressive and multimodal vehicle reidentification for large-scale urban surveillance. In: IEEE Transactions on Multimedia, pp 645-658","DOI":"10.1109\/TMM.2017.2751966"},{"key":"692_CR21","unstructured":"Aihua Z, Xianmin L, Chenglong L, Ran H, Jin T (2019) Attributes guided feature learning for vehicle re-identification"},{"key":"692_CR22","doi-asserted-by":"crossref","unstructured":"Yan K, Tian Y, Wang Y (2017) Exploiting multi-grain ranking constraints for precisely searching visually-similar vehicles. In: IEEE International Conference on Computer Vision, pp 562-570","DOI":"10.1109\/ICCV.2017.68"},{"key":"692_CR23","unstructured":"Bing H, Jia L, Yifan Z, Yonghong T (2019) Partregularized near-duplicate vehicle re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 3997\u20134005"},{"key":"692_CR24","doi-asserted-by":"crossref","unstructured":"Liu H, Feng J, Qi M, Jiang J, Yan S(2017) End-to-end comparative attention networks for person re-identification.\u2019 IEEE Transactions on Image Processing, , pp 3492-3506","DOI":"10.1109\/TIP.2017.2700762"},{"key":"692_CR25","doi-asserted-by":"crossref","unstructured":"Liu H, Tian Y, Yang Y (2016) Deep relative distance learning: tell the difference between similar vehicles. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2167\u20132175","DOI":"10.1109\/CVPR.2016.238"},{"key":"692_CR26","doi-asserted-by":"crossref","unstructured":"GUO H, ZHAO C, LIU Z (2018) Learning coarse-to-fine structured feature embedding for vehicle re-identification. In: The 3nd AAAI Conference on Artificial Intelligence, pp 6853\u20136860","DOI":"10.1609\/aaai.v32i1.12237"},{"key":"692_CR27","doi-asserted-by":"crossref","unstructured":"Bai Y, Lou Y, Gao F (2018) Group-sensitive triplet embedding for vehicle re-identification. In: IEEE Transactions on Multimedia, pp 2385\u20132399","DOI":"10.1109\/TMM.2018.2796240"},{"key":"692_CR28","doi-asserted-by":"crossref","unstructured":"Zheng Z, Ruan T, Wei Y, Yang Y, Mei T(2020) VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification. IEEE Transactions on Multimedia","DOI":"10.1109\/TMM.2020.3014488"},{"key":"692_CR29","doi-asserted-by":"crossref","unstructured":"Wang G, Yuan Y, Chen X, et al (2018) Learning discriminative features with multiple granularities for person re-identification. In: The 2018 ACM Multimedia Conference on Multimedia Conference, pp 274\u2013282","DOI":"10.1145\/3240508.3240552"},{"key":"692_CR30","doi-asserted-by":"crossref","unstructured":"Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of tricks and a strong baseline for deep person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 1487\u20131495","DOI":"10.1109\/CVPRW.2019.00190"},{"key":"692_CR31","doi-asserted-by":"crossref","unstructured":"Yifan S, Liang Z, Yi Y, Qi T, Shengjin W (2018) Beyond part models: Person retrieval with refined part pooling (and A strong convolutional baseline). In ECCV, pp 501\u2013518","DOI":"10.1007\/978-3-030-01225-0_30"},{"key":"692_CR32","doi-asserted-by":"crossref","unstructured":"Wang G, Yuan Y, Li J, Ge S, Zhou X(2020) Receptive Multi-Granularity Representation for Person Re-Identification. In: IEEE Transactions on Image Processing, pp 6096\u20136109","DOI":"10.1109\/TIP.2020.2986878"},{"key":"692_CR33","unstructured":"Kaiming H, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770\u2013778"},{"key":"692_CR34","unstructured":"Maxim B et al (2019) Multigrain: a unified image embedding for classes and instances. arXiv preprint arXiv:1902.05509"},{"key":"692_CR35","doi-asserted-by":"crossref","unstructured":"Rosasco L, De Vito E, Caponnetto A, Piana M, Verri A.(2004) Are loss functions all the same? Neural Computation,pp 1063\u20131076","DOI":"10.1162\/089976604773135104"},{"key":"692_CR36","doi-asserted-by":"crossref","unstructured":"Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SCH (2021) Deep Learning for Person Re-identification: A Survey and Outlook. IEEE Trans Pattern Anal Mach Intell","DOI":"10.1109\/TPAMI.2021.3054775"},{"key":"692_CR37","doi-asserted-by":"crossref","unstructured":"- Liao S, Hu Y, Zhu X, Li S Z (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2197\u20132206","DOI":"10.1109\/CVPR.2015.7298832"},{"key":"692_CR38","doi-asserted-by":"crossref","unstructured":"Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1116\u20131124","DOI":"10.1109\/ICCV.2015.133"},{"key":"692_CR39","doi-asserted-by":"crossref","unstructured":"Yang L, Luo P, Change Loy C, Tang X (2015) A large-scale car dataset for fine-grained categorization and verification. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp 3973\u20133981","DOI":"10.1109\/CVPR.2015.7299023"},{"key":"692_CR40","doi-asserted-by":"crossref","unstructured":"Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4700\u20134708","DOI":"10.1109\/CVPR.2017.243"},{"key":"692_CR41","doi-asserted-by":"crossref","unstructured":"Shen Y, Xiao T, Li H, Yi S, Wang X (2017) Learning deep neural networks for vehicle re-ID with visual-spatio-temporal path proposals. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1900-1909","DOI":"10.1109\/ICCV.2017.210"},{"key":"692_CR42","doi-asserted-by":"crossref","unstructured":"Zhou Y, Shao L (2018) Aware attentive multi-view inference for vehicle re-identification. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp 6489\u20136498","DOI":"10.1109\/CVPR.2018.00679"},{"key":"692_CR43","doi-asserted-by":"crossref","unstructured":"Guo H, Zhu K, Tang M, Wang J (2019) Two-level attention network with multi-grain ranking loss for vehicle re-identification. IEEE Trans. Image Process, pp 4328\u20134338","DOI":"10.1109\/TIP.2019.2910408"},{"key":"692_CR44","doi-asserted-by":"crossref","unstructured":"He B, Li J, Zhao Y, Tian Y (2019) Part-regularized near-duplicate vehicle re-identification. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp 3997\u20134005","DOI":"10.1109\/CVPR.2019.00412"},{"key":"692_CR45","unstructured":"Zhu J, Zeng H, Huang J, Liao S, Lei Z, Cai C, Zheng L (2019) Vehicle re-identification using quadruple directional deep learning features. IEEE Trans Intell Transp Syst, pp 2410\u20132420"},{"key":"692_CR46","doi-asserted-by":"crossref","unstructured":"Khorramshahi P, Kumar A, Peri N, Rambhatla SS, Chen JC, Chellappa R (2019) A dual-path model with adaptive attention for vehicle re-identification. In: The IEEE International Conference on Computer Vision (ICCV), pp 6131\u20136140","DOI":"10.1109\/ICCV.2019.00623"},{"key":"692_CR47","unstructured":"Chen H, Lagadec B, Bremond F (2019) A Two-Branch Neural Network for Vehicle Re-identification. In: CVPR Workshops, Partition and Reunion, pp 184\u2013192"},{"key":"692_CR48","doi-asserted-by":"crossref","unstructured":"Yao Y, Zheng L, Yang X, Naphade M, Gedeon T (2020) Simulating content consistent vehicle datasets with attribute descent. In: Computer Vision-ECCV 2020. Springer International Publishing, Part VI 16:775\u2013791","DOI":"10.1007\/978-3-030-58539-6_46"},{"key":"692_CR49","doi-asserted-by":"crossref","unstructured":"Zhedong Z, Zheng L, Yang Y (2017) A discriminatively learned cnn embedding for person reidentification. In: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 14.1, pp 1-20","DOI":"10.1145\/3159171"},{"key":"692_CR50","doi-asserted-by":"crossref","unstructured":"Chen TS, Lee MY, Liu CT, Chien SY (2020) Aware Channel-Wise Attentive Network for Vehicle Re-Identification. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 574-575","DOI":"10.1109\/CVPRW50498.2020.00295"},{"key":"692_CR51","doi-asserted-by":"crossref","unstructured":"Khorramshahi P, Peri N, Chen JC, Chellappa R. (2020) The devil is in the details: Self-supervised attention for vehicle re-identification. In: European Conference on Computer Vision, Springer, Cham, pp 369-386","DOI":"10.1007\/978-3-030-58568-6_22"},{"key":"692_CR52","unstructured":"Ramprasaath RS, Michael C, Abhishek D, Ramakrishna V, Devi P, Dhruv B (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: ICCV, pp 618-626"}],"updated-by":[{"DOI":"10.1007\/s40747-022-00768-9","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2022,5,23]],"date-time":"2022-05-23T00:00:00Z","timestamp":1653264000000}}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00692-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-022-00692-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00692-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,27]],"date-time":"2022-09-27T13:49:59Z","timestamp":1664286599000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-022-00692-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,5]]},"references-count":52,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,10]]}},"alternative-id":["692"],"URL":"https:\/\/doi.org\/10.1007\/s40747-022-00692-y","relation":{"correction":[{"id-type":"doi","id":"10.1007\/s40747-022-00768-9","asserted-by":"object"}]},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,5]]},"assertion":[{"value":"6 October 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 March 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 May 2022","order":4,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Correction","order":5,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"A Correction to this paper has been published:","order":6,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"https:\/\/doi.org\/10.1007\/s40747-022-00768-9","URL":"https:\/\/doi.org\/10.1007\/s40747-022-00768-9","order":7,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}}]}}