{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T07:12:22Z","timestamp":1778224342975,"version":"3.51.4"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2022,5,13]],"date-time":"2022-05-13T00:00:00Z","timestamp":1652400000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,5,13]],"date-time":"2022-05-13T00:00:00Z","timestamp":1652400000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61902301"],"award-info":[{"award-number":["61902301"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010030","name":"China National Textile and Apparel Council","doi-asserted-by":"publisher","award":["No.2018097"],"award-info":[{"award-number":["No.2018097"]}],"id":[{"id":"10.13039\/501100010030","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Science and Technology Plan Project of Shaanxi Province","award":["2022JM-146"],"award-info":[{"award-number":["2022JM-146"]}]},{"DOI":"10.13039\/501100009103","name":"Shaanxi Provincial Education Department","doi-asserted-by":"crossref","award":["19JK036418JK0334"],"award-info":[{"award-number":["19JK036418JK0334"]}],"id":[{"id":"10.13039\/501100009103","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The paper summarizes the research progress on critical region recognition and deep metric learning to achieve accurate clothing image retrieval in cross-domain situations. Critical region recognition is of great value for the clothing feature extraction, effectively improving retrieval accuracy. The accuracy will decrease when solving difficult samples with similar features but different categories. Nowadays, deep metric learning is an effective way to solve this problem, which utilizes the optimization of different loss functions and ensemble network to strengthen the discrimination of clothing features. Therefore, through comparison of the experimental results of different algorithms and analysis of the accuracy of cross-domain clothing retrieval, it is demonstrated that the improvement of the retrieval accuracy in the future mainly depends on clothing important feature extraction and clothing feature discrimination.<\/jats:p>","DOI":"10.1007\/s40747-022-00750-5","type":"journal-article","created":{"date-parts":[[2022,5,13]],"date-time":"2022-05-13T06:02:57Z","timestamp":1652421777000},"page":"5531-5544","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Survey on clothing image retrieval with cross-domain"],"prefix":"10.1007","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0056-0337","authenticated-orcid":false,"given":"Chen","family":"Ning","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"Di","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"Menglu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,5,13]]},"reference":[{"key":"750_CR1","unstructured":"Korea Federation of Textile Industries (2019) Korea fashion market trend 2019 Report; Korea Federation of Textile Industries: Seoul, Korea"},{"key":"750_CR2","unstructured":"Korea Fashion Association (2019) Global fashion industry survey. Seoul, Korea, Korea Fashion Association"},{"key":"750_CR3","unstructured":"Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR. arXiv:1409.1556"},{"key":"750_CR4","unstructured":"Lin M, Chen Q, Yan S (2013) Network in network. arXiv:1312.4400"},{"key":"750_CR5","unstructured":"Chen L, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:1706.05587"},{"key":"750_CR6","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770\u2013778","DOI":"10.1109\/CVPR.2016.90"},{"key":"750_CR7","first-page":"3343","volume":"2015","author":"MH Kiapour","year":"2015","unstructured":"Kiapour MH, Han X, Lazebnik S (2015) Where to buy it: matching street clothing photos in online shops. IEEE Int Conf Comput Vis 2015:3343\u20133351","journal-title":"IEEE Int Conf Comput Vis"},{"key":"750_CR8","unstructured":"Sande KEA\u00a0van\u00a0de, Uijlings JRR, Gevers T, Smeulders AWM (2011) Segmentation as selective search for object recognition. In: ICCV"},{"key":"750_CR9","doi-asserted-by":"crossref","unstructured":"Chen Q, Huang J, Feris R, Brown L, Dong J, Yan S (2015) Deep domain adaptation for describing people based on fine-grained clothing attributes. In: CVPR","DOI":"10.1109\/CVPR.2015.7299169"},{"key":"750_CR10","doi-asserted-by":"crossref","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR","DOI":"10.1109\/CVPR.2014.81"},{"key":"750_CR11","doi-asserted-by":"crossref","unstructured":"Huang J, Feris RS, Chen Q (2015) Cross-domain image retrieval with a dual attribute-aware ranking network. In: Proceedings of 2015 international conference on computer vision, pp 1062\u20131070","DOI":"10.1109\/ICCV.2015.127"},{"key":"750_CR12","first-page":"5","volume":"2020","author":"C Ning","year":"2020","unstructured":"Ning C, Menglu L, Hao Y, Xueping S, Yunhong L (2020) Survey of pedestrian detection with occlusion. Compl Intell Syst 2020:5","journal-title":"Compl Intell Syst"},{"key":"750_CR13","first-page":"38","volume":"2012","author":"M Eichner","year":"2012","unstructured":"Eichner M, Ferrar V (2012) Appearance sharing for collective human pose estimation. Comput Vis ACCV 2012:38\u2013151","journal-title":"Comput Vis ACCV"},{"key":"750_CR14","doi-asserted-by":"crossref","unstructured":"Chen H, Andrew G, Bernd G (2012) Describing clothing by semantic attributes. In: Proceedings of the 12th European conference on computer vision, pp 609\u2013623","DOI":"10.1007\/978-3-642-33712-3_44"},{"key":"750_CR15","doi-asserted-by":"crossref","unstructured":"Chen K, Luo T, Jai B (2017) When fashion meets big data: discriminative mining of best selling clothing features. In: Proceedings of the 26th international conference on world wide web companion, pp 15\u201322","DOI":"10.1145\/3041021.3054141"},{"key":"750_CR16","first-page":"1627","volume":"2010","author":"FF Pedro","year":"2010","unstructured":"Pedro FF, Ross BG, David AM (2010) Object detection with discriminatively trained part-based models. IEEE Trans 2010:1627\u20131645","journal-title":"IEEE Trans"},{"key":"750_CR17","first-page":"2","volume":"57","author":"P Viola","year":"2001","unstructured":"Viola P, Jones M (2001) Robust real-time object detection. Int J Comput Vis 57:2","journal-title":"Int J Comput Vis"},{"issue":"3","key":"750_CR18","first-page":"309","volume":"23","author":"R Carten","year":"2014","unstructured":"Carten R, Vladimir K, Aadrew B (2014) \u201cGrabCut\u2019\u2019: interactive foreground extraction using iterated graph cuts. ACM Trans 23(3):309\u2013314","journal-title":"ACM Trans"},{"key":"750_CR19","first-page":"1096","volume":"2016","author":"Z Liu","year":"2016","unstructured":"Liu Z, Luo P, Qiu S, Wang X, Tang X (2016) Deepfashion: powering robust clothes recognition and retrieval with rich annotations. Comput Vis Pattern Recogn 2016:1096\u20131104","journal-title":"Comput Vis Pattern Recogn"},{"key":"750_CR20","doi-asserted-by":"crossref","unstructured":"Liu Z, Yan S, Lou P (2016) Fashion landmark detection in the wild. In: proceedings of the 14th European conference of computer vision (ECCV), pp 229\u2013245","DOI":"10.1007\/978-3-319-46475-6_15"},{"key":"750_CR21","doi-asserted-by":"crossref","unstructured":"Ge Y, Zhang R, Wu L (2019) DeepFashion2: a versatile benchmark for detection,pose estimation,segmentation and re-identification of clothing images. In: proceedings of the 2019 conference on computer vision and pattern recognition, pp 5337\u20135345","DOI":"10.1109\/CVPR.2019.00548"},{"key":"750_CR22","doi-asserted-by":"crossref","unstructured":"Ji X, Wang W, Liu MH, Yang Y (2017) Cross-domain image retrieval with attention modeling. In: Proceedings ACM on multimedia conference, ACM, pp 1654\u20131662","DOI":"10.1145\/3123266.3123429"},{"key":"750_CR23","first-page":"5","volume":"2017","author":"Z Wang","year":"2017","unstructured":"Wang Z, Gu Y, Zhang Y, Zhou J, Gu X (2017) Clothing retrieval with visual attention model. IEEE Vis Commun Image Process 2017:5","journal-title":"IEEE Vis Commun Image Process"},{"key":"750_CR24","first-page":"640","volume":"2015","author":"J Long","year":"2015","unstructured":"Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 2015:640","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"750_CR25","doi-asserted-by":"crossref","unstructured":"Zheng Y, Huang D, Liu S, Wang Y (2020) Cross-domain object detection through coarse-to-fine feature adaptation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 13766\u201313775","DOI":"10.1109\/CVPR42600.2020.01378"},{"key":"750_CR26","doi-asserted-by":"crossref","unstructured":"Luo Z,Yuan J, Yang J, Wen W (2019) Spatial constraint multiple granularity attention network for clothes retrieval. In: 2019 IEEE international conference on image processing (ICIP), IEEE, pp 859\u2013863","DOI":"10.1109\/ICIP.2019.8802938"},{"key":"750_CR27","unstructured":"Xu K, Ba J, Kiros R, Cho K, Courville AC, Salakhutdinov R, Zemel RS, Bengio Y (2015) Show, attend and tell: Neural image caption generation with visual attention. In: Proceedings of the 32nd international conference on machine learning, ICML 2015, Lille, France, pp 2048\u20132057"},{"key":"750_CR28","first-page":"5","volume":"2019","author":"Y Luo","year":"2019","unstructured":"Luo Y, Wang Z, Huang Z, Yang Y, Lu H (2019) Snap and find: deep discrete cross-domain garment image retrieval. IEEE Trans Image Process 2019:5","journal-title":"IEEE Trans Image Process"},{"key":"750_CR29","doi-asserted-by":"crossref","unstructured":"Chopra S, Hadsell R, LeCun Y (2005) Learning a similarity metric discriminatively,with application to face verification. In: Computer vision and pattern recognition (CVPR), pp 539\u2013546","DOI":"10.1109\/CVPR.2005.202"},{"issue":"4","key":"750_CR30","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1145\/2766959","volume":"34","author":"S Bell","year":"2015","unstructured":"Bell S, Bala K (2015) Learning visual similarity for product design with convolutional neural networks. ACM Trans Graph 34(4):98","journal-title":"ACM Trans Graph"},{"key":"750_CR31","doi-asserted-by":"crossref","unstructured":"Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: Convolutional architecture for fast feature embedding. In: Proceedings of the ACM international conference on multimedia, ACM, pp 675\u2013678","DOI":"10.1145\/2647868.2654889"},{"key":"750_CR32","doi-asserted-by":"crossref","unstructured":"Xiong Y, Liu N, Xu Z, Zhang Y (2016) A parameter partial-sharing cnn architecture for cross-domain clothing retrieval. In: Visual communications and image processing (VCIP), pp 1\u20134","DOI":"10.1109\/VCIP.2016.7805463"},{"key":"750_CR33","doi-asserted-by":"crossref","unstructured":"Wangxi SZ, Zhang W et al (2016) Matching user photos to online products with robust deep features. In: Proceedings of the 2016 ACM on international conference on multimedia retrieval, ACM, pp 7\u201314","DOI":"10.1145\/2911996.2912002"},{"key":"750_CR34","doi-asserted-by":"crossref","unstructured":"Schroff F, Kalenichenko D, Philbin J (2015) Facenet: A unified embedding for face recognition and clustering. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 815\u2013823","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"750_CR35","doi-asserted-by":"crossref","unstructured":"Wang X, Gupta A (2015) Unsupervised learning of visual representations using videos. In: ICCV","DOI":"10.1109\/ICCV.2015.320"},{"key":"750_CR36","doi-asserted-by":"crossref","unstructured":"Cui Y , Zhou F, Lin Y, Belongie S (2015) Fine-grained categorization and dataset bootstrapping using deep metric learning with humans in the loop. arXiv:1512.05227","DOI":"10.1109\/CVPR.2016.130"},{"key":"750_CR37","doi-asserted-by":"crossref","unstructured":"Simo-Serra E, Trulls E, Ferraz L, Kokkinos I, Fua P, Moreno-Noguer F (2015) Discriminative learning of deep convolutional feature point descriptors. In: ICCV","DOI":"10.1109\/ICCV.2015.22"},{"key":"750_CR38","doi-asserted-by":"crossref","unstructured":"Song HO, Xiang Y, Jegelka S, Savarese S (2016) Deep metric learning via lifted structured feature embedding. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 4004\u20134012","DOI":"10.1109\/CVPR.2016.434"},{"key":"750_CR39","doi-asserted-by":"crossref","unstructured":"Liu H, Tian Y, Yang Y, Pang L, Huang T (2016) Deep relative distance learning: tell the difference between similar vehicles. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 2167\u20132175","DOI":"10.1109\/CVPR.2016.238"},{"key":"750_CR40","doi-asserted-by":"crossref","unstructured":"Ge W, Huang W, Dong D, Scott MR (2018) Deep metric learning with hierarchical triplet loss. In: ECCV, pp 269\u2013285","DOI":"10.1007\/978-3-030-01231-1_17"},{"key":"750_CR41","doi-asserted-by":"crossref","unstructured":"Song O, Xiang H, Jegelka Y, Savarese S (2016) S: deep metric learning via lifted structured feature embedding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4004\u20134012","DOI":"10.1109\/CVPR.2016.434"},{"key":"750_CR42","doi-asserted-by":"crossref","unstructured":"Zhao Y, Jin Z, Qi G-J, Lu H, Hua X-S (2018) An adversarial approach to hard triplet generation. In: ECCV, pp 501\u2013517","DOI":"10.1007\/978-3-030-01240-3_31"},{"key":"750_CR43","doi-asserted-by":"crossref","unstructured":"Chopra A, Sinha A, Gupta H, Sarkar M, Ayush K, Krishnamurthy B (2019) Powering robust fashion retrieval with information rich feature embeddings. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops","DOI":"10.1109\/CVPRW.2019.00045"},{"key":"750_CR44","doi-asserted-by":"crossref","unstructured":"Kuang Z, Gao Y, Li G, Luo P, Chen Y, Lin L, Zhang W (2019) Fashion retrieval via graph reasoning networks on a similarity pyramid. In: The IEEE international conference on computer vision (ICCV), pp 3066\u20133075","DOI":"10.1109\/ICCV.2019.00316"},{"key":"750_CR45","doi-asserted-by":"crossref","unstructured":"Lin Z, Yang Z, Huang F, Chen J (2018) Regional maximum activations of convolutions with attention for cross-domain beauty and personal care product retrieval. In: 2018 ACM multimedia conference on multimedia conference, pp 2073\u20132077","DOI":"10.1145\/3240508.3266436"},{"key":"750_CR46","doi-asserted-by":"crossref","unstructured":"Xuan H, Souvenir R, Pless R (2018) Deep randomized ensembles for metric learning. In: The European conference on computer vision (ECCV), pp 723\u2013734","DOI":"10.1007\/978-3-030-01270-0_44"},{"key":"750_CR47","doi-asserted-by":"crossref","unstructured":"Yuan Y, Yang K, Zhang C (2017) Hard-aware deeply cascaded embedding. In: The IEEE international conference on computer vision (ICCV), pp 814\u2013823","DOI":"10.1109\/ICCV.2017.94"},{"key":"750_CR48","unstructured":"Lee C-Y, Xie S, Gallagher PW, Zhang Z, Tu Z (2015) Deeply-supervised nets. In: Proc. AISTATS"},{"key":"750_CR49","doi-asserted-by":"crossref","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, anhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proc. CVPR","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"750_CR50","doi-asserted-by":"crossref","unstructured":"Opitz M, Waltner G, Possegger H, Bischof H (2017) Bier\u2014boosting independent embeddings robustly. In: ICCV, pp 5189\u20135198","DOI":"10.1109\/ICCV.2017.555"},{"key":"750_CR51","doi-asserted-by":"crossref","unstructured":"Xuan H, Souvenir R, Pless R (2018) Deep randomized ensembles for metric learning. In: ECCV, pp 723\u2013734","DOI":"10.1007\/978-3-030-01270-0_44"},{"key":"750_CR52","doi-asserted-by":"crossref","unstructured":"Kim W, Goyal B, Chawla K, Lee J, Kwon K (2018) Attention-based ensemble for deep metric learning. In: ECCV, pp 760\u2013777","DOI":"10.1007\/978-3-030-01246-5_45"},{"key":"750_CR53","doi-asserted-by":"crossref","unstructured":"Zheng S, Yang F, Kiapour MH, Piramuthu R (2018) Modanet: a large-scale street fashion dataset with polygon annotations. In: ACM multimedia","DOI":"10.1145\/3240508.3240652"},{"key":"750_CR54","doi-asserted-by":"crossref","unstructured":"Zou X, Kong X, Wong W, Wang C, Liu Y, Cao Y (2019) Fashionai: a hierarchical dataset for fashion understanding. In: CVPR workshop","DOI":"10.1109\/CVPRW.2019.00039"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00750-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-022-00750-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00750-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,27]],"date-time":"2022-10-27T12:21:14Z","timestamp":1666873274000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-022-00750-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,13]]},"references-count":54,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["750"],"URL":"https:\/\/doi.org\/10.1007\/s40747-022-00750-5","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,13]]},"assertion":[{"value":"5 June 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 May 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"On behalf of all authors, the corresponding author states that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"This work is supported in part by China National Textile and Apparel Council No. 2018097, National Natural science Foundation of China 61902301, Shaanxi Provincial Education Department 19JK036418JK0334 and Science and Technology Plan Project of Shaanxi Province 2022JM-146, Thanks all reviewers.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Funding"}}]}}