{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T16:56:16Z","timestamp":1771260976794,"version":"3.50.1"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2023,8,24]],"date-time":"2023-08-24T00:00:00Z","timestamp":1692835200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,8,24]],"date-time":"2023-08-24T00:00:00Z","timestamp":1692835200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"crossref","award":["62203133"],"award-info":[{"award-number":["62203133"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012166","name":"National Key R &D Program of China","doi-asserted-by":"crossref","award":["2021YFB3901300"],"award-info":[{"award-number":["2021YFB3901300"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100011381","name":"State Key Laboratory of Robotics and System","doi-asserted-by":"publisher","award":["SKLRS-2023-KF-17"],"award-info":[{"award-number":["SKLRS-2023-KF-17"]}],"id":[{"id":"10.13039\/501100011381","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Multimedia Systems"],"published-print":{"date-parts":[[2023,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Unsupervised 2D image-based 3D model retrieval aims at retrieving images from the gallery of 3D models by the given 2D images. Despite the encouraging progress made in this task, there are still two significant limitations: (1) feature alignment of 2D images and 3D model gallery is still difficult due to the huge gap between the two modalities. (2) The important view information in the 3D model gallery was ignored by the prior arts, which led to inaccurate results. To alleviate these limitations, inspired by the success of vision transformers (ViT) in a great variety of vision tasks, in this paper, we propose an end-to-end 3D model retrieval architecture on top of ViT, termly transformer-based 3D model retrieval network (T3DRN). In addition, to take advantage of the valuable view information of 3D models, we present an attentive module in T3DRN named shared view-guided attentive module (SVAM) to guide the learning of the alignment features. The proposed method is tested on the challenging dataset, MI3DOR-1. The extensive experimental results have proved the superiority of our proposed method to state-of-the-art methods.<\/jats:p>","DOI":"10.1007\/s00530-023-01166-y","type":"journal-article","created":{"date-parts":[[2023,8,24]],"date-time":"2023-08-24T09:02:43Z","timestamp":1692867763000},"page":"3891-3901","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer"],"prefix":"10.1007","volume":"29","author":[{"given":"Jiacheng","family":"Chang","sequence":"first","affiliation":[]},{"given":"Lanyong","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Zhuang","family":"Shao","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,8,24]]},"reference":[{"key":"1166_CR1","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1007\/s00530-020-00699-w","volume":"27","author":"B Veerasamy","year":"2021","unstructured":"Veerasamy, B., Annadurai, S.: Video compression using hybrid hexagon search and teaching\u2013learning-based optimization technique for 3D reconstruction. Multimed. Syst. 27, 45\u201359 (2021)","journal-title":"Multimed. Syst."},{"issue":"1","key":"1166_CR2","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1007\/s10055-022-00659-x","volume":"27","author":"M Kirya","year":"2023","unstructured":"Kirya, M., Debattista, K., Chalmers, A.: Using virtual environments to facilitate refugee integration in third countries. Virtual Real. 27(1), 97\u2013107 (2023)","journal-title":"Virtual Real."},{"issue":"6","key":"1166_CR3","doi-asserted-by":"publisher","first-page":"2694","DOI":"10.1002\/mrm.29400","volume":"88","author":"X Liu","year":"2022","unstructured":"Liu, X., Pang, Y., Jin, R., Liu, Y., Wang, Z.: Dual-domain reconstruction network with V-Net and K-Net for fast MRI. Magn. Reson. Med. 88(6), 2694\u20132708 (2022)","journal-title":"Magn. Reson. Med."},{"key":"1166_CR4","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1016\/j.neucom.2022.09.048","volume":"517","author":"Y Liu","year":"2023","unstructured":"Liu, Y., Pang, Y., Liu, X., Liu, Y., Nie, J.: DIIK-Net: a full-resolution cross-domain deep interaction convolutional neural network for MR image reconstruction. Neurocomputing 517, 213\u2013222 (2023)","journal-title":"Neurocomputing"},{"key":"1166_CR5","first-page":"1","volume":"2022","author":"A Gao","year":"2022","unstructured":"Gao, A., Pang, Y., Nie, J., Shao, Z., Cao, J., Guo, Y., Li, X.: ESGN: efficient stereo geometry network for fast 3D object detection. IEEE Trans. Circuits Syst. Video Technol. 2022, 1 (2022)","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"1166_CR6","unstructured":"Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21\u201326, 2017, pp. 77\u201385 (2017)"},{"key":"1166_CR7","unstructured":"Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., Xiao, J.: 3D shapenets: a deep representation for volumetric shapes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912\u20131920 (2015)"},{"key":"1166_CR8","doi-asserted-by":"crossref","unstructured":"Furuya, T., Ohbuchi, R.: Deep aggregation of local 3D geometric features for 3D model retrieval. In: Wilson, R.C., Hancock, E.R., Smith, W.A.P. (eds.) Proceedings of the British Machine Vision Conference, BMVC (2016)","DOI":"10.5244\/C.30.121"},{"key":"1166_CR9","doi-asserted-by":"crossref","unstructured":"Feng, Y., Feng, Y., You, H., Zhao, X., Gao, Y.: Meshnet: mesh neural network for 3D shape representation. In: The 33rd AAAI Conference on Artificial Intelligence, The 31st Innovative Applications of Artificial Intelligence Conference, IAAI, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI, pp. 8279\u20138286 (2019)","DOI":"10.1609\/aaai.v33i01.33018279"},{"key":"1166_CR10","unstructured":"Ganin, Y., Lempitsky, V.S.: Unsupervised domain adaptation by backpropagation. In: Bach, F.R., Blei, D.M. (eds.) Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6\u201311 July 2015. JMLR Workshop and Conference Proceedings, vol. 37, pp. 1180\u20131189 (2015)"},{"key":"1166_CR11","doi-asserted-by":"crossref","unstructured":"Wang, J., Feng, W., Chen, Y., Yu, H., Huang, M., Yu, P.S.: Visual domain adaptation with manifold embedded distribution alignment. In: Boll, S., Lee, K.M., Luo, J., Zhu, W., Byun, H., Chen, C.W., Lienhart, R., Mei, T. (eds.) 2018 ACM Multimedia Conference on Multimedia Conference, MM, pp. 402\u2013410 (2018)","DOI":"10.1145\/3240508.3240512"},{"key":"1166_CR12","doi-asserted-by":"crossref","unstructured":"Zhang, J., Li, W., Ogunbona, P.: Joint geometrical and statistical alignment for visual domain adaptation. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21\u201326, 2017, pp. 5150\u20135158 (2017)","DOI":"10.1109\/CVPR.2017.547"},{"key":"1166_CR13","doi-asserted-by":"crossref","unstructured":"Yue, X., Zhang, Y., Zhao, S., Sangiovanni-Vincentelli, A.L., Keutzer, K., Gong, B.: Domain randomization and pyramid consistency: simulation-to-real generalization without accessing target domain data. CoRR arXiv:abs\/1909.00889 (2019)","DOI":"10.1109\/ICCV.2019.00219"},{"key":"1166_CR14","doi-asserted-by":"crossref","unstructured":"Chen, Y., Ouyang, X., Zhu, K., Agam, G.: Semi-supervised domain adaptation for semantic segmentation. CoRR arXiv:abs\/2110.10639 (2021)","DOI":"10.1109\/ICPR56361.2022.9956524"},{"key":"1166_CR15","doi-asserted-by":"crossref","unstructured":"Peng, D., Lei, Y., Hayat, M., Guo, Y., Li, W.: Semantic-aware domain generalized segmentation. CoRR arXiv:abs\/2204.00822 (2022)","DOI":"10.1109\/CVPR52688.2022.00262"},{"key":"1166_CR16","doi-asserted-by":"publisher","first-page":"677","DOI":"10.1007\/s11042-017-5532-x","volume":"78","author":"A-A Liu","year":"2019","unstructured":"Liu, A.-A., Shao, Z., Wong, Y., Li, J., Su, Y.-T., Kankanhalli, M.: LSTM-based multi-label video event detection. Multimed. Tools Appl. 78, 677\u2013695 (2019)","journal-title":"Multimed. Tools Appl."},{"key":"1166_CR17","doi-asserted-by":"crossref","unstructured":"Chu, F., Cao, J., Shao, Z., Pang, Y.: Illumination-guided transformer-based network for multispectral pedestrian detection. In: Artificial Intelligence: Second CAAI International Conference, CICAI 2022, Beijing, China, August 27\u201328, 2022, Revised Selected Papers, Part I, pp. 343\u2013355 (2022). Springer, London","DOI":"10.1007\/978-3-031-20497-5_28"},{"issue":"1","key":"1166_CR18","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s00530-022-00982-y","volume":"29","author":"Y Li","year":"2023","unstructured":"Li, Y., Yin, K., Liang, J., Tan, Z., Wang, X., Yin, G., Wang, Z.: A multitask joint framework for real-time person search. Multimed. Syst. 29(1), 211\u2013222 (2023)","journal-title":"Multimed. Syst."},{"key":"1166_CR19","doi-asserted-by":"crossref","unstructured":"Wang, J., Pang, Y., Cao, J., Sun, H., Shao, Z., Li, X.: Deep intra-image contrastive learning for weakly supervised one-step person search. Preprint arXiv:2302.04607 (2023)","DOI":"10.1016\/j.patcog.2023.110047"},{"key":"1166_CR20","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019, Minneapolis, MN, USA, June 2\u20137, 2019, Volume 1 (Long and Short Papers), pp. 4171\u20134186 (2019). https:\/\/doi.org\/10.18653\/v1\/n19-1423","DOI":"10.18653\/v1\/n19-1423"},{"key":"1166_CR21","first-page":"1","volume":"2015","author":"D Maturana","year":"2015","unstructured":"Maturana, D., Scherer, S.: Voxnet: a 3D convolutional neural network for real-time object recognition. Intell. Robots Syst 2015, 1 (2015)","journal-title":"Intell. Robots Syst"},{"key":"1166_CR22","unstructured":"Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., Xiao, J.: 3D shapenets: a deep representation for volumetric shapes. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7\u201312, 2015, pp. 1912\u20131920 (2015)"},{"key":"1166_CR23","unstructured":"Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: deep hierarchical feature learning on point sets in a metric space. Computer Vision and Pattern Recognition, arXiv (2017)"},{"key":"1166_CR24","doi-asserted-by":"crossref","unstructured":"Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 945\u2013953 (2015)","DOI":"10.1109\/ICCV.2015.114"},{"key":"1166_CR25","doi-asserted-by":"crossref","unstructured":"Gao, Z., Shao, Y., Guan, W., Liu, M., Cheng, Z., Chen, S.: A novel patch convolutional neural network for view-based 3D model retrieval. Computer Vision and Pattern Recognition, arXiv (2021)","DOI":"10.1145\/3474085.3475450"},{"key":"1166_CR26","doi-asserted-by":"crossref","unstructured":"Watanabe, S., Takahashi, S., Wang, L.: Aggregating viewpoints for effective view-based 3D model retrieval. In: 2021 25th International Conference Information Visualisation (IV) (2021)","DOI":"10.1109\/IV53921.2021.00058"},{"key":"1166_CR27","doi-asserted-by":"crossref","unstructured":"Kim, D., Seo, M., Park, J., Choi, D.: Source domain subset sampling for semi-supervised domain adaptation in semantic segmentation. CoRR arXiv:abs\/2205.00312 (2022)","DOI":"10.2139\/ssrn.3990644"},{"key":"1166_CR28","doi-asserted-by":"crossref","unstructured":"Li, B., Wang, Y., Zhang, S., Li, D., Keutzer, K., Darrell, T., Zhao, H.: Learning invariant representations and risks for semi-supervised domain adaptation. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 1104\u20131113 (2021)","DOI":"10.1109\/CVPR46437.2021.00116"},{"key":"1166_CR29","doi-asserted-by":"crossref","unstructured":"Zhou, H., Liu, A., Nie, W.: Dual-level embedding alignment network for 2D image-based 3D object retrieval. In: Amsaleg, L., Huet, B., Larson, M.A., Gravier, G., Hung, H., Ngo, C., Ooi, W.T. (eds.) Proceedings of the 27th ACM International Conference on Multimedia, MM, pp. 1667\u20131675 (2019)","DOI":"10.1145\/3343031.3351011"},{"key":"1166_CR30","unstructured":"Long, M., Zhu, H., Wang, J., Jordan, M.I.: Deep transfer learning with joint adaptation networks. In: Precup, D., Teh, Y.W. (eds.) Proceedings of the 34th International Conference on Machine Learning, ICML. Proceedings of Machine Learning Research, vol. 70, pp. 2208\u20132217 (2017)"},{"key":"1166_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.sigpro.2019.03.008","volume":"2019","author":"X Li","year":"2019","unstructured":"Li, X., Zhang, W., Ding, Q., Sun, J.-Q.: Multi-layer domain adaptation method for rolling bearing fault diagnosis. Signal Process. 2019, 1 (2019)","journal-title":"Signal Process."},{"key":"1166_CR32","first-page":"1","volume":"2020","author":"Q Wang","year":"2020","unstructured":"Wang, Q., Du, P., Liu, X., Yang, J., Wang, G.: Adversarial unsupervised domain adaptation for cross scenario waveform recognition. Signal Process. 2020, 1 (2020)","journal-title":"Signal Process."},{"key":"1166_CR33","doi-asserted-by":"crossref","unstructured":"Saporta, A., Douillard, A., Vu, T., P\u00e9rez, P., Cord, M.: Multi-head distillation for continual unsupervised domain adaptation in semantic segmentation. CoRR arXiv:abs\/2204.11667 (2022)","DOI":"10.1109\/CVPRW56347.2022.00419"},{"key":"1166_CR34","doi-asserted-by":"crossref","unstructured":"Hoyer, L., Dai, D., Gool, L.V.: HRDA: context-aware high-resolution domain-adaptive semantic segmentation. CoRR arXiv:abs\/2204.13132 (2022)","DOI":"10.1007\/978-3-031-20056-4_22"},{"key":"1166_CR35","doi-asserted-by":"publisher","first-page":"109246","DOI":"10.1016\/j.ress.2023.109246","volume":"236","author":"K Zhao","year":"2023","unstructured":"Zhao, K., Hu, J., Shao, H., Hu, J.: Federated multi-source domain adversarial adaptation framework for machinery fault diagnosis with data privacy. Reliab. Eng. Syst. Saf. 236, 109246 (2023)","journal-title":"Reliab. Eng. Syst. Saf."},{"key":"1166_CR36","doi-asserted-by":"publisher","first-page":"110203","DOI":"10.1016\/j.knosys.2022.110203","volume":"262","author":"K Zhao","year":"2023","unstructured":"Zhao, K., Jia, F., Shao, H.: A novel conditional weighting transfer Wasserstein auto-encoder for rolling bearing fault diagnosis with multi-source domains. Knowl.-Based Syst. 262, 110203 (2023)","journal-title":"Knowl.-Based Syst."},{"issue":"6","key":"1166_CR37","doi-asserted-by":"publisher","first-page":"2761","DOI":"10.3233\/BME-141094","volume":"24","author":"B Jin","year":"2014","unstructured":"Jin, B., Vai, M.I.: An adaptive ultrasonic backscattered signal processing technique for instantaneous characteristic frequency detection. Bio-Med. Mater. Eng. 24(6), 2761\u20132770 (2014)","journal-title":"Bio-Med. Mater. Eng."},{"issue":"13","key":"1166_CR38","doi-asserted-by":"publisher","first-page":"7723","DOI":"10.1007\/s00521-020-05514-1","volume":"33","author":"Q Zheng","year":"2021","unstructured":"Zheng, Q., Zhao, P., Li, Y., Wang, H., Yang, Y.: Spectrum interference-based two-level data augmentation method in deep learning for automatic modulation classification. Neural Comput. Appl. 33(13), 7723\u20137745 (2021)","journal-title":"Neural Comput. Appl."},{"issue":"6","key":"1166_CR39","doi-asserted-by":"publisher","first-page":"1298","DOI":"10.1109\/LCOMM.2022.3145647","volume":"26","author":"Q Zheng","year":"2022","unstructured":"Zheng, Q., Zhao, P., Wang, H., Elhanashi, A., Saponara, S.: Fine-grained modulation classification using multi-scale radio transformer with dual-channel representation. IEEE Commun. Lett. 26(6), 1298\u20131302 (2022)","journal-title":"IEEE Commun. Lett."},{"key":"1166_CR40","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (NeurIPS), pp. 5998\u20136008 (2017)"},{"key":"1166_CR41","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N.: An image is worth $$16\\times 16$$ words: transformers for image recognition at scale. In: ICLR (2021)"},{"key":"1166_CR42","doi-asserted-by":"crossref","unstructured":"Sun, T., Lu, C., Zhang, T., Ling, H.: Safe self-refinement for transformer-based domain adaptation. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7191\u20137200 (2022)","DOI":"10.1109\/CVPR52688.2022.00705"},{"issue":"1","key":"1166_CR43","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1109\/TPAMI.2016.2537337","volume":"39","author":"A-A Liu","year":"2016","unstructured":"Liu, A.-A., Su, Y.-T., Nie, W.-Z., Kankanhalli, M.: Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(1), 102\u2013114 (2016)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"5","key":"1166_CR44","doi-asserted-by":"publisher","first-page":"1372","DOI":"10.1109\/TMM.2019.2941820","volume":"22","author":"N Xu","year":"2019","unstructured":"Xu, N., Zhang, H., Liu, A.-A., Nie, W., Su, Y., Nie, J., Zhang, Y.: Multi-level policy and reward-based deep reinforcement learning framework for image captioning. IEEE Trans. Multimed. 22(5), 1372\u20131383 (2019)","journal-title":"IEEE Trans. Multimed."},{"key":"1166_CR45","doi-asserted-by":"publisher","first-page":"4515","DOI":"10.1109\/TMM.2020.3043084","volume":"23","author":"A-A Liu","year":"2020","unstructured":"Liu, A.-A., Wang, Y., Xu, N., Nie, W., Nie, J., Zhang, Y.: Adaptively clustering-driven learning for visual relationship detection. IEEE Trans. Multimed. 23, 4515\u20134525 (2020)","journal-title":"IEEE Trans. Multimed."},{"key":"1166_CR46","doi-asserted-by":"publisher","first-page":"104585","DOI":"10.1016\/j.imavis.2022.104585","volume":"128","author":"Z Ji","year":"2022","unstructured":"Ji, Z., Hu, Z., Wang, Y., Shao, Z., Pang, Y.: Reinforced pedestrian attribute recognition with group optimization reward. Image Vis. Comput. 128, 104585 (2022)","journal-title":"Image Vis. Comput."},{"key":"1166_CR47","first-page":"1","volume":"2022","author":"Z Shao","year":"2022","unstructured":"Shao, Z., Han, J., Marnerides, D., Debattista, K.: Region-object relation-aware dense captioning via transformer. IEEE Trans. Neural Netw. Learn. Syst. 2022, 1 (2022)","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"1166_CR48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TMM.2023.3310259","volume":"2023","author":"Z Shao","year":"2023","unstructured":"Shao, Z., Han, J., Debattista, K., Pang, Y.: Textual context-aware dense captioning with diverse words. IEEE Trans. Multimed. 2023, 1 (2023)","journal-title":"IEEE Trans. Multimed."},{"key":"1166_CR49","doi-asserted-by":"crossref","unstructured":"Liu, Z., Ning, J., Cao, Y., Wei, Y., Zhang, Z., Lin, S., Hu, H.: Video swin transformer. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 3202\u20133211 (2022)","DOI":"10.1109\/CVPR52688.2022.00320"},{"key":"1166_CR50","doi-asserted-by":"crossref","unstructured":"Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., Wang, M.: Swin-unet: unet-like pure transformer for medical image segmentation. In: European Conference on Computer Vision, pp. 205\u2013218. Springer, London (2022)","DOI":"10.1007\/978-3-031-25066-8_9"},{"key":"1166_CR51","unstructured":"Ruder, S.: An overview of gradient descent optimization algorithms. Preprint arXiv:1609.04747 (2016)"},{"issue":"6","key":"1166_CR52","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1145\/3065386","volume":"60","author":"A Krizhevsky","year":"2017","unstructured":"Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84\u201390 (2017)","journal-title":"Commun. ACM"},{"key":"1166_CR53","doi-asserted-by":"crossref","unstructured":"Zhou, H., Nie, W., Li, W., Song, D., Liu, A.-A.: Hierarchical instance feature alignment for 2D image-based 3D shape retrieval. In: Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence, pp. 839\u2013845 (2021)","DOI":"10.24963\/ijcai.2020\/117"},{"key":"1166_CR54","unstructured":"Xie, S., Zheng, Z., Chen, L., Chen, C.: Learning semantic representations for unsupervised domain adaptation. In: International Conference on Machine Learning, pp. 5423\u20135432. PMLR (2018)"}],"container-title":["Multimedia Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00530-023-01166-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00530-023-01166-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00530-023-01166-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,19]],"date-time":"2023-12-19T20:59:52Z","timestamp":1703019592000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00530-023-01166-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,24]]},"references-count":54,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,12]]}},"alternative-id":["1166"],"URL":"https:\/\/doi.org\/10.1007\/s00530-023-01166-y","relation":{},"ISSN":["0942-4962","1432-1882"],"issn-type":[{"value":"0942-4962","type":"print"},{"value":"1432-1882","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,24]]},"assertion":[{"value":"14 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 August 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare no conflict of interest.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}