{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T09:08:33Z","timestamp":1770282513808,"version":"3.49.0"},"reference-count":60,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2020,2,17]],"date-time":"2020-02-17T00:00:00Z","timestamp":1581897600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61872270 and 61572357"],"award-info":[{"award-number":["61872270 and 61572357"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"National Key R8D Program of China","award":["2019YFBB1404700"],"award-info":[{"award-number":["2019YFBB1404700"]}]},{"name":"Jinan's innovation team","award":["2018GXRC014"],"award-info":[{"award-number":["2018GXRC014"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2020,2,29]]},"abstract":"<jats:p>In recent years, view-based 3D model retrieval has become one of the research focuses in the field of computer vision and machine learning. In fact, the 3D model retrieval algorithm consists of feature extraction and similarity measurement, and the robust features play a decisive role in the similarity measurement. Although deep learning has achieved comprehensive success in the field of computer vision, deep learning features are used for 3D model retrieval only in a small number of works. To the best of our knowledge, there is no benchmark to evaluate these deep learning features. To tackle this problem, in this work we systematically evaluate the performance of deep learning features in view-based 3D model retrieval on four popular datasets (ETH, NTU60, PSB, and MVRED) by different kinds of similarity measure methods. In detail, the performance of hand-crafted features and deep learning features are compared, and then the robustness of deep learning features is assessed. Finally, the difference between single-view deep learning features and multi-view deep learning features is also evaluated. By quantitatively analyzing the performances on different datasets, it is clear that these deep learning features can consistently outperform all of the hand-crafted features, and they are also more robust than the hand-crafted features when different degrees of noise are added into the image. The exploration of latent relationships among different views in multi-view deep learning network architectures shows that the performance of multi-view deep learning outperforms that of single-view deep learning features with low computational complexity.<\/jats:p>","DOI":"10.1145\/3377876","type":"journal-article","created":{"date-parts":[[2020,3,4]],"date-time":"2020-03-04T10:23:32Z","timestamp":1583317412000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":90,"title":["Exploring Deep Learning for View-Based 3D Model Retrieval"],"prefix":"10.1145","volume":"16","author":[{"given":"Zan","family":"Gao","sequence":"first","affiliation":[{"name":"Tianjin University of Technology and Qilu University of Technology, Jinan, P.R China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yinming","family":"Li","sequence":"additional","affiliation":[{"name":"Tianjin University of Technology and Qilu University of Technology, Jinan, P.R China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7013-9081","authenticated-orcid":false,"given":"Shaohua","family":"Wan","sequence":"additional","affiliation":[{"name":"Zhongnan University of Economics and Law, Wuhan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,2,17]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10916-015-0282-7"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-017-4384-8"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(95)00147-6"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.1977.4309681"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2014.03.079"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.isprsjprs.2015.01.010"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.05.048"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2015.04.042"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2540802"},{"key":"e_1_2_1_10_1","volume-title":"C","author":"Gao Zan","year":"2019"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2018.2873844"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2419973"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1111\/1467-8659.00669"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2006.04.034"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.55109"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2009.5457716"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.1999.790410"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems. 1097--1105","author":"Krizhevsky Alex"},{"key":"e_1_2_1_20_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.  Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556."},{"key":"e_1_2_1_21_1","doi-asserted-by":"crossref","unstructured":"Christian Szegedy Wei Liu Yangqing Jia Pierre Sermanet Scott Reed Dragomir Anguelov Dumitru Erhan Vincent Vanhoucke and Andrew Rabinovich. 2014. Going deeper with convolutions. arXiv:1409.4842.  Christian Szegedy Wei Liu Yangqing Jia Pierre Sermanet Scott Reed Dragomir Anguelov Dumitru Erhan Vincent Vanhoucke and Andrew Rabinovich. 2014. Going deeper with convolutions. arXiv:1409.4842.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_23_1","volume-title":"Yu","author":"Wu Zonghan","year":"2019"},{"key":"e_1_2_1_24_1","volume-title":"Deep learning models for real-time human activity recognition with smartphones. Mobile Networks and Applications. Epub ahead of print (Dec. 30","author":"Wan Shaohua","year":"2019"},{"key":"e_1_2_1_25_1","unstructured":"Jie Zhou Ganqu Cui Zhengyan Zhang Cheng Yang Zhiyuan Liu and Maosong Sun. 2018. Graph neural networks: A review of methods and applications. arXiv:1812.08434.  Jie Zhou Ganqu Cui Zhengyan Zhang Cheng Yang Zhiyuan Liu and Maosong Sun. 2018. Graph neural networks: A review of methods and applications. arXiv:1812.08434."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comcom.2019.10.012"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2018.2867286"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comnet.2019.107036"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2019.2894422"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00035"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240702"},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Alexander Grabner Peter M. Roth and Vincent Lepetit. 2018. 3D pose estimation and 3D model retrieval for objects in the wild. arXiv:1803.11493.  Alexander Grabner Peter M. Roth and Vincent Lepetit. 2018. 3D pose estimation and 3D model retrieval for objects in the wild. arXiv:1803.11493.","DOI":"10.1109\/CVPR.2018.00319"},{"key":"e_1_2_1_33_1","first-page":"1","article-title":"Multi-view and multivariate Gaussian descriptor for 3D object retrieval","volume":"1","author":"Gao Zan","year":"2017","journal-title":"Multimedia Tools and Applications"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2011.2170081"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0277-2"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2009.07.012"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-011-0873-3"},{"key":"e_1_2_1_38_1","volume-title":"3D model retrieval. In 3D Video: From Capture to Diffusion","author":"Lucas Laurent"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"S. Haykin and B. Kosko. 2001. Gradient-based learning applied to document recognition. In Intelligent Signal Processing. IEEE Los Alamitos CA 306--351.  S. Haykin and B. Kosko. 2001. Gradient-based learning applied to document recognition. In Intelligent Signal Processing. IEEE Los Alamitos CA 306--351.","DOI":"10.1109\/9780470544976"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-015-0485-2"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2017.2664503"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the International Conference on Pattern Recognition","volume":"1","author":"Dubuisson M. P."},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the KDD Workshop on Text Mining.","author":"Steinbach M."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2006.886359"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2014.2374755"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2540802"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2395961"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2010.10.006"},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of the 10th IEEE International Conference on Computer Vision. 1482--1489","author":"Leordeanu M."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888150.1888189"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2019.2911669"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.114"},{"key":"e_1_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Zan Gao Deyu Wang Y. B. Xue G. P Xu H. Zhang and Y. L. Wang. 2018. 3D object recognition based on pairwise multi-view convolutional neural networks. Journal of Visual Communication and Image Representation 56 C (2018) 305--315.  Zan Gao Deyu Wang Y. B. Xue G. P Xu H. Zhang and Y. L. Wang. 2018. 3D object recognition based on pairwise multi-view convolutional neural networks. Journal of Visual Communication and Image Representation 56 C (2018) 305--315.","DOI":"10.1016\/j.jvcir.2018.10.007"},{"key":"e_1_2_1_55_1","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 1--8.","author":"Gao Zan","year":"2018"},{"key":"e_1_2_1_56_1","volume-title":"Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition","volume":"2","author":"Schiele Bernt","year":"2003"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/SMI.2004.1314504"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIE.2013.2262760"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2011.2160619"},{"key":"e_1_2_1_60_1","volume-title":"C","author":"Nie Wei-Zhi","year":"2016"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377876","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3377876","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:52Z","timestamp":1750199932000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377876"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,2,17]]},"references-count":60,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,2,29]]}},"alternative-id":["10.1145\/3377876"],"URL":"https:\/\/doi.org\/10.1145\/3377876","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,2,17]]},"assertion":[{"value":"2019-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-02-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}