{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T16:49:10Z","timestamp":1774630150535,"version":"3.50.1"},"reference-count":50,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2021,5,6]],"date-time":"2021-05-06T00:00:00Z","timestamp":1620259200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["No.61803221 and No.U1813216"],"award-info":[{"award-number":["No.61803221 and No.U1813216"]}]},{"name":"The Basic Research Program of Shenzhen of China","award":["JCYJ20160301100921349, JCYJ20170817152701660"],"award-info":[{"award-number":["JCYJ20160301100921349, JCYJ20170817152701660"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Point clouds with rich local geometric information have potentially huge implications in several applications, especially in areas of robotic manipulation and autonomous driving. However, most point cloud processing methods cannot extract enough geometric features from a raw point cloud, which restricts the performance of their downstream tasks such as point cloud classification, shape retrieval and part segmentation. In this paper, the authors propose a new method where a convolution based on geometric primitives is adopted to accurately represent the elusive shape in the form of a point cloud to fully extract hidden geometric features. The key idea of the proposed approach is building a brand-new convolution net named ResSANet on the basis of geometric primitives to learn hierarchical geometry information. Two different modules are devised in our network, Res-SA and Res\u00adSA\u00ad2, to achieve feature fusion at different levels in ResSANet. This work achieves classification accuracy up to 93.2% on the ModelNet40 dataset and the shape retrieval with an effect of 87.4%. The part segmentation experiment also achieves an accuracy of 83.3% (class mIoU) and 85.3% (instance mIoU) on ShapeNet dataset. It is worth mentioning that the number of parameters in this work is just 1.04 M while the network depth is minimal. Experimental results and comparisons with state-of-the-art methods demonstrate that our approach can achieve superior performance.<\/jats:p>","DOI":"10.3390\/s21093227","type":"journal-article","created":{"date-parts":[[2021,5,6]],"date-time":"2021-05-06T11:10:27Z","timestamp":1620299427000},"page":"3227","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["ResSANet: Learning Geometric Information for Point Cloud Processing"],"prefix":"10.3390","volume":"21","author":[{"given":"Xiaojun","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Automation, Beijing University of Posts and Telecommunications, Beijing 100876, China"}]},{"given":"Zheng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Center for Artificial Intelligence and Robotics, Shenzhen International Graduate School, Tsinghua University, Shenzhen 518005, China"}]},{"given":"Jian","family":"Ruan","sequence":"additional","affiliation":[{"name":"Center for Artificial Intelligence and Robotics, Shenzhen International Graduate School, Tsinghua University, Shenzhen 518005, China"}]},{"given":"Houde","family":"Liu","sequence":"additional","affiliation":[{"name":"Center for Artificial Intelligence and Robotics, Shenzhen International Graduate School, Tsinghua University, Shenzhen 518005, China"}]},{"given":"Hanxu","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Automation, Beijing University of Posts and Telecommunications, Beijing 100876, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Qi, C.R., Liu, W., Wu, C., Su, H., and Guibas, L.J. (2018, January 18\u201322). Frustum pointnets for 3d object detection from rgb-d data. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00102"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Liang, H., Ma, X., Li, S., Gorner, M., Tang, S., Fang, B., Sun, F., and Zhang, J. (2019, January 20\u201324). Pointnetgpd: Detecting grasp configurations from point sets. Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada.","DOI":"10.1109\/ICRA.2019.8794435"},{"key":"ref_3","unstructured":"Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017, January 21\u201326). Pointnet: Deep learning on point sets for 3d classification and segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA."},{"key":"ref_4","unstructured":"Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (2017). Pointnet++: Deep hierarchical feature learning on point sets in a metric space. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Chiu, C.-C., Sainath, T.N., Wu, Y., Prabhavalkar, R., Nguyen, P., Chen, Z., Kannan, A., Weiss, R.J., Rao, K., and Gonina, E. (2018, January 15\u201320). State-of-the-art speech recognition with sequence-to-sequence models. Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada.","DOI":"10.1109\/ICASSP.2018.8462105"},{"key":"ref_7","first-page":"820","article-title":"Pointcnn: Convolution on x-transformed points","volume":"31","author":"Li","year":"2018","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_8","first-page":"1","article-title":"Dynamic graph cnn for learning on point clouds","volume":"38","author":"Wang","year":"2019","journal-title":"ACM Trans. Graph. (TOG)"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhao, H., Jiang, L., Fu, C.W., and Jia, J. (2019, January 16\u201320). Pointweb: Enhancing local neighborhood features for point cloud processing. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00571"},{"key":"ref_10","unstructured":"Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., and Xiao, J. (2015, January 7\u201312). 3d shapenets: A deep representation for volumetric shapes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_11","unstructured":"Savva, M., Yu, F., Su, H., Aono, M., Chen, B., Cohen-Or, D., Deng, W., Su, H., Bai, S., and Bai, X. (2016, January 8). Shrec16 track: Largescale 3d shape retrieval from shapenet core55. Proceedings of the Eurographics Workshop on 3D Object Retrieval, Lisbon, Portugal."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhang, W., and Xiao, C. (2019, January 16\u201320). PCAN: 3d attention map learning using contextual information for point cloud based retrieval. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01272"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, Y., Fan, B., Xiang, S., and Pan, C. (2019, January 16\u201320). Relation-shape convolutional neural network for point cloud analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00910"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Lan, S., Yu, R., Yu, G., and Davis, L.S. (2019, January 16\u201320). Modeling local geometric structure of 3d point clouds using geo-cnn. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00109"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Jian, S. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Su, H., Maji, S., Kalogerakis, E., and Learned-Miller, E. (2015, January 7\u201313). Multi-view convolutional neural networks for 3d shape recognition. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.114"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Feng, Y., Zhang, Z., Zhao, X., Ji, R., and Gao, Y. (2018, January 18\u201322). Gvcnn: Group-view convolutional neural networks for 3d shape recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00035"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Maturana, D., and Scherer, S. (October, January 28). Voxnet: A 3d convolutional neural network for real-time object recognition. Proceedings of the 2015 IEEERSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany.","DOI":"10.1109\/IROS.2015.7353481"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Zhou, Y., and Tuzel, O. (2018, January 18\u201322). Voxelnet: End-to-end learning for point cloud based 3d object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00472"},{"key":"ref_20","unstructured":"Liu, Y., Fan, B., Meng, G., Lu, J., Xiang, S., and Pan, C. (November, January 27). Densepoint: Learning densely contextual representation for efficient point cloud processing. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Lei, H., Akhtar, N., and Mian, A. (2019, January 16\u201320). Octree guided cnn with spherical kernels for 3d point clouds. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00986"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Komarichev, A., Zhong, Z., and Hua, J. (2019, January 16\u201320). A-cnn: Annularly convolutional neural networks on point clouds. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00760"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Shen, Y., Feng, C., Yang, Y., and Tian, D. (2018, January 18\u201322). Mining point cloud local structures by kernel correlation and graph pooling. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00478"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Chen, C., Li, G., Xu, R., Chen, T., Wang, M., and Lin, L. (2019, January 16\u201320). Clusternet: Deep hierarchical cluster network with rigorously rotation-invariant representation for point cloud analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00513"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2644","DOI":"10.3390\/s21082644","article-title":"A Deep Learning Method for 3D Object Classification and Retrieval Using the Global Point Signature Plus and Deep Wide Residual Network","volume":"21","author":"Long","year":"2021","journal-title":"Sensors"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"3145","DOI":"10.1109\/LRA.2018.2850061","article-title":"3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks","volume":"3","author":"Lindenbaum","year":"2018","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1007\/s10846-019-01016-y","article-title":"Visual Object Categorization Based on Hierarchical Shape Motifs Learned From Noisy Point Cloud Decompositions","volume":"97","author":"Christian","year":"2020","journal-title":"J. Intell. Robot. Syst."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1016\/j.neucom.2020.12.067","article-title":"Geometric attentional dynamic graph convolutional neural networks for point cloud analysis","volume":"432","author":"Cui","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1049\/cvi2.12039","article-title":"Point cloud classification by dynamic graph CNN with adaptive feature fusion","volume":"15","author":"Guo","year":"2021","journal-title":"IET Comput. Vis."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2980179.2980238","article-title":"A scalable active frame-work for region annotation in 3d shape collections","volume":"35","author":"Yi","year":"2016","journal-title":"ACM Trans. Graph. (TOG)"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Xie, S., Liu, S., Chen, Z., and Tu, Z. (2018, January 18\u201322). Attentional shapecontextnet for point cloud recognition. Proceedings of the IEEE Confer-ence on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00484"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Klokov, R., and Lempitsky, V. (2017, January 22\u201329). Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.99"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Hermosilla, P., Ritschel, T., Vazquez, P.P., Vinacua, A., and Ropinski, T. (2018). Monte carlo convolution for learning on non-uniformly sampled point clouds. SIGGRAPH Asia 2018 Technical Papers, ACM.","DOI":"10.1145\/3272127.3275110"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Gadelha, M., Wang, R., and Maji, S. (2018, January 8\u201314). Multiresolution tree networks for 3d point cloud processing. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_7"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wang, C., Samari, B., and Siddiqi, K. (2018, January 8\u201314). Local spectral graph convolution for point set feature learning. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01225-0_4"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Atzmon, M., Maron, H., and Lipman, Y. (2018). Point convolutional neural networks by extension operators. arXiv.","DOI":"10.1145\/3197517.3201301"},{"key":"ref_37","unstructured":"Liu, X., Han, Z., Liu, Y.-S., and Zwicker, M. (February, January 27). Point2sequence: Learning the shape representation of 3d point clouds with an atten-tion-based sequence to sequence network. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_38","unstructured":"Zhang, K., Hao, M., Wang, J., de Silva, C.W., and Fu, C. (2019). Linked dynamic graph cnn: Learning on point cloud via linking hierar-chical features. arXiv."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Yan, X., Zheng, C., Li, Z., Wang, S., and Cui, S. (2020, January 13\u201319). Pointasnl: Robust point clouds processing using nonlocal neural networks with adaptive sampling. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00563"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Yang, Y., Feng, C., Shen, Y., and Tian, D. (2018, January 18\u201322). Foldingnet: Point cloud auto-encoder via deep grid deformation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00029"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Li, J., Chen, B.M., and Lee, G.H. (2018, January 18\u201322). So-net: Self-organizing network for point cloud analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00979"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Wu, W., Qi, Z., and Fuxin, L. (2019, January 16\u201320). Pointconv: Deep convolutional networks on 3d point clouds. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00985"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Xu, Y., Fan, T., Xu, M., Zeng, L., and Qiao, Y. (2018, January 8\u201314). Spidercnn: Deep learning on point sets with parameterized convolutional filters. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01237-3_6"},{"key":"ref_44","unstructured":"Jiang, J., Bao, D., Chen, Z., Zhao, X., and Gao, Y. (February, January 27). Mlvcnn: Multi-loop-view convolutional neural network for 3d shape retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"You, H., Feng, Y., Ji, R., and Gao, Y. (2018, January 22\u201326). Pvnet: A joint convolutional network of point cloud and multi-view for 3d shape recogni-tion. Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, Seoul, Korea.","DOI":"10.1145\/3240508.3240702"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Bai, S., Bai, X., Zhou, Z., Zhang, Z., and jan Latecki, L. (2016, January 27\u201330). Gift: A real-time and scalable 3d shape search engine. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.543"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1016\/j.cag.2017.12.001","article-title":"Ensemble of panorama-based convolutional neural networks for 3d model classifica-tion and retrieval","volume":"71","author":"Sfikas","year":"2018","journal-title":"Comput. Graph."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"He, X., Zhou, Y., Zhou, Z., Bai, S., and Bai, X. (2018, January 18\u201322). Triplet-center loss for multi-view 3d object retrieval. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00208"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"658","DOI":"10.1109\/TIP.2018.2868426","article-title":"Seqviews2seqlabels: Learning 3d global features via aggregating sequential views by rnn with attention","volume":"28","author":"Han","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Te, G., Hu, W., Zheng, A., and Guo, Z. (2018, January 22\u201326). Rgcnn: Regularized graph cnn for point cloud segmentation. Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, Seoul, Korea.","DOI":"10.1145\/3240508.3240621"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/9\/3227\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:57:44Z","timestamp":1760162264000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/9\/3227"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,6]]},"references-count":50,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["s21093227"],"URL":"https:\/\/doi.org\/10.3390\/s21093227","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,6]]}}}