{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T11:27:34Z","timestamp":1763810854607,"version":"3.41.0"},"reference-count":65,"publisher":"Association for Computing Machinery (ACM)","issue":"1s","license":[{"start":{"date-parts":[[2023,1,23]],"date-time":"2023-01-23T00:00:00Z","timestamp":1674432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62172417, 62101555, 61806206, 61772530"],"award-info":[{"award-number":["62172417, 62101555, 61806206, 61772530"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004608","name":"Natural Science Foundation of Jiangsu Province","doi-asserted-by":"crossref","award":["BK20180639, BK20201346, BK20210488"],"award-info":[{"award-number":["BK20180639, BK20201346, BK20210488"]}],"id":[{"id":"10.13039\/501100004608","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100010014","name":"Six Talent Peaks Project in Jiangsu Province","doi-asserted-by":"crossref","award":["2015-DZXX-010, 2018-XYDXX-044"],"award-info":[{"award-number":["2015-DZXX-010, 2018-XYDXX-044"]}],"id":[{"id":"10.13039\/501100010014","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,2,28]]},"abstract":"<jats:p>Point clouds provide a flexible geometric representation for computer vision research. However, the harsh demands for the number of input points and computer hardware are still significant challenges, which hinder their deployment in real applications. To address these challenges, we design a simple and effective module named cyclic self-attention module (CSAM). Specifically, three attention maps of the same input are obtained by cyclically pairing the feature maps, thus exploring the features sufficiently of the attention space of the original input. CSAM can adequately explore the correlation between points to obtain sufficient feature information despite the multiplicative decrease in inputs. Meanwhile, it can direct the computational power to the more essential features, relieving the burden on the computer hardware. We build a point cloud classification network by simply stacking CSAM called cyclic self-attention network (CSAN). We also propose a novel framework for point cloud semantic segmentation called full cyclic self-attention network (FCSAN). By adaptively fusing the original mapping features and the CSAM extracted features, it can better capture the context information of point clouds. Extensive experiments on several benchmark datasets show that our methods can achieve competitive performance in classification and segmentation tasks.<\/jats:p>","DOI":"10.1145\/3538648","type":"journal-article","created":{"date-parts":[[2022,6,24]],"date-time":"2022-06-24T10:05:24Z","timestamp":1656065124000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Cyclic Self-attention for Point Cloud Recognition"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4867-7235","authenticated-orcid":false,"given":"Guanyu","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Computer Sciences and Technology, Engineering Research Center of Mine Digitization of Ministry of Education of the People\u2019s Republic of China, China University of Mining and Technology, Xuzhou, Jiangsu Province, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6207-0299","authenticated-orcid":false,"given":"Yong","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Computer Sciences and Technology, Engineering Research Center of Mine Digitization of Ministry of Education of the People\u2019s Republic of China, China University of Mining and Technology, Xuzhou, Jiangsu Province, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2734-915X","authenticated-orcid":false,"given":"Rui","family":"Yao","sequence":"additional","affiliation":[{"name":"School of Computer Sciences and Technology, Engineering Research Center of Mine Digitization of Ministry of Education of the People\u2019s Republic of China, China University of Mining and Technology, Xuzhou, Jiangsu Province, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5418-9879","authenticated-orcid":false,"given":"Hancheng","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Computer Sciences and Technology, Engineering Research Center of Mine Digitization of Ministry of Education of the People\u2019s Republic of China, China University of Mining and Technology, Xuzhou, Jiangsu Province, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3564-5090","authenticated-orcid":false,"given":"Jiaqi","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Engineering Research Center of Mine Digitization of Ministry of Education of the People\u2019s Republic of China, Innovation Research Center of Disaster Intelligent Prevention and Emergency Rescue, China University of Mining and Technology, China"}]}],"member":"320","published-online":{"date-parts":[[2023,1,23]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3377352"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.170"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2850061"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2020.02.005"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_1_7_2","first-page":"4994","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Chen Chao","year":"2019","unstructured":"Chao Chen, Guanbin Li, Ruijia Xu, Tianshui Chen, Meng Wang, and Liang Lin. 2019. ClusterNet: Deep hierarchical cluster network with rigorously rotation-invariant representation for point cloud analysis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4994\u20135002."},{"key":"e_1_3_1_8_2","first-page":"783","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Chen Xuzhan","year":"2017","unstructured":"Xuzhan Chen, Youping Chen, and Homayoun Najjaran. 2017. 3D object classification with point convolution network. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 783\u2013788."},{"key":"e_1_3_1_9_2","article-title":"An image is worth 16x16 words: Transformers for image recognition at scale","author":"Dosovitskiy Alexey","year":"2020","unstructured":"Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et\u00a0al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).","journal-title":"arXiv preprint arXiv:2010.11929"},{"key":"e_1_3_1_10_2","first-page":"0","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"Engelmann Francis","year":"2018","unstructured":"Francis Engelmann, Theodora Kontogianni, Jonas Schult, and Bastian Leibe. 2018. Know what your neighbors do: 3D semantic segmentation of point clouds. In Proceedings of the European Conference on Computer Vision (ECCV). 0\u20130."},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107446"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00035"},{"key":"e_1_3_1_13_2","article-title":"LFT-Net: Local feature transformer network for point clouds analysis","author":"Gao Yongbin","year":"2022","unstructured":"Yongbin Gao, Xuebing Liu, Jun Li, Zhijun Fang, Xiaoyan Jiang, and Kazi Mohammed Saidul Huq. 2022. LFT-Net: Local feature transformer network for point clouds analysis. IEEE Trans. Intell. Transport. Syst. (2022).","journal-title":"IEEE Trans. Intell. Transport. Syst."},{"key":"e_1_3_1_14_2","first-page":"3809","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Goyal Ankit","year":"2021","unstructured":"Ankit Goyal, Hei Law, Bowei Liu, Alejandro Newell, and Jia Deng. 2021. Revisiting point cloud shape classification with a simple and effective baseline. In Proceedings of the International Conference on Machine Learning. PMLR, 3809\u20133820."},{"key":"e_1_3_1_15_2","article-title":"PCT: Point cloud transformer","author":"Guo Meng-Hao","year":"2020","unstructured":"Meng-Hao Guo, Jun-Xiong Cai, Zheng-Ning Liu, Tai-Jiang Mu, Ralph R. Martin, and Shi-Min Hu. 2020. PCT: Point cloud transformer. arXiv preprint arXiv:2012.09688 (2020).","journal-title":"arXiv preprint arXiv:2012.09688"},{"key":"e_1_3_1_16_2","first-page":"10925","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Han Wenkai","year":"2020","unstructured":"Wenkai Han, Chenglu Wen, Cheng Wang, Xin Li, and Qing Li. 2020. Point2Node: Correlation learning of dynamic-node for point cloud feature modeling. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 10925\u201310932."},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/3388861"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00278"},{"key":"e_1_3_1_19_2","first-page":"7211","volume-title":"Proceedings of the 25th International Conference on Pattern Recognition (ICPR)","author":"Kaul Chaitanya","year":"2021","unstructured":"Chaitanya Kaul, Nick Pears, and Suresh Manandhar. 2021. FatNet: A feature-attentive network for 3D point cloud processing. In Proceedings of the 25th International Conference on Pattern Recognition (ICPR). IEEE, 7211\u20137218."},{"key":"e_1_3_1_20_2","article-title":"Transformers in vision: A survey","author":"Khan Salman","year":"2021","unstructured":"Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah. 2021. Transformers in vision: A survey. ACM Comput. Surv. (2021).","journal-title":"ACM Comput. Surv."},{"key":"e_1_3_1_21_2","first-page":"9204","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Le Truc","year":"2018","unstructured":"Truc Le and Ye Duan. 2018. PointGrid: A deep network for 3D shape understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9204\u20139214."},{"key":"e_1_3_1_22_2","first-page":"820","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems","author":"Li Yangyan","year":"2018","unstructured":"Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. 2018. PointCNN: Convolution on x-transformed points. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 820\u2013830."},{"key":"e_1_3_1_23_2","first-page":"1954","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Lin Kevin","year":"2021","unstructured":"Kevin Lin, Lijuan Wang, and Zicheng Liu. 2021. End-to-end human pose and mesh reconstruction with transformers. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1954\u20131963."},{"key":"e_1_3_1_24_2","first-page":"7546","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Liu Jinxian","year":"2019","unstructured":"Jinxian Liu, Bingbing Ni, Caiyuan Li, Jiancheng Yang, and Qi Tian. 2019. Dynamic points agglomeration for hierarchical point sets learning. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7546\u20137555."},{"key":"e_1_3_1_25_2","first-page":"8895","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Liu Yongcheng","year":"2019","unstructured":"Yongcheng Liu, Bin Fan, Shiming Xiang, and Chunhong Pan. 2019. Relation-shape convolutional neural network for point cloud analysis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8895\u20138904."},{"key":"e_1_3_1_26_2","article-title":"Point-voxel CNN for efficient 3D deep learning","volume":"32","author":"Liu Zhijian","year":"2019","unstructured":"Zhijian Liu, Haotian Tang, Yujun Lin, and Song Han. 2019. Point-voxel CNN for efficient 3D deep learning. Adv. Neural Inf. Process. Syst. 32 (2019).","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"e_1_3_1_27_2","first-page":"11677","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Liu Zhe","year":"2020","unstructured":"Zhe Liu, Xin Zhao, Tengteng Huang, Ruolan Hu, Yu Zhou, and Xiang Bai. 2020. TANet: Robust 3D object detection from point clouds with triple attention. In Proceedings of the AAAI Conference on Artificial Intelligence. 11677\u201311684."},{"key":"e_1_3_1_28_2","article-title":"Sgdr: Stochastic gradient descent with warm restarts","author":"Loshchilov Ilya","year":"2016","unstructured":"Ilya Loshchilov and Frank Hutter. 2016. Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016).","journal-title":"arXiv preprint arXiv:1608.03983"},{"key":"e_1_3_1_29_2","first-page":"1578","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Mao Jiageng","year":"2019","unstructured":"Jiageng Mao, Xiaogang Wang, and Hongsheng Li. 2019. Interpolated convolutional networks for 3D point cloud understanding. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 1578\u20131587."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7353481"},{"key":"e_1_3_1_31_2","first-page":"2906","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Misra Ishan","year":"2021","unstructured":"Ishan Misra, Rohit Girdhar, and Armand Joulin. 2021. An end-to-end transformer model for 3D object detection. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2906\u20132917."},{"issue":"4","key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3410439","article-title":"MMFN: Multimodal information fusion networks for 3D model classification and retrieval","volume":"16","author":"Nie Weizhi","year":"2020","unstructured":"Weizhi Nie, Qi Liang, Yixin Wang, Xing Wei, and Yuting Su. 2020. MMFN: Multimodal information fusion networks for 3D model classification and retrieval. ACM Trans. Multim. Comput. Commun. Applic. 16, 4 (2020), 1\u201322.","journal-title":"ACM Trans. Multim. Comput. Commun. Applic."},{"key":"e_1_3_1_33_2","first-page":"652","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Qi Charles R.","year":"2017","unstructured":"Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. 2017. Pointnet: Deep learning on point sets for 3D classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 652\u2013660."},{"key":"e_1_3_1_34_2","first-page":"5099","article-title":"Pointnet++: Deep hierarchical feature learning on point sets in a metric space","volume":"30","author":"Qi Charles Ruizhongtai","year":"2017","unstructured":"Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J. Guibas. 2017. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv. Neural Inf. Process. Syst. 30 (2017), 5099\u20135108.","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"e_1_3_1_35_2","first-page":"3813","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision","author":"Qiu Shi","year":"2021","unstructured":"Shi Qiu, Saeed Anwar, and Nick Barnes. 2021. Dense-resolution network for point cloud classification and segmentation. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 3813\u20133822."},{"key":"e_1_3_1_36_2","first-page":"452","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Rao Yongming","year":"2019","unstructured":"Yongming Rao, Jiwen Lu, and Jie Zhou. 2019. Spherical fractal convolutional neural networks for point cloud recognition. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 452\u2013460."},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.701"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.11"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.114"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00651"},{"key":"e_1_3_1_41_2","article-title":"Training data-efficient image transformers & distillation through attention","author":"Touvron Hugo","year":"2020","unstructured":"Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco Massa, Alexandre Sablayrolles, and Herv\u00e9 J\u00e9gou. 2020. Training data-efficient image transformers & distillation through attention. arXiv preprint arXiv:2012.12877 (2020).","journal-title":"arXiv preprint arXiv:2012.12877"},{"key":"e_1_3_1_42_2","volume-title":"Proceedings of the International Conference on Computer Vision (ICCV)","author":"Uy Mikaela Angelina","year":"2019","unstructured":"Mikaela Angelina Uy, Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, and Sai-Kit Yeung. 2019. Revisiting point cloud classification: A new benchmark dataset and classification model on real-world data. In Proceedings of the International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_1_43_2","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017).","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"e_1_3_1_44_2","first-page":"108","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Wang Huiyu","year":"2020","unstructured":"Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam, Alan Yuille, and Liang-Chieh Chen. 2020. Axial-DeepLab: Stand-alone axial-attention for panoptic segmentation. In Proceedings of the European Conference on Computer Vision. Springer, 108\u2013126."},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01054"},{"issue":"4","key":"e_1_3_1_46_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072959.3073608","article-title":"O-CNN: Octree-based convolutional neural networks for 3D shape analysis","volume":"36","author":"Wang Peng-Shuai","year":"2017","unstructured":"Peng-Shuai Wang, Yang Liu, Yu-Xiao Guo, Chun-Yu Sun, and Xin Tong. 2017. O-CNN: Octree-based convolutional neural networks for 3D shape analysis. ACM Comput. Graph. 36, 4 (2017), 1\u201311.","journal-title":"ACM Comput. Graph."},{"issue":"5","key":"e_1_3_1_47_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3326362","article-title":"Dynamic graph CNN for learning on point clouds","volume":"38","author":"Wang Yue","year":"2019","unstructured":"Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. 2019. Dynamic graph CNN for learning on point clouds. ACM Comput. Graph. 38, 5 (2019), 1\u201312.","journal-title":"ACM Comput. Graph."},{"key":"e_1_3_1_48_2","first-page":"9621","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wu Wenxuan","year":"2019","unstructured":"Wenxuan Wu, Zhongang Qi, and Li Fuxin. 2019. PointConv: Deep convolutional networks on 3D point clouds. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 9621\u20139630."},{"key":"e_1_3_1_49_2","first-page":"1912","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wu Zhirong","year":"2015","unstructured":"Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3D ShapeNets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1912\u20131920."},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00484"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.03.086"},{"key":"e_1_3_1_52_2","first-page":"4589","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Xu Chenfeng","year":"2021","unstructured":"Chenfeng Xu, Bohan Zhai, Bichen Wu, Tian Li, Wei Zhan, Peter Vajda, Kurt Keutzer, and Masayoshi Tomizuka. 2021. You only group once: Efficient point-cloud processing with token representation and relation inference module. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 4589\u20134596."},{"key":"e_1_3_1_53_2","first-page":"5661","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Xu Qiangeng","year":"2020","unstructured":"Qiangeng Xu, Xudong Sun, Cho-Ying Wu, Panqu Wang, and Ulrich Neumann. 2020. Grid-GCN for fast and scalable point cloud learning. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5661\u20135670."},{"key":"e_1_3_1_54_2","first-page":"87","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"Xu Yifan","year":"2018","unstructured":"Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, and Yu Qiao. 2018. SpiderCNN: Deep learning on point sets with parameterized convolutional filters. In Proceedings of the European Conference on Computer Vision (ECCV). 87\u2013102."},{"key":"e_1_3_1_55_2","first-page":"5589","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Yan Xu","year":"2020","unstructured":"Xu Yan, Chaoda Zheng, Zhen Li, Sheng Wang, and Shuguang Cui. 2020. PointASNL: Robust point clouds processing using nonlocal neural networks with adaptive sampling. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5589\u20135598."},{"key":"e_1_3_1_56_2","first-page":"7505","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Yang Ze","year":"2019","unstructured":"Ze Yang and Liwei Wang. 2019. Learning relationships for multi-view 3D object recognition. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7505\u20137514."},{"key":"e_1_3_1_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00027"},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00644"},{"key":"e_1_3_1_59_2","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"Zeng Wei","year":"2018","unstructured":"Wei Zeng and Theo Gevers. 2018. 3DContextNet: K-d tree guided hierarchical learning of point clouds using local and global contextual cues. In Proceedings of the European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_1_60_2","article-title":"GSIP: Green semantic segmentation of large-scale indoor point clouds","author":"Zhang Min","year":"2021","unstructured":"Min Zhang, Pranav Kadam, Shan Liu, and C.-C. Jay Kuo. 2021. GSIP: Green semantic segmentation of large-scale indoor point clouds. arXiv preprint arXiv:2109.11835 (2021).","journal-title":"arXiv preprint arXiv:2109.11835"},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2963592"},{"key":"e_1_3_1_62_2","first-page":"204","volume-title":"Proceedings of the International Conference on 3D Vision (3DV)","author":"Zhang Zhiyuan","year":"2019","unstructured":"Zhiyuan Zhang, Binh-Son Hua, David W. Rosen, and Sai-Kit Yeung. 2019. Rotation invariant convolutions for 3D point clouds deep learning. In Proceedings of the International Conference on 3D Vision (3DV). IEEE, 204\u2013213."},{"key":"e_1_3_1_63_2","first-page":"1607","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Zhang Zhiyuan","year":"2019","unstructured":"Zhiyuan Zhang, Binh-Son Hua, and Sai-Kit Yeung. 2019. ShellNet: Efficient point cloud convolutional neural networks using concentric shells statistics. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 1607\u20131616."},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2022.108626"},{"key":"e_1_3_1_65_2","first-page":"10076","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhao Hengshuang","year":"2020","unstructured":"Hengshuang Zhao, Jiaya Jia, and Vladlen Koltun. 2020. Exploring self-attention for image recognition. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10076\u201310085."},{"key":"e_1_3_1_66_2","article-title":"Multi point-voxel convolution (MPVConv) for deep learning on point clouds","author":"Zhou Wei","year":"2021","unstructured":"Wei Zhou, Xin Cao, Xiaodan Zhang, Xingxing Hao, Dekui Wang, and Ying He. 2021. Multi point-voxel convolution (MPVConv) for deep learning on point clouds. arXiv preprint arXiv:2107.13152 (2021).","journal-title":"arXiv preprint arXiv:2107.13152"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3538648","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3538648","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:03:03Z","timestamp":1750186983000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3538648"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,23]]},"references-count":65,"journal-issue":{"issue":"1s","published-print":{"date-parts":[[2023,2,28]]}},"alternative-id":["10.1145\/3538648"],"URL":"https:\/\/doi.org\/10.1145\/3538648","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2023,1,23]]},"assertion":[{"value":"2021-09-13","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-05-16","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}