{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T19:25:49Z","timestamp":1762543549603,"version":"3.37.3"},"reference-count":51,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2022,1,4]],"date-time":"2022-01-04T00:00:00Z","timestamp":1641254400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,1,4]],"date-time":"2022-01-04T00:00:00Z","timestamp":1641254400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100021171","name":"Guangdong Basic and Applied Basic Research Foundation","doi-asserted-by":"crossref","award":["2021A1515012637"],"award-info":[{"award-number":["2021A1515012637"]}],"id":[{"id":"10.13039\/501100021171","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1713213","61775175"],"award-info":[{"award-number":["U1713213","61775175"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2022,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Three-dimensional (3D) semantic segmentation of point clouds is important in many scenarios, such as automatic driving, robotic navigation, while edge computing is indispensable in the devices. Deep learning methods based on point sampling prove to be computation and memory efficient to tackle large-scale point clouds (e.g. millions of points). However, some local features may be abandoned while sampling. In this paper, We present one end-to-end 3D semantic segmentation framework based on dilated nearest neighbor encoding. Instead of down-sampling point cloud directly, we propose a dilated nearest neighbor encoding module to broaden the network\u2019s receptive field to learn more 3D geometric information. Without increase of network parameters, our method is computation and memory efficient for large-scale point clouds. We have evaluated the dilated nearest neighbor encoding in two different networks. The first is the random sampling with local feature aggregation. The second is the Point Transformer. We have evaluated the quality of the semantic segmentation on the benchmark 3D dataset S3DIS, and demonstrate that the proposed dilated nearest neighbor encoding exhibited stable advantages over baseline and competing methods.<\/jats:p>","DOI":"10.1007\/s40747-021-00618-0","type":"journal-article","created":{"date-parts":[[2022,1,4]],"date-time":"2022-01-04T07:03:04Z","timestamp":1641279784000},"page":"3833-3845","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Semantic segmentation of large-scale point clouds based on dilated nearest neighbors graph"],"prefix":"10.1007","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5990-896X","authenticated-orcid":false,"given":"Lei","family":"Wang","sequence":"first","affiliation":[]},{"given":"Jiaji","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Xunyu","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Xiaoliang","family":"Ma","sequence":"additional","affiliation":[]},{"given":"Jun","family":"Cheng","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,1,4]]},"reference":[{"key":"618_CR1","doi-asserted-by":"crossref","unstructured":"Johnson-Roberson M, Bohg J, Bj\u00f6rkman M, Kragic D (2010) Attention-based active 3D point cloud segmentation. In: 2010 IEEE\/RSJ international conference on intelligent robots and systems, Taipei, Taiwan, pp 1165\u20131170","DOI":"10.1109\/IROS.2010.5649872"},{"issue":"5","key":"618_CR2","doi-asserted-by":"publisher","first-page":"1217","DOI":"10.1109\/TCYB.2015.2430526","volume":"46","author":"M Liu","year":"2016","unstructured":"Liu M (2016) Robotic online path planning on point cloud. IEEE Trans Cybern 46(5):1217\u20131228","journal-title":"IEEE Trans Cybern"},{"issue":"3","key":"618_CR3","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1109\/TRO.2016.2638453","volume":"33","author":"U Asif","year":"2017","unstructured":"Asif U, Bennamoun M, Sohel FA (2017) RGB-D object recognition and grasp detection using hierarchical cascaded forests. IEEE Trans Rob 33(3):547\u2013564","journal-title":"IEEE Trans Rob"},{"issue":"2","key":"618_CR4","doi-asserted-by":"publisher","first-page":"1240","DOI":"10.1109\/LRA.2019.2894915","volume":"4","author":"J Chen","year":"2019","unstructured":"Chen J, Cho YK, Kira Z (2019) Multi-view incremental segmentation of 3-D point clouds for mobile robots. InIEEE Robot Autom Lett 4(2):1240\u20131246","journal-title":"InIEEE Robot Autom Lett"},{"key":"618_CR5","doi-asserted-by":"crossref","unstructured":"Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite. In: 2012 IEEE conference on computer vision and pattern recognition, Providence, RI, USA, pp 3354\u20133361","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"618_CR6","doi-asserted-by":"crossref","unstructured":"Chen X, Ma H, Wan J, Li B, Xia T (2017) Multi-view 3D object detection network for autonomous driving. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 6526\u20136534","DOI":"10.1109\/CVPR.2017.691"},{"key":"618_CR7","doi-asserted-by":"crossref","unstructured":"Yang B, Luo W, Urtasun R (2018) PIXOR: real-time 3D object detection from point clouds. In: 2018 IEEE\/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, pp 7652\u20137660","DOI":"10.1109\/CVPR.2018.00798"},{"key":"618_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.scs.2019.102002","volume":"54","author":"L Wang","year":"2020","unstructured":"Wang L, Fan X, Chen J, Cheng J, Tan J, Ma X (2020) 3D object detection based on sparse convolution neural network and feature fusion for autonomous driving in smart cities. Sustain Cities Soc 54:1\u201310","journal-title":"Sustain Cities Soc"},{"issue":"4","key":"618_CR9","doi-asserted-by":"publisher","first-page":"3434","DOI":"10.1109\/LRA.2018.2852843","volume":"3","author":"Y Zeng","year":"2018","unstructured":"Zeng Y et al (2018) RT3D: real-time 3-D vehicle detection in LiDAR point cloud for autonomous driving. IEEE Robot Autom Lett 3(4):3434\u20133440","journal-title":"IEEE Robot Autom Lett"},{"issue":"3","key":"618_CR10","doi-asserted-by":"publisher","first-page":"2902","DOI":"10.1109\/LRA.2019.2922582","volume":"4","author":"BH Wang","year":"2019","unstructured":"Wang BH, Chao W, Wang Y, Hariharan B, Weinberger KQ, Campbell M (2019) LDLS: 3-D object segmentation through label diffusion from 2-D images. IEEE Robot Autom Lett 4(3):2902\u20132909","journal-title":"IEEE Robot Autom Lett"},{"key":"618_CR11","doi-asserted-by":"publisher","first-page":"963","DOI":"10.1007\/s00371-011-0610-y","volume":"27","author":"I Sipiran","year":"2011","unstructured":"Sipiran I, Bustos B (2011) Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes. Vis Comput 27:963\u2013976","journal-title":"Vis Comput"},{"key":"618_CR12","doi-asserted-by":"crossref","unstructured":"Zhong Y (2009) Intrinsic shape signatures: a shape descriptor for 3D object recognition. In: 2009 IEEE 12th international conference on computer vision workshops, ICCV Workshops, Kyoto, Japan, pp 689\u2013696","DOI":"10.1109\/ICCVW.2009.5457637"},{"key":"618_CR13","doi-asserted-by":"crossref","unstructured":"Rusu RB, Blodow N, Beetz M (2009) Fast point feature histograms (FPFH) for 3D registration. In: 2009 IEEE international conference on robotics and automation, Kobe, Japan, pp 3212\u20133217","DOI":"10.1109\/ROBOT.2009.5152473"},{"key":"618_CR14","doi-asserted-by":"crossref","unstructured":"Rusu RB, Bradski G, Thibaux R, Hsu J (2010) Fast 3D recognition and pose using the viewpoint feature histogram. In: 2010 IEEE\/RSJ international conference on intelligent robots and systems, Taipei, Taiwan, pp 2155\u20132162","DOI":"10.1109\/IROS.2010.5651280"},{"issue":"3","key":"618_CR15","doi-asserted-by":"publisher","first-page":"177","DOI":"10.5194\/isprs-annals-III-3-177-2016","volume":"3","author":"T Hackel","year":"2016","unstructured":"Hackel T, Wegner JD, Schindler K (2016) Fast semantic segmentation of 3D point clouds with strongly varying density. ISPRS Ann Photogramm Remote Sens Spatial Inf Sci 3(3):177\u2013184","journal-title":"ISPRS Ann Photogramm Remote Sens Spatial Inf Sci"},{"key":"618_CR16","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1016\/j.ins.2015.03.031","volume":"320","author":"D Tao","year":"2015","unstructured":"Tao D, Cheng J, Lin X, Yu J (2015) Local structure preserving discriminative projections for RGB-D sensor-based scene classification. Inf Sci 320:383\u2013394","journal-title":"Inf Sci"},{"issue":"2","key":"618_CR17","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1109\/LRA.2018.2792681","volume":"3","author":"D Bobkov","year":"2018","unstructured":"Bobkov D, Chen S, Jian R, Iqbal MZ, Steinbach E (2018) Noise-resistant deep learning for object classification in three-dimensional point clouds using a point pair descriptor. IEEE Robot Autom Lett 3(2):865\u2013872","journal-title":"IEEE Robot Autom Lett"},{"key":"618_CR18","doi-asserted-by":"crossref","unstructured":"Maturana D, Scherer S (2015) VoxNet: a 3D convolutional neural network for real-time object recognition. In: 2015 IEEE\/RSJ international conference on intelligent robots and systems (IROS), Hamburg, Germany, pp 922\u2013928","DOI":"10.1109\/IROS.2015.7353481"},{"key":"618_CR19","doi-asserted-by":"crossref","unstructured":"Zhou Y, Tuzel O (2018) VoxelNet: end-to-end learning for point cloud based 3D object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4490\u20134499","DOI":"10.1109\/CVPR.2018.00472"},{"key":"618_CR20","unstructured":"Qi CR, Su H, Mo K, Guibas LJ (2017) PointNet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 652\u2013660"},{"key":"618_CR21","unstructured":"Qi CR, Yi L, Su H, Guibas LJ (2017) PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in neural information processing systems, pp 5099\u20135108"},{"key":"618_CR22","doi-asserted-by":"crossref","unstructured":"Hu Q, Yang B, Xie L, Rosa S, Guo Y, Wang Z, Trigoni N, Markham A (2020) RandLA-Net: efficient semantic segmentation of large-scale point clouds. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp. 11108\u201311117","DOI":"10.1109\/CVPR42600.2020.01112"},{"key":"618_CR23","unstructured":"Hengshuang Z, Li J, Jiaya J, Philip T, Vladlen K (2020) Point transformer. arXiv:2012.09164 [cs.CV]"},{"key":"618_CR24","unstructured":"Rusu RB, Marton ZC, Blodow N, Beetz M (2008) Persistent point feature histograms for 3D point clouds. In: Proceedings of the International Conference on Intelligent Autonomous Systems, pp. 119\u2013128"},{"key":"618_CR25","doi-asserted-by":"crossref","unstructured":"Behley J, Steinhage V, Cremers AB (2012) Performance of histogram descriptors for the classification of 3D laser range data in urban environments. In: 2012 IEEE international conference on robotics and automation, pp 4391\u20134398","DOI":"10.1109\/ICRA.2012.6225003"},{"key":"618_CR26","unstructured":"Jutzi B, Gross H (2009) Nearest neighbour classification on laser point clouds to gain object structures from buildings. Int Arch Photogramm Remote Sens Spat Inf Sci XXXVIII-1-4-7\/W5"},{"key":"618_CR27","doi-asserted-by":"crossref","unstructured":"Fehr D, Cherian A, Sivalingam R, Nickolay S, Morellas V, Papanikolopoulos N (2012) Compact covariance descriptors in 3D point clouds for object recognition. In: 2012 IEEE international conference on robotics and automation, pp 1793\u20131798","DOI":"10.1109\/ICRA.2012.6224740"},{"key":"618_CR28","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1016\/j.cag.2015.01.006","volume":"49","author":"M Weinmann","year":"2015","unstructured":"Weinmann M, Urban S, Hinz S, Jutzi B, Mallet C (2015) Distinctive 2D and 3D features for automated large-scale scene analysis in urban areas. Comput Graph 49:47\u201357","journal-title":"Comput Graph"},{"key":"618_CR29","unstructured":"Li B, Zhang T, Xia T (2016) Vehicle detection from 3D lidar using fully convolutional network. In: Robotics science and systems"},{"key":"618_CR30","doi-asserted-by":"crossref","unstructured":"Su H, Maji S, Kalogerakis E, Learned-Miller E (2015) Multi-view convolutional neural networks for 3D shape recognition. In: IEEE international conference on computer vision (ICCV), pp 945\u2013953","DOI":"10.1109\/ICCV.2015.114"},{"key":"618_CR31","doi-asserted-by":"crossref","unstructured":"Lang AH, Vora S, Caesar H, Zhou L, Yang J, Beijbom O (2019) PointPillars: fast encoders for object detection from point clouds. In: IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 12689\u201312697","DOI":"10.1109\/CVPR.2019.01298"},{"key":"618_CR32","doi-asserted-by":"crossref","unstructured":"Graham B, Engelcke M, Maaten LVD (2018) 3D semantic segmentation with submanifold sparse convolutional networks. In: 2018 IEEE\/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, pp 9224\u20139232","DOI":"10.1109\/CVPR.2018.00961"},{"key":"618_CR33","doi-asserted-by":"crossref","unstructured":"Meng H, Gao L, Lai Y, Manocha D (2019) VV-Net: voxel VAE net with group convolutions for point cloud segmentation. In: 2019 IEEE\/CVF international conference on computer vision (ICCV), Seoul, Korea (South), pp 8499\u20138507","DOI":"10.1109\/ICCV.2019.00859"},{"key":"618_CR34","doi-asserted-by":"crossref","unstructured":"Huang Q, Wang W, Neumann U (2018) Recurrent slice networks for 3D segmentation of point clouds. In: 2018 IEEE\/CVF conference on computer vision and pattern recognition, pp 2626\u20132635","DOI":"10.1109\/CVPR.2018.00278"},{"key":"618_CR35","doi-asserted-by":"crossref","unstructured":"Zhao H, Jiang L, Fu CW, Jia J (2019) Pointweb: enhancing local neighborhood features for point cloud processing. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 5560\u20135568","DOI":"10.1109\/CVPR.2019.00571"},{"key":"618_CR36","doi-asserted-by":"crossref","unstructured":"Wang L, Huang Y, Hou Y, Zhang S, Shan J (2019) Graph attention convolution for point cloud semantic segmentation. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 10288\u201310297","DOI":"10.1109\/CVPR.2019.01054"},{"key":"618_CR37","doi-asserted-by":"crossref","unstructured":"Chen C, Li G, Xu R, Chen T, Wang M, Lin L (2019) Clusternet: deep hierarchical cluster network with rigorously rotation-invariant representation for point cloud analysis. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 4989\u20134997","DOI":"10.1109\/CVPR.2019.00513"},{"key":"618_CR38","doi-asserted-by":"crossref","unstructured":"Jiang L, Zhao H, Liu S, Shen X, Fu CW, Jia J (2019) Hierarchical point-edge interaction network for point cloud semantic segmentation. In: IEEE\/CVF international conference on computer vision (ICCV), Seoul, Korea (South), pp 10432\u201310440","DOI":"10.1109\/ICCV.2019.01053"},{"key":"618_CR39","doi-asserted-by":"crossref","unstructured":"Wu W, Qi Z, Fuxin L (2019) Pointconv: deep convolutional networks on 3D point clouds. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 9613\u20139622","DOI":"10.1109\/CVPR.2019.00985"},{"key":"618_CR40","doi-asserted-by":"crossref","unstructured":"Qi CR, Liu W, Wu C, Su H, Guibas LJ (2018) Frustum pointnets for 3D object detection from RGB-D data, In: 2018 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 918\u2013927","DOI":"10.1109\/CVPR.2018.00102"},{"key":"618_CR41","unstructured":"Li Y, Bu R, Sun M, Wu W, Di X, Chen B (2018) PointCNN: convolution on x-transformed points. In: Advances in neural information processing systems, pp 820\u2013830"},{"key":"618_CR42","doi-asserted-by":"crossref","unstructured":"Shi S, Wang X, Li H (2019) Pointrcnn: 3D object proposal generation and detection from point cloud. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 770\u2013779","DOI":"10.1109\/CVPR.2019.00086"},{"key":"618_CR43","doi-asserted-by":"crossref","unstructured":"Landrieu L, Simonovsky M (2018) Large-scale point cloud semantic segmentation with superpoint graphs. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4558\u20134567","DOI":"10.1109\/CVPR.2018.00479"},{"key":"618_CR44","doi-asserted-by":"crossref","unstructured":"Engelmann F, Kontogianni T, Schult J, Leibe B (2018) Know what your neighbors do: 3D semantic segmentation of point clouds. European conference on computer vision (ECCV), workshops. pp 395\u2013409","DOI":"10.1007\/978-3-030-11015-4_29"},{"key":"618_CR45","doi-asserted-by":"crossref","unstructured":"Wang X, Liu S, Shen X, Shen C, Jia J (2019) Associatively segmenting instances and semantics in point clouds. In: 2019 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA, pp 4091\u20134100","DOI":"10.1109\/CVPR.2019.00422"},{"key":"618_CR46","doi-asserted-by":"crossref","unstructured":"Zhao L, Tao W (2020) JSNet: joint instance and semantic segmentation of 3D point clouds. In: The thirty-fourth AAAI conference on artificial intelligence, pp 12951\u201312958","DOI":"10.1609\/aaai.v34i07.6994"},{"issue":"12","key":"618_CR47","doi-asserted-by":"publisher","first-page":"3181","DOI":"10.1109\/TFUZZ.2020.2992611","volume":"28","author":"M Zhong","year":"2020","unstructured":"Zhong M, Li C, Liu L, Wen J, Ma J, Yu X (2020) Fuzzy neighborhood learning for deep 3-D segmentation of point cloud. IEEE Trans Fuzzy Syst 28(12):3181\u20133192","journal-title":"IEEE Trans Fuzzy Syst"},{"key":"618_CR48","doi-asserted-by":"crossref","unstructured":"Lei H, Akhtar N, Mian A (2020) SegGCN: efficient 3D point cloud segmentation with fuzzy spherical kernel. In: 2020 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), Seattle, WA, USA, pp 11608\u201311617","DOI":"10.1109\/CVPR42600.2020.01163"},{"key":"618_CR49","doi-asserted-by":"crossref","unstructured":"Armeni I, Sener O, Zamir AR, Jiang H, Brilakis I, Fischer M, Savarese S (2016) 3D semantic parsing of large-scale indoor spaces. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, NV, pp 1534\u20131543","DOI":"10.1109\/CVPR.2016.170"},{"key":"618_CR50","doi-asserted-by":"publisher","first-page":"533","DOI":"10.1016\/j.neunet.2018.09.001","volume":"108","author":"AV Phan","year":"2018","unstructured":"Phan AV, Nguyen ML, Nguyen YLH, Bui LT (2018) DGCNN: a convolutional neural network over large-scale labeled graphs. Neural Netw 108:533\u2013543","journal-title":"Neural Netw"},{"key":"618_CR51","doi-asserted-by":"crossref","unstructured":"Ye X, Li J, Huang H, Du L, Zhang X (2018) 3D recurrent neural networks with context fusion for point cloud semantic segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 403\u2013417","DOI":"10.1007\/978-3-030-01234-2_25"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-021-00618-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-021-00618-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-021-00618-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,27]],"date-time":"2022-09-27T13:46:16Z","timestamp":1664286376000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-021-00618-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,4]]},"references-count":51,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,10]]}},"alternative-id":["618"],"URL":"https:\/\/doi.org\/10.1007\/s40747-021-00618-0","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"type":"print","value":"2199-4536"},{"type":"electronic","value":"2198-6053"}],"subject":[],"published":{"date-parts":[[2022,1,4]]},"assertion":[{"value":"21 August 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 December 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 January 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All authors declare that there are no conflicts of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}