{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T13:11:06Z","timestamp":1775913066133,"version":"3.50.1"},"reference-count":48,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2024,9,12]],"date-time":"2024-09-12T00:00:00Z","timestamp":1726099200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42130105"],"award-info":[{"award-number":["42130105"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["L2023G016"],"award-info":[{"award-number":["L2023G016"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2042024kf0029"],"award-info":[{"award-number":["2042024kf0029"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2023-2-06"],"award-info":[{"award-number":["2023-2-06"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"China State Railway Group Co., Ltd.","award":["42130105"],"award-info":[{"award-number":["42130105"]}]},{"name":"China State Railway Group Co., Ltd.","award":["L2023G016"],"award-info":[{"award-number":["L2023G016"]}]},{"name":"China State Railway Group Co., Ltd.","award":["2042024kf0029"],"award-info":[{"award-number":["2042024kf0029"]}]},{"name":"China State Railway Group Co., Ltd.","award":["2023-2-06"],"award-info":[{"award-number":["2023-2-06"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["42130105"],"award-info":[{"award-number":["42130105"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["L2023G016"],"award-info":[{"award-number":["L2023G016"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["2042024kf0029"],"award-info":[{"award-number":["2042024kf0029"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["2023-2-06"],"award-info":[{"award-number":["2023-2-06"]}]},{"name":"Hubei Provincial Geographical National Condition Monitoring Center","award":["42130105"],"award-info":[{"award-number":["42130105"]}]},{"name":"Hubei Provincial Geographical National Condition Monitoring Center","award":["L2023G016"],"award-info":[{"award-number":["L2023G016"]}]},{"name":"Hubei Provincial Geographical National Condition Monitoring Center","award":["2042024kf0029"],"award-info":[{"award-number":["2042024kf0029"]}]},{"name":"Hubei Provincial Geographical National Condition Monitoring Center","award":["2023-2-06"],"award-info":[{"award-number":["2023-2-06"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Effective semantic segmentation of Airborne Laser Scanning (ALS) point clouds is a crucial field of study and influences subsequent point cloud application tasks. Transformer networks have made significant progress in 2D\/3D computer vision tasks, exhibiting superior performance. We propose a multilevel geometric feature embedding transformer network (MGFE-T), which aims to fully utilize the three-dimensional structural information carried by point clouds and enhance transformer performance in ALS point cloud semantic segmentation. In the encoding stage, compute the geometric features surrounding tee sampling points at each layer and embed them into the transformer workflow. To ensure that the receptive field of the self-attention mechanism and the geometric computation domain can maintain a consistent scale at each layer, we propose a fixed-radius dilated KNN (FR-DKNN) search method to address the limitation of traditional KNN search methods in considering domain radius. In the decoding stage, we aggregate prediction deviations at each level into a unified loss value, enabling multilevel supervision to improve the network\u2019s feature learning ability at different levels. The MGFE-T network can predict the class label of each point in an end-to-end manner. Experiments were conducted on three widely used benchmark datasets. The results indicate that the MGFE-T network achieves superior OA and mF1 scores on the LASDU and DFC2019 datasets and performs well on the ISPRS dataset with imbalanced classes.<\/jats:p>","DOI":"10.3390\/rs16183386","type":"journal-article","created":{"date-parts":[[2024,9,12]],"date-time":"2024-09-12T02:30:23Z","timestamp":1726108223000},"page":"3386","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Multilevel Geometric Feature Embedding in Transformer Network for ALS Point Cloud Semantic Segmentation"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-3573-5125","authenticated-orcid":false,"given":"Zhuanxin","family":"Liang","sequence":"first","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430070, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4611-820X","authenticated-orcid":false,"given":"Xudong","family":"Lai","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430070, China"}]}],"member":"1968","published-online":{"date-parts":[[2024,9,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Su, H., Maji, S., Kalogerakis, E., and Learned-Miller, E. (2015, January 7\u201313). Multi-View Convolutional Neural Networks for 3D Shape Recognition. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.114"},{"key":"ref_2","unstructured":"Qi, C.R., Su, H., Niessner, M., Dai, A., Yan, M., and Guibas, L.J. (July, January 26). Volumetric and Multi-View CNNs for Object Classification on 3D Data. Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition, (CVPR), Las Vegas, NV, USA."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Maturana, D., and Scherer, S. (October, January 28). VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition. Proceedings of the 2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany.","DOI":"10.1109\/IROS.2015.7353481"},{"key":"ref_4","unstructured":"Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., and Xiao, J. (2015, January 7\u201312). 3D ShapeNets: A Deep Representation for Volumetric Shapes. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Charles, R.Q., Su, H., Kaichun, M., and Guibas, L.J. (2017, January 21\u201326). PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.16"},{"key":"ref_6","unstructured":"Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (2018). PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. Advances in Neural Information Processing Systems 30, Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4\u20139 December 2017, Curran Associates, Inc."},{"key":"ref_7","unstructured":"Ma, X., Qin, C., You, H., Ran, H., and Fu, Y. (2022). Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework. arXiv."},{"key":"ref_8","first-page":"23192","article-title":"PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies","volume":"35","author":"Qian","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_9","unstructured":"Li, Y., Bu, R., Sun, M., Wu, W., Di, X., and Chen, B. (2019). PointCNN: Convolution On X-Transformed Points. Advances in Neural Information Processing Systems 31, Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, QC, Canada, 3\u20138 December 2018, Curran Associates, Inc."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Jiang, M., Wu, Y., Zhao, T., Zhao, Z., and Lu, C. (2018). PointSIFT: A SIFT-like Network Module for 3D Point Cloud Semantic Segmentation. arXiv.","DOI":"10.1109\/IGARSS.2019.8900102"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Wu, W., Qi, Z., and Fuxin, L. (2019, January 15\u201320). PointConv: Deep Convolutional Networks on 3D Point Clouds. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00985"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Thomas, H., Deschaud, J.-E., Marcotegui, B., Goulette, F., and Guibas, L.J. (November, January 27). KPConv: Flexible and Deformable Convolution for Point Clouds. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00651"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Simonovsky, M., and Komodakis, N. (2017, January 21\u201326). Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.11"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1145\/3326362","article-title":"Dynamic Graph CNN for Learning on Point Clouds","volume":"38","author":"Wang","year":"2019","journal-title":"ACM Trans. Graph."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liu, Y., Fan, B., Xiang, S., and Pan, C. (2019, January 15\u201320). Relation-Shape Convolutional Neural Network for Point Cloud Analysis. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00910"},{"key":"ref_16","unstructured":"Wu, X., Lao, Y., Jiang, L., Liu, X., and Zhao, H. (2023). Point Transformer V2: Grouped Vector Attention and Partition-Based Pooling. Advances in Neural Information Processing Systems 35, 36th Conference on Neural Information Processing Systems (NeurIPS 2022), New Orleans, LA, USA, 28 November\u20139 December 2022, Curran Associates, Inc."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1007\/s41095-021-0229-5","article-title":"PCT: Point Cloud Transformer","volume":"7","author":"Guo","year":"2021","journal-title":"Comput. Vis. Media"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/j.isprsjprs.2021.04.016","article-title":"Local and Global Encoder Network for Semantic Segmentation of Airborne Laser Scanning Point Clouds","volume":"176","author":"Lin","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1016\/j.isprsjprs.2018.03.018","article-title":"A Multi-Scale Fully Convolutional Network for Semantic Labeling of 3D Point Clouds","volume":"143","author":"Yousefhussien","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_21","first-page":"5704617","article-title":"A Dual Attention Neural Network for Airborne LiDAR Point Cloud Semantic Segmentation","volume":"60","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","first-page":"116","article-title":"A Building Extraction Method Based on IGA That Fuses Point Cloud and Image Data","volume":"43","author":"Lai","year":"2023","journal-title":"J. Infrared Millim. Waves"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1109\/JSTARS.2022.3223698","article-title":"OFFS-Net: Optimal Feature Fusion-Based Spectral Information Network for Airborne Point Cloud Classification","volume":"16","author":"He","year":"2023","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"104932","DOI":"10.1016\/j.cageo.2021.104932","article-title":"A Hierarchical Deep Neural Network with Iterative Features for Semantic Labeling of Airborne LiDAR Point Clouds","volume":"157","author":"Yang","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"7004405","DOI":"10.1109\/LGRS.2022.3190558","article-title":"STN: Saliency-Guided Transformer Network for Point-Wise Semantic Segmentation of Urban Scenes","volume":"19","author":"Ma","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1016\/j.isprsjprs.2020.03.016","article-title":"A Geometry-Attentional Network for ALS Point Cloud Classification","volume":"164","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"5702615","DOI":"10.1109\/TGRS.2022.3158362","article-title":"Local and Global Structure for Urban ALS Point Cloud Semantic Segmentation With Ground-Aware Attention","volume":"60","author":"Jiang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"3958","DOI":"10.1109\/JSTARS.2020.3008477","article-title":"A Point-Based Fully Convolutional Neural Network for Airborne LiDAR Ground Point Filtering in Forested Environments","volume":"13","author":"Jin","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2021.04.017","article-title":"GraNet: Global Relation-Aware Attentional Network for Semantic Segmentation of ALS Point Clouds","volume":"177","author":"Huang","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/j.isprsjprs.2022.03.019","article-title":"Beyond Single Receptive Field: A Receptive Field Fusion-and-Stratification Network for Airborne Laser Scanning Point Cloud Classification","volume":"188","author":"Mao","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhao, H., Jiang, L., Jia, J., Torr, P., and Koltun, V. (2021). Point Transformer. arXiv.","DOI":"10.1109\/ICCV48922.2021.01595"},{"key":"ref_32","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention Is All You Need. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_33","unstructured":"Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019, January 2\u20137). BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019), Minneapolis, MN, USA."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Dai, Z., Yang, Z., Yang, Y., Carbonell, J., Le, Q.V., and Salakhutdinov, R. (2019). Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context. arXiv.","DOI":"10.18653\/v1\/P19-1285"},{"key":"ref_35","unstructured":"Ramachandran, P., Parmar, N., Vaswani, A., Bello, I., Levskaya, A., and Shlens, J. (2020). Stand-Alone Self-Attention in Vision Models. Advances in Neural Information Processing Systems 32, Proceedings of the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019) Vancouver, BC, Canada, 8\u201314 December 2019, Curran Associates, Inc."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Zhao, H., Jia, J., and Koltun, V. (2020, January 13\u201319). Exploring Self-Attention for Image Recognition. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01009"},{"key":"ref_37","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"3833","DOI":"10.1007\/s40747-021-00618-0","article-title":"Semantic Segmentation of Large-Scale Point Clouds Based on Dilated Nearest Neighbors Graph","volume":"8","author":"Wang","year":"2022","journal-title":"Complex Intell. Syst."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Ye, Z., Xu, Y., Huang, R., Tong, X., Li, X., Liu, X., Luan, K., Hoegner, L., and Stilla, U. (2020). LASDU: A Large-Scale Aerial LiDAR Dataset for Semantic Labeling in Dense Urban Areas. ISPRS Int. J. Geo-Inf., 9.","DOI":"10.3390\/ijgi9070450"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/MGRS.2019.2949679","article-title":"2019 IEEE GRSS Data Fusion Contest: Large-Scale Semantic 3D Reconstruction [Technical Committees]","volume":"7","author":"Yokoya","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1016\/j.isprsjprs.2013.11.001","article-title":"Contextual Classification of Lidar Data and Building Object Detection in Urban Areas","volume":"87","author":"Niemeyer","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1016\/j.isprsjprs.2022.01.012","article-title":"VD-LAB: A View-Decoupled Network with Local-Global Aggregation Bridge for Airborne Laser Scanning Point Cloud Classification","volume":"186","author":"Li","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"5702614","DOI":"10.1109\/TGRS.2023.3285207","article-title":"Recurrent Residual Dual Attention Network for Airborne Laser Scanning Point Cloud Semantic Segmentation","volume":"61","author":"Zeng","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"5506605","DOI":"10.1109\/LGRS.2023.3294246","article-title":"Multilevel Context Feature Fusion for Semantic Segmentation of ALS Point Cloud","volume":"20","author":"Zeng","year":"2023","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Zhang, R., Chen, S., Wang, X., and Zhang, Y. (2023). IPCONV: Convolution with Multiple Different Kernels for Point Cloud Semantic Segmentation. Remote Sens., 15.","DOI":"10.3390\/rs15215136"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.5194\/isprs-archives-XLII-2-W13-1105-2019","article-title":"Classification of aerial laser scanning point clouds using machine learning: A comparison between random forest and tensorflow","volume":"XLII-2-W13","author":"Pirotti","year":"2019","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Atik, M.E., Duran, Z., and Seker, D.Z. (2021). Machine Learning-Based Supervised Classification of Point Clouds Using Multiscale Geometric Features. ISPRS Int. J. Geo-Inf., 10.","DOI":"10.3390\/ijgi10030187"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"5036","DOI":"10.1109\/JSTARS.2021.3077568","article-title":"A Hierarchical Approach for Point Cloud Classification With 3D Contextual Features","volume":"14","author":"Feng","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/18\/3386\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:54:17Z","timestamp":1760111657000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/18\/3386"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,12]]},"references-count":48,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2024,9]]}},"alternative-id":["rs16183386"],"URL":"https:\/\/doi.org\/10.3390\/rs16183386","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,12]]}}}