{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:35:00Z","timestamp":1760150100848,"version":"build-2065373602"},"reference-count":28,"publisher":"MDPI AG","issue":"20","license":[{"start":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T00:00:00Z","timestamp":1697068800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"The Special Funds for Creative Research","award":["2022C61540"],"award-info":[{"award-number":["2022C61540"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Aiming to solve the problem that spatially distributed similar instances cannot be distinguished in 3D point cloud instance segmentation, a 3D point cloud instance segmentation network, considering the global shape contour, was proposed. This research used the global-to-local design idea and added the global shape constraint to solve this problem. A Transformer module (Global Shape Attention, GSA) that can capture the shape contour information of the instance in the scene was designed. This module encoded the shape contour information into the Transformer structure as a Key-Value and extracted the instance fused with the global shape contour features, for instance, segmentation. At the same time, the network directly predicted the instance mask in an end-to-end mode, avoiding heavy post-processing algorithms. Many experiments have been conducted on the S3DIS, ScanNet, and STPL3D datasets, and our experimental results showed that our proposed network can efficiently capture the shape contour information of scene instances and can help to alleviate the problem of the difficulty distinguishing between spatially distributed similar instances in a scene, improving the efficiency and stability of instance segmentation.<\/jats:p>","DOI":"10.3390\/rs15204939","type":"journal-article","created":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T12:46:13Z","timestamp":1697114773000},"page":"4939","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7231-1954","authenticated-orcid":false,"given":"Jiabin","family":"Xv","sequence":"first","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430072, China"}]},{"given":"Fei","family":"Deng","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430072, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Yi, L., Zhao, W., Wang, H., Sung, M., and Guibas, L.J. (2019, January 15\u201320). GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00407"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Hou, J., Dai, A., and NieBner, M. (2019, January 15\u201320). 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00455"},{"key":"ref_3","unstructured":"Yang, B., Wang, J., Clark, R., Hu, Q., Wang, S., Markham, A., and Trigoni, N. (2019). Advances in Neural Information Processing Systems, Curran Associates, Inc."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Wang, W., Yu, R., Huang, Q., and Neumann, U. (2018, January 18\u201322). SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00272"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Wang, X., Liu, S., Shen, X., Shen, C., and Jia, J. (2019). Associatively Segmenting Instances and Semantics in Point Clouds. arXiv.","DOI":"10.1109\/CVPR.2019.00422"},{"key":"ref_6","first-page":"48","article-title":"3D-BEVIS: Bird\u2019s-Eye-View Instance Segmentation","volume":"11824","author":"Elich","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020). End-to-End Object Detection with Transformers. arXiv.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"ref_8","unstructured":"Zhu, X., Su, W., Lu, L., Li, B., Wang, X., and Dai, J. (2021). Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Schult, J., Engelmann, F., Hermans, A., Litany, O., Tang, S., and Leibe, B. (2022). Mask3D for 3D Semantic Instance Segmentation. arXiv.","DOI":"10.1109\/ICRA48891.2023.10160590"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian Method for the Assignment Problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Nav. Res. Logist. NRL"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1002\/nav.3800030404","article-title":"Variants of the Hungarian Method for Assignment Problems","volume":"3","author":"Kuhn","year":"1956","journal-title":"Nav. Res. Logist."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Armeni, I., Sener, O., Zamir, A.R., Jiang, H., Brilakis, I., Fischer, M., and Savarese, S. (2016, January 27\u201330). 3D Semantic Parsing of Large-Scale Indoor Spaces. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.170"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Dai, A., Chang, A.X., Savva, M., Halber, M., Funkhouser, T., and Nie\u00dfner, M. (2017, January 21\u201326). ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.261"},{"key":"ref_14","unstructured":"Chen, M., Hu, Q., Hugues, T., Feng, A., Hou, Y., McCullough, K., and Soibelman, L. (2022). STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ding, Z., Han, X., and Niethammer, M. (2019). VoteNet: A Deep Learning Label Fusion Method for Multi-Atlas Segmentation. arXiv.","DOI":"10.1007\/978-3-030-32248-9_23"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Vu, T., Kim, K., Luu, T.M., Nguyen, X.T., and Yoo, C.D. (2022). SoftGroup for 3D Instance Segmentation on Point Clouds. arXiv.","DOI":"10.1109\/CVPR52688.2022.00273"},{"key":"ref_17","unstructured":"Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017). PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. arXiv."},{"key":"ref_18","unstructured":"Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (2017). PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Xv, J., Deng, F., and Liu, H. (2022). Point Cloud Convolution Network Based on Spatial Location Correspondence. ISPRS Int. J. Geo-Inf., 11.","DOI":"10.3390\/ijgi11120591"},{"key":"ref_20","unstructured":"Loshchilov, I., and Hutter, F. (2019). Decoupled Weight Decay Regularization. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Smith, L.N., and Topin, N. (2018). Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates. arXiv.","DOI":"10.1117\/12.2520589"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Jiang, L., Zhao, H., Shi, S., Liu, S., Fu, C.-W., and Jia, J. (2020, January 13\u201319). PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00492"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhong, M., Chen, X., Chen, X., Zeng, G., and Wang, Y. (2022). MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation. arXiv.","DOI":"10.1109\/ICME52920.2022.9859996"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liang, Z., Li, Z., Xu, S., Tan, M., and Jia, K. (2021, January 11\u201317). Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00278"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Engelmann, F., Bokeloh, M., Fathi, A., Leibe, B., and NieBner, M. (2020, January 13\u201319). 3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance Segmentation. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00905"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"He, T., Shen, C., and van den Hengel, A. (2021, January 20\u201325). DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00042"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Han, L., Zheng, T., Xu, L., and Fang, L. (2020, January 13\u201319). OccuSeg: Occupancy-Aware 3D Instance Segmentation. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00301"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Chen, S., Fang, J., Zhang, Q., Liu, W., and Wang, X. (2021, January 11\u201317). Hierarchical Aggregation for 3D Instance Segmentation. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.01518"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4939\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:54Z","timestamp":1760130354000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4939"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,12]]},"references-count":28,"journal-issue":{"issue":"20","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["rs15204939"],"URL":"https:\/\/doi.org\/10.3390\/rs15204939","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2023,10,12]]}}}