{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:08:27Z","timestamp":1750219707134,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,12,6]],"date-time":"2023-12-06T00:00:00Z","timestamp":1701820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,12,6]]},"DOI":"10.1145\/3595916.3626449","type":"proceedings-article","created":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T16:34:41Z","timestamp":1704126881000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Multi-Scale Superpoint Network for 3D Point Cloud Semantic Segmentation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-0183-935X","authenticated-orcid":false,"given":"Ft","family":"Zheng","sequence":"first","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0851-6805","authenticated-orcid":false,"given":"Le","family":"Hui","sequence":"additional","affiliation":[{"name":"Northwestern Polytechnical University, CN"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9157-838X","authenticated-orcid":false,"given":"Jin","family":"Xie","sequence":"additional","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4039-7618","authenticated-orcid":false,"given":"Haofeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}]}],"member":"320","published-online":{"date-parts":[[2024,1]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.170"},{"key":"e_1_3_2_1_2_1","volume-title":"Token merging: Your vit but faster. arXiv preprint arXiv:2210.09461","author":"Bolya Daniel","year":"2022","unstructured":"Daniel Bolya , Cheng-Yang Fu , Xiaoliang Dai , Peizhao Zhang , Christoph Feichtenhofer , and Judy Hoffman . 2022. Token merging: Your vit but faster. arXiv preprint arXiv:2210.09461 ( 2022 ). Daniel Bolya, Cheng-Yang Fu, Xiaoliang Dai, Peizhao Zhang, Christoph Feichtenhofer, and Judy Hoffman. 2022. Token merging: Your vit but faster. arXiv preprint arXiv:2210.09461 (2022)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.691"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00319"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.261"},{"key":"e_1_3_2_1_6_1","volume-title":"An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929","author":"Dosovitskiy Alexey","year":"2020","unstructured":"Alexey Dosovitskiy , Lucas Beyer , Alexander Kolesnikov , Dirk Weissenborn , Xiaohua Zhai , Thomas Unterthiner , Mostafa Dehghani , Matthias Minderer , Georg Heigold , Sylvain Gelly , 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 ( 2020 ). Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00961"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/s41095-021-0229-5"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01112"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 5510\u20135519","author":"Hui Le","year":"2021","unstructured":"Le Hui , Jia Yuan , Mingmei Cheng , Jin Xie , Xiaoya Zhang , and Jian Yang . 2021 . Superpoint network for point cloud oversegmentation . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 5510\u20135519 . Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, and Jian Yang. 2021. Superpoint network for point cloud oversegmentation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 5510\u20135519."},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8500\u20138509","author":"Lai Xin","year":"2022","unstructured":"Xin Lai , Jianhui Liu , Li Jiang , Liwei Wang , Hengshuang Zhao , Shu Liu , Xiaojuan Qi , and Jiaya Jia . 2022 . Stratified transformer for 3D point cloud segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8500\u20138509 . Xin Lai, Jianhui Liu, Li Jiang, Liwei Wang, Hengshuang Zhao, Shu Liu, Xiaojuan Qi, and Jiaya Jia. 2022. Stratified transformer for 3D point cloud segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8500\u20138509."},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 4558\u20134567","author":"Landrieu Loic","year":"2018","unstructured":"Loic Landrieu and Martin Simonovsky . 2018 . Large-scale point cloud semantic segmentation with superpoint graphs . In Proceedings of the IEEE conference on computer vision and pattern recognition. 4558\u20134567 . Loic Landrieu and Martin Simonovsky. 2018. Large-scale point cloud semantic segmentation with superpoint graphs. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4558\u20134567."},{"key":"e_1_3_2_1_13_1","volume-title":"Vehicle detection from 3d lidar using fully convolutional network. arXiv preprint arXiv:1608.07916","author":"Li Bo","year":"2016","unstructured":"Bo Li , Tianlei Zhang , and Tian Xia . 2016. Vehicle detection from 3d lidar using fully convolutional network. arXiv preprint arXiv:1608.07916 ( 2016 ). Bo Li, Tianlei Zhang, and Tian Xia. 2016. Vehicle detection from 3d lidar using fully convolutional network. arXiv preprint arXiv:1608.07916 (2016)."},{"key":"e_1_3_2_1_14_1","volume-title":"Pointcnn: Convolution on x-transformed points. Advances in neural information processing systems 31","author":"Li Yangyan","year":"2018","unstructured":"Yangyan Li , Rui Bu , Mingchao Sun , Wei Wu , Xinhan Di , and Baoquan Chen . 2018 . Pointcnn: Convolution on x-transformed points. Advances in neural information processing systems 31 (2018). Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. 2018. Pointcnn: Convolution on x-transformed points. Advances in neural information processing systems 31 (2018)."},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 17682\u201317691","author":"Lin Haojia","year":"2023","unstructured":"Haojia Lin , Xiawu Zheng , Lijiang Li , Fei Chao , Shanshan Wang , Yan Wang , Yonghong Tian , and Rongrong Ji . 2023 . Meta Architecture for Point Cloud Analysis . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 17682\u201317691 . Haojia Lin, Xiawu Zheng, Lijiang Li, Fei Chao, Shanshan Wang, Yan Wang, Yonghong Tian, and Rongrong Ji. 2023. Meta Architecture for Point Cloud Analysis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 17682\u201317691."},{"key":"e_1_3_2_1_16_1","volume-title":"Toward better boundary preserved supervoxel segmentation for 3D point clouds. ISPRS journal of photogrammetry and remote sensing 143","author":"Lin Yangbin","year":"2018","unstructured":"Yangbin Lin , Cheng Wang , Dawei Zhai , Wei Li , and Jonathan Li. 2018. Toward better boundary preserved supervoxel segmentation for 3D point clouds. ISPRS journal of photogrammetry and remote sensing 143 ( 2018 ), 39\u201347. Yangbin Lin, Cheng Wang, Dawei Zhai, Wei Li, and Jonathan Li. 2018. Toward better boundary preserved supervoxel segmentation for 3D point clouds. ISPRS journal of photogrammetry and remote sensing 143 (2018), 39\u201347."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7353481"},{"key":"e_1_3_2_1_18_1","volume-title":"European conference on computer vision. Springer, 604\u2013621","author":"Pang Yatian","year":"2022","unstructured":"Yatian Pang , Wenxiao Wang , Francis\u00a0 EH Tay , Wei Liu , Yonghong Tian , and Li Yuan . 2022 . Masked autoencoders for point cloud self-supervised learning . In European conference on computer vision. Springer, 604\u2013621 . Yatian Pang, Wenxiao Wang, Francis\u00a0EH Tay, Wei Liu, Yonghong Tian, and Li Yuan. 2022. Masked autoencoders for point cloud self-supervised learning. In European conference on computer vision. Springer, 604\u2013621."},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 2027\u20132034","author":"Papon Jeremie","year":"2013","unstructured":"Jeremie Papon , Alexey Abramov , Markus Schoeler , and Florentin Worgotter . 2013 . Voxel cloud connectivity segmentation-supervoxels for point clouds . In Proceedings of the IEEE conference on computer vision and pattern recognition. 2027\u20132034 . Jeremie Papon, Alexey Abramov, Markus Schoeler, and Florentin Worgotter. 2013. Voxel cloud connectivity segmentation-supervoxels for point clouds. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2027\u20132034."},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 652\u2013660","author":"Qi R","year":"2017","unstructured":"Charles\u00a0 R Qi , Hao Su , Kaichun Mo , and Leonidas\u00a0 J Guibas . 2017 . PointNet: Deep learning on point sets for 3D classification and segmentation . In Proceedings of the IEEE conference on computer vision and pattern recognition. 652\u2013660 . Charles\u00a0R Qi, Hao Su, Kaichun Mo, and Leonidas\u00a0J Guibas. 2017. PointNet: Deep learning on point sets for 3D classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 652\u2013660."},{"key":"e_1_3_2_1_21_1","volume-title":"Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems 30","author":"Qi Charles\u00a0Ruizhongtai","year":"2017","unstructured":"Charles\u00a0Ruizhongtai Qi , Li Yi , Hao Su , and Leonidas\u00a0 J Guibas . 2017. PointNet++ : Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems 30 ( 2017 ). Charles\u00a0Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas\u00a0J Guibas. 2017. PointNet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_1_22_1","first-page":"23192","article-title":"Pointnext: Revisiting pointnet++ with improved training and scaling strategies","volume":"35","author":"Qian Guocheng","year":"2022","unstructured":"Guocheng Qian , Yuchen Li , Houwen Peng , Jinjie Mai , Hasan Hammoud , Mohamed Elhoseiny , and Bernard Ghanem . 2022 . Pointnext: Revisiting pointnet++ with improved training and scaling strategies . Advances in Neural Information Processing Systems 35 (2022), 23192 \u2013 23204 . Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Hammoud, Mohamed Elhoseiny, and Bernard Ghanem. 2022. Pointnext: Revisiting pointnet++ with improved training and scaling strategies. Advances in Neural Information Processing Systems 35 (2022), 23192\u201323204.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18942\u201318952","author":"Ran Haoxi","year":"2022","unstructured":"Haoxi Ran , Jun Liu , and Chengjie Wang . 2022 . Surface representation for point clouds . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18942\u201318952 . Haoxi Ran, Jun Liu, and Chengjie Wang. 2022. Surface representation for point clouds. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18942\u201318952."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.28"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.114"},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8489\u20138499","author":"Tang Liyao","year":"2022","unstructured":"Liyao Tang , Yibing Zhan , Zhe Chen , Baosheng Yu , and Dacheng Tao . 2022 . Contrastive boundary learning for point cloud segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8489\u20138499 . Liyao Tang, Yibing Zhan, Zhe Chen, Baosheng Yu, and Dacheng Tao. 2022. Contrastive boundary learning for point cloud segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8489\u20138499."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00651"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 10296\u201310305","author":"Wang Lei","year":"2019","unstructured":"Lei Wang , Yuchun Huang , Yaolin Hou , Shenman Zhang , and Jie Shan . 2019 . Graph attention convolution for point cloud semantic segmentation . In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 10296\u201310305 . Lei Wang, Yuchun Huang, Yaolin Hou, Shenman Zhang, and Jie Shan. 2019. Graph attention convolution for point cloud semantic segmentation. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 10296\u201310305."},{"key":"e_1_3_2_1_29_1","volume-title":"Dynamic graph cnn for learning on point clouds. ACM Transactions on Graphics (tog) 38, 5","author":"Wang Yue","year":"2019","unstructured":"Yue Wang , Yongbin Sun , Ziwei Liu , Sanjay\u00a0 E Sarma , Michael\u00a0 M Bronstein , and Justin\u00a0 M Solomon . 2019. Dynamic graph cnn for learning on point clouds. ACM Transactions on Graphics (tog) 38, 5 ( 2019 ), 1\u201312. Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay\u00a0E Sarma, Michael\u00a0M Bronstein, and Justin\u00a0M Solomon. 2019. Dynamic graph cnn for learning on point clouds. ACM Transactions on Graphics (tog) 38, 5 (2019), 1\u201312."},{"key":"e_1_3_2_1_30_1","first-page":"33330","article-title":"Point transformer v2: Grouped vector attention and partition-based pooling","volume":"35","author":"Wu Xiaoyang","year":"2022","unstructured":"Xiaoyang Wu , Yixing Lao , Li Jiang , Xihui Liu , and Hengshuang Zhao . 2022 . Point transformer v2: Grouped vector attention and partition-based pooling . Advances in Neural Information Processing Systems 35 (2022), 33330 \u2013 33342 . Xiaoyang Wu, Yixing Lao, Li Jiang, Xihui Liu, and Hengshuang Zhao. 2022. Point transformer v2: Grouped vector attention and partition-based pooling. Advances in Neural Information Processing Systems 35 (2022), 33330\u201333342.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_31_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19313\u201319322","author":"Yu Xumin","year":"2022","unstructured":"Xumin Yu , Lulu Tang , Yongming Rao , Tiejun Huang , Jie Zhou , and Jiwen Lu . 2022 . Point-Bert: Pre-training 3D point cloud transformers with masked point modeling . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19313\u201319322 . Xumin Yu, Lulu Tang, Yongming Rao, Tiejun Huang, Jie Zhou, and Jiwen Lu. 2022. Point-Bert: Pre-training 3D point cloud transformers with masked point modeling. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19313\u201319322."},{"key":"e_1_3_2_1_32_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 5565\u20135573","author":"Zhao Hengshuang","year":"2019","unstructured":"Hengshuang Zhao , Li Jiang , Chi-Wing Fu , and Jiaya Jia . 2019 . PointWeb: Enhancing local neighborhood features for point cloud processing . In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 5565\u20135573 . Hengshuang Zhao, Li Jiang, Chi-Wing Fu, and Jiaya Jia. 2019. PointWeb: Enhancing local neighborhood features for point cloud processing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 5565\u20135573."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01595"}],"event":{"name":"MMAsia '23: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Tainan Taiwan","acronym":"MMAsia '23"},"container-title":["ACM Multimedia Asia 2023"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626449","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3595916.3626449","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:35:56Z","timestamp":1750178156000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626449"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,6]]},"references-count":33,"alternative-id":["10.1145\/3595916.3626449","10.1145\/3595916"],"URL":"https:\/\/doi.org\/10.1145\/3595916.3626449","relation":{},"subject":[],"published":{"date-parts":[[2023,12,6]]},"assertion":[{"value":"2024-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}