{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,18]],"date-time":"2026-05-18T10:53:48Z","timestamp":1779101628158,"version":"3.51.4"},"reference-count":62,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2024,9,20]],"date-time":"2024-09-20T00:00:00Z","timestamp":1726790400000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"},{"start":{"date-parts":[[2024,9,20]],"date-time":"2024-09-20T00:00:00Z","timestamp":1726790400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of Robotics Research"],"published-print":{"date-parts":[[2025,4]]},"abstract":"<jats:p>\n                    Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and LiDAR) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural automation. Therefore, we introduce\n                    <jats:italic toggle=\"yes\">WildScenes<\/jats:italic>\n                    , a bi-modal benchmark dataset consisting of multiple large-scale, sequential traversals in natural environments, including semantic annotations in high-resolution 2D images and dense 3D LiDAR point clouds, and accurate 6-DoF pose information. The data is (1) trajectory-centric with accurate localization and globally aligned point clouds, (2) calibrated and synchronized to support bi-modal training and inference, and (3) containing different natural environments over 6 months to support research on domain adaptation. Our 3D semantic labels are obtained via an efficient, automated process that transfers the human-annotated 2D labels from multiple views into 3D point cloud sequences, thus circumventing the need for expensive and time-consuming human annotation in 3D. We introduce benchmarks on 2D and 3D semantic segmentation and evaluate a variety of recent deep-learning techniques to demonstrate the challenges in semantic segmentation in natural environments. We propose train-val-test splits for standard benchmarks as well as domain adaptation benchmarks and utilize an automated split generation technique to ensure the balance of class label distributions. The\n                    <jats:italic toggle=\"yes\">WildScenes<\/jats:italic>\n                    benchmark webpage is\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/csiro-robotics.github.io\/WildScenes\">https:\/\/csiro-robotics.github.io\/WildScenes<\/jats:ext-link>\n                    , and the data is publicly available at\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/data.csiro.au\/collection\/csiro:61541\">https:\/\/data.csiro.au\/collection\/csiro:61541<\/jats:ext-link>\n                    .\n                  <\/jats:p>","DOI":"10.1177\/02783649241278369","type":"journal-article","created":{"date-parts":[[2024,9,20]],"date-time":"2024-09-20T15:53:39Z","timestamp":1726847619000},"page":"532-549","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":16,"title":["WildScenes: A benchmark for 2D and 3D semantic segmentation in large-scale natural environments"],"prefix":"10.1177","volume":"44","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9875-3577","authenticated-orcid":false,"given":"Kavisha","family":"Vidanapathirana","sequence":"first","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"},{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joshua","family":"Knights","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"},{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0092-0096","authenticated-orcid":false,"given":"Stephen","family":"Hausler","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8269-7918","authenticated-orcid":false,"given":"Mark","family":"Cox","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Milad","family":"Ramezani","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jason","family":"Jooste","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-4871-1188","authenticated-orcid":false,"given":"Ethan","family":"Griffiths","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"},{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shaheer","family":"Mohamed","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"},{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sridha","family":"Sridharan","sequence":"additional","affiliation":[{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Clinton","family":"Fookes","sequence":"additional","affiliation":[{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8169-3560","authenticated-orcid":false,"given":"Peyman","family":"Moghadam","sequence":"additional","affiliation":[{"name":"CSIRO Robotics"},{"name":"Data61"},{"name":"CSIRO"},{"name":"Queensland University of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2024,9,20]]},"reference":[{"key":"e_1_3_5_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2023.3290516"},{"key":"e_1_3_5_3_1","doi-asserted-by":"publisher","unstructured":"Almin A Leo L Duong A et al. (2023b) Navya3dseg \u2013 navya 3d semantic segmentation dataset and split generation for autonomous vehicles. DOI:10.48550\/arXiv.2302.08292.","DOI":"10.48550\/arXiv.2302.08292"},{"key":"e_1_3_5_4_1","doi-asserted-by":"crossref","unstructured":"Baghbaderani RK Li Y Wang S et al. (2024) Temporally-consistent video semantic segmentation with bidirectional occlusion-guided feature propagation. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision Waikoloa HI 03\u201308 January 2024 pp. 685\u2013695.","DOI":"10.1109\/WACV57701.2024.00074"},{"key":"e_1_3_5_5_1","doi-asserted-by":"crossref","unstructured":"Behley J Garbade M Milioto A et al. (2019) Semantickitti: a dataset for semantic scene understanding of lidar sequences. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision October 27th to November 2nd 2019 Seoul South Korea pp. 9297\u20139307.","DOI":"10.1109\/ICCV.2019.00939"},{"key":"e_1_3_5_6_1","doi-asserted-by":"publisher","DOI":"10.1177\/02783649211006735"},{"key":"e_1_3_5_7_1","doi-asserted-by":"publisher","DOI":"10.55417\/fr.2022049"},{"key":"e_1_3_5_8_1","doi-asserted-by":"crossref","unstructured":"Bosse M Zlot R (2009) Continuous 3D scan-matching with a spinning 2D laser. In: 2009 IEEE international conference on robotics and automation Kobe Japan 12\u201317 May 2009 pp. 4312\u20134319. IEEE.","DOI":"10.1109\/ROBOT.2009.5152851"},{"key":"e_1_3_5_9_1","doi-asserted-by":"publisher","DOI":"10.1177\/02783649231160195"},{"key":"e_1_3_5_10_1","article-title":"Rethinking atrous convolution for semantic image segmentation","author":"Chen LC","year":"2019","unstructured":"Chen LC, Papandreou G, Schroff F, et al. (2019) Rethinking atrous convolution for semantic image segmentation. arxiv 2017. arXiv preprint arXiv:1706.05587 2.","journal-title":"arxiv 2017. arXiv preprint arXiv:1706.05587 2"},{"key":"e_1_3_5_11_1","first-page":"17864","article-title":"Per-pixel classification is not all you need for semantic segmentation","volume":"34","author":"Cheng B","year":"2021","unstructured":"Cheng B, Schwing A, Kirillov A (2021) Per-pixel classification is not all you need for semantic segmentation. Advances in Neural Information Processing Systems 34: 17864\u201317875.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_5_12_1","doi-asserted-by":"crossref","unstructured":"Cheng B Misra I Schwing AG et al. (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition New Orleans LA USA 18-24 June 2022 pp. 1290\u20131299.","DOI":"10.1109\/CVPR52688.2022.00135"},{"key":"e_1_3_5_13_1","doi-asserted-by":"crossref","unstructured":"Choy C Gwak J Savarese S (2019) 4d spatio-temporal convnets: minkowski convolutional neural networks. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition Long Beach CA USA 15-20 June 2019 pp. 3075\u20133084.","DOI":"10.1109\/CVPR.2019.00319"},{"key":"e_1_3_5_14_1","doi-asserted-by":"crossref","unstructured":"Cordts M Omran M Ramos S et al. (2016) The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Las Vegas NV 27\u201330 June 2016 pp. 3213\u20133223.","DOI":"10.1109\/CVPR.2016.350"},{"key":"e_1_3_5_15_1","doi-asserted-by":"crossref","unstructured":"Cortinhal T Tzelepis G Erdal Aksoy E (2020) Salsanext: fast uncertainty-aware semantic segmentation of lidar point clouds. In: Advances in Visual Computing: 15th International Symposium ISVC 2020 San Diego CA 5\u20137 October 2020 Proceedings Part II 15 pp. 207\u2013222. Springer.","DOI":"10.1007\/978-3-030-64559-5_16"},{"key":"e_1_3_5_16_1","doi-asserted-by":"crossref","unstructured":"Deng J Dong W Socher R et al. (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition Miami FL 20\u201325 June 2009 pp. 248\u2013255. IEEE.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_5_17_1","doi-asserted-by":"crossref","unstructured":"Dokania S Hafez A Subramanian A et al. (2023) Idd-3d: Indian driving dataset for 3d unstructured road scenes. In: Proceedings of the IEEE\/CVF winter conference on applications of computer vision Waikoloa Hawaii USA 3-7 January 2023 pp. 4482\u20134491.","DOI":"10.1109\/WACV56688.2023.00446"},{"key":"e_1_3_5_18_1","doi-asserted-by":"crossref","unstructured":"Droeschel D Behnke S (2018) Efficient continuous-time SLAM for 3D lidar-based online mapping. In: 2018 IEEE international conference on robotics and automation (ICRA) Brisbane Queensland Australia 21-26 May 2018 pp. 5000\u20135007.","DOI":"10.1109\/ICRA.2018.8461000"},{"key":"e_1_3_5_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2022.3148457"},{"key":"e_1_3_5_20_1","doi-asserted-by":"crossref","unstructured":"Furgale P Barfoot TD Sibley G (2012) Continuous-time batch estimation using temporal basis functions. In: 2012 IEEE international conference on robotics and automation Saint Paul MN 14\u201318 May 2012 2088\u20132095. IEEE.","DOI":"10.1109\/ICRA.2012.6225005"},{"key":"e_1_3_5_21_1","doi-asserted-by":"crossref","unstructured":"Jiang P Saripalli S (2021) Lidarnet: a boundary-aware domain adaptation model for point cloud semantic segmentation. In: 2021 IEEE international conference on robotics and automation (ICRA) Xi\u2019an China 30 May 2021 - 05 June 2021 pp. 2457\u20132464. IEEE.","DOI":"10.1109\/ICRA48506.2021.9561255"},{"key":"e_1_3_5_22_1","doi-asserted-by":"crossref","unstructured":"Jiang P Osteen P Wigness M et al. (2021) Rellis-3d dataset: data benchmarks and analysis. In: 2021 IEEE international conference on robotics and automation (ICRA) Xi\u2019an China 30 May 2021 - 05 June 2021 pp. 1110\u20131116. IEEE.","DOI":"10.1109\/ICRA48506.2021.9561251"},{"key":"e_1_3_5_23_1","doi-asserted-by":"publisher","unstructured":"Katz S Tal A (2015) On the visibility of point clouds. In: 2015 IEEE International Conference on Computer Vision (ICCV) Santiago Chile 07\u201313 December 2015 pp. 1350\u20131358. DOI: 10.1109\/ICCV.2015.159.","DOI":"10.1109\/ICCV.2015.159"},{"key":"e_1_3_5_24_1","doi-asserted-by":"crossref","unstructured":"Knights J Vidanapathirana K Ramezani M et al. (2023) Wild-places: a large-scale dataset for LiDAR place recognition in unstructured natural environments. In: 2023 IEEE International Conference on Robotics and Automation (ICRA) London UK 29 May 2023 - 02 June 2023 pp. 11322\u201311328.","DOI":"10.1109\/ICRA48891.2023.10160432"},{"key":"e_1_3_5_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2023.3337698"},{"key":"e_1_3_5_26_1","doi-asserted-by":"crossref","unstructured":"Krispel G Opitz M Waltner G et al. (2020) Fuseseg: lidar point cloud segmentation fusing multi-modal data. In: Proceedings of the IEEE\/CVF winter conference on applications of computer vision Snowmass CO 01\u201305 March 2020 pp. 1874\u20131883.","DOI":"10.1109\/WACV45572.2020.9093584"},{"key":"e_1_3_5_27_1","doi-asserted-by":"crossref","unstructured":"Lai X Chen Y Lu F et al. (2023) Spherical transformer for lidar-based 3d recognition. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition Vancouver BC 17\u201324 June 2023 pp. 17545\u201317555.","DOI":"10.1109\/CVPR52729.2023.01683"},{"key":"e_1_3_5_28_1","doi-asserted-by":"crossref","unstructured":"Li L Zhou T Wang W et al. (2022) Deep hierarchical semantic segmentation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition New Orleans LA 18\u201324 June 2022 pp. 1246\u20131257.","DOI":"10.1109\/CVPR52688.2022.00131"},{"key":"e_1_3_5_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2022.3179507"},{"key":"e_1_3_5_30_1","doi-asserted-by":"crossref","unstructured":"Liu Z Mao H Wu CY et al. (2022) A convnet for the 2020s. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition New Orleans LA 18\u201324 June 2022 pp. 11976\u201311986.","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"e_1_3_5_31_1","first-page":"027836492312100","article-title":"Magro dataset: a dataset for simultaneous localization and mapping in agricultural environments","volume":"43","author":"Marzoa Tanco M","year":"2023","unstructured":"Marzoa Tanco M, Trinidad Barnech G, Andrade F, et al. (2023) Magro dataset: a dataset for simultaneous localization and mapping in agricultural environments. The International Journal of Robotics Research 43: 02783649231210011.","journal-title":"The International Journal of Robotics Research"},{"key":"e_1_3_5_32_1","doi-asserted-by":"crossref","unstructured":"Maturana D Chou PW Uenoyama M et al. (2018) Real-time semantic mapping for autonomous off-road navigation. In: Field and Service Robotics: Results of the 11th International Conference Zurich Switzerland 12-15 September 2017 pp. 335\u2013350. Springer.","DOI":"10.1007\/978-3-319-67361-5_22"},{"key":"e_1_3_5_33_1","doi-asserted-by":"crossref","unstructured":"Metzger KA Mortimer P Wuensche HJ (2021) A fine-grained dataset and its efficient semantic segmentation for unstructured driving scenarios. In: 2020 25th international conference on pattern recognition (ICPR) Milan Italy 10\u201315 January 2021 pp. 7892\u20137899. IEEE.","DOI":"10.1109\/ICPR48806.2021.9411987"},{"key":"e_1_3_5_34_1","doi-asserted-by":"crossref","unstructured":"Min C Jiang W Zhao D et al. (2022) Orfd: a dataset and benchmark for off-road freespace detection. In: 2022 International Conference on Robotics and Automation (ICRA) Philadelphia PA 23\u201327 May 2022 pp. 2532\u20132538. IEEE.","DOI":"10.1109\/ICRA46639.2022.9812139"},{"key":"e_1_3_5_35_1","doi-asserted-by":"crossref","unstructured":"Nunes L Wiesmann L Marcuzzi R et al. (2023) Temporal consistent 3d lidar representation learning for semantic perception in autonomous driving. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Vancouver BC 17\u201324 June 2023 pp. 5217\u20135228.","DOI":"10.1109\/CVPR52729.2023.00505"},{"key":"#cr-split#-e_1_3_5_36_1.1","doi-asserted-by":"crossref","unstructured":"Pan Y Gao B Mei J et al. (2020) Semanticposs: a point cloud dataset with large quantity of dynamic instances. In: 2020 IEEE Intelligent Vehicles Symposium","DOI":"10.1109\/IV47402.2020.9304596"},{"key":"#cr-split#-e_1_3_5_36_1.2","unstructured":"(IV) Las Vegas NV 19 October 2020-13 November 2020 pp. 687-693. IEEE."},{"key":"e_1_3_5_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2021.3096650"},{"key":"e_1_3_5_38_1","article-title":"Wildcat: online continuous-time 3d lidar-inertial slam","author":"Ramezani M","year":"2022","unstructured":"Ramezani M, Khosoussi K, Catt G, et al. (2022) Wildcat: online continuous-time 3d lidar-inertial slam. arXiv preprint arXiv:2205.12595.","journal-title":"arXiv preprint arXiv:2205.12595"},{"key":"e_1_3_5_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/0377-0427(87)90125-7"},{"key":"e_1_3_5_40_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364917695640"},{"key":"e_1_3_5_41_1","doi-asserted-by":"crossref","unstructured":"Saltori C Osep A Ricci E et al. (2023) Walking your lidog: a journey through multiple domains for lidar semantic segmentation. In: Proceedings of the IEEE\/CVF international conference on computer vision 196\u2013206.","DOI":"10.1109\/ICCV51070.2023.00025"},{"key":"e_1_3_5_42_1","doi-asserted-by":"crossref","unstructured":"Sanchez J Deschaud JE Goulette F (2023a) Domain generalization of 3d semantic segmentation in autonomous driving. In: Proceedings of the IEEE\/CVF international conference on computer vision Paris France 01\u201306 October 2023 pp. 18077\u201318087.","DOI":"10.1109\/ICCV51070.2023.01657"},{"key":"e_1_3_5_43_1","article-title":"Parisluco3d: a high-quality target dataset for domain generalization of lidar perception","author":"Sanchez J","year":"2023","unstructured":"Sanchez J, Soum-Fontez L, Deschaud JE, et al. (2023b) Parisluco3d: a high-quality target dataset for domain generalization of lidar perception. arXiv preprint arXiv:2310.16542.","journal-title":"arXiv preprint arXiv:2310.16542"},{"key":"e_1_3_5_44_1","doi-asserted-by":"crossref","unstructured":"Silberman N Hoiem D Kohli P et al. (2012) Indoor segmentation and support inference from rgbd images. In: Computer Vision\u2013ECCV 2012: 12th European conference on computer vision Florence Italy 7\u201313 October 2012 Proceedings Part V 12 pp. 746\u2013760. Springer.","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"e_1_3_5_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2023.3256926"},{"key":"e_1_3_5_46_1","doi-asserted-by":"crossref","unstructured":"Strudel R Garcia R Laptev I et al. (2021) Segmenter: transformer for semantic segmentation. IN: Proceedings of the IEEE\/CVF international conference on computer vision Montreal QC 10\u201317 October 2021 pp. 7262\u20137272.","DOI":"10.1109\/ICCV48922.2021.00717"},{"key":"e_1_3_5_47_1","doi-asserted-by":"crossref","unstructured":"Sun G Liu Y Ding H et al. (2022) Coarse-to-fine feature mining for video semantic segmentation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition New Orleans LA 18\u201324 June 2022 pp. 3126\u20133137.","DOI":"10.1109\/CVPR52688.2022.00313"},{"key":"e_1_3_5_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58604-1_41"},{"key":"e_1_3_5_49_1","doi-asserted-by":"crossref","unstructured":"Triest S Sivaprakasam M Wang SJ et al. (2022) Tartandrive: a large-scale dataset for learning off-road dynamics models. In: 2022 international conference on robotics and automation (ICRA) Philadelphia PA 23\u201327 May 2022 pp. 2546\u20132552. IEEE.","DOI":"10.1109\/ICRA46639.2022.9811648"},{"key":"e_1_3_5_50_1","doi-asserted-by":"crossref","unstructured":"Valada A Oliveira GL Brox T et al. (2017) Deep multispectral semantic scene understanding of forested environments using multimodal fusion. In: 2016 international symposium on experimental robotics Tokyo Japan 2-3 October 2016 pp. 465\u2013477. Springer.","DOI":"10.1007\/978-3-319-50115-4_41"},{"key":"e_1_3_5_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2854290"},{"key":"e_1_3_5_52_1","doi-asserted-by":"crossref","unstructured":"Wigness M Eum S Rogers JG et al. (2019) A rugd dataset for autonomous navigation and visual perception in unstructured outdoor environments. In: 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) Macau China 03\u201308 November 2019 5000\u20135007. IEEE.","DOI":"10.1109\/IROS40897.2019.8968283"},{"key":"e_1_3_5_53_1","doi-asserted-by":"crossref","unstructured":"Wu Y Zhang T Ke W et al. (2023) Spatiotemporal self-supervised learning for point clouds in the wild. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition Vancouver BC 17\u201324 June 2023 pp. 5251\u20135260.","DOI":"10.1109\/CVPR52729.2023.00508"},{"key":"e_1_3_5_54_1","doi-asserted-by":"crossref","unstructured":"Xiao T Liu Y Zhou B et al. (2018) Unified perceptual parsing for scene understanding. In: Proceedings of the European conference on computer vision (ECCV) Munich Germany September 2018 8418\u201314434.","DOI":"10.1007\/978-3-030-01228-1_26"},{"key":"e_1_3_5_55_1","doi-asserted-by":"crossref","unstructured":"Xiao A Huang J Xuan W et al. (2023) 3d semantic segmentation in the wild: learning generalized models for adverse-condition point clouds. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Vancouver BC 17\u201324 June 2023 pp. 9382\u20139392.","DOI":"10.1109\/CVPR52729.2023.00905"},{"key":"e_1_3_5_56_1","first-page":"12077","article-title":"Segformer: simple and efficient design for semantic segmentation with transformers","volume":"34","author":"Xie E","year":"2021","unstructured":"Xie E, Wang W, Yu Z, et al. (2021) Segformer: simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems 34: 12077\u201312090.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_5_57_1","doi-asserted-by":"crossref","unstructured":"Yan X Gao J Zheng C et al. (2022) 2dpass: 2d priors assisted semantic segmentation on lidar point clouds. In: European Conference on Computer Vision 677\u2013695. Springer.","DOI":"10.1007\/978-3-031-19815-1_39"},{"key":"e_1_3_5_58_1","doi-asserted-by":"crossref","unstructured":"Ye X Shu M Li H et al. (2022) Rope3d: the roadside perception dataset for autonomous driving and monocular 3d object detection task. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition New Orleans LA 18\u201324 June 2022 pp. 21341\u201321350.","DOI":"10.1109\/CVPR52688.2022.02065"},{"key":"e_1_3_5_59_1","doi-asserted-by":"crossref","unstructured":"Zhou B Zhao H Puig X et al. (2017) Scene parsing through ade20k dataset. In: Proceedings of the IEEE conference on computer vision and pattern recognition Honolulu HI 21\u201326 July 2017 pp. 633\u2013641.","DOI":"10.1109\/CVPR.2017.544"},{"key":"e_1_3_5_60_1","doi-asserted-by":"crossref","unstructured":"Zhu Y Sapra K Reda FA et al. (2019) Improving semantic segmentation via video propagation and label relaxation. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Long Beach CA 15\u201320 June 2019 pp. 8856\u20138865.","DOI":"10.1109\/CVPR.2019.00906"},{"key":"e_1_3_5_61_1","doi-asserted-by":"crossref","unstructured":"Zhu X Zhou H Wang T et al. (2021) Cylindrical and asymmetrical 3d convolution networks for LIDAR segmentation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition 19-25 June 2021 pp. 9939\u20139948.","DOI":"10.1109\/CVPR46437.2021.00981"},{"key":"e_1_3_5_62_1","doi-asserted-by":"crossref","unstructured":"Zhuang Z Li R Jia K et al. (2021) Perception-aware multi-sensor fusion for 3d lidar semantic segmentation. In: Proceedings of the IEEE\/CVF international conference on computer vision Montreal QC 10\u201317 October 2021 pp. 16280\u201316290.","DOI":"10.1109\/ICCV48922.2021.01597"}],"container-title":["The International Journal of Robotics Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/02783649241278369","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/02783649241278369","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/02783649241278369","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T10:17:34Z","timestamp":1777457854000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/02783649241278369"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,20]]},"references-count":62,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,4]]}},"alternative-id":["10.1177\/02783649241278369"],"URL":"https:\/\/doi.org\/10.1177\/02783649241278369","relation":{},"ISSN":["0278-3649","1741-3176"],"issn-type":[{"value":"0278-3649","type":"print"},{"value":"1741-3176","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,20]]}}}