{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T01:41:56Z","timestamp":1760233316421,"version":"build-2065373602"},"reference-count":56,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,27]],"date-time":"2022-12-27T00:00:00Z","timestamp":1672099200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["52002285","21ZR1467400","22120220593","2021YFB2501104"],"award-info":[{"award-number":["52002285","21ZR1467400","22120220593","2021YFB2501104"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003399","name":"Shanghai Science and Technology Commission","doi-asserted-by":"publisher","award":["52002285","21ZR1467400","22120220593","2021YFB2501104"],"award-info":[{"award-number":["52002285","21ZR1467400","22120220593","2021YFB2501104"]}],"id":[{"id":"10.13039\/501100003399","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Tongji University","award":["52002285","21ZR1467400","22120220593","2021YFB2501104"],"award-info":[{"award-number":["52002285","21ZR1467400","22120220593","2021YFB2501104"]}]},{"DOI":"10.13039\/501100012166","name":"National Key R&amp;D Program of China","doi-asserted-by":"publisher","award":["52002285","21ZR1467400","22120220593","2021YFB2501104"],"award-info":[{"award-number":["52002285","21ZR1467400","22120220593","2021YFB2501104"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Recently, 3D object detection based on multi-modal sensor fusion has been increasingly adopted in automated driving and robotics. For example, the semantic information provided by cameras and the geometric information provided by light detection and ranging (LiDAR) are fused to perceive 3D objects, as single modal sensors are unable to capture enough information from the environment. Many state-of-the-art methods fuse the signals sequentially for simplicity. By sequentially, we mean using the image semantic signals as auxiliary input for LiDAR-based object detectors would make the overall performance heavily rely on the semantic signals. Moreover, the error introduced by these signals may lead to detection errors. To remedy this dilemma, we propose an approach coined supervised-PointRendering to correct the potential errors in the image semantic segmentation results by training auxiliary tasks with fused features of the laser point geometry feature, the image semantic feature and a novel laser visibility feature. The laser visibility feature is obtained through the raycasting algorithm and is adopted to constrain the spatial distribution of fore- and background objects. Furthermore, we build an efficient anchor-free Single Stage Detector (SSD) powered by an advanced global-optimal label assignment to achieve a better time\u2013accuracy balance. The new detection framework is evaluated on the extensively used KITTI and nuScenes datasets, manifesting the highest inference speed and at the same time outperforming most of the existing single-stage detectors with respect to the average precision.<\/jats:p>","DOI":"10.3390\/rs15010161","type":"journal-article","created":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T05:30:27Z","timestamp":1672205427000},"page":"161","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["SPV-SSD: An Anchor-Free 3D Single-Stage Detector with Supervised-PointRendering and Visibility Representation"],"prefix":"10.3390","volume":"15","author":[{"given":"Lingmei","family":"Yin","sequence":"first","affiliation":[{"name":"School of Automotive Studies, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5085-7219","authenticated-orcid":false,"given":"Wei","family":"Tian","sequence":"additional","affiliation":[{"name":"School of Automotive Studies, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ling","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Automotive Studies, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiang","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Automotive Studies, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhuoping","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Automotive Studies, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Wang, B., Lan, J., and Gao, J. (2022). LiDAR filtering in 3D object detection based on improved RANSAC. Remote Sens., 14.","DOI":"10.3390\/rs14092110"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Deng, S., Liang, Z., Sun, L., and Jia, K. (2022, January 19\u201324). VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00826"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"15824","DOI":"10.1109\/TITS.2022.3145588","article-title":"MASS: Multi-attentional semantic segmentation of LiDAR data for dense top-view understanding","volume":"23","author":"Peng","year":"2022","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Vora, S., Lang, A.H., Helou, B., and Beijbom, O. (2020, January 14\u201319). PointPainting: Sequential Fusion for 3D Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00466"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Geiger, A., Lenz, P., and Urtasun, R. (2012, January 16\u201321). Are we ready for autonomous driving? The KITTI vision benchmark suite. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Shi, S., Wang, X., and Li, H. (2019, January 16\u201320). PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00086"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1007\/s12650-021-00755-1","article-title":"Recent advances and challenges in uncertainty visualization: A survey","volume":"24","author":"Kamal","year":"2021","journal-title":"J. Vis."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1016\/j.cageo.2018.10.006","article-title":"Assessing and visualizing uncertainty of 3D geological surfaces using level sets with stochastic motion","volume":"122","author":"Yang","year":"2019","journal-title":"Comput. Geosci."},{"key":"ref_9","unstructured":"Choi, J., Chun, D., Kim, H., and Lee, H.J. (November, January 27). Gaussian yolov3: An accurate and fast object detector using localization uncertainty for autonomous driving. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Caesar, H., Bankiti, V., Lang, A.H., Vora, S., Liong, V.E., Xu, Q., Krishnan, A., Pan, Y., Baldan, G., and Beijbom, O. (2020, January 14\u201319). nuScenes: A Multimodal Dataset for Autonomous Driving. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01164"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Yan, Y., Mao, Y., and Li, B. (2018). SECOND: Sparsely Embedded Convolutional Detection. Sensors, 18.","DOI":"10.3390\/s18103337"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., and Beijbom, O. (2019, January 16\u201320). PointPillars: Fast Encoders for Object Detection From Point Clouds. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01298"},{"key":"ref_13","unstructured":"Ge, R., Ding, Z., Hu, Y., Wang, Y., Chen, S., Huang, L., and Li, Y. (2020). AFDet: Anchor Free One Stage 3D Object Detection. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yin, T., Zhou, X., and Kr\u00e4henb\u00fchl, P. (2021, January 19\u201325). Center-based 3D Object Detection and Tracking. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01161"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ge, Z., Liu, S., Li, Z., Yoshie, O., and Sun, J. (2021, January 19\u201325). OTA: Optimal Transport Assignment for Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00037"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.cageo.2015.12.020","article-title":"GOSIM: A multi-scale iterative multiple-point statistics algorithm with global optimization","volume":"89","author":"Yang","year":"2016","journal-title":"Comput. Geosci."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Chen, Y., Tai, L., Sun, K., and Li, M. (2020, January 14\u201319). Monopair: Monocular 3d object detection using pairwise spatial relationships. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01211"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018). Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. arXiv.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"3212","DOI":"10.1109\/TNNLS.2018.2876865","article-title":"Object detection with deep learning: A review","volume":"30","author":"Zhao","year":"2019","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_20","unstructured":"Zou, Z., Shi, Z., Guo, Y., and Ye, J. (2019). Object detection in 20 years: A survey. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhou, Y., and Tuzel, O. (2018, January 18\u201322). VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00472"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Shi, W., and Rajkumar, R.R. (2020, January 14\u201319). Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00178"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"He, C.H., Zeng, H., Huang, J., Hua, X., and Zhang, L. (2020, January 14\u201319). Structure Aware Single-Stage 3D Object Detection From Point Cloud. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01189"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Deng, J., Shi, S., Li, P., Zhou, W., Zhang, Y., and Li, H. (2021, January 2\u20139). Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection. Proceedings of the AAAI, Virtual.","DOI":"10.1609\/aaai.v35i2.16207"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Shi, S., Guo, C., Jiang, L., Wang, Z., Shi, J., Wang, X., and Li, H. (2020, January 14\u201319). PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01054"},{"key":"ref_26","unstructured":"Shi, S., Wang, Z., Wang, X., and Li, H. (2019). Part-A2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). CornerNet: Detecting Objects as Paired Keypoints. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_29","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as Points. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Chen, X., Ma, H., Wan, J., Li, B., and Xia, T. (2017, January 21\u201326). Multi-view 3D Object Detection Network for Autonomous Driving. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.691"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ku, J., Mozifian, M., Lee, J., Harakeh, A., and Waslander, S.L. (2018, January 1\u20135). Joint 3D Proposal Generation and Object Detection from View Aggregation. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain.","DOI":"10.1109\/IROS.2018.8594049"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Liang, M., Yang, B., Wang, S., and Urtasun, R. (2018, January 8\u201314). Deep Continuous Fusion for Multi-sensor 3D Object Detection. Proceedings of the ECCV, Munich, Germany.","DOI":"10.1007\/978-3-030-01270-0_39"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chen, Z., Li, Z., Zhang, S., Fang, L., Jiang, Q., Zhao, F., Zhou, B., and Zhao, H. (2022). AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection. arXiv.","DOI":"10.24963\/ijcai.2022\/116"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, Z., Tang, H., Amini, A., Yang, X., Mao, H., Rus, D., and Han, S. (2022). BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird\u2019s-Eye View Representation. arXiv.","DOI":"10.1109\/ICRA48891.2023.10160968"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Qi, C., Liu, W., Wu, C., Su, H., and Guibas, L.J. (2018, January 18\u201322). Frustum PointNets for 3D Object Detection from RGB-D Data. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00102"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Wang, Z., and Jia, K. (2019, January 3\u20138). Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China.","DOI":"10.1109\/IROS40897.2019.8968513"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"F\u00fcrst, M., Wasenm\u00fcller, O., and Stricker, D. (2020, January 20\u201323). LRPD: Long Range 3D Pedestrian Detection Leveraging Specific Strengths of LiDAR and RGB. Proceedings of the IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece.","DOI":"10.1109\/ITSC45102.2020.9294537"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Buhmann, J.M., Burgard, W., Cremers, A.B., Fox, D., Hofmann, T., Schneider, F.E., Strikos, J., and Thrun, S. (1995, January 14\u201315). The Mobile Robot Rhino. Proceedings of the SNN Symposium on Neural Networks, Nijmegen, The Netherlands.","DOI":"10.1007\/978-1-4471-3087-1_26"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1007\/s10514-012-9321-0","article-title":"OctoMap: An efficient probabilistic 3D mapping framework based on octrees","volume":"34","author":"Hornung","year":"2013","journal-title":"Auton. Robot."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1515\/teme-2019-0052","article-title":"Fusion of range measurements and semantic estimates in an evidential framework \/ Fusion von Distanzmessungen und semantischen Gr\u00f6\u00dfen im Rahmen der Evidenztheorie","volume":"86","author":"Richter","year":"2019","journal-title":"TM-Tech. Mess."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Hu, P., Ziglar, J., Held, D., and Ramanan, D. (2020, January 14\u201319). What You See is What You Get: Exploiting Visibility for 3D Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01101"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zheng, W., Tang, W., Chen, S., Jiang, L., and Fu, C.W. (2021, January 2\u20139). CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud. Proceedings of the AAAI, Virtual.","DOI":"10.1109\/CVPR46437.2021.01426"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Bodla, N., Singh, B., Chellappa, R., and Davis, L.S. (2017, January 22\u201329). Soft-NMS \u2014 Improving Object Detection with One Line of Code. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.593"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. Proceedings of the AAAI, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Zhou, D., Fang, J., Song, X., Guan, C., Yin, J., Dai, Y., and Yang, R. (2019, January 16\u201319). IoU Loss for 2D\/3D Object Detection. Proceedings of the International Conference on 3D Vision (3DV), Quebec City, QC, Canada.","DOI":"10.1109\/3DV.2019.00019"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for Dense Object Detection","volume":"42","author":"Lin","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., and Schiele, B. (2016, January 27\u201330). The Cityscapes Dataset for Semantic Urban Scene Understanding. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.350"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Chen, K., Pang, J., Wang, J., Xiong, Y., Li, X., Sun, S., Feng, W., Liu, Z., Shi, J., and Ouyang, W. (2019, January 15\u201320). Hybrid Task Cascade for Instance Segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00511"},{"key":"ref_49","unstructured":"(2022, December 04). nuImages. Available online: https:\/\/www.nuscenes.org\/nuimages."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Pang, S., Morris, D.D., and Radha, H. (2020, January 25\u201329). CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection. Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.","DOI":"10.1109\/IROS45743.2020.9341791"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Chen, X., Zhang, T., Wang, Y., Wang, Y., and Zhao, H. (2022). Futr3d: A unified sensor fusion framework for 3d detection. arXiv.","DOI":"10.1109\/CVPRW59228.2023.00022"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Yin, J., Shen, J., Guan, C., Zhou, D., and Yang, R. (2020, January 14\u201319). Lidar-based online 3d video object detection with graph-based message passing and spatiotemporal transformer attention. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01151"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Yang, Z., Sun, Y., Liu, S., and Jia, J. (2020, January 14\u201319). 3DSSD: Point-Based 3D Single Stage Object Detector. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01105"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"6807","DOI":"10.1109\/TPAMI.2021.3098789","article-title":"Cylindrical and asymmetrical 3d convolution networks for lidar-based perception","volume":"44","author":"Zhu","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_55","unstructured":"Zhu, B., Jiang, Z., Zhou, X., Li, Z., and Yu, G. (2019). Class-balanced grouping and sampling for point cloud 3D object detection. arXiv."},{"key":"ref_56","first-page":"21224","article-title":"Every view counts: Cross-view consistency in 3D object detection with hybrid-cylindrical-spherical voxelization","volume":"33","author":"Chen","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/1\/161\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:53:12Z","timestamp":1760147592000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/1\/161"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,27]]},"references-count":56,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["rs15010161"],"URL":"https:\/\/doi.org\/10.3390\/rs15010161","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2022,12,27]]}}}