{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:49:48Z","timestamp":1760147388885,"version":"build-2065373602"},"reference-count":36,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T00:00:00Z","timestamp":1675296000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"French \u201cProgramme d\u2019Investissements d\u2019Avenir\u201d (PIA)","award":["CORAM\/NeVeOS"],"award-info":[{"award-number":["CORAM\/NeVeOS"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Occupancy grid maps are widely used as an environment model that allows the fusion of different range sensor technologies in real-time for robotics applications. In an autonomous vehicle setting, occupancy grid maps are especially useful for their ability to accurately represent the position of surrounding obstacles while being robust to discrepancies between the fused sensors through the use of occupancy probabilities representing uncertainty. In this article, we propose to evaluate the applicability of real-time vehicle detection on occupancy grid maps. State of the art detectors in sensor-specific domains such as YOLOv2\/YOLOv3 for images or PIXOR for LiDAR point clouds are modified to use occupancy grid maps as input and produce oriented bounding boxes enclosing vehicles as output. The five proposed detectors are trained on the Waymo Open automotive dataset and compared regarding the quality of their detections measured in terms of Average Precision (AP) and their real-time capabilities measured in Frames Per Second (FPS). Of the five detectors presented, one inspired from the PIXOR backbone reaches the highest AP0.7 of 0.82 and runs at 20 FPS. Comparatively, two other proposed detectors inspired from YOLOv2 achieve an almost as good, with a AP0.7 of 0.79 while running at 91 FPS. These results validate the feasibility of real-time vehicle detection on occupancy grids.<\/jats:p>","DOI":"10.3390\/s23031613","type":"journal-article","created":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T01:53:54Z","timestamp":1675302834000},"page":"1613","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Vehicle Detection on Occupancy Grid Maps: Comparison of Five Detectors Regarding Real-Time Performance"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8101-3687","authenticated-orcid":false,"given":"Nils","family":"Defauw","sequence":"first","affiliation":[{"name":"Univ. Grenoble Alpes, CEA, List, F-38000 Grenoble, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6448-7650","authenticated-orcid":false,"given":"Marielle","family":"Malfante","sequence":"additional","affiliation":[{"name":"Univ. Grenoble Alpes, CEA, List, F-38000 Grenoble, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6538-8542","authenticated-orcid":false,"given":"Olivier","family":"Antoni","sequence":"additional","affiliation":[{"name":"Univ. Grenoble Alpes, CEA, List, F-38000 Grenoble, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5301-759X","authenticated-orcid":false,"given":"Tiana","family":"Rakotovao","sequence":"additional","affiliation":[{"name":"Univ. Grenoble Alpes, CEA, List, F-38000 Grenoble, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3238-8081","authenticated-orcid":false,"given":"Suzanne","family":"Lesecq","sequence":"additional","affiliation":[{"name":"Univ. Grenoble Alpes, CEA, Leti, F-38000 Grenoble, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,2,2]]},"reference":[{"key":"ref_1","unstructured":"Moravec, H., and Elfes, A. (1985, January 25). High resolution maps from wide angle sonar. Proceedings of the 1985 IEEE International Conference on Robotics and Automation Proceedings, St. Louis, MO, USA."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/2.30720","article-title":"Using occupancy grids for mobile robot perception and navigation","volume":"22","author":"Elfes","year":"1989","journal-title":"Computer"},{"key":"ref_3","unstructured":"Thrun, S., Burgard, W., and Fox, D. (2005). Probabilistic Robotics (Intelligent Robotics and Autonomous Agents), The MIT Press."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/S0004-3702(97)00078-7","article-title":"Learning metric-topological maps for indoor mobile robot navigation","volume":"99","author":"Thrun","year":"1998","journal-title":"Artif. Intell."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1023\/A:1008987612352","article-title":"Using Real-Time Stereo Vision for Mobile Robot Navigation","volume":"8","author":"Murray","year":"2000","journal-title":"Auton. Robot."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Kohlbrecher, S., von Stryk, O., Meyer, J., and Klingauf, U. (2011, January 1\u20135). A flexible and scalable SLAM system with full 3D motion estimation. Proceedings of the 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics, Kyoto, Japan.","DOI":"10.1109\/SSRR.2011.6106777"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Goeddel, R., and Olson, E. (2016, January 9\u201314). Learning semantic place labels from occupancy grids using CNNs. Proceedings of the 2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Republic of Korea.","DOI":"10.1109\/IROS.2016.7759589"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Hiller, M., Qiu, C., Particke, F., Hofmann, C., and Thielecke, J. (2019, January 3\u20138). Learning Topometric Semantic Maps from Occupancy Grids. Proceedings of the 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), The Venetian Macao, Macau.","DOI":"10.1109\/IROS40897.2019.8968111"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1109\/TPAMI.2013.185","article-title":"3D Traffic Scene Understanding From Movable Platforms","volume":"36","author":"Geiger","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Park, S.H., Kim, B., Kang, C.M., Chung, C.C., and Choi, J.W. (2018, January 26\u201330). Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture. Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Anchorage, AK, USA.","DOI":"10.1109\/IVS.2018.8500658"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Hoermann, S., Henzler, P., Bach, M., and Dietmayer, K. (2018, January 26\u201330). Object Detection on Dynamic Occupancy Grid Maps Using Deep Learning and Automatic Label Generation. Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Anchorage, AK, USA.","DOI":"10.1109\/IVS.2018.8500677"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1142\/S2301385019410036","article-title":"End-to-End Learning of Semantic Grid Estimation Deep Neural Network with Occupancy Grids","volume":"7","author":"Erkent","year":"2019","journal-title":"Unmanned Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"113816","DOI":"10.1016\/j.eswa.2020.113816","article-title":"Self-driving cars: A survey","volume":"165","author":"Badue","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"22988","DOI":"10.1109\/TITS.2022.3214079","article-title":"Dynamic Conditional Imitation Learning for Autonomous Driving","volume":"23","author":"Eraqi","year":"2022","journal-title":"Trans. Intell. Transp. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wirges, S., Fischer, T., Stiller, C., and Frias, J.B. (2018, January 4\u20137). Object Detection and Classification in Occupancy Grid Maps Using Deep Convolutional Networks. Proceedings of the 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, USA.","DOI":"10.1109\/ITSC.2018.8569433"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_17","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yang, B., Luo, W., and Urtasun, R. (2018, January 18\u201323). PIXOR: Real-time 3D Object Detection from Point Clouds. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00798"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Sun, P., Kretzschmar, H., Dotiwalla, X., Chouard, A., Patnaik, V., Tsui, P., Guo, J., Zhou, Y., Chai, Y., and Caine, B. (2020, January 13\u201319). Scalability in Perception for Autonomous Driving: Waymo Open Dataset. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00252"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Leibe, B., Matas, J., Sebe, N., and Welling, M. (2016). Proceedings of the Computer Vision\u2014ECCV 2016, Springer International Publishing. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-46454-1"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_25","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"9102","DOI":"10.1109\/ACCESS.2020.2964608","article-title":"Real-Time Apple Detection System Using Embedded Systems with Hardware Accelerators: An Edge AI Application","volume":"8","author":"Mazzia","year":"2020","journal-title":"IEEE Access"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Mark Liao, H.Y., Wu, Y.H., Chen, P.Y., Hsieh, J.W., and Yeh, I.H. (2020, January 14\u201319). CSPNet: A New Backbone that can Enhance Learning Capability of CNN. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Fang, L., Wu, Y., Li, Y., Guo, H., Zhang, H., Wang, X., Xi, R., and Hou, J. (2021). Ginger Seeding Detection and Shoot Orientation Discrimination Using an Improved YOLOv4-LITE Network. Agronomy, 11.","DOI":"10.3390\/agronomy11112328"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Liu, S., Kong, W., Chen, X., Xu, M., Yasir, M., Zhao, L., and Li, J. (2022). Multi-Scale Ship Detection Algorithm Based on a Lightweight Neural Network for Spaceborne SAR Images. Remote Sens., 14.","DOI":"10.3390\/rs14051149"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., and Beijbom, O. (2019, January 15\u201320). PointPillars: Fast Encoders for Object Detection From Point Clouds. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01298"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Beltr\u00e1n, J., Guindel, C., Moreno, F.M., Cruzado, D., Garc\u00eda, F., and De La Escalera, A. (2018, January 4\u20137). BirdNet: A 3D Object Detection Framework from LiDAR Information. Proceedings of the 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, USA.","DOI":"10.1109\/ITSC.2018.8569311"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Barrera, A., Guindel, C., Beltr\u00e1n, J., and Garc\u00eda, F. (2020, January 20\u201323). BirdNet+: End-to-End 3D Object Detection in LiDAR Bird\u2019s Eye View. Proceedings of the 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece.","DOI":"10.1109\/ITSC45102.2020.9294293"},{"key":"ref_33","unstructured":"Kingma, D.P., and Ba, J. (2017). Adam: A Method for Stochastic Optimization. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Navab, N., Hornegger, J., Wells, W.M., and Frangi, A.F. (2015). Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2015, Springer International Publishing. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-24571-3"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Rakotovao, T., Mottin, J., Puschini, D., and Laugier, C. (2016, January 16\u201321). Multi-sensor fusion of occupancy grids based on integer arithmetic. Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden.","DOI":"10.1109\/ICRA.2016.7487330"},{"key":"ref_36","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (2016). TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/3\/1613\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:21:38Z","timestamp":1760120498000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/3\/1613"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,2]]},"references-count":36,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,2]]}},"alternative-id":["s23031613"],"URL":"https:\/\/doi.org\/10.3390\/s23031613","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2023,2,2]]}}}