{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T16:19:31Z","timestamp":1759335571562,"version":"3.41.2"},"reference-count":26,"publisher":"World Scientific Pub Co Pte Ltd","issue":"14","funder":[{"name":"Key Projects of North China University of Science and Technology","award":["ZD-YG-202317-23"],"award-info":[{"award-number":["ZD-YG-202317-23"]}]},{"DOI":"10.13039\/501100010877","name":"Science, Technology and Innovation Commission of Shenzhen Municipality","doi-asserted-by":"publisher","award":["JCYJ20210324120002007"],"award-info":[{"award-number":["JCYJ20210324120002007"]}],"id":[{"id":"10.13039\/501100010877","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,11]]},"abstract":"<jats:p> The fusion of millimeter-wave radar and camera for three-dimensional (3D) object detection represents a pivotal technology for autonomous driving, yet it is not without its inherent challenges. First, the radar point cloud contains clutter, which can result in the generation of impure radar features. Second, the radar point cloud is sparse, which presents a challenge in fully extracting the radar features. This can result in the loss of object information, leading to object misdetection, omission, and a reduction in the robustness. To address these issues, a 3D object detection method based on the semantic information of radar features and camera fusion (Semantics-Fusion) is proposed. Initially, the image features are extracted through the centroid detection network, resulting in the generation of a preliminary 3D bounding box for the objects. Subsequently, the radar point cloud is clustered based on the objects\u2019 position and velocity, thereby eliminating irrelevant point cloud and clutter. The clustered radar point cloud is projected onto the image plane, thereby forming a radar 2D pseudo-image. This is then input to the designed 2D convolution module, which enables the full extraction of the semantic information of the radar features. Ultimately, the radar features are fused with the image features, and secondary regression is employed to achieve robust 3D object detection. The performance of our method was evaluated on the nuScenes dataset, achieving a mean average precision (mAP) of 0.325 and a nuScenes detection score (NDS) of 0.462. <\/jats:p>","DOI":"10.1142\/s0218001424550152","type":"journal-article","created":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T09:56:27Z","timestamp":1726221387000},"source":"Crossref","is-referenced-by-count":1,"title":["Semantics-Fusion: Radar Semantic Information-Based Radar\u2013Camera Fusion for 3D Object Detection"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-5627-6372","authenticated-orcid":false,"given":"Ziran","family":"Tian","sequence":"first","affiliation":[{"name":"Hebei Provincial Key Laboratory of Industrial Intelligent Perception, College of Artificial Intelligence, North China University of Science and Technology, Tangshan, Hebei 063210, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-0577-7657","authenticated-orcid":false,"given":"Xiaohong","family":"Huang","sequence":"additional","affiliation":[{"name":"Hebei Provincial Key Laboratory of Industrial Intelligent Perception, College of Artificial Intelligence, North China University of Science and Technology, Tangshan, Hebei 063210, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-7913-6165","authenticated-orcid":false,"given":"Kunqiang","family":"Xu","sequence":"additional","affiliation":[{"name":"Hebei Provincial Key Laboratory of Industrial Intelligent Perception, College of Artificial Intelligence, North China University of Science and Technology, Tangshan, Hebei 063210, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-3752-0370","authenticated-orcid":false,"given":"Xujie","family":"Sun","sequence":"additional","affiliation":[{"name":"Hebei Provincial Key Laboratory of Industrial Intelligent Perception, College of Artificial Intelligence, North China University of Science and Technology, Tangshan, Hebei 063210, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9054-6853","authenticated-orcid":false,"given":"Zhenmiao","family":"Deng","sequence":"additional","affiliation":[{"name":"College of Electronics and Communication Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510275, P.\u00a0R.\u00a0China"}]}],"member":"219","published-online":{"date-parts":[[2024,11,14]]},"reference":[{"key":"S0218001424550152BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01164"},{"key":"S0218001424550152BIB002","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.89"},{"key":"S0218001424550152BIB003","first-page":"2366","volume-title":"Advances in Neural Information Processing Systems","volume":"27","author":"Eigen D.","year":"2014"},{"key":"S0218001424550152BIB004","first-page":"226","volume-title":"Proc. Second Int. Conf. Knowledge Discovery and Data Mining","author":"Ester M.","year":"1996"},{"key":"S0218001424550152BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S0218001424550152BIB009","doi-asserted-by":"publisher","DOI":"10.1109\/IROS45743.2020.9341177"},{"key":"S0218001424550152BIB010","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i1.25198"},{"key":"S0218001424550152BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.01615"},{"key":"S0218001424550152BIB014","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i2.25233"},{"key":"S0218001424550152BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00281"},{"key":"S0218001424550152BIB016","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-20077-9_1","volume-title":"ECCV 2022: Computer Vision","volume":"13669","author":"Li Z.","year":"2022"},{"key":"S0218001424550152BIB017","doi-asserted-by":"publisher","DOI":"10.1186\/s13634-022-00944-6"},{"key":"S0218001424550152BIB018","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19812-0_31"},{"key":"S0218001424550152BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW50498.2020.00506"},{"key":"S0218001424550152BIB020","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i2.25270"},{"key":"S0218001424550152BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.597"},{"key":"S0218001424550152BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/WACV48630.2021.00157"},{"key":"S0218001424550152BIB023","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1007\/978-3-319-46484-8_29","volume-title":"ECCV 2016: Computer Vision","volume":"9912","author":"Newell A.","year":"2012"},{"key":"S0218001424550152BIB025","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00313"},{"key":"S0218001424550152BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58568-6_12"},{"key":"S0218001424550152BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00208"},{"key":"S0218001424550152BIB028","first-page":"180","volume-title":"Proc. 5th Conf. Robot Learning","author":"Wang Y.","year":"2022"},{"key":"S0218001424550152BIB029","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW54120.2021.00107"},{"key":"S0218001424550152BIB030","first-page":"1475","volume-title":"Proc. 5th Conf. Robot Learning","author":"Wang T.","year":"2022"},{"key":"S0218001424550152BIB031","doi-asserted-by":"publisher","DOI":"10.3390\/s22072542"},{"key":"S0218001424550152BIB033","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00255"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001424550152","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,29]],"date-time":"2024-11-29T09:25:24Z","timestamp":1732872324000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001424550152"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11]]},"references-count":26,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2024,11]]}},"alternative-id":["10.1142\/S0218001424550152"],"URL":"https:\/\/doi.org\/10.1142\/s0218001424550152","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,11]]},"article-number":"2455015"}}