{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,15]],"date-time":"2025-11-15T10:28:55Z","timestamp":1763202535882,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,17]],"date-time":"2021-10-17T00:00:00Z","timestamp":1634428800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Anhui Provincial Development and Reform Commission 2020 New Energy Vehicle Industry Innovation Development Project ``Key System Research and Vehicle Development for Mass Production Oriented Highly Autonomous Driving'"},{"name":"the National Key Research and Development Program of China","award":["No. 2018AAA0100500"],"award-info":[{"award-number":["No. 2018AAA0100500"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,17]]},"DOI":"10.1145\/3474085.3475641","type":"proceedings-article","created":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T06:21:10Z","timestamp":1634538070000},"page":"5239-5247","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":22,"title":["Neighbor-Vote"],"prefix":"10.1145","author":[{"given":"Xiaomeng","family":"Chu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiajun","family":"Deng","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yao","family":"Li","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenxun","family":"Yuan","sequence":"additional","affiliation":[{"name":"The University of Sydney, Sydney, NSW, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yanyong","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianmin","family":"Ji","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS45743.2020.9340889"},{"key":"e_1_3_2_2_2_1","volume-title":"Cascade R-CNN: High Quality Object Detection and Instance Segmentation. CoRR","author":"Cai Zhaowei","year":"2019","unstructured":"Zhaowei Cai and Nuno Vasconcelos . 2019. Cascade R-CNN: High Quality Object Detection and Instance Segmentation. CoRR , Vol. abs\/ 1906 .09756 ( 2019 ). arxiv: 1906.09756 Zhaowei Cai and Nuno Vasconcelos. 2019. Cascade R-CNN: High Quality Object Detection and Instance Segmentation. CoRR, Vol. abs\/1906.09756 (2019). arxiv: 1906.09756"},{"key":"e_1_3_2_2_3_1","volume-title":"Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection. CoRR","author":"Deng Jiajun","year":"2020","unstructured":"Jiajun Deng , Shaoshuai Shi , Peiwei Li , Wengang Zhou , Yanyong Zhang , and Houqiang Li. 2020. Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection. CoRR , Vol. abs\/ 2012 .15712 ( 2020 ). arxiv: 2012.15712 Jiajun Deng, Shaoshuai Shi, Peiwei Li, Wengang Zhou, Yanyong Zhang, and Houqiang Li. 2020. Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection. CoRR, Vol. abs\/2012.15712 (2020). arxiv: 2012.15712"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01169"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/2969033.2969091"},{"key":"e_1_3_2_2_6_1","volume-title":"Deep Ordinal Regression Network for Monocular Depth Estimation. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018","author":"Fu Huan","year":"2018","unstructured":"Huan Fu , Mingming Gong , Chaohui Wang , Kayhan Batmanghelich , and Dacheng Tao . 2018 . Deep Ordinal Regression Network for Monocular Depth Estimation. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 , Salt Lake City, UT, USA , June 18-22, 2018. IEEE Computer Society, 2002--2011. Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, and Dacheng Tao. 2018. Deep Ordinal Regression Network for Monocular Depth Estimation. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018. IEEE Computer Society, 2002--2011."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354978"},{"key":"e_1_3_2_2_8_1","volume-title":"PointPillars: Fast Encoders for Object Detection From Point Clouds. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Lang Alex H.","year":"2019","unstructured":"Alex H. Lang , Sourabh Vora , Holger Caesar , Lubing Zhou , Jiong Yang , and Oscar Beijbom . 2019 . PointPillars: Fast Encoders for Object Detection From Point Clouds. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA , June 16-20, 2019. Computer Vision Foundation \/ IEEE, 12697--12705. Alex H. Lang, Sourabh Vora, Holger Caesar, Lubing Zhou, Jiong Yang, and Oscar Beijbom. 2019. PointPillars: Fast Encoders for Object Detection From Point Clouds. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019. Computer Vision Foundation \/ IEEE, 12697--12705."},{"key":"e_1_3_2_2_9_1","volume-title":"UK","volume":"660","author":"Li Peixuan","year":"2020","unstructured":"Peixuan Li , Huaici Zhao , Pengfei Liu , and Feidao Cao . 2020 . RTM3D: Real-Time Monocular 3D Detection from Object Keypoints for Autonomous Driving. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow , UK , August 23-28, 2020, Proceedings, Part III (Lecture Notes in Computer Science , Vol. 12348), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 644-- 660 . Peixuan Li, Huaici Zhao, Pengfei Liu, and Feidao Cao. 2020. RTM3D: Real-Time Monocular 3D Detection from Object Keypoints for Autonomous Driving. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part III (Lecture Notes in Computer Science, Vol. 12348), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 644--660."},{"key":"e_1_3_2_2_10_1","volume-title":"Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017","author":"Lin Tsung-Yi","year":"2017","unstructured":"Tsung-Yi Lin , Priya Goyal , Ross B. Girshick , Kaiming He , and Piotr Doll\u00e1 r. 2017 . Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017 , Venice, Italy , October 22-29, 2017. IEEE Computer Society, 2999--3007. Tsung-Yi Lin, Priya Goyal, Ross B. Girshick, Kaiming He, and Piotr Doll\u00e1 r. 2017. Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. IEEE Computer Society, 2999--3007."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW50498.2020.00506"},{"key":"e_1_3_2_2_12_1","volume-title":"UK","volume":"327","author":"Ma Xinzhu","year":"2020","unstructured":"Xinzhu Ma , Shinan Liu , Zhiyi Xia , Hongwen Zhang , Xingyu Zeng , and Wanli Ouyang . 2020 . Rethinking Pseudo-LiDAR Representation. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow , UK , August 23-28, 2020, Proceedings, Part XIII (Lecture Notes in Computer Science , Vol. 12358), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 311-- 327 . Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, and Wanli Ouyang. 2020. Rethinking Pseudo-LiDAR Representation. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XIII (Lecture Notes in Computer Science, Vol. 12358), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 311--327."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00695"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00217"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.597"},{"key":"e_1_3_2_2_16_1","volume-title":"2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019","author":"Qi Charles R.","year":"2019","unstructured":"Charles R. Qi , Or Litany , Kaiming He , and Leonidas J. Guibas . 2019. Deep Hough Voting for 3D Object Detection in Point Clouds . In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019 , Seoul, Korea (South), October 27 - November 2, 2019 . IEEE, 9276--9285. Charles R. Qi, Or Litany, Kaiming He, and Leonidas J. Guibas. 2019. Deep Hough Voting for 3D Object Detection in Point Clouds. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. IEEE, 9276--9285."},{"key":"e_1_3_2_2_17_1","volume-title":"2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017","author":"Qi Charles Ruizhongtai","year":"2017","unstructured":"Charles Ruizhongtai Qi , Hao Su , Kaichun Mo , and Leonidas J. Guibas . 2017a. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation . In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 , Honolulu, HI, USA, July 21--26 , 2017 . IEEE Computer Society, 77--85. Charles Ruizhongtai Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. 2017a. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21--26, 2017. IEEE Computer Society, 77--85."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295263"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018851"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01054"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00086"},{"key":"e_1_3_2_2_22_1","unstructured":"OpenPCDet Development Team. 2020. OpenPCDet: An Open-source Toolbox for 3D Object Detection from Point Clouds. https:\/\/github.com\/open-mmlab\/OpenPCDet.  OpenPCDet Development Team. 2020. OpenPCDet: An Open-source Toolbox for 3D Object Detection from Point Clouds. https:\/\/github.com\/open-mmlab\/OpenPCDet."},{"key":"e_1_3_2_2_23_1","volume-title":"FCOS: Fully Convolutional One-Stage Object Detection. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019","author":"Tian Zhi","year":"2019","unstructured":"Zhi Tian , Chunhua Shen , Hao Chen , and Tong He . 2019 . FCOS: Fully Convolutional One-Stage Object Detection. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019 , Seoul, Korea (South), October 27 - November 2, 2019. IEEE, 9626--9635. Zhi Tian, Chunhua Shen, Hao Chen, and Tong He. 2019. FCOS: Fully Convolutional One-Stage Object Detection. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. IEEE, 9626--9635."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"},{"key":"e_1_3_2_2_25_1","volume-title":"Non-Local Neural Networks. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018","author":"Wang Xiaolong","year":"2018","unstructured":"Xiaolong Wang , Ross B. Girshick , Abhinav Gupta , and Kaiming He . 2018 . Non-Local Neural Networks. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 , Salt Lake City, UT, USA , June 18-22, 2018. IEEE Computer Society, 7794--7803. Xiaolong Wang, Ross B. Girshick, Abhinav Gupta, and Kaiming He. 2018. Non-Local Neural Networks. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018. IEEE Computer Society, 7794--7803."},{"key":"e_1_3_2_2_26_1","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Wang Yan","year":"2019","unstructured":"Yan Wang , Wei-Lun Chao , Divyansh Garg , Bharath Hariharan , Mark E. Campbell , and Kilian Q. Weinberger . 2019. Pseudo-LiDAR From Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving . In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA , June 16-20, 2019 . Computer Vision Foundation \/ IEEE, 8445--8453. Yan Wang, Wei-Lun Chao, Divyansh Garg, Bharath Hariharan, Mark E. Campbell, and Kilian Q. Weinberger. 2019. Pseudo-LiDAR From Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019. Computer Vision Foundation \/ IEEE, 8445--8453."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00114"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01046"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-021-01456-w"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3012695"},{"key":"e_1_3_2_2_31_1","first-page":"3337","article-title":"SECOND","volume":"18","author":"Yan Yan","year":"2018","unstructured":"Yan Yan , Yuxing Mao , and Bo Li . 2018 . SECOND : Sparsely Embedded Convolutional Detection. Sensors , Vol. 18 , 10 (2018), 3337 . Yan Yan, Yuxing Mao, and Bo Li. 2018. SECOND: Sparsely Embedded Convolutional Detection. Sensors, Vol. 18, 10 (2018), 3337.","journal-title":"Sparsely Embedded Convolutional Detection. Sensors"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00798"},{"key":"e_1_3_2_2_33_1","volume-title":"Objects as Points. CoRR","author":"Zhou Xingyi","year":"2019","unstructured":"Xingyi Zhou , Dequan Wang , and Philipp Kr\u00e4henb\u00fchl . 2019. Objects as Points. CoRR , Vol. abs\/ 1904 .07850 ( 2019 ). arxiv: 1904.07850 Xingyi Zhou, Dequan Wang, and Philipp Kr\u00e4henb\u00fchl. 2019. Objects as Points. CoRR, Vol. abs\/1904.07850 (2019). arxiv: 1904.07850"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00472"}],"event":{"name":"MM '21: ACM Multimedia Conference","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Virtual Event China","acronym":"MM '21"},"container-title":["Proceedings of the 29th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475641","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3474085.3475641","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:24Z","timestamp":1750193304000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475641"}},"subtitle":["Improving Monocular 3D Object Detection through Neighbor Distance Voting"],"short-title":[],"issued":{"date-parts":[[2021,10,17]]},"references-count":34,"alternative-id":["10.1145\/3474085.3475641","10.1145\/3474085"],"URL":"https:\/\/doi.org\/10.1145\/3474085.3475641","relation":{},"subject":[],"published":{"date-parts":[[2021,10,17]]},"assertion":[{"value":"2021-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}