{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:10:39Z","timestamp":1753881039585,"version":"3.41.2"},"reference-count":56,"publisher":"World Scientific Pub Co Pte Ltd","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["World Sci. Ann. Rev. Artif. Intell."],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:p> Exploiting geometric features is a common approach to enhance monocular 3D object detection. However, their performance is limited due to the absence of depth information. To address this limitation, an external depth estimator can be employed to predict depth, but this approach significantly reduces the efficiency and flexibility of the model. Instead of relying on a costly depth estimator, we propose a depth-aware monocular 3D object detector that is trained using augmented training data. Specifically, we utilize reference images and their corresponding depth maps to train an efficient rendering module, which synthesizes a variety of photo-realistic images with different virtual depths. By learning from these images, the detector adapts its features to depth variations. Furthermore, we introduce an auxiliary module that guides the network to learn more informative representations from the depth images. Both modules are removed after training, resulting in no additional computational overhead during the final deployment. <\/jats:p>","DOI":"10.1142\/s2811032324400034","type":"journal-article","created":{"date-parts":[[2024,2,3]],"date-time":"2024-02-03T06:05:12Z","timestamp":1706940312000},"source":"Crossref","is-referenced-by-count":0,"title":["Improving Monocular 3D Object Detection by Synthetic Images with Virtual Depth"],"prefix":"10.1142","volume":"02","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5069-3587","authenticated-orcid":false,"given":"Chenhang","family":"He","sequence":"first","affiliation":[{"name":"Department of Computing, The Hong Kong Polytechnic University, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2078-4215","authenticated-orcid":false,"given":"Lei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Computing, The Hong Kong Polytechnic University, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,2,26]]},"reference":[{"doi-asserted-by":"crossref","unstructured":"Y. Zhou and  O. Tuzel,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2018,  pp. 4490\u20134499.","key":"S2811032324400034BIB001","DOI":"10.1109\/CVPR.2018.00472"},{"key":"S2811032324400034BIB002","volume":"18","author":"Yan Y.","year":"2018","journal-title":"Sensors"},{"doi-asserted-by":"crossref","unstructured":"S. Shi,  X. Wang and  H. Li,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 770\u2013779.","key":"S2811032324400034BIB003","DOI":"10.1109\/CVPR.2019.00086"},{"unstructured":"A. H. Lang,  S. Vora,  H. Caesar,  L. Zhou,  J. Yang and  O. Beijbom,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 12697\u201312705.","key":"S2811032324400034BIB004"},{"volume-title":"Proc IEEE\/CVF Int Conf Computer Vision","year":"2019","author":"Yang Z.","key":"S2811032324400034BIB005"},{"volume-title":"Proc IEEE Conf Computer Vision and Pattern Recognition","year":"2020","author":"He C.","key":"S2811032324400034BIB006"},{"volume-title":"Proc IEEE Conf Computer Vision and Pattern Recognition","year":"2020","author":"Shi S.","key":"S2811032324400034BIB007"},{"volume-title":"Proc IEEE Conf Computer Vision and Pattern Recognition","year":"2022","author":"He C.","key":"S2811032324400034BIB008"},{"volume-title":"Proc British Machine Vision Conf","year":"2019","author":"Roddick T.","key":"S2811032324400034BIB009"},{"unstructured":"G. Brazil and  X. Liu,  Proc IEEE Int Conf Computer Vision, 2019,  pp. 9287\u20139296.","key":"S2811032324400034BIB010"},{"unstructured":"A. Mousavian,  D. Anguelov,  J. Flynn and  J. Kosecka,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2017,  pp. 7074\u20137082.","key":"S2811032324400034BIB011"},{"doi-asserted-by":"crossref","unstructured":"A. Naiden,  V. Paunescu,  G. Kim,  B. Jeon and  M. Leordeanu,  2019 IEEE Int Conf Image Processing, 2019,  pp. 61\u201365.","key":"S2811032324400034BIB012","DOI":"10.1109\/ICIP.2019.8803397"},{"unstructured":"Z. Liu,  Z. Wu and  R. T\u00f3th,  Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition Workshops, 2020,  pp. 996\u2013997.","key":"S2811032324400034BIB013"},{"doi-asserted-by":"crossref","unstructured":"Z. Qin,  J. Wang and  Y. Lu,  Proc AAAI Conf Artificial Intelligence, 2019,  pp. 8851\u20138858.","key":"S2811032324400034BIB014","DOI":"10.1609\/aaai.v33i01.33018851"},{"unstructured":"X. Zhou,  Y. Peng,  C. Long,  F. Ren and  C. Shi,  Int Conf Machine Learning, 2020,  pp. 11503\u201311512.","key":"S2811032324400034BIB015"},{"unstructured":"Y. Chen,  L. Tai,  K. Sun and  M. Li,  Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition, 2020,  pp. 12093\u201312102.","key":"S2811032324400034BIB016"},{"doi-asserted-by":"crossref","unstructured":"P. Li,  H. Zhao,  P. Liu and  F. Cao,  European Conf Computer Vision, 2020,  pp. 644\u2013660.","key":"S2811032324400034BIB017","DOI":"10.1007\/978-3-030-58580-8_38"},{"unstructured":"Y. Lu,  X. Ma,  L. Yang,  T. Zhang,  Y. Liu,  Q. Chu,  J. Yan and  W. Ouyang,  Proce IEEE\/CVF Int Conf Computer Vision (ICCV), 2021,  pp. 3111\u20133121.","key":"S2811032324400034BIB018"},{"volume-title":"IEEE Conf Computer Vision and Pattern Recognition (CVPR)","year":"2021","author":"Chen H.","key":"S2811032324400034BIB019"},{"volume-title":"European Conf Computer Vision","year":"2022","author":"Li Y.","key":"S2811032324400034BIB020"},{"volume-title":"European Conf Computer Vision","year":"2022","author":"Peng L.","key":"S2811032324400034BIB021"},{"volume-title":"Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition (CVPR)","year":"2022","author":"Gu J.","key":"S2811032324400034BIB022"},{"unstructured":"B. Xu and  Z. Chen,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2018,  pp. 2345\u20132353.","key":"S2811032324400034BIB023"},{"doi-asserted-by":"crossref","unstructured":"H. Chu,  W.C. Ma,  K. Kundu,  R. Urtasun and  S. Fidler,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2018,  pp. 3002\u20133011.","key":"S2811032324400034BIB024","DOI":"10.1109\/CVPR.2018.00317"},{"volume-title":"Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition","year":"2020","author":"Ding M.","key":"S2811032324400034BIB025"},{"unstructured":"Y. Wang,  W.L. Chao,  D. Garg,  B. Hariharan,  M. Campbell and  K. Q. Weinberger,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 8445\u20138453.","key":"S2811032324400034BIB026"},{"unstructured":"R. Qian,  D. Garg,  Y. Wang,  Y. You,  S. Belongie,  B. Hariharan,  M. Campbell,  K. Q. Weinberger and  W.L. Chao,  Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition, 2020, pp.  5881\u20135890.","key":"S2811032324400034BIB028"},{"volume-title":"Proc European Conf Computer Vision","year":"2020","author":"Ma X.","key":"S2811032324400034BIB029"},{"doi-asserted-by":"publisher","key":"S2811032324400034BIB030","DOI":"10.1109\/TIP.2019.2952201"},{"unstructured":"X. Ma,  Z. Wang,  H. Li,  P. Zhang,  W. Ouyang and  X. Fan,  Proc IEEE Int Conf Computer Vision, 2019,  pp. 6851\u20136860.","key":"S2811032324400034BIB031"},{"unstructured":"J. Ku,  A. D. Pon and  S. L. Waslander,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 11867\u201311876.","key":"S2811032324400034BIB032"},{"volume-title":"IEEE Int Conf Computer Vision Workshops","year":"2019","author":"Weng X.","key":"S2811032324400034BIB033"},{"unstructured":"O. Wiles,  G. Gkioxari,  R. Szeliski and  J. Johnson,  Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition, 2020,  pp. 7467\u20137477.","key":"S2811032324400034BIB034"},{"unstructured":"X. Chen,  J. Song and  O. Hilliges,  Proc IEEE Int Conf Computer Vision, 2019,  pp. 4090\u20134100.","key":"S2811032324400034BIB035"},{"unstructured":"I. Choi,  O. Gallo,  A. Troccoli,  M. H. Kim and  J. Kautz,  Proc IEEE Int Conf Computer Vision, 2019,  pp. 7781\u20137790.","key":"S2811032324400034BIB036"},{"unstructured":"Y. Zhang,  J. Lu and  J. Zhou,  Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition (CVPR), 2021,  pp. 3289\u20133298.","key":"S2811032324400034BIB038"},{"unstructured":"X. Chen,  Y. Duan,  R. Houthooft,  J. Schulman,  I. Sutskever and  P. Abbeel,  Advances in Neural Information Processing Systems, 2016,  pp. 2172\u20132180.","key":"S2811032324400034BIB039"},{"unstructured":"T. D. Kulkarni,  W. F. Whitney,  P. Kohli and  J. Tenenbaum,  Advances in Neural Information Processing Systems, 2015,  pp. 2539\u20132547.","key":"S2811032324400034BIB040"},{"doi-asserted-by":"crossref","unstructured":"M. Tatarchenko,  A. Dosovitskiy and  T. Brox,  European Conf Computer Vision, 2016,  pp. 322\u2013337.","key":"S2811032324400034BIB041","DOI":"10.1007\/978-3-319-46478-7_20"},{"doi-asserted-by":"crossref","unstructured":"T. Zhou,  S. Tulsiani,  W. Sun,  J. Malik and  A. A. Efros,  European Conf Computer Vision, 2016,  pp. 286\u2013301.","key":"S2811032324400034BIB042","DOI":"10.1007\/978-3-319-46493-0_18"},{"unstructured":"D. E. Worrall,  S. J. Garbin,  D. Turmukhambetov and  G. J. Brostow,  Proc IEEE Int Conf Computer Vision, 2017,  pp. 5726\u20135735.","key":"S2811032324400034BIB043"},{"unstructured":"V. Sitzmann,  M. Zollh\u00f6fer and  G. Wetzstein,  Advances in Neural Information Processing Systems, 2019,  pp. 1121\u20131132.","key":"S2811032324400034BIB044"},{"key":"S2811032324400034BIB045","first-page":"66","volume":"7","author":"Alexander M.","year":"2018","journal-title":"J. Comput. Graph. Tech. (JCGT)"},{"unstructured":"G. Liu,  F. A. Reda,  K. J. Shih,  T.C. Wang,  A. Tao and  B. Catanzaro,  Proc European Conf Computer Vision (ECCV), 2018,  pp. 85\u2013100.","key":"S2811032324400034BIB046"},{"doi-asserted-by":"crossref","unstructured":"J. Yu,  Z. Lin,  J. Yang,  X. Shen,  X. Lu and  T. S. Huang,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2018,  pp. 5505\u20135514.","key":"S2811032324400034BIB047","DOI":"10.1109\/CVPR.2018.00577"},{"volume-title":"Proc IEEE\/CVF Conf Computer Vision and Pattern Recognition (CVPR)","year":"2019","author":"Liang M.","key":"S2811032324400034BIB048"},{"unstructured":"T. Mordan,  N. Thome,  G. Henaff and  M. Cord,  Advances in Neural Information Processing Systems (NeurIPS), 2018,  pp. 1310\u20131322.","key":"S2811032324400034BIB049"},{"doi-asserted-by":"crossref","unstructured":"W. Liu,  D. Anguelov,  D. Erhan,  C. Szegedy,  S. Reed,  C.Y. Fu and  A. C. Berg,  European Conf Computer Vision, 2016,  pp. 21\u201337.","key":"S2811032324400034BIB050","DOI":"10.1007\/978-3-319-46448-0_2"},{"doi-asserted-by":"publisher","key":"S2811032324400034BIB051","DOI":"10.1177\/0278364913491297"},{"unstructured":"X. Chen,  H. Ma,  J. Wan,  B. Li and  T. Xia,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2017,  pp. 1907\u20131915.","key":"S2811032324400034BIB052"},{"unstructured":"X. Chen,  K. Kundu,  Y. Zhu,  A. G. Berneshawi,  H. Ma,  S. Fidler and  R. Urtasun,  Advances in Neural Information Processing Systems, 2015,  pp. 424\u2013432.","key":"S2811032324400034BIB053"},{"doi-asserted-by":"crossref","unstructured":"A. Simonelli,  S. R. Bulo,  L. Porzi,  M. L\u00f3pez-Antequera and  P. Kontschieder,  Proc IEEE Int Conf Computer Vision, 2019,  pp. 1991\u20131999.","key":"S2811032324400034BIB054","DOI":"10.1109\/ICCV.2019.00208"},{"unstructured":"L. Liu,  J. Lu,  C. Xu,  Q. Tian and  J. Zhou,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 1057\u20131066.","key":"S2811032324400034BIB055"},{"unstructured":"F. Manhardt,  W. Kehl and  A. Gaidon,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 2069\u20132078.","key":"S2811032324400034BIB056"},{"unstructured":"B. Li,  W. Ouyang,  L. Sheng,  X. Zeng and  X. Wang,  Proc IEEE Conf Computer Vision and Pattern Recognition, 2019,  pp. 1019\u20131028.","key":"S2811032324400034BIB057"},{"volume-title":"European Conf Computer Vision","year":"2020","author":"Brazil G.","key":"S2811032324400034BIB058"}],"container-title":["World Scientific Annual Review of Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S2811032324400034","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,17]],"date-time":"2025-02-17T07:51:43Z","timestamp":1739778703000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S2811032324400034"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1]]},"references-count":56,"alternative-id":["10.1142\/S2811032324400034"],"URL":"https:\/\/doi.org\/10.1142\/s2811032324400034","relation":{},"ISSN":["2811-0323","2811-0331"],"issn-type":[{"type":"print","value":"2811-0323"},{"type":"electronic","value":"2811-0331"}],"subject":[],"published":{"date-parts":[[2024,1]]},"article-number":"2440003"}}