{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T18:42:16Z","timestamp":1761676936370,"version":"build-2065373602"},"reference-count":31,"publisher":"MDPI AG","issue":"20","license":[{"start":{"date-parts":[[2020,10,21]],"date-time":"2020-10-21T00:00:00Z","timestamp":1603238400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61973311"],"award-info":[{"award-number":["61973311"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Science and Technology on Near-Surface Detection Laboratory","award":["6142414180207"],"award-info":[{"award-number":["6142414180207"]}]},{"name":"the Science and Technology Development Fund","award":["0015\/2019\/AKP"],"award-info":[{"award-number":["0015\/2019\/AKP"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Estimating range to the closest object in front is the core component of the forward collision warning (FCW) system. Previous monocular range estimation methods mostly involve two sequential steps of object detection and range estimation. As a result, they are only effective for objects from specific categories relying on expensive object-level annotation for training, but not for unseen categories. In this paper, we present an end-to-end deep learning architecture to solve the above problems. Specifically, we represent the target range as a weighted sum of a set of potential distances. These potential distances are generated by inverse perspective projection based on intrinsic and extrinsic camera parameters, while a deep neural network predicts the corresponding weights of these distances. The whole architecture is optimized towards the range estimation task directly in an end-to-end manner with only the target range as supervision. As object category is not restricted in the training stage, the proposed method can generalize to objects with unseen categories. Furthermore, camera parameters are explicitly considered in the proposed method, making it able to generalize to images taken with different cameras and novel views. Additionally, the proposed method is not a pure black box, but provides partial interpretability by visualizing the produced weights to see which part of the image dominates the final result. We conduct experiments to verify the above properties of the proposed method on synthetic and real-world collected data.<\/jats:p>","DOI":"10.3390\/s20205941","type":"journal-article","created":{"date-parts":[[2020,10,22]],"date-time":"2020-10-22T20:51:00Z","timestamp":1603399860000},"page":"5941","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["End-to-End Monocular Range Estimation for Forward Collision Warning"],"prefix":"10.3390","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5556-2084","authenticated-orcid":false,"given":"Jie","family":"Tang","sequence":"first","affiliation":[{"name":"College of Intelligence Science, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jian","family":"Li","sequence":"additional","affiliation":[{"name":"College of Intelligence Science, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,10,21]]},"reference":[{"key":"ref_1","unstructured":"Van Der Horst, R., and Hogema, J. (1993, January 27\u201329). Time-to-collision and collision avoidance systems. Proceedings of the 6th ICTCT Workshop: Safety Evaluation of Traffic Systems: Traffic Conflicts and Other Measures, Salzburg, Austria."},{"key":"ref_2","unstructured":"Dagan, E., Mano, O., Stein, G.P., and Shashua, A. (2004, January 14\u201317). Forward collision warning with a single camera. Proceedings of the IEEE Intelligent Vehicles Symposium, Parma, Italy."},{"key":"ref_3","unstructured":"Chen, Y.L., Shen, K.Y., and Wang, S.C. (2013, January 19\u201321). Forward collision warning system considering both time-to-collision and safety braking distance. Proceedings of the 2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA), Melbourne, Australia."},{"key":"ref_4","unstructured":"Eigen, D., Puhrsch, C., and Fergus, R. (2014, January 8\u201313). Depth map prediction from a single image using a multi-scale deep network. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_5","unstructured":"Kim, G., and Cho, J.S. (2012, January 17\u201321). Vision-based vehicle detection and inter-vehicle distance estimation. Proceedings of the 2012 12th International Conference on Control, Automation and Systems, JeJu Island, Korea."},{"key":"ref_6","unstructured":"Wu, B.F., Chen, Y.L., Chen, Y.H., and Chen, C.J. (2011). Real-Time Nighttime Vehicle Detection and Recognition System Based on Computer Vision. (No. 7,949,190), U.S. Patent."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"388","DOI":"10.1007\/s10043-012-0063-1","article-title":"Vision-based vehicle detection and inter-vehicle distance estimation for driver alarm system","volume":"19","author":"Kim","year":"2012","journal-title":"Opt. Rev."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Tuohy, S., O\u2019Cualain, D., Jones, E., and Glavin, M. (2010, January 23\u201324). Distance determination for an automobile environment using inverse perspective mapping in OpenCV. Proceedings of the IET Irish Signals and Systems Conference (ISSC 2010), Cork, Ireland.","DOI":"10.1049\/cp.2010.0495"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/BF00201978","article-title":"Inverse perspective mapping simplifies optical flow computation and obstacle detection","volume":"64","author":"Mallot","year":"1991","journal-title":"Biol. Cybern."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2723","DOI":"10.1109\/TITS.2015.2421482","article-title":"Robust vehicle detection and distance estimation under challenging lighting conditions","volume":"16","author":"Rezaei","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Kim, J.B. (2019). Efficient vehicle detection and distance estimation based on aggregated channel features and inverse perspective mapping from a single camera. Symmetry, 11.","DOI":"10.3390\/sym11101205"},{"key":"ref_12","unstructured":"Stein, G.P., Mano, O., and Shashua, A. (2003, January 9\u201311). Vision-based ACC with a single camera: Bounds on range and range rate accuracy. Proceedings of the IEEE IV2003 Intelligent Vehicles Symposium, Proceedings (Cat. No. 03TH8683), Columbus, OH, USA."},{"key":"ref_13","unstructured":"Zhu, J., and Fang, Y. (November, January 27). Learning object-specific distance from a monocular image. Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Li, Y., Zhao, M., and Yu, X. (2020, January 6\u201310). A regional regression network for monocular object distance estimation. Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), London, UK.","DOI":"10.1109\/ICMEW46912.2020.9106012"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"4907","DOI":"10.1109\/TVT.2020.2977623","article-title":"Inter-Vehicle distance estimation method based on monocular vision using 3D detection","volume":"69","author":"Zhe","year":"2020","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Facil, J.M., Ummenhofer, B., Zhou, H., Montesano, L., Brox, T., and Civera, J. (2019, January 16\u201320). CAM-Convs: Camera-aware multi-scale convolutions for single-view depth. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01210"},{"key":"ref_17","unstructured":"Zhao, Y., Kong, S., and Fowlkes, C. (2020). When perspective comes for free: Improving depth prediction with camera pose encoding. arXiv."},{"key":"ref_18","unstructured":"Dijk, T.V., and Croon, G.D. (November, January 27). How do neural networks see depth in single images?. Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1997","DOI":"10.1016\/j.eswa.2013.08.096","article-title":"Automatic on-the-fly extrinsic camera calibration of onboard vehicular cameras","volume":"41","author":"Jung","year":"2014","journal-title":"Expert Syst. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_21","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_22","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv."},{"key":"ref_23","unstructured":"Nair, V., and Hinton, G.E. (2010, January 21\u201324). Rectified linear units improve restricted boltzmann machines. Proceedings of the ICML, Haifa, Israel."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_25","first-page":"1929","article-title":"Dropout: A simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_26","unstructured":"Ba, J.L., Kiros, J.R., and Hinton, G.E. (2016). Layer normalization. arXiv."},{"key":"ref_27","unstructured":"Dugas, C., Bengio, Y., B\u00e9lisle, F., Nadeau, C., and Garcia, R. (2001, January 3\u20138). Incorporating second-order functional knowledge for better option pricing. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_28","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1231","DOI":"10.1177\/0278364913491297","article-title":"Vision meets robotics: The KITTI dataset","volume":"32","author":"Geiger","year":"2013","journal-title":"Int. J. Robot. Res."},{"key":"ref_30","unstructured":"Gaidon, A., Wang, Q., Cabon, Y., and Vig, E. (July, January 26). Virtual worlds as proxy for multi-object tracking analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ren, J., Chen, X., Liu, J., Sun, W., Pang, J., Yan, Q., Tai, Y.W., and Xu, L. (2017, January 21\u201326). Accurate single stage detector using recurrent rolling convolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.87"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/20\/5941\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:25:06Z","timestamp":1760178306000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/20\/5941"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,21]]},"references-count":31,"journal-issue":{"issue":"20","published-online":{"date-parts":[[2020,10]]}},"alternative-id":["s20205941"],"URL":"https:\/\/doi.org\/10.3390\/s20205941","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2020,10,21]]}}}