{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T10:27:41Z","timestamp":1760956061241,"version":"build-2065373602"},"reference-count":29,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2020,9,21]],"date-time":"2020-09-21T00:00:00Z","timestamp":1600646400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Depth estimation of a single image presents a classic problem for computer vision, and is important for the 3D reconstruction of scenes, augmented reality, and object detection. At present, most researchers are beginning to focus on unsupervised monocular depth estimation. This paper proposes solutions to the current depth estimation problem. These solutions include a monocular depth estimation method based on uncertainty analysis, which solves the problem in which a neural network has strong expressive ability but cannot evaluate the reliability of an output result. In addition, this paper proposes a photometric loss function based on the Retinex algorithm, which solves the problem of pulling around pixels due to the presence of moving objects. We objectively compare our method to current mainstream monocular depth estimation methods and obtain satisfactory results.<\/jats:p>","DOI":"10.3390\/s20185389","type":"journal-article","created":{"date-parts":[[2020,9,21]],"date-time":"2020-09-21T08:18:01Z","timestamp":1600676281000},"page":"5389","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Unsupervised Monocular Depth Estimation Method Based on Uncertainty Analysis and Retinex Algorithm"],"prefix":"10.3390","volume":"20","author":[{"given":"Chuanxue","family":"Song","sequence":"first","affiliation":[{"name":"College of Automotive Engineering, Jilin University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chunyang","family":"Qi","sequence":"additional","affiliation":[{"name":"College of Automotive Engineering, Jilin University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shixin","family":"Song","sequence":"additional","affiliation":[{"name":"School of Mechanical and Aerospace Engineering, Jilin University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1847-504X","authenticated-orcid":false,"given":"Feng","family":"Xiao","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Automotive Simulation and Control, Jilin University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,9,21]]},"reference":[{"key":"ref_1","unstructured":"Eigen, D., Puhrsch, C., and Fergus, R. (2014, January 8\u201313). Depth Map Prediction from a Single Image Using a Multi-Scale Deep Network. Proceedings of the Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Eigen, D., and Fergus, R. (2016, January 11\u201318). Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.304"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, F., Shen, C., and Lin, G. (2015, January 7\u201312). Deep Convolutional Neural Fields for Depth Estimation from a Single Image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299152"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.1109\/TPAMI.2015.2505283","article-title":"Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields","volume":"38","author":"Liu","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","unstructured":"Trigueiros, P., Ribeiro, F., and Reis, L.P. (2012, January 20\u201323). A Comparison of Machine Learning Algorithms Applied to Hand Gesture Recognition. Proceedings of the 7th Iberian Conference on Information Systems and Technologies, Mardin, Spain."},{"key":"ref_6","unstructured":"Li, N.B., Shen, N.C., Dai, N.Y., Hengel, A.V.D., and He, N.M. (2015, January 7\u201312). Depth and Surface Normal Estimation from Monocular Images Using Regression on Deep Features and Hierarchical Crfs. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., and Navab, N. (2016, January 25\u201328). Deeper Depth Prediction with Fully Convolutional Residual Networks. Proceedings of the 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA.","DOI":"10.1109\/3DV.2016.32"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3174","DOI":"10.1109\/TCSVT.2017.2740321","article-title":"Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks","volume":"28","author":"Cao","year":"2018","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Xu, D., Elisa, R., and Ouyang, W.L. (2017, January 21\u201326). Multi-scale continuous CRFs as sequential deep networks for monocular depth estimation. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.25"},{"key":"ref_10","unstructured":"Arsalan, M., Hamed, P., and Jana, K. (2016, January 25\u201328). Joint semantic segmentation and depth estimation with deep convolutional networks. Proceedings of the 4th International Conference on 3D Vision, Stanford, CA, USA."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhang, Z.Y., Alexander, G.S., and Sanja, F. (2015, January 7\u201312). Monocular object instance segmentation and depth ordering with CNNs. Proceedings of the 15th International Conference on Computer Vision, Boston, MA, USA.","DOI":"10.1109\/ICCV.2015.300"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Liu, B.Y., Stephen, G., and Stephen, G. (2010, January 13\u201318). Single image depth estimation from predicted semantic labels. Proceedings of the 23th IEEE Conference on Computer Vision and Pattern, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539823"},{"key":"ref_13","unstructured":"Wang, P., Shen, X.H., and Lin, Z. (2015, January 7\u201312). Towards unified depth and semantic prediction from a single image. Proceedings of the 28th IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhou, T., Brown, M., Snavely, N., and Lowe, D.G. (2017, January 21\u201326). Unsupervised Learning of Depth and Ego-Motion from Video. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.700"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Yin, Z., and Shi, J. (2018, January 18\u201322). Geonet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00212"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Mahjourian, R., Wicke, M., and Angelova, A. (2018, January 18\u201322). Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3d Geometric Constraints. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00594"},{"key":"ref_17","unstructured":"Clement, G., Oisin, M.A., and Gabriel, J.B. (2016, January 27\u201330). Unsupervised monocular depth estimation with left-right consistency. Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhan, H., Garg, R., Weerasekera, C.S., Li, K., Agarwal, H., and Reid, I. (2018, January 18\u201322). Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00043"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Garg, R., Bg, V.K., Carneiro, G., and Reid, I. (2016, January 11\u201314). Unsupervised Cnn for Single View Depth Estimation: Geometry to the Rescue. Proceedings of the European Conference on Computer Vision 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46484-8_45"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Godard, C., Mac Aodha, O., and Brostow, G.J. (2017, January 21\u201326). Unsupervised Monocular Depth Estimation with Left-Right Consistency. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.699"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Kuznietsov, Y., Stuckler, J., and Leibe, B. (2017, January 21\u201326). Semi-supervised deep learning for monocular depth map prediction. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.238"},{"key":"ref_22","unstructured":"Kendall, A., and Gal, Y. (2017, January 4\u20139). What uncertainties do we need in bayesian deep learning for computer vision?. Proceedings of the Neural Information Processing Systems 30, Long Beach, CA, USA."},{"key":"ref_23","unstructured":"Kendall, A., Gal, Y., and Cipolla, R. (2018, January 18\u201322). Multi-task learning using uncertainty to weight losses for scene geometry and semantics. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA."},{"key":"ref_24","unstructured":"Jaderberg, M., Simonyan, K., and Zisserman, A. (2015, January 7\u201312). Spatial transformer networks. Proceedings of the Neural Information Processing Systems (NIPS), Montreal, QC, Canada."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1231","DOI":"10.1177\/0278364913491297","article-title":"Vision meets Robotics:The kitti dataset","volume":"32","author":"Geiger","year":"2013","journal-title":"Int. J. Robot. Res. (IJRR)"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.1109\/TPAMI.2015.2505283","article-title":"Learning depth from single monocular images using deep convolutional neural fields","volume":"38","author":"Liu","year":"2016","journal-title":"IEEE Trans. Pattern Recognit. Mach. Intell. PAMI"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, C., Miguel Buenaposada, J., Zhu, R., and Lucey, S. (2018, January 18\u201322). Learning depth from monocular videos using direct methods. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00216"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zou, Y., Luo, Z., and Huang, J.B. (2018, January 8\u201314). DF-Net: Unsupervised joint learning of depth and flow using cross-task consistency. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01228-1_3"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Ranjan, A., Jampani, V., Kim, K., Sun, D., Wulff, J., and Black, M.J. (2019, January 16\u201320). Competitive Collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01252"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/18\/5389\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:11:52Z","timestamp":1760177512000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/18\/5389"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,21]]},"references-count":29,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2020,9]]}},"alternative-id":["s20185389"],"URL":"https:\/\/doi.org\/10.3390\/s20185389","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2020,9,21]]}}}