{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T12:28:58Z","timestamp":1772454538399,"version":"3.50.1"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,5,29]],"date-time":"2024-05-29T00:00:00Z","timestamp":1716940800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2024,5,29]],"date-time":"2024-05-29T00:00:00Z","timestamp":1716940800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach. Intell. Res."],"published-print":{"date-parts":[[2024,8]]},"DOI":"10.1007\/s11633-024-1494-4","type":"journal-article","created":{"date-parts":[[2024,5,29]],"date-time":"2024-05-29T16:02:27Z","timestamp":1716998547000},"page":"652-669","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Towards Domain-agnostic Depth Completion"],"prefix":"10.1007","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9669-0381","authenticated-orcid":false,"given":"Guangkai","family":"Xu","sequence":"first","affiliation":[]},{"given":"Wei","family":"Yin","sequence":"additional","affiliation":[]},{"given":"Jianming","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Oliver","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Simon","family":"Niklaus","sequence":"additional","affiliation":[]},{"given":"Simon","family":"Chen","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2046-3363","authenticated-orcid":false,"given":"Jia-Wang","family":"Bian","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,5,29]]},"reference":[{"key":"1494_CR1","doi-asserted-by":"publisher","first-page":"10526","DOI":"10.1109\/CVPR42600.2020.01054","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA","author":"S S Shi","year":"2020","unstructured":"S. S. Shi, C. X. Guo, L. Jiang, Z. Wang, J. P. Shi, X. G. Wang, H. S. Li. PV-RCNN: Point-voxel feature set abstraction for 3D object detection. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, pp. 10526\u201310535, 2020. DOI: https:\/\/doi.org\/10.1109\/CVPR42600.2020.01054."},{"key":"1494_CR2","doi-asserted-by":"publisher","first-page":"8437","DOI":"10.1109\/CVPR.2019.00864","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"Y Wang","year":"2019","unstructured":"Y. Wang, W. L. Chao, D. Garg, B. Hariharan, M. Campbell, K. Q. Weinberger. Pseudo-LiDAR from visual depth estimation: Bridging the gap in 3D object detection for autonomous driving. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, pp. 8437\u20138455, 2019. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.00864."},{"key":"1494_CR3","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1109\/IS-MAR.2011.6092378","volume-title":"Proceedings of the 10th IEEE International Symposium on Mixed and Augmented Reality, Basel, Switzerland","author":"R A Newcombe","year":"2011","unstructured":"R. A. Newcombe, S. Izadi, O. Hilliges, D. Molyneaux, D. Kim, A. J. Davison, P. Kohi, J. Shotton, S. Hodges, A. Fitzgibbon. KinectFusion: Real-time dense surface mapping and tracking. In Proceedings of the 10th IEEE International Symposium on Mixed and Augmented Reality, Basel, Switzerland, pp. 127\u2013136, 2011. DOI: https:\/\/doi.org\/10.1109\/IS-MAR.2011.6092378."},{"issue":"5","key":"1494_CR4","doi-asserted-by":"publisher","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","volume":"33","author":"R Mur-Artal","year":"2017","unstructured":"R. Mur-Artal, J. D. Tard\u00f3s. ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics, vol. vol. 33, no. 5, pp. 1255\u20131262, 2017. DOI: https:\/\/doi.org\/10.1109\/TRO.2017.2705103.","journal-title":"IEEE Transactions on Robotics"},{"key":"1494_CR5","doi-asserted-by":"publisher","first-page":"9276","DOI":"10.1109\/ICCV51070.2023.00854","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Paris, France","author":"G K Xu","year":"2023","unstructured":"G. K. Xu, W. Yin, H. Chen, C. H. Shen, K. Cheng, F. Zhao. FrozenRecon: Pose-free 3D scene reconstruction with frozen depth models. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Paris, France, pp. 9276\u20139286, 2023. DOI: https:\/\/doi.org\/10.1109\/ICCV51070.2023.00854."},{"key":"1494_CR6","doi-asserted-by":"publisher","first-page":"2538","DOI":"10.1109\/CVPR.2017.272","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA","author":"T Sch\u00f6ps","year":"2017","unstructured":"T. Sch\u00f6ps, J. L. Sch\u00f6nberger, S. Galliani, T. Sattler, K. Schindler, M. Pollefeys, A. Geiger. A multi-view stereo benchmark with high-resolution images and multi-camera videos. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp. 2538\u20132547, 2017. DOI: https:\/\/doi.org\/10.1109\/CVPR.2017.272."},{"key":"1494_CR7","doi-asserted-by":"publisher","first-page":"1787","DOI":"10.1109\/CVPR42600.2020.00186","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA","author":"Y Yao","year":"2020","unstructured":"Y. Yao, Z. X. Luo, S. W. Li, J. Y. Zhang, Y. F. Ren, L. Zhou, T. Fang, L. Quan. BlendedMVS: A large-scale dataset for generalized multi-view stereo networks. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, pp. 1787\u20131796, 2020. DOI: https:\/\/doi.org\/10.1109\/CVPR42600.2020.00186."},{"key":"1494_CR8","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1109\/CVPR.2019.00027","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"F H Zhang","year":"2019","unstructured":"F. H. Zhang, V. Prisacariu, R. G. Yang, P. H. S. Torr. GA-Net: Guided aggregation net for end-to-end stereo matching. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, pp. 185\u2013194, 2019. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.00027."},{"key":"1494_CR9","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1109\/CVPR.2018.00026","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA","author":"Y D Zhang","year":"2018","unstructured":"Y. D. Zhang, T. Funkhouser. Deep depth completion of a single RGB-D image. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, pp. 175\u2013185, 2018. DOI: https:\/\/doi.org\/10.1109\/CVPR.2018.00026."},{"key":"1494_CR10","doi-asserted-by":"publisher","first-page":"2181","DOI":"10.1109\/IROS51168.2021.9636870","volume-title":"Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Prague, Czech Republic","author":"D Senushkin","year":"2021","unstructured":"D. Senushkin, M. Romanov, I. Belikov, N. Patakin, A. Konushin. Decoder modulation for indoor depth completion. In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Prague, Czech Republic, pp. 2181\u20132188, 2021. DOI: https:\/\/doi.org\/10.1109\/IROS51168.2021.9636870."},{"key":"1494_CR11","doi-asserted-by":"publisher","first-page":"1070","DOI":"10.1109\/ICCVW.2019.00137","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision Workshop, Seoul, Republic of Korea","author":"Y K Huang","year":"2019","unstructured":"Y. K. Huang, T. H. Wu, Y. C. Liu, W. H. Hsu. Indoor depth completion with boundary consistency and self-attention. In Proceedings of IEEE\/CVF International Conference on Computer Vision Workshop, Seoul, Republic of Korea, pp. 1070\u20131078, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCVW.2019.00137."},{"key":"1494_CR12","doi-asserted-by":"publisher","first-page":"10615","DOI":"10.1609\/aaai.v34i07.6635","volume-title":"Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA","author":"X J Cheng","year":"2020","unstructured":"X. J. Cheng, P. Wang, C. Y. Guan, R. G. Yang. CSPN++: Learning context and resource aware convolutional spatial propagation networks for depth completion. In Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA, pp. 10615\u201310622, 2020. DOI: https:\/\/doi.org\/10.1609\/aaai.v34i07.6635."},{"key":"1494_CR13","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1007\/978-3-030-58601-0_8","volume-title":"Proceedings of the 16th European Conference on Computer Vision, Glasgow, UK","author":"J Park","year":"2020","unstructured":"J. Park, K. Joo, Z. Hu, C. K. Liu, I. S. Kweon. Non-local spatial propagation network for depth completion. In Proceedings of the 16th European Conference on Computer Vision, Glasgow, UK, pp. 120\u2013136, 2020. DOI: https:\/\/doi.org\/10.1007\/978-3-030-58601-0_8."},{"key":"1494_CR14","doi-asserted-by":"publisher","first-page":"2811","DOI":"10.1109\/ICCV.2019.00290","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"Y Xu","year":"2019","unstructured":"Y. Xu, X. G. Zhu, J. P. Shi, G. F. Zhang, H. J. Bao, H. S. Li, Depth completion from sparse LiDAR data with depth-normal constraints. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 2811\u20132820, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.00290."},{"key":"1494_CR15","doi-asserted-by":"publisher","first-page":"3308","DOI":"10.1109\/CVPR.2019.00343","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"J X Qiu","year":"2019","unstructured":"J. X. Qiu, Z. P. Cui, Y. D. Zhang, X. D. Zhang, S. C. Liu, B. Zeng, M. Pollefeys. DeepLiDAR: Deep surface normal guided depth prediction for outdoor scene from sparse LiDAR data and single color image. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019, pp. 3308\u20133317. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.00343."},{"issue":"10","key":"1494_CR16","doi-asserted-by":"publisher","first-page":"2361","DOI":"10.1109\/TPAMI.2019.2947374","volume":"42","author":"X J Cheng","year":"2020","unstructured":"X. J. Cheng, P. Wang, R. G. Yang. Learning depth with convolutional spatial propagation network. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. vol. 42, no. 10, pp. 2361\u20132379, 2020. DOI: https:\/\/doi.org\/10.1109\/TPAMI.2019.2947374.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"1494_CR17","doi-asserted-by":"publisher","first-page":"746","DOI":"10.1007\/978-3-642-33715-4_54","volume-title":"Proceedings of the 12th European Conference on Computer Vision, Florence, Italy","author":"N Silberman","year":"2012","unstructured":"N. Silberman, D. Hoiem, P. Kohli, R. Fergus. Indoor segmentation and support inference from RGBD images. In Proceedings of the 12th European Conference on Computer Vision, Florence, Italy, pp. 746\u2013760, 2012. DOI: https:\/\/doi.org\/10.1007\/978-3-642-33715-4_54."},{"key":"1494_CR18","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1109\/3DV.2017.00012","volume-title":"Proceedings of International Conference on 3D Vision, Qingdao, China","author":"J Uhrig","year":"2017","unstructured":"J. Uhrig, N. Schneider, L. Schneider, U. Franke, T. Brox, A. Geiger. Sparsity invariant CNNs. In Proceedings of International Conference on 3D Vision, Qingdao, China, pp. 11\u201320, 2017. DOI: https:\/\/doi.org\/10.1109\/3DV.2017.00012."},{"key":"1494_CR19","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1007\/978-3-030-01270-0_7","volume-title":"Proceedings of the 15th European Conference on Computer Vision, Munich, Germany","author":"X J Cheng","year":"2018","unstructured":"X. J. Cheng, P. Wang, R. G. Yang. Depth estimation via affinity learned with convolutional spatial propagation network. In Proceedings of the 15th European Conference on Computer Vision, Munich, Germany, pp. 108\u2013125, 2018. DOI: https:\/\/doi.org\/10.1007\/978-3-030-01270-0_7."},{"key":"1494_CR20","doi-asserted-by":"publisher","first-page":"2583","DOI":"10.1109\/CVPR46437.2021.00261","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA","author":"S Imran","year":"2021","unstructured":"S. Imran, X. M. Liu, D. Morris. Depth completion with twin surface extrapolation at occlusion boundaries. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA, pp. 2583\u20132592, 2021. DOI: https:\/\/doi.org\/10.1109\/CVPR46437.2021.00261."},{"key":"1494_CR21","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1109\/IROS.2017.8202133","volume-title":"Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Vancouver, Canada","author":"J Tobin","year":"2017","unstructured":"J. Tobin, R. Fong, A. Ray, J. Schneider, W. Zaremba, P. Abbeel. Domain randomization for transferring deep neural networks from simulation to the real world. In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Vancouver, Canada, pp. 23\u201330, 2017. DOI: https:\/\/doi.org\/10.1109\/IROS.2017.8202133."},{"key":"1494_CR22","doi-asserted-by":"publisher","first-page":"3482","DOI":"10.1109\/IROS.2018.8593933","volume-title":"Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Madrid, Spain","author":"J Tobin","year":"2018","unstructured":"J. Tobin, L. Biewald, R. Duan, M. Andrychowicz, A. Handa, V. Kumar, B. McGrew, A. Ray, J. Schneider, P. Welinder, W. Zaremba, P. Abbeel. Domain randomization and generative models for robotic grasping. In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Madrid, Spain, pp. 3482\u20133489, 2018. DOI: https:\/\/doi.org\/10.1109\/IROS.2018.8593933."},{"key":"1494_CR23","doi-asserted-by":"publisher","first-page":"532","DOI":"10.1109\/ICCV.2019.00062","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"S Zakharov","year":"2019","unstructured":"S. Zakharov, W. Kehl, S. Ilic. DeceptionNet: Network-driven domain randomization. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 532\u2013541, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.00062."},{"key":"1494_CR24","doi-asserted-by":"publisher","first-page":"204","DOI":"10.1109\/CVPR46437.2021.00027","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA","author":"W Yin","year":"2021","unstructured":"W. Yin, J. M. Zhang, O. Wang, S. Niklaus, L. Mai, S. M. Chen, C. H. Shen. Learning to recover 3D scene shape from a single image. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA, pp. 204\u2013213, 2021. DOI: https:\/\/doi.org\/10.1109\/CVPR46437.2021.00027."},{"issue":"3","key":"1494_CR25","doi-asserted-by":"publisher","first-page":"1623","DOI":"10.1109\/TPAMI.2020.3019967","volume":"44","author":"R Ranftl","year":"2022","unstructured":"R. Ranftl, K. Lasinger, D. Hafner, K. Schindler, V. Koltun. Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. vol. 44, no. 3, pp. 1623\u20131637, 2022. DOI: https:\/\/doi.org\/10.1109\/TPAMI.2020.3019967.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"1494_CR26","doi-asserted-by":"publisher","first-page":"667","DOI":"10.1109\/3DV.2017.00081","volume-title":"Proceedings of International Conference on 3D Vision, Qingdao, China","author":"A Chang","year":"2017","unstructured":"A. Chang, A. Dai, T. Funkhouser, M. Halber, M. Niebner, M. Savva, S. R. Song, A. Zeng, Y. D. Zhang. Matter-port3D: Learning from RGB-D data in indoor environments. In Proceedings of International Conference on 3D Vision, Qingdao, China, pp. 667\u2013676, 2017. DOI: https:\/\/doi.org\/10.1109\/3DV.2017.00081."},{"key":"1494_CR27","doi-asserted-by":"publisher","first-page":"2432","DOI":"10.1109\/CVPR.2017.261","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA","author":"A Dai","year":"2017","unstructured":"A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, M. Nie\u00dfner. ScanNet: Richly-annotated 3D reconstructions of indoor scenes. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017, pp. 2432\u20132443. DOI: https:\/\/doi.org\/10.1109\/CVPR.2017.261."},{"key":"1494_CR28","volume-title":"DIODE: A dense indoor and outdoor DEpth dataset","author":"I Vasiljevic","year":"2019","unstructured":"I. Vasiljevic, N. Kolkin, S. Y. Zhang, R. T. Luo, H. C. Wang, F. Z. Dai, A. F. Daniele, M. Mostajabi, S. Basart, M. R. Walter, G. Shakhnarovich. DIODE: A dense indoor and outdoor DEpth dataset, [Online], Available: https:\/\/arxiv.org\/abs\/1908.00463, 2019."},{"key":"1494_CR29","doi-asserted-by":"publisher","unstructured":"J. L. Sch\u00f6nberger, E. L. Zheng, J. M. Frahm, M. Pollefeys. Pixelwise view selection for unstructured multi-view stereo. In Proceedings of the 14th European Conference on Computer Vision, Amsterdam, The Netherlands, pp 501\u2013518, 2016. DOI: https:\/\/doi.org\/10.1007\/978-3-319-46487-9_31.","DOI":"10.1007\/978-3-319-46487-9_31"},{"key":"1494_CR30","doi-asserted-by":"publisher","unstructured":"L. Huynh, P. Nguyen, J. Matas, E. Rahtu, J. Heikkila. Boosting monocular depth estimation with lightweight 3D point fusion. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Montreal, Canada, pp. 12747\u201312756, 2021. DOI: https:\/\/doi.org\/10.1109\/ICCV48922.2021.01253.","DOI":"10.1109\/ICCV48922.2021.01253"},{"key":"1494_CR31","doi-asserted-by":"publisher","first-page":"10022","DOI":"10.1109\/ICCV.2019.01012","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"Y Chen","year":"2019","unstructured":"Y. Chen, B. Yang, M. Liang, R. Urtasun. Learning joint 2D-3D representations for depth completion. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 10022\u201310031, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.01012."},{"key":"1494_CR32","doi-asserted-by":"publisher","first-page":"5597","DOI":"10.1109\/CVPR.2019.00575","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"W F Chen","year":"2019","unstructured":"W. F. Chen, S. Y. Qian, J. Deng. Learning single-image depth from videos using quality assessment networks. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, pp. 5597\u20135606, 2019. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.00575."},{"key":"1494_CR33","doi-asserted-by":"publisher","first-page":"3348","DOI":"10.1109\/CVPR.2019.00347","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"Y C Yang","year":"2019","unstructured":"Y. C. Yang, A. Wong, S. Soatto. Dense depth posterior (DDP) from single image and sparse range. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, pp. 3348\u20133357, 2019. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.00347."},{"key":"1494_CR34","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1007\/978-3-642-38886-6_52","volume-title":"Proceedings of Scandinavian Conference on Image Analysis, Espoo, Finland","author":"C D Herrera","year":"2013","unstructured":"C. D. Herrera, J. Kannala, L. Ladick\u00fd, J. Heikkil\u00e4. Depth map inpainting under a second-order smoothness prior. In Proceedings of Scandinavian Conference on Image Analysis, Espoo, Finland, pp. 555\u2013566, 2013. DOI: https:\/\/doi.org\/10.1007\/978-3-642-38886-6_52."},{"key":"1494_CR35","doi-asserted-by":"publisher","first-page":"3574","DOI":"10.1109\/CVPR.2015.7298980","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA","author":"K Matsuo","year":"2015","unstructured":"K. Matsuo, Y. Aoki. Depth image enhancement using local tangent plane approximations. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, pp. 3574\u20133583, 2015. DOI: https:\/\/doi.org\/10.1109\/CVPR.2015.7298980."},{"key":"1494_CR36","doi-asserted-by":"publisher","first-page":"1416","DOI":"10.1109\/BIBM47256.2019.8983266","volume-title":"Proceedings of IEEE International Conference on Bioinformatics and Biomedicine, San Diego, USA","author":"A A Albishri","year":"2019","unstructured":"A. A. Albishri, S. J. H. Shah, Y. Lee. CU-Net: Cascaded u-net model for automated liver and lesion segmentation and summarization. In Proceedings of IEEE International Conference on Bioinformatics and Biomedicine, San Diego, USA, pp. 1416\u20131423, 2019. DOI: https:\/\/doi.org\/10.1109\/BIBM47256.2019.8983266."},{"key":"1494_CR37","volume-title":"Towards 3D scene reconstruction from locally scale-aligned monocular video depth","author":"G K Xu","year":"2023","unstructured":"G. K. Xu, W. Yin, H. Chen, C. H. Shen, K. Cheng, F. Wu, F. Zhao. Towards 3D scene reconstruction from locally scale-aligned monocular video depth, [Online], Available: https:\/\/arxiv.org\/abs\/2202.01470, 2023."},{"key":"1494_CR38","doi-asserted-by":"publisher","first-page":"2366","DOI":"10.5555\/2969033.2969091","volume-title":"Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada","author":"D Eigen","year":"2014","unstructured":"D. Eigen, C. Puhrsch, R. Fergus. Depth map prediction from a single image using a multi-scale deep network. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2366\u20132374, 2014. DOI: https:\/\/doi.org\/10.5555\/2969033.2969091."},{"key":"1494_CR39","doi-asserted-by":"publisher","first-page":"5683","DOI":"10.1109\/ICCV.2019.00578","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"W Yin","year":"2019","unstructured":"W. Yin, Y. F. Liu, C. H. Shen, Y. L. Yan. Enforcing geometric constraints of virtual normal for depth prediction. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 5683\u20135692, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.00578."},{"issue":"10","key":"1494_CR40","doi-asserted-by":"publisher","first-page":"2024.2039","DOI":"10.1109\/TPAMI.2015.2505283","volume":"38","author":"F Y Liu","year":"2016","unstructured":"F. Y. Liu, C. H. Shen, G. S. Lin, I. Reid. Learning depth from single monocular images using deep convolutional neural fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. vol. 38, no. 10, pp. 2024.2039, 2016. DOI: https:\/\/doi.org\/10.1109\/TPAMI.2015.2505283.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"1494_CR41","doi-asserted-by":"publisher","first-page":"608","DOI":"10.1109\/CVPR42600.2020.00069","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA","author":"K Xian","year":"2020","unstructured":"K. Xian, J. M. Zhang, O. Wang, L. Mai, Z. Lin, Z. G. Cao. Structure-guided ranking loss for single image depth prediction. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, pp. 608\u2013617, 2020. DOI: https:\/\/doi.org\/10.1109\/CVPR42600.2020.00069."},{"issue":"9","key":"1494_CR42","doi-asserted-by":"publisher","first-page":"2548.2564","DOI":"10.1007\/s11263-021-01484-6","volume":"129","author":"J W Bian","year":"2021","unstructured":"J. W. Bian, H. Y. Zhan, N. Y. Wang, Z. C. Li, L. Zhang, C. H. Shen, M. M. Cheng, I. Reid. Unsupervised scale-consistent depth learning from video. International Journal of Computer Vision, vol. vol. 129, no. 9, pp. 2548.2564, 2021. DOI: https:\/\/doi.org\/10.1007\/s11263-021-01484-6.","journal-title":"International Journal of Computer Vision"},{"key":"1494_CR43","doi-asserted-by":"publisher","first-page":"3827","DOI":"10.1109\/ICCV.2019.00393","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"C Godard","year":"2019","unstructured":"C. Godard, O. Mac Aodha, M. Firman, G. Brostow. Digging into self-supervised monocular depth estimation. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 3827\u20133837, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.00393."},{"key":"1494_CR44","volume-title":"DiverseDepth: Affine-invariant depth prediction using diverse data","author":"W Yin","year":"2020","unstructured":"W. Yin, X. L. Wang, C. H. Shen, Y. F. Liu, Z. Tian, S. C. Xu, C. M. Sun, D. Renyin. DiverseDepth: Affine-invariant depth prediction using diverse data, [Online], Available: https:\/\/arxiv.org\/abs\/2002.00569, 2020."},{"key":"1494_CR45","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1007\/978-3-031-19824-3_9","volume-title":"Proceedings of the 17th European Conference on Computer Vision, Tel Aviv, Israel","author":"J P Wang","year":"2022","unstructured":"J. P. Wang, P. Wang, X. X. Long, C. Theobalt, T. Komura, L. J. Liu, W. P. Wang. NeuRIS: Neural reconstruction of indoor scenes using normal priors. In Proceedings of the 17th European Conference on Computer Vision, Tel Aviv, Israel, pp. 139\u2013155, 2022. DOI: https:\/\/doi.org\/10.1007\/978-3-031-19824-3_9."},{"key":"1494_CR46","doi-asserted-by":"publisher","first-page":"4796","DOI":"10.1109\/ICRA.2018.8460184","volume-title":"Proceedings of IEEE International Conference on Robotics and Automation, Brisbane, Australia","author":"F C Ma","year":"2018","unstructured":"F. C. Ma, S. Karaman. Sparse-to-dense: Depth prediction from sparse depth samples and a single image. In Proceedings of IEEE International Conference on Robotics and Automation, Brisbane, Australia, pp. 4796\u20134803, 2018. DOI: https:\/\/doi.org\/10.1109\/ICRA.2018.8460184."},{"key":"1494_CR47","doi-asserted-by":"publisher","first-page":"430","DOI":"10.1007\/11744023_34","volume-title":"Proceedings of the 9th European Conference on Computer Vision, Graz, Austria","author":"E Rosten","year":"2006","unstructured":"E. Rosten, T. Drummond. Machine learning for high-speed corner detection. In Proceedings of the 9th European Conference on Computer Vision, Graz, Austria, pp. 430\u2013443, 2006. DOI: https:\/\/doi.org\/10.1007\/11744023_34."},{"key":"1494_CR48","doi-asserted-by":"publisher","first-page":"1519","DOI":"10.5555\/3294771.3294916","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, USA","author":"S F Liu","year":"2017","unstructured":"S. F. Liu, S. De Mello, J. W. Gu, G. Y. Zhong, M. H. Yang, J. Kautz. Learning affinity via spatial propagation networks. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, USA, pp. 1519\u20131529, 2017. DOI: https:\/\/doi.org\/10.5555\/3294771.3294916."},{"key":"1494_CR49","doi-asserted-by":"publisher","first-page":"12438","DOI":"10.1109\/CVPR.2019.01273","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA","author":"S Imran","year":"2019","unstructured":"S. Imran, Y. F. Long, X. M. Liu, D. Morris. Depth coefficients for depth completion. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, pp. 12438\u201312447, 2019. DOI: https:\/\/doi.org\/10.1109\/CVPR.2019.01273."},{"key":"1494_CR50","doi-asserted-by":"publisher","first-page":"13911","DOI":"10.1109\/CVPR46437.2021.01370","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA","author":"B U Lee","year":"2021","unstructured":"B. U. Lee, K. Lee, I. S. Kweon. Depth completion using plane-residual representation. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA, pp. 13911\u201313920, 2021. DOI: https:\/\/doi.org\/10.1109\/CVPR46437.2021.01370."},{"key":"1494_CR51","doi-asserted-by":"publisher","first-page":"13525","DOI":"10.1109\/ICRA48506.2021.9561675","volume-title":"Proceedings of IEEE International Conference on Robotics and Automation, Xi\u2019an, China","author":"D Seichter","year":"2020","unstructured":"D. Seichter, M. Kohler, B. Lewandowski, T. Wengefeld, H. M. Gross. Efficient RGB-D semantic segmentation for indoor scene analysis. In Proceedings of IEEE International Conference on Robotics and Automation, Xi\u2019an, China, pp. 13525\u201313531, 2020. DOI: https:\/\/doi.org\/10.1109\/ICRA48506.2021.9561675."},{"key":"1494_CR52","doi-asserted-by":"publisher","first-page":"3712","DOI":"10.1109\/CVPR.2018.00391","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA","author":"A R Zamir","year":"2018","unstructured":"A. R. Zamir, A. Sax, W. Shen, L. Guibas, J. Malik, S. Savarese. Taskonomy: Disentangling task transfer learning. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, pp. 3712\u20133722, 2018. DOI: https:\/\/doi.org\/10.1109\/CVPR.2018.00391."},{"issue":"8","key":"1494_CR53","doi-asserted-by":"publisher","first-page":"4131","DOI":"10.1109\/TIP.2018.2836318","volume":"27","author":"Y Kim","year":"2018","unstructured":"Y. Kim, H. Jung, D. Min, K. Sohn. Deep monocular depth estimation via integration of global and local predictions. IEEE Transactions on Image Processing, vol. vol. 27, no. 8, pp. 4131\u20134144, 2018. DOI: https:\/\/doi.org\/10.1109\/TIP.2018.2836318.","journal-title":"IEEE Transactions on Image Processing"},{"key":"1494_CR54","doi-asserted-by":"publisher","first-page":"4909","DOI":"10.1109\/IROS45743.2020.9341801","volume-title":"Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, USA","author":"W S Wang","year":"2020","unstructured":"W. S. Wang, D. L. Zhu, X. W. Wang, Y. Y. Hu, Y. H. Qiu, C. Wang, Y. F. Hu, A. Kapoor, S. Scherer. TartanAir: A dataset to push the limits of visual SLAM. In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, USA, pp. 4909\u20134916, 2020. DOI: https:\/\/doi.org\/10.1109\/IROS45743.2020.9341801."},{"key":"1494_CR55","doi-asserted-by":"publisher","first-page":"770","DOI":"10.1109\/CVPR.2016.90","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA","author":"K M He","year":"2016","unstructured":"K. M. He, X. Y. Zhang, S. Q. Ren, J. Sun. Deep residual learning for image recognition. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, pp. 770\u2013778, 2016. DOI: https:\/\/doi.org\/10.1109\/CVPR.2016.90."},{"key":"1494_CR56","doi-asserted-by":"publisher","first-page":"7627","DOI":"10.1109\/ICCV.2019.00772","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea","author":"R Garg","year":"2019","unstructured":"R. Garg, N. Wadhwa, S. Ansari, J. Barron. Learning single camera depth estimation using dual-pixels. In Proceedings of IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea, pp. 7627\u20137636, 2019. DOI: https:\/\/doi.org\/10.1109\/ICCV.2019.00772."}],"container-title":["Machine Intelligence Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11633-024-1494-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11633-024-1494-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11633-024-1494-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T05:17:29Z","timestamp":1727068649000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11633-024-1494-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,29]]},"references-count":56,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["1494"],"URL":"https:\/\/doi.org\/10.1007\/s11633-024-1494-4","relation":{},"ISSN":["2731-538X","2731-5398"],"issn-type":[{"value":"2731-538X","type":"print"},{"value":"2731-5398","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,29]]},"assertion":[{"value":"6 March 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 January 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declared that they have no conflicts of interest to this work.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations of conflict of interest"}}]}}