{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,24]],"date-time":"2026-07-24T15:15:53Z","timestamp":1784906153211,"version":"3.55.0"},"reference-count":207,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T00:00:00Z","timestamp":1740528000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T00:00:00Z","timestamp":1740528000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003407","name":"Ministero dell\u2019Istruzione, dell\u2019Universit\u00e0 e della Ricerca","doi-asserted-by":"publisher","award":["2022MMBA8X_002"],"award-info":[{"award-number":["2022MMBA8X_002"]}],"id":[{"id":"10.13039\/501100003407","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2025,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Stereo matching is close to hitting a half-century of history, yet witnessed a rapid evolution in the last decade thanks to deep learning. While previous surveys in the late 2010s covered the first stage of this revolution, the last five years of research brought further ground-breaking advancements to the field. This paper aims to fill this gap in a two-fold manner: first, we offer an in-depth examination of the latest developments in deep stereo matching, focusing on the pioneering architectural designs and groundbreaking paradigms that have redefined the field in the 2020s; second, we present a thorough analysis of the critical challenges that have emerged alongside these advances, providing a comprehensive taxonomy of these issues and exploring the state-of-the-art techniques proposed to address them. By reviewing both the architectural innovations and the key challenges, we offer a holistic view of deep stereo matching and highlight the specific areas that require further investigation. To accompany this survey, we maintain a regularly updated project page that catalogs papers on deep stereo matching in our Awesome-Deep-Stereo-Matching repository.<\/jats:p>","DOI":"10.1007\/s11263-024-02331-0","type":"journal-article","created":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T07:51:37Z","timestamp":1740556297000},"page":"4245-4276","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":56,"title":["A Survey on Deep Stereo Matching in the Twenties"],"prefix":"10.1007","volume":"133","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6276-5282","authenticated-orcid":false,"given":"Fabio","family":"Tosi","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5509-437X","authenticated-orcid":false,"given":"Luca","family":"Bartolomei","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3337-2236","authenticated-orcid":false,"given":"Matteo","family":"Poggi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,2,26]]},"reference":[{"key":"2331_CR1","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1023\/A:1014573219977","volume":"47","author":"D Scharstein","year":"2002","unstructured":"Scharstein, D., & Szeliski, R. (2002). A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International journal of computer vision, 47, 7\u201342.","journal-title":"International journal of computer vision"},{"issue":"9","key":"2331_CR2","first-page":"5314","volume":"44","author":"M Poggi","year":"2021","unstructured":"Poggi, M., Tosi, F., Batsos, K., Mordohai, P., & Mattoccia, S. (2021). On the synergies between machine learning and binocular stereo for depth estimation from images: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(9), 5314\u20135334.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"4","key":"2331_CR3","doi-asserted-by":"publisher","first-page":"1738","DOI":"10.1109\/TPAMI.2020.3032602","volume":"44","author":"H Laga","year":"2020","unstructured":"Laga, H., Jospin, L. V., Boussaid, F., & Bennamoun, M. (2020). A survey on deep learning techniques for stereo-based depth estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(4), 1738\u20131764.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR4","doi-asserted-by":"crossref","unstructured":"Li, Z., Liu, X., Drenkow, N., Ding, A., Creighton, F.X., Taylor, R.H., Unberath, M. (2021). Revisiting stereo depth estimation from a sequence-to-sequence perspective with transformers. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 6197\u20136206","DOI":"10.1109\/ICCV48922.2021.00614"},{"key":"2331_CR5","doi-asserted-by":"crossref","unstructured":"Lipson, L., Teed, Z., Deng, J.(2021). Raft-stereo: Multilevel recurrent field transforms for stereo matching. In International Conference on 3D Vision (3DV)","DOI":"10.1109\/3DV53792.2021.00032"},{"issue":"9","key":"2331_CR6","first-page":"5293","volume":"44","author":"M Poggi","year":"2021","unstructured":"Poggi, M., Kim, S., Tosi, F., Kim, S., Aleotti, F., Min, D., Sohn, K., & Mattoccia, S. (2021). On the confidence of stereo matching in a deep-learning era: A quantitative evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(9), 5293\u20135313.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR7","doi-asserted-by":"crossref","unstructured":"Poggi, M., Tosi, F., Mattoccia, S. (2017). Quantitative evaluation of confidence measures in a machine learning world. In Proceedings of the IEEE International Conference on Computer Vision, 5228\u20135237","DOI":"10.1109\/ICCV.2017.559"},{"key":"2331_CR8","doi-asserted-by":"crossref","unstructured":"Xu, H., Zhang, J. (2020) Aanet: Adaptive aggregation network for efficient stereo matching. In proceedings of the ieee\/cvf conference on computer vision and pattern recognition, 1959\u20131968","DOI":"10.1109\/CVPR42600.2020.00203"},{"key":"2331_CR9","doi-asserted-by":"crossref","unstructured":"Yang, M., Wu, F., Li, W. (2020). Waveletstereo: Learning wavelet coefficients of disparity map in stereo matching. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.01290"},{"key":"2331_CR10","doi-asserted-by":"crossref","unstructured":"Shen, Z., Dai, Y., Rao, Z.(2021). Cfnet: Cascade and fused cost volume for robust stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 13906\u201313915","DOI":"10.1109\/CVPR46437.2021.01369"},{"key":"2331_CR11","doi-asserted-by":"crossref","unstructured":"Mao, Y., Liu, Z., Li, W., Dai, Y., Wang, Q., Kim, Y.-T., Lee, H.-S. (2021). Uasnet: Uncertainty adaptive sampling network for deep stereo matching. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 6311\u20136319","DOI":"10.1109\/ICCV48922.2021.00625"},{"key":"2331_CR12","doi-asserted-by":"crossref","unstructured":"Shen, Z., Dai, Y., Song, X., Rao, Z., Zhou, D., Zhang, L. (2022). Pcw-net: Pyramid combination and warping cost volume for stereo matching. In European Conference on Computer Vision, 280\u2013297. Springer","DOI":"10.1007\/978-3-031-19824-3_17"},{"key":"2331_CR13","doi-asserted-by":"crossref","unstructured":"Chen, L., Wang, W., Mordohai, P. (2023). Learning the distribution of errors in stereo matching for joint disparity and uncertainty estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 17235\u201317244","DOI":"10.1109\/CVPR52729.2023.01653"},{"key":"2331_CR14","unstructured":"Cheng, X., Zhong, Y., Harandi, M., Dai, Y., Chang, X., Li, H., Drummond, T., Ge, Z. (2020). Hierarchical neural architecture search for deep stereo matching. Advances in Neural Information Processing Systems, 33"},{"key":"2331_CR15","doi-asserted-by":"crossref","unstructured":"Wang, Q., Shi, S., Zhao, K., Chu, X. (2022). Easnet: searching elastic and accurate network architecture for stereo matching. In European Conference on Computer Vision, 437\u2013453. Springer","DOI":"10.1007\/978-3-031-19824-3_26"},{"key":"2331_CR16","doi-asserted-by":"crossref","unstructured":"Hu, Y., Wang, W., Yu, H., Zhen, W., Scherer, S. (2021). Orstereo: Occlusion-aware recurrent stereo matching for 4k-resolution images. In 2021 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), 5671\u20135678. IEEE","DOI":"10.1109\/IROS51168.2021.9635869"},{"key":"2331_CR17","unstructured":"Gong, R., Liu, W., Gu, Z., Yang, X., Cheng, J. (2020). Learning intra-view and cross-view geometric knowledge for stereo matching. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)"},{"key":"2331_CR18","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhou, H., Zhang, Y., Chen, J., Yang, Y., Zhao, Y. (2023). High-frequency stereo matching network. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 1327\u20131336","DOI":"10.1109\/CVPR52729.2023.00134"},{"key":"2331_CR19","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhou, H., Zhang, Y., Zhao, Y., Yang, Y., Ouyang, T. (2022). Eai-stereo: Error aware iterative network for stereo matching. In Proceedings of the Asian Conference on Computer Vision, 315\u2013332","DOI":"10.1007\/978-3-031-26319-4_1"},{"key":"2331_CR20","doi-asserted-by":"crossref","unstructured":"Xu, G., Wang, X., Ding, X., Yang, X. (2023). Iterative geometry encoding volume for stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 21919\u201321928","DOI":"10.1109\/CVPR52729.2023.02099"},{"key":"2331_CR21","doi-asserted-by":"crossref","unstructured":"Li, J., Wang, P., Xiong, P., Cai, T., Yan, Z., Yang, L., Liu, J., Fan, H., Liu, S. (2022). Practical stereo matching via cascaded recurrent network with adaptive correlation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 16263\u201316272","DOI":"10.1109\/CVPR52688.2022.01578"},{"key":"2331_CR22","doi-asserted-by":"crossref","unstructured":"Jing, J., Li, J., Xiong, P., Liu, J., Liu, S., Guo, Y., Deng, X., Xu, M., Jiang, L., Sigal, L. (2023). Uncertainty guided adaptive warping for robust and efficient stereo matching. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 3318\u20133327","DOI":"10.1109\/ICCV51070.2023.00307"},{"key":"2331_CR23","doi-asserted-by":"crossref","unstructured":"Wang, X., Xu, G., Jia, H., Yang, X. (2024). Selective-stereo: Adaptive frequency information selection for stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","DOI":"10.1109\/CVPR52733.2024.01863"},{"key":"2331_CR24","doi-asserted-by":"crossref","unstructured":"Feng, M., Cheng, J., Jia, H., Liu, L., Xu, G., Yang, X. (2024). Mc-stereo: Multi-peak lookup and cascade search range for stereo matching","DOI":"10.1109\/3DV62453.2024.00083"},{"key":"2331_CR25","doi-asserted-by":"crossref","unstructured":"Cheng, Z., Yang, J., Li, H. (2024). Stereo matching in time: 100+ fps video stereo matching for extended reality. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, 8719\u20138728","DOI":"10.1109\/WACV57701.2024.00852"},{"key":"2331_CR26","doi-asserted-by":"crossref","unstructured":"Chen, Z., Long, W., Yao, H., Zhang, Y., Wang, B., Qin, Y., Wu, J. (2024). Mocha-stereo: Motif channel attention network for stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","DOI":"10.1109\/CVPR52733.2024.02623"},{"key":"2331_CR27","doi-asserted-by":"crossref","unstructured":"Guo, W., Li, Z., Yang, Y., Wang, Z., Taylor, R.H., Unberath, M., Yuille, A., Li, Y. (2022). Context-enhanced stereo transformer. In European Conference on Computer Vision, 263\u2013279. Springer","DOI":"10.1007\/978-3-031-19824-3_16"},{"key":"2331_CR28","doi-asserted-by":"crossref","unstructured":"Su, Q., Ji, S. (2022). Chitransformer: Towards reliable stereo from cues. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1939\u20131949","DOI":"10.1109\/CVPR52688.2022.00198"},{"key":"2331_CR29","doi-asserted-by":"crossref","unstructured":"Xu, H., Zhang, J., Cai, J., Rezatofighi, H., Yu, F., Tao, D., Geiger, A. (2023). Unifying flow, stereo and depth estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2023.3298645"},{"key":"2331_CR30","doi-asserted-by":"crossref","unstructured":"Weinzaepfel, P., Lucas, T., Leroy, V., Cabon, Y., Arora, V., Br\u00e9gier, R., Csurka, G., Antsfeld, L., Chidlovskii, B., Revaud, J. (2023). CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow. In: ICCV","DOI":"10.1109\/ICCV51070.2023.01647"},{"key":"2331_CR31","doi-asserted-by":"crossref","unstructured":"Lou, J., Liu, W., Chen, Z., Liu, F., Cheng, J. (2023). Elfnet: Evidential local-global fusion for stereo matching. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 17784\u201317793","DOI":"10.1109\/ICCV51070.2023.01630"},{"key":"2331_CR32","doi-asserted-by":"crossref","unstructured":"Liu, Z., Li, Y., Okutomi, M. (2024). Global occlusion-aware transformer for robust stereo matching. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, 3535\u20133544","DOI":"10.1109\/WACV57701.2024.00350"},{"key":"2331_CR33","doi-asserted-by":"crossref","unstructured":"Knobelreiter, P., Sormann, C., Shekhovtsov, A., Fraundorfer, F., Pock, T. (2020). Belief propagation reloaded: Learning bp-layers for labeling problems. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 7900\u20137909","DOI":"10.1109\/CVPR42600.2020.00792"},{"key":"2331_CR34","doi-asserted-by":"crossref","unstructured":"Guan, T., Wang, C., Liu, Y.-H. (2024). Neural Markov Random Field for Stereo Matching","DOI":"10.1109\/CVPR52733.2024.00522"},{"key":"2331_CR35","doi-asserted-by":"crossref","unstructured":"Yee, K., Chakrabarti, A. (2020). Fast deep stereo with 2d convolutional processing of cost signatures. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV)","DOI":"10.1109\/WACV45572.2020.9093273"},{"key":"2331_CR36","doi-asserted-by":"crossref","unstructured":"Yao, C., Jia, Y., Di, H., Li, P., Wu, Y. (2021). A decomposition model for stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 6091\u20136100","DOI":"10.1109\/CVPR46437.2021.00603"},{"key":"2331_CR37","doi-asserted-by":"crossref","unstructured":"Xu, G., Cheng, J., Guo, P., Yang, X. (2022). Attention concatenation volume for accurate and efficient stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12981\u201312990","DOI":"10.1109\/CVPR52688.2022.01264"},{"key":"2331_CR38","doi-asserted-by":"crossref","unstructured":"Zeng, J., Yao, C., Yu, L., Wu, Y., Jia, Y. (2023) Parameterized cost volume for stereo matching. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 18347\u201318357","DOI":"10.1109\/ICCV51070.2023.01682"},{"key":"2331_CR39","doi-asserted-by":"crossref","unstructured":"Badki, A., Troccoli, A., Kim, K., Kautz, J., Sen, P., Gallo, O. (2020). Bi3d: Stereo depth estimation via binary classifications. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.00167"},{"issue":"4","key":"2331_CR40","doi-asserted-by":"publisher","first-page":"3225","DOI":"10.1609\/aaai.v38i4.28107","volume":"38","author":"X Li","year":"2024","unstructured":"Li, X., Zhang, C., Su, W., & Tao, W. (2024). Iinet: Implicit intra-inter information fusion for real-time stereo matching. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3225\u20133233. https:\/\/doi.org\/10.1609\/aaai.v38i4.28107","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2331_CR41","doi-asserted-by":"crossref","unstructured":"Pang, J., Sun, W., Ren, J.S., Yang, C., Yan, Q. (2017). Cascade residual learning: A two-stage convolutional neural network for stereo matching. In Proceedings of the IEEE International Conference on Computer Vision Workshops, 887\u2013895","DOI":"10.1109\/ICCVW.2017.108"},{"key":"2331_CR42","doi-asserted-by":"crossref","unstructured":"Xu, B., Xu, Y., Yang, X., Jia, W., Guo, Y. (2021). Bilateral grid learning for stereo matching networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 12497\u201312506","DOI":"10.1109\/CVPR46437.2021.01231"},{"key":"2331_CR43","doi-asserted-by":"crossref","unstructured":"Xing, J., Qi, Z., Dong, J., Cai, J., Liu, H. (2020). Mabnet: a lightweight stereo network based on multibranch adjustable bottleneck module. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part XXVIII 16, 340\u2013356. Springer","DOI":"10.1007\/978-3-030-58604-1_21"},{"key":"2331_CR44","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Poggi, M., Mattoccia, S. (2023). Temporalstereo: Efficient spatial-temporal stereo matching network. In: IROS","DOI":"10.1109\/IROS55552.2023.10341598"},{"key":"2331_CR45","doi-asserted-by":"crossref","unstructured":"Chang, Q., Li, X., Xu, X., Liu, X., Li, Y., Miyazaki, J. (2023). Stereovae: A lightweight stereo-matching system using embedded gpus. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 1982\u20131988. IEEE","DOI":"10.1109\/ICRA48891.2023.10160441"},{"key":"2331_CR46","doi-asserted-by":"crossref","unstructured":"Poggi, M., Tosi, F.(2024). Federated online adaptation for deep stereo. In: CVPR","DOI":"10.1109\/CVPR52733.2024.01906"},{"key":"2331_CR47","doi-asserted-by":"crossref","unstructured":"Bangunharcana, A., Cho, J.W., Lee, S., Kweon, I.S., Kim, K.-S., Kim, S.(2021). Correlate-and-excite: Real-time stereo matching via guided cost volume excitation. In IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","DOI":"10.1109\/IROS51168.2021.9635909"},{"key":"2331_CR48","doi-asserted-by":"crossref","unstructured":"Wang, Q., Shi, S., Zheng, S., Zhao, K., Chu, X.(2020). Fadnet: A fast and accurate network for disparity estimation. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 101\u2013107. IEEE","DOI":"10.1109\/ICRA40945.2020.9197031"},{"key":"2331_CR49","doi-asserted-by":"crossref","unstructured":"Tankovich, V., Hane, C., Zhang, Y., Kowdle, A., Fanello, S., Bouaziz, S. (2021). Hitnet: Hierarchical iterative tile refinement network for real-time stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14362\u201314372","DOI":"10.1109\/CVPR46437.2021.01413"},{"key":"2331_CR50","doi-asserted-by":"crossref","unstructured":"Cai, J., QI, Z., Fu, K., Shi, X., Li, Z., Liu, X., Liu, H. (2022). Pbcstereo: A compressed stereo network with pure binary convolutional operations. In Proceedings of the Asian Conference on Computer Vision (ACCV), 4378\u20134394","DOI":"10.1007\/978-3-031-26313-2_38"},{"key":"2331_CR51","doi-asserted-by":"crossref","unstructured":"Chang, J.-R., Chang, P.-C., Chen, Y.-S. (2020). Attention-aware feature aggregation for real-time stereo matching on edge devices. In Proceedings of the Asian Conference on Computer Vision (ACCV)","DOI":"10.1007\/978-3-030-69525-5_22"},{"key":"2331_CR52","doi-asserted-by":"crossref","unstructured":"Shamsafar, F., Woerz, S., Rahim, R., Zell, A. (2022). Mobilestereonet: Towards lightweight deep networks for stereo matching. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV), 2417\u20132426","DOI":"10.1109\/WACV51458.2022.00075"},{"key":"2331_CR53","doi-asserted-by":"crossref","unstructured":"Dovesi, P.L., Poggi, M., Andraghetti, L., Mart\u00ed, M., Kjellstr\u00f6m, H., Pieropan, A., Mattoccia, S. (2020). Real-time semantic stereo matching. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 10780\u201310787. IEEE","DOI":"10.1109\/ICRA40945.2020.9196784"},{"key":"2331_CR54","doi-asserted-by":"crossref","unstructured":"Chen, S., Xiang, Z., Qiao, C., Chen, Y., Bai, T. (2020). Sgnet: Semantics guided deep stereo matching. In Proceedings of the Asian Conference on Computer Vision","DOI":"10.1007\/978-3-030-69525-5_7"},{"key":"2331_CR55","doi-asserted-by":"crossref","unstructured":"Kusupati, U., Cheng, S., Chen, R., Su, H. (2020). Normal assisted stereo depth estimation. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.00226"},{"key":"2331_CR56","doi-asserted-by":"crossref","unstructured":"Aleotti, F., Poggi, M., Tosi, F., Mattoccia, S. (2020). Learning end-to-end scene flow by distilling single tasks knowledge. In Proceedings of the AAAI Conference on Artificial Intelligence, 34, 10435\u201310442","DOI":"10.1609\/aaai.v34i07.6613"},{"key":"2331_CR57","doi-asserted-by":"crossref","unstructured":"Jiao, Y., Tran, T.D., Shi, G. (2021). Effiscene: Efficient per-pixel rigidity inference for unsupervised joint learning of optical flow, depth, camera pose and motion segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 5538\u20135547","DOI":"10.1109\/CVPR46437.2021.00549"},{"key":"2331_CR58","doi-asserted-by":"crossref","unstructured":"Chi, C., Wang, Q., Hao, T., Guo, P., Yang, X. (2021). Feature-level collaboration: Joint unsupervised learning of optical flow, stereo depth and camera motion. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2463\u20132473","DOI":"10.1109\/CVPR46437.2021.00249"},{"key":"2331_CR59","unstructured":"You, Y., Wang, Y., Chao, W.-L., Garg, D., Pleiss, G., Hariharan, B., Campbell, M., Weinberger, K.Q. (2020). Pseudo-lidar++: Accurate depth for 3d object detection in autonomous driving. In: ICLR"},{"key":"2331_CR60","doi-asserted-by":"crossref","unstructured":"Zhang, J., Ramanagopal, M.S., Vasudevan, R., Johnson-Roberson, M. (2020). Listereo: Generate dense depth maps from lidar and stereo imagery. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 7829\u20137836. IEEE","DOI":"10.1109\/ICRA40945.2020.9196628"},{"key":"2331_CR61","doi-asserted-by":"crossref","unstructured":"Huang, Y.-K., Liu, Y.-C., Wu, T.-H., Su, H.-T., Chang, Y.-C., Tsou, T.-L., Wang, Y.-A., Hsu, W.H. (2021). S3: Learnable sparse signal superdensity for guided depth estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 16706\u201316716","DOI":"10.1109\/CVPR46437.2021.01643"},{"key":"2331_CR62","doi-asserted-by":"crossref","unstructured":"Yin, H., Deng, L., Chen, Z., Chen, B., Sun, T., Yusen, X., Xiao, J., Fu, Y., Deng, S., Li, X. (2022). Lsmd-net: Lidar-stereo fusion with mixture density network for depth sensing. In Proceedings of the Asian Conference on Computer Vision (ACCV), 552\u2013568","DOI":"10.1007\/978-3-031-26319-4_6"},{"key":"2331_CR63","doi-asserted-by":"crossref","unstructured":"Bartolomei, L., Poggi, M., Tosi, F., Conti, A., Mattoccia, S. (2023). Active stereo without pattern projector. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 18470\u201318482","DOI":"10.1109\/ICCV51070.2023.01693"},{"key":"2331_CR64","doi-asserted-by":"crossref","unstructured":"Tulyakov, S., Fleuret, F., Kiefel, M., Gehler, P., Hirsch, M. (2019). Learning an event sequence embedding for event-based deep stereo. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). To appear. https:\/\/fleuret.org\/papers\/tulyakov-et-al-iccv2019.pdf","DOI":"10.1109\/ICCV.2019.00161"},{"key":"2331_CR65","doi-asserted-by":"crossref","unstructured":"Nam, Y., Mostafavi, M., Yoon, K.-J., Choi, J. (2022). Stereo depth from events cameras: Concentrate and focus on the future. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 6114\u20136123","DOI":"10.1109\/CVPR52688.2022.00602"},{"key":"2331_CR66","doi-asserted-by":"crossref","unstructured":"Cho, H., Yoon, K.-J. (2022). Selection and cross similarity for event-image deep stereo. In European Conference on Computer Vision, 470\u2013486. Springer","DOI":"10.1007\/978-3-031-19824-3_28"},{"key":"2331_CR67","doi-asserted-by":"crossref","unstructured":"Zhang, K., Che, K., Zhang, J., Cheng, J., Zhang, Z., Guo, Q., Leng, L. (2022). Discrete time convolution for fast event-based stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8676\u20138686","DOI":"10.1109\/CVPR52688.2022.00848"},{"key":"2331_CR68","doi-asserted-by":"crossref","unstructured":"Cho, H., Cho, J., Yoon, K.-J. (2023). Learning adaptive dense event stereo from the image domain. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17797\u201317807","DOI":"10.1109\/CVPR52729.2023.01707"},{"key":"2331_CR69","doi-asserted-by":"crossref","unstructured":"Mostafavi, M., Yoon, K.-J., Choi, J. (2021). Eventintensity stereo: Estimating depth by the best of both worlds. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 4258\u20134267","DOI":"10.1109\/ICCV48922.2021.00422"},{"key":"2331_CR70","doi-asserted-by":"crossref","unstructured":"Cho, H., Yoon, K.-J. (2022). Event-image fusion stereo using cross-modality feature propagation. In Proceedings of the AAAI Conference on Artificial Intelligence, 36, 454\u2013462","DOI":"10.1609\/aaai.v36i1.19923"},{"key":"2331_CR71","doi-asserted-by":"crossref","unstructured":"Chen, X., Weng, W., Zhang, Y., Xiong, Z. (2024). Depth from asymmetric frame-event stereo: A divide-and-conquer approach. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV), 3045\u20133054","DOI":"10.1109\/WACV57701.2024.00302"},{"key":"2331_CR72","doi-asserted-by":"crossref","unstructured":"Walz, S., Bijelic, M., Ramazzina, A., Walia, A., Mannan, F., Heide, F. (2023). Gated stereo: Joint depth estimation from gated and wide-baseline active stereo cues. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 13252\u201313262","DOI":"10.1109\/CVPR52729.2023.01273"},{"key":"2331_CR73","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Khamis, S., Rhemann, C., Valentin, J., Kowdle, A., Tankovich, V., Schoenberg, M., Izadi, S., Funkhouser, T., Fanello, S. (2018). Activestereonet: End-to-end self-supervised learning for active stereo systems. In Proceedings of the European Conference on Computer Vision (ECCV)","DOI":"10.1007\/978-3-030-01237-3_48"},{"key":"2331_CR74","doi-asserted-by":"crossref","unstructured":"Baek, S.-H., Heide, F. (2021). Polka lines: Learning structured illumination and reconstruction for active stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5757\u20135767","DOI":"10.1109\/CVPR46437.2021.00570"},{"key":"2331_CR75","doi-asserted-by":"crossref","unstructured":"Liu, I., Yang, E., Tao, J., Chen, R., Zhang, X., Ran, Q., Liu, Z., Su, H. (2022). Activezero: Mixed domain learning for active stereovision with zero annotation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 13033\u201313042","DOI":"10.1109\/CVPR52688.2022.01269"},{"key":"2331_CR76","doi-asserted-by":"crossref","unstructured":"Xu, Y., Yang, X., Yu, Y., Jia, W., Chu, Z., Guo, Y. (2022). Depth estimation by combining binocular stereo and monocular structured-light. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1746\u20131755","DOI":"10.1109\/CVPR52688.2022.00179"},{"key":"2331_CR77","doi-asserted-by":"crossref","unstructured":"Chen, R., Liu, I., Yang, E., Tao, J., Zhang, X., Ran, Q., Liu, Z., Xu, J., Su, H. (2023). Activezero++: Mixed domain learning stereo and confidence-based depth completion with zero annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2023.3305399"},{"key":"2331_CR78","doi-asserted-by":"crossref","unstructured":"Zhi, T., Pires, B.R., Hebert, M., Narasimhan, S.G. (2018). Deep material-aware cross-spectral stereo matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1916\u20131925","DOI":"10.1109\/CVPR.2018.00205"},{"key":"2331_CR79","doi-asserted-by":"crossref","unstructured":"Liang, M., Guo, X., Li, H., Wang, X., Song, Y. (2019). Unsupervised cross-spectral stereo matching by learning to synthesize. In Proceedings of the AAAI Conference on Artificial Intelligence, 33, 8706\u20138713","DOI":"10.1609\/aaai.v33i01.33018706"},{"key":"2331_CR80","doi-asserted-by":"crossref","unstructured":"Walters, C., Mendez, O., Johnson, M., Bowden, R. (2021). There and back again: Self-supervised multispectral correspondence estimation. In 2021 IEEE International Conference on Robotics and Automation (ICRA), 5147\u20135154. IEEE","DOI":"10.1109\/ICRA48506.2021.9561621"},{"key":"2331_CR81","doi-asserted-by":"crossref","unstructured":"Tosi, F., Ramirez, P.Z., Poggi, M., Salti, S., Mattoccia, S., Di\u00a0Stefano, L. (2022). Rgb-multispectral matching: Dataset, learning methodology, evaluation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 15958\u201315968","DOI":"10.1109\/CVPR52688.2022.01549"},{"key":"2331_CR82","doi-asserted-by":"crossref","unstructured":"Tian, C., Pan, W., Wang, Z., Mao, M., Zhang, G., Bao, H., Tan, P., Cui, Z. (2023). Dps-net: Deep polarimetric stereo depth estimation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 3569\u20133579","DOI":"10.1109\/ICCV51070.2023.00330"},{"key":"2331_CR83","doi-asserted-by":"crossref","unstructured":"Brucker, S., Walz, S., Bijelic, M., Heide, F. (2024). Cross-spectral gated-rgb stereo depth estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR52733.2024.02046"},{"key":"2331_CR84","doi-asserted-by":"crossref","unstructured":"Guo, X., Yang, K., Yang, W., Wang, X., Li, H. (2019). Group-wise correlation stereo network. In: CVPR","DOI":"10.1109\/CVPR.2019.00339"},{"key":"2331_CR85","doi-asserted-by":"crossref","unstructured":"Teed, Z., Deng, J. (2020). Raft: Recurrent all-pairs field transforms for optical flow. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part II 16, 402\u2013419. Springer","DOI":"10.1007\/978-3-030-58536-5_24"},{"issue":"4","key":"2331_CR86","doi-asserted-by":"publisher","first-page":"3333","DOI":"10.1609\/aaai.v38i4.28119","volume":"38","author":"Z Liang","year":"2024","unstructured":"Liang, Z., & Li, C. (2024). Any-stereo: Arbitrary scale disparity estimation for iterative stereo matching. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3333\u20133341. https:\/\/doi.org\/10.1609\/aaai.v38i4.28119","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2331_CR87","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929"},{"key":"2331_CR88","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30"},{"key":"2331_CR89","doi-asserted-by":"crossref","unstructured":"Karaev, N., Rocco, I., Graham, B., Neverova, N., Vedaldi, A., Rupprecht, C. (2023). Dynamicstereo: Consistent dynamic depth from stereo videos. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13229\u201313239","DOI":"10.1109\/CVPR52729.2023.01271"},{"key":"2331_CR90","doi-asserted-by":"crossref","unstructured":"Gu, X., Fan, Z., Zhu, S., Dai, Z., Tan, F., Tan, P. (2020). Cascade cost volume for high-resolution multi-view stereo and stereo matching. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.00257"},{"key":"2331_CR91","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.-C. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4510\u20134520","DOI":"10.1109\/CVPR.2018.00474"},{"key":"2331_CR92","doi-asserted-by":"crossref","unstructured":"Tonioni, A., Tosi, F., Poggi, M., Mattoccia, S., Stefano, L.D. (2019). Real-time self-adaptive deep stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2019.00028"},{"key":"2331_CR93","doi-asserted-by":"crossref","unstructured":"Zhan, W., Ou, X., Yang, Y., Chen, L. (2019). Dsnet: Joint learning for scene segmentation and disparity estimation. In 2019 International Conference on Robotics and Automation (ICRA), 2946\u20132952. IEEE","DOI":"10.1109\/ICRA.2019.8793573"},{"key":"2331_CR94","doi-asserted-by":"crossref","unstructured":"Yang, G., Zhao, H., Shi, J., Deng, Z., Jia, J. (2018). Segstereo: Exploiting semantic information for disparity estimation. In Proceedings of the European Conference on Computer Vision (ECCV), 636\u2013651","DOI":"10.1007\/978-3-030-01234-2_39"},{"key":"2331_CR95","doi-asserted-by":"crossref","unstructured":"Jiang, H., Sun, D., Jampani, V., Lv, Z., Learned-Miller, E., Kautz, J. (2019). Sense: A shared encoder network for scene-flow estimation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 3195\u20133204","DOI":"10.1109\/ICCV.2019.00329"},{"key":"2331_CR96","doi-asserted-by":"crossref","unstructured":"Wu, Z., Wu, X., Zhang, X., Wang, S., Ju, L. (2019). Semantic stereo matching with pyramid cost volumes. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 7484\u20137493","DOI":"10.1109\/ICCV.2019.00758"},{"key":"2331_CR97","doi-asserted-by":"crossref","unstructured":"Chang, J.-R., Chen, Y.-S. (2018). Pyramid stereo matching network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5410\u20135418","DOI":"10.1109\/CVPR.2018.00567"},{"key":"2331_CR98","doi-asserted-by":"crossref","unstructured":"Vedula, S., Baker, S., Rander, P., Collins, R., Kanade, T. (1999). Three-dimensional scene flow. In Proceedings of the Seventh IEEE International Conference on Computer Vision, 2, 722\u2013729. IEEE","DOI":"10.1109\/ICCV.1999.790293"},{"key":"2331_CR99","doi-asserted-by":"crossref","unstructured":"Teed, Z., Deng, J. (2021). Raft-3d: Scene flow using rigid-motion embeddings. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR46437.2021.00827"},{"key":"2331_CR100","doi-asserted-by":"crossref","unstructured":"Liu, H., Lu, T., Xu, Y., Liu, J., Li, W., Chen, L. (2022). Camliflow: bidirectional camera-lidar fusion for joint optical flow and scene flow estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 5791\u20135801","DOI":"10.1109\/CVPR52688.2022.00570"},{"key":"2331_CR101","doi-asserted-by":"crossref","unstructured":"Li, A., Hu, A., Xi, W., Yu, W., Zou, D. (2024). Stereo-lidar depth estimation with deformable propagation and learned disparity-depth conversion. arXiv preprint arXiv:2404.07545","DOI":"10.1109\/ICRA57147.2024.10611533"},{"issue":"1","key":"2331_CR102","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1109\/TPAMI.2020.3008413","volume":"44","author":"G Gallego","year":"2022","unstructured":"Gallego, G., Delbr\u00fcck, T., Orchard, G., Bartolozzi, C., Taba, B., Censi, A., Leutenegger, S., Davison, A. J., Conradt, J., Daniilidis, K., & Scaramuzza, D. (2022). Event-based vision: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(1), 154\u2013180. https:\/\/doi.org\/10.1109\/TPAMI.2020.3008413","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR103","doi-asserted-by":"crossref","unstructured":"Kendall, A., Martirosyan, H., Dasgupta, S., Henry, P., Kennedy, R., Bachrach, A., Bry, A. (2017). End-to-end learning of geometry and context for deep stereo regression. In Proceedings of the IEEE International Conference on Computer Vision, 66\u201375","DOI":"10.1109\/ICCV.2017.17"},{"key":"2331_CR104","doi-asserted-by":"crossref","unstructured":"Park, T., Liu, M.-Y., Wang, T.-C., Zhu, J.-Y. (2019). Semantic image synthesis with spatially-adaptive normalization. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 2337\u20132346","DOI":"10.1109\/CVPR.2019.00244"},{"key":"2331_CR105","doi-asserted-by":"crossref","unstructured":"Zhu, A., Wang, Z., Khant, K., Daniilidis, K. (2021). Eventgan: Leveraging large scale image datasets for event cameras. In 2021 IEEE International Conference on Computational Photography (ICCP), 1\u201311","DOI":"10.1109\/ICCP51581.2021.9466265"},{"key":"2331_CR106","unstructured":"Rebecq, H., Ranftl, R., Koltun, V., Scaramuzza, D. (2019). High speed and high dynamic range video with an event camera. IEEE Trans. Pattern Anal. Mach. Intell. (T-PAMI)"},{"issue":"3","key":"2331_CR107","doi-asserted-by":"publisher","first-page":"034301","DOI":"10.1117\/1.2183668","volume":"45","author":"P Andersson","year":"2006","unstructured":"Andersson, P. (2006). Long-range three-dimensional imaging using range-gated laser radar images. Optical Engineering, 45(3), 034301\u2013034301.","journal-title":"Optical Engineering"},{"key":"2331_CR108","doi-asserted-by":"crossref","unstructured":"Zhang, F., Qi, X., Yang, R., Prisacariu, V., Wah, B., Torr, P. (2020). Domain-invariant stereo matching networks. In Europe Conference on Computer Vision (ECCV)","DOI":"10.1007\/978-3-030-58536-5_25"},{"key":"2331_CR109","doi-asserted-by":"crossref","unstructured":"Zhang, J., Wang, X., Bai, X., Wang, C., Huang, L., Chen, Y., Gu, L., Zhou, J., Harada, T., Hancock, E.R. (2022). Revisiting domain generalized stereo matching networks from a feature consistency perspective. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13001\u201313011","DOI":"10.1109\/CVPR52688.2022.01266"},{"key":"2331_CR110","doi-asserted-by":"crossref","unstructured":"Liu, B., Yu, H., Qi, G. (2022). Graftnet: Towards domain generalized stereo matching with a broad-spectrum and task-oriented feature. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13012\u201313021","DOI":"10.1109\/CVPR52688.2022.01267"},{"key":"2331_CR111","doi-asserted-by":"crossref","unstructured":"Chuah, W., Tennakoon, R., Hoseinnezhad, R., Bab-Hadiashar, A., Suter, D.(2022). Itsa: An information-theoretic approach to automatic shortcut avoidance and domain generalization in stereo matching networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13022\u201313032","DOI":"10.1109\/CVPR52688.2022.01268"},{"key":"2331_CR112","doi-asserted-by":"crossref","unstructured":"Chang, T., Yang, X., Zhang, T., Wang, M. (2023). Domain generalized stereo matching via hierarchical visual transformation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9559\u20139568","DOI":"10.1109\/CVPR52729.2023.00922"},{"key":"2331_CR113","doi-asserted-by":"crossref","unstructured":"Rao, Z., Xiong, B., He, M., Dai, Y., He, R., Shen, Z., Li, X. (2023). Masked representation learning for domain generalized stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5435\u20135444","DOI":"10.1109\/CVPR52729.2023.00526"},{"key":"2331_CR114","doi-asserted-by":"publisher","unstructured":"Cai, C., Poggi, M., Mattoccia, S., Mordohai, P. (2020). Matching-space stereo networks for cross-domain generalization. In 2020 International Conference on 3D Vision (3DV), 364\u2013373. https:\/\/doi.org\/10.1109\/3DV50981.2020.00046","DOI":"10.1109\/3DV50981.2020.00046"},{"key":"2331_CR115","first-page":"16305","volume":"35","author":"K Cheng","year":"2022","unstructured":"Cheng, K., Wu, T., & Healey, C. (2022). Revisiting non-parametric matching cost volumes for robust and generalizable stereo matching. Advances in Neural Information Processing Systems, 35, 16305\u201316318.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2331_CR116","doi-asserted-by":"crossref","unstructured":"Aleotti, F., Tosi, F., Ramirez, P.Z., Poggi, M., Salti, S., Mattoccia, S., Di\u00a0Stefano, L. (2021). Neural disparity refinement for arbitrary resolution stereo. In 2021 International Conference on 3D Vision (3DV), 207\u2013217. IEEE","DOI":"10.1109\/3DV53792.2021.00031"},{"key":"2331_CR117","doi-asserted-by":"crossref","unstructured":"Tosi, F., Aleotti, F., Ramirez, P.Z., Poggi, M., Salti, S., Mattoccia, S., Di\u00a0Stefano, L. (2024). Neural disparity refinement. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2024.3411292"},{"key":"2331_CR118","doi-asserted-by":"crossref","unstructured":"Pilzer, A., Hou, Y., Loppi, N., Solin, A., Kannala, J.(2023). Expansion of visual hints for improved generalization in stereo matching. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV), 5840\u20135849","DOI":"10.1109\/WACV56688.2023.00579"},{"key":"2331_CR119","doi-asserted-by":"crossref","unstructured":"Watson, J., Aodha, O.M., Turmukhambetov, D., Brostow, G.J., Firman, M. (2020). Learning stereo from single images. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part I 16, 722\u2013740. Springer","DOI":"10.1007\/978-3-030-58452-8_42"},{"key":"2331_CR120","doi-asserted-by":"crossref","unstructured":"Tosi, F., Tonioni, A., De\u00a0Gregorio, D., Poggi, M. (2023). Nerf-supervised deep stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 855\u2013866","DOI":"10.1109\/CVPR52729.2023.00089"},{"key":"2331_CR121","doi-asserted-by":"crossref","unstructured":"Zhang, J., Li, J., Huang, L., Yu, X., Gu, L., Zheng, J., Bai, X. (2024). Robust synthetic-to-real transfer for stereo matching. arXiv preprint arXiv:2403.07705","DOI":"10.1109\/CVPR52733.2024.01914"},{"key":"2331_CR122","doi-asserted-by":"crossref","unstructured":"Liu, P., King, I., Lyu, M.R., Xu, J. (2020). Flow2stereo: Effective self-supervised learning of optical flow and stereo matching. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.00668"},{"key":"2331_CR123","doi-asserted-by":"crossref","unstructured":"Aleotti, F., Tosi, F., Zhang, L., Poggi, M., Mattoccia, S. (2020). Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part XI 16, 614\u2013632. Springer","DOI":"10.1007\/978-3-030-58621-8_36"},{"key":"2331_CR124","doi-asserted-by":"crossref","unstructured":"Chen, Z., Ye, X., Yang, W., Xu, Z., Tan, X., Zou, Z., Ding, E., Zhang, X., Huang, L. (2021). Revealing the reciprocal relations between self-supervised stereo and monocular depth estimation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 15529\u201315538","DOI":"10.1109\/ICCV48922.2021.01524"},{"key":"2331_CR125","doi-asserted-by":"crossref","unstructured":"Yuan, W., Zhang, Y., Wu, B., Zhu, S., Tan, P., Wang, M.Y., Chen, Q.(2021). Stereo matching by self-supervision of multiscopic vision. In 2021 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), 5702\u20135709. IEEE","DOI":"10.1109\/IROS51168.2021.9636616"},{"key":"2331_CR126","doi-asserted-by":"crossref","unstructured":"Shen, Z., Song, X., Dai, Y., Zhou, D., Rao, Z., Zhang, L.(2023). Digging into uncertainty-based pseudo-label for robust stereo matching. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2023.3300976"},{"key":"2331_CR127","doi-asserted-by":"crossref","unstructured":"Liu, R., Yang, C., Sun, W., Wang, X., Li, H.(2020). Stereogan: Bridging synthetic-to-real domain gap by joint optimization of domain translation and stereo matching. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR42600.2020.01277"},{"key":"2331_CR128","doi-asserted-by":"crossref","unstructured":"Song, X., Yang, G., Zhu, X., Zhou, H., Wang, Z., Shi, J. (2021). Adastereo: A simple and efficient approach for adaptive stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10328\u201310337","DOI":"10.1109\/CVPR46437.2021.01019"},{"key":"2331_CR129","doi-asserted-by":"crossref","unstructured":"Zhang, C., Tian, K., Fan, B., Meng, G., Zhang, Z., Pan, C. (2022). Continual stereo matching of continuous driving scenes with growing architecture. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18901\u201318910","DOI":"10.1109\/CVPR52688.2022.01833"},{"key":"2331_CR130","doi-asserted-by":"crossref","unstructured":"Wang, H., Wang, X., Song, J., Lei, J., Song, M.(2020). Faster self-adaptive deep stereo. In Proceedings of the Asian Conference on Computer Vision","DOI":"10.1007\/978-3-030-69525-5_11"},{"issue":"9","key":"2331_CR131","doi-asserted-by":"publisher","first-page":"4713","DOI":"10.1109\/TPAMI.2021.3075815","volume":"44","author":"M Poggi","year":"2021","unstructured":"Poggi, M., Tonioni, A., Tosi, F., Mattoccia, S., & Di Stefano, L. (2021). Continual adaptation for deep stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(9), 4713\u20134729.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR132","doi-asserted-by":"crossref","unstructured":"Kim, K., Park, J., Lee, J., Min, D., Sohn, K. (2022). Pointfix: Learning to fix domain bias for robust online stereo adaptation. In European Conference on Computer Vision, 568\u2013585. Springer","DOI":"10.1007\/978-3-031-19839-7_33"},{"key":"2331_CR133","doi-asserted-by":"crossref","unstructured":"Chen, C., Chen, X., Cheng, H. (2019). On the over-smoothing problem of cnn based disparity estimation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 8997\u20139005","DOI":"10.1109\/ICCV.2019.00909"},{"key":"2331_CR134","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Chen, Y., Bai, X., Yu, S., Yu, K., Li, Z., Yang, K. (2020). Adaptive unimodal cost volume filtering for deep stereo matching. In Proceedings of the AAAI Conference on Artificial Intelligence, 34, 12926\u201312934","DOI":"10.1609\/aaai.v34i07.6991"},{"key":"2331_CR135","first-page":"22517","volume":"33","author":"D Garg","year":"2020","unstructured":"Garg, D., Wang, Y., Hariharan, B., Campbell, M., Weinberger, K. Q., & Chao, W.-L. (2020). Wasserstein distances for stereo disparity estimation. Advances in Neural Information Processing Systems, 33, 22517\u201322529.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2331_CR136","doi-asserted-by":"crossref","unstructured":"Liu, B., Yu, H., Long, Y. (2022). Local similarity pattern and cost self-reassembling for deep stereo matching networks. In Proceedings of the AAAI Conference on Artificial Intelligence, 36, 1647\u20131655","DOI":"10.1609\/aaai.v36i2.20056"},{"key":"2331_CR137","doi-asserted-by":"crossref","unstructured":"Tosi, F., Liao, Y., Schmitt, C., Geiger, A. (2021). Smd-nets: Stereo mixture density networks. In Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR46437.2021.00883"},{"key":"2331_CR138","doi-asserted-by":"crossref","unstructured":"Xu, P., Xiang, Z., Qiao, C., Fu, J., Zhao, X. (2024). Adaptive multi-modal cross-entropy loss for stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","DOI":"10.1109\/CVPR52733.2024.00491"},{"key":"2331_CR139","doi-asserted-by":"crossref","unstructured":"Chai, C.-Y., Wu, Y.-P., Tsao, S.-L. (2020). Deep depth fusion for black, transparent, reflective and texture-less objects. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 6766\u20136772. IEEE","DOI":"10.1109\/ICRA40945.2020.9196894"},{"key":"2331_CR140","doi-asserted-by":"crossref","unstructured":"Wu, Z., Su, S., Chen, Q., Fan, R. (2023). Transparent objects: A corner case in stereo matching. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 12353\u201312359. IEEE","DOI":"10.1109\/ICRA48891.2023.10161385"},{"key":"2331_CR141","doi-asserted-by":"crossref","unstructured":"Costanzino, A., Ramirez, P.Z., Poggi, M., Tosi, F., Mattoccia, S., Di\u00a0Stefano, L.(2023). Learning depth estimation for transparent and mirror surfaces. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 9244\u20139255","DOI":"10.1109\/ICCV51070.2023.00848"},{"key":"2331_CR142","doi-asserted-by":"crossref","unstructured":"Liu, Y., Ren, J., Zhang, J., Liu, J., Lin, M.(2020). Visually imbalanced stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 2029\u20132038","DOI":"10.1109\/CVPR42600.2020.00210"},{"key":"2331_CR143","doi-asserted-by":"crossref","unstructured":"Chen, X., Xiong, Z., Cheng, Z., Peng, J., Zhang, Y., Zha, Z.-J. (2022). Degradation-agnostic correspondence from resolution-asymmetric stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12962\u201312971","DOI":"10.1109\/CVPR52688.2022.01262"},{"key":"2331_CR144","doi-asserted-by":"crossref","unstructured":"Song, T., Kim, S., Sohn, K.(2023). Unsupervised deep asymmetric stereo matching with spatially-adaptive self-similarity. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 13672\u201313680","DOI":"10.1109\/CVPR52729.2023.01314"},{"key":"2331_CR145","doi-asserted-by":"crossref","unstructured":"Butler, D.J., Wulff, J., Stanley, G.B., Black, M.J. (2012). A naturalistic open source movie for optical flow evaluation. In Computer Vision\u2013ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012, Proceedings, Part VI 12, 611\u2013625. Springer","DOI":"10.1007\/978-3-642-33783-3_44"},{"key":"2331_CR146","doi-asserted-by":"crossref","unstructured":"Scharstein, D., Hirschm\u00fcller, H., Kitajima, Y., Krathwohl, G., Ne\u0161i\u0107, N., Wang, X., Westling, P.(2014). High-resolution stereo datasets with subpixel-accurate ground truth. In Pattern Recognition: 36th German Conference, GCPR 2014, M\u00fcnster, Germany, September 2-5, 2014, Proceedings 36, 31\u201342. Springer","DOI":"10.1007\/978-3-319-11752-2_3"},{"key":"2331_CR147","doi-asserted-by":"crossref","unstructured":"Duggal, S., Wang, S., Ma, W.-C., Hu, R., Urtasun, R.(2019). Deeppruner: Learning efficient stereo matching via differentiable patchmatch. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 4384\u20134393","DOI":"10.1109\/ICCV.2019.00448"},{"key":"2331_CR148","doi-asserted-by":"crossref","unstructured":"Ranftl, R., Lasinger, K., Hafner, D., Schindler, K., Koltun, V.(2022). Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3)","DOI":"10.1109\/TPAMI.2020.3019967"},{"issue":"4","key":"2331_CR149","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3528223.3530127","volume":"41","author":"T M\u00fcller","year":"2022","unstructured":"M\u00fcller, T., Evans, A., Schied, C., & Keller, A. (2022). Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (TOG), 41(4), 1\u201315.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"2331_CR150","doi-asserted-by":"crossref","unstructured":"Godard, C., Mac Aodha, O., Brostow, G.J.(2017). Unsupervised monocular depth estimation with left-right consistency. In: CVPR","DOI":"10.1109\/CVPR.2017.699"},{"key":"2331_CR151","doi-asserted-by":"crossref","unstructured":"Zhou, Z., Dong, Q.(2023). Two-in-one depth: Bridging the gap between monocular and binocular self-supervised depth estimation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, 9411\u20139421","DOI":"10.1109\/ICCV51070.2023.00863"},{"key":"2331_CR152","doi-asserted-by":"crossref","unstructured":"Mayer, N., Ilg, E., Hausser, P., Fischer, P., Cremers, D., Dosovitskiy, A., Brox, T. (2016). A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4040\u20134048","DOI":"10.1109\/CVPR.2016.438"},{"issue":"10","key":"2331_CR153","doi-asserted-by":"publisher","first-page":"2361","DOI":"10.1109\/TPAMI.2019.2947374","volume":"42","author":"X Cheng","year":"2019","unstructured":"Cheng, X., Wang, P., & Yang, R. (2019). Learning depth with convolutional spatial propagation network. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(10), 2361\u20132379.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR154","doi-asserted-by":"crossref","unstructured":"Zhang, F., Prisacariu, V., Yang, R., Torr, P.H.(2019). Ga-net: Guided aggregation net for end-to-end stereo matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 185\u2013194","DOI":"10.1109\/CVPR.2019.00027"},{"key":"2331_CR155","doi-asserted-by":"crossref","unstructured":"Yang, G., Manela, J., Happold, M., Ramanan, D.(2019). Hierarchical deep stereo matching on high-resolution images. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 5515\u20135524","DOI":"10.1109\/CVPR.2019.00566"},{"issue":"65","key":"2331_CR156","first-page":"1","volume":"17","author":"J \u017dbontar","year":"2016","unstructured":"\u017dbontar, J., & LeCun, Y. (2016). Stereo matching by training a convolutional neural network to compare image patches. Journal of Machine Learning Research, 17(65), 1\u201332.","journal-title":"Journal of Machine Learning Research"},{"key":"2331_CR157","doi-asserted-by":"crossref","unstructured":"Liang, Z., Feng, Y., Guo, Y., Liu, H., Chen, W., Qiao, L., Zhou, L., Zhang, J. (2018). Learning for disparity estimation through feature constancy. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2811\u20132820","DOI":"10.1109\/CVPR.2018.00297"},{"issue":"2","key":"2331_CR158","doi-asserted-by":"publisher","first-page":"328","DOI":"10.1109\/TPAMI.2007.1166","volume":"30","author":"H Hirschmuller","year":"2007","unstructured":"Hirschmuller, H. (2007). Stereo processing by semiglobal matching and mutual information. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 328\u2013341.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2331_CR159","unstructured":"Jiang, H., Xu, R., Jiang, W.(2022). An improved raftstereo trained with a mixed dataset for the robust vision challenge 2022. arXiv preprint arXiv:2210.12785"},{"key":"2331_CR160","doi-asserted-by":"crossref","unstructured":"Rao, Z., Dai, Y., Shen, Z., He, R. (2022). Rethinking training strategy in stereo matching. IEEE Transactions on Neural Networks and Learning Systems","DOI":"10.1109\/TNNLS.2022.3146306"},{"key":"2331_CR161","doi-asserted-by":"crossref","unstructured":"Mehltretter, M., Heipke, C.(2019). Cnn-based cost volume analysis as confidence measure for dense matching. In Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops, 0\u20130","DOI":"10.1109\/ICCVW.2019.00262"},{"key":"2331_CR162","doi-asserted-by":"crossref","unstructured":"Ilg, E., Saikia, T., Keuper, M., Brox, T.(2018). Occlusions, motion and depth boundaries with a generic network for disparity, optical flow or scene flow estimation. In Proceedings of the European Conference on Computer Vision (ECCV), 614\u2013630","DOI":"10.1007\/978-3-030-01258-8_38"},{"key":"2331_CR163","doi-asserted-by":"crossref","unstructured":"Batsos, K., Cai, C., Mordohai, P. (2018). Cbmv: A coalesced bidirectional matching volume for disparity estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2060\u20132069","DOI":"10.1109\/CVPR.2018.00220"},{"key":"2331_CR164","doi-asserted-by":"crossref","unstructured":"Geiger, A., Roser, M., Urtasun, R.(2010). Efficient large-scale stereo matching. In Asian Conference on Computer Vision, 25\u201338. Springer","DOI":"10.1007\/978-3-642-19315-6_3"},{"key":"2331_CR165","doi-asserted-by":"crossref","unstructured":"Yin, Z., Darrell, T., Yu, F. (2019). Hierarchical discrete distribution decomposition for match density estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 6044\u20136053","DOI":"10.1109\/CVPR.2019.00620"},{"issue":"2","key":"2331_CR166","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1007\/s00355-010-0475-4","volume":"36","author":"M Schulze","year":"2011","unstructured":"Schulze, M. (2011). A new monotonic, clone-independent, reversal symmetric, and condorcet-consistent single-winner election method. Social Choice and Welfare, 36(2), 267\u2013303.","journal-title":"Social Choice and Welfare"},{"key":"2331_CR167","doi-asserted-by":"crossref","unstructured":"Ramirez, P.Z., Tosi, F., Poggi, M., Salti, S., Mattoccia, S., Di\u00a0Stefano, L. (2022). Open challenges in deep stereo: The booster dataset. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 21168\u201321178","DOI":"10.1109\/CVPR52688.2022.02049"},{"key":"2331_CR168","doi-asserted-by":"crossref","unstructured":"Ramirez, P.Z., Costanzino, A., Tosi, F., Poggi, M., Salti, S., Mattoccia, S., Di\u00a0Stefano, L. (2023). Booster: a benchmark for depth from images of specular and transparent surfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/CVPRW59228.2023.00143"},{"key":"2331_CR169","doi-asserted-by":"crossref","unstructured":"Yang, L., Kang, B., Huang, Z., Xu, X., Feng, J., Zhao, H. (2024). Depth anything: Unleashing the power of large-scale unlabeled data. In: CVPR","DOI":"10.1109\/CVPR52733.2024.00987"},{"key":"2331_CR170","unstructured":"Yang, L., Kang, B., Huang, Z., Zhao, Z., Xu, X., Feng, J., Zhao, H. (2024). Depth anything v2. arXiv:2406.09414"},{"key":"2331_CR171","doi-asserted-by":"crossref","unstructured":"Wang, S., Leroy, V., Cabon, Y., Chidlovskii, B., Revaud, J. (2024). Dust3r: Geometric 3d vision made easy. In: CVPR","DOI":"10.1109\/CVPR52733.2024.01956"},{"key":"2331_CR172","doi-asserted-by":"crossref","unstructured":"Leroy, V., Cabon, Y., Revaud, J. (2024). Grounding image matching in 3d with mast3r. In European Conference on Computer Vision, 71\u201391. Springer","DOI":"10.1007\/978-3-031-73220-1_5"},{"key":"2331_CR173","unstructured":"Zhang, J., Herrmann, C., Hur, J., Jampani, V., Darrell, T., Cole, F., Sun, D., Yang, M.-H.(2024). MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion. https:\/\/arxiv.org\/abs\/2410.03825"},{"key":"2331_CR174","doi-asserted-by":"crossref","unstructured":"Shin, U., Park, J., Kweon, I.S. (2023). Deep depth estimation from thermal image. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1043\u20131053","DOI":"10.1109\/CVPR52729.2023.00107"},{"key":"2331_CR175","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11432-019-2803-x","volume":"63","author":"W Bao","year":"2020","unstructured":"Bao, W., Wang, W., Xu, Y., Guo, Y., Hong, S., & Zhang, X. (2020). Instereo2k: A large real dataset for stereo matching in indoor scenes. Science China Information Sciences, 63, 1\u201311.","journal-title":"Science China Information Sciences"},{"key":"2331_CR176","doi-asserted-by":"crossref","unstructured":"Chaney, K., Cladera, F., Wang, Z., Bisulco, A., Hsieh, M.A., Korpela, C., Kumar, V., Taylor, C.J., Daniilidis, K. (2023). M3ed: Multi-robot, multi-sensor, multi-environment event dataset. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 4015\u20134022","DOI":"10.1109\/CVPRW59228.2023.00419"},{"issue":"3","key":"2331_CR177","doi-asserted-by":"publisher","first-page":"4947","DOI":"10.1109\/LRA.2021.3068942","volume":"6","author":"M Gehrig","year":"2021","unstructured":"Gehrig, M., Aarents, W., Gehrig, D., & Scaramuzza, D. (2021). Dsec: A stereo event camera dataset for driving scenarios. IEEE Robotics and Automation Letters, 6(3), 4947\u20134954.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2331_CR178","doi-asserted-by":"crossref","unstructured":"Zabih, R., Woodfill, J. (1994). Non-parametric local transforms for computing visual correspondence. In Computer Vision-ECCV\u201994: Third European Conference on Computer Vision Stockholm, Sweden, May 2\u20136 1994 Proceedings, Volume II 3, 151\u2013158. Springer","DOI":"10.1007\/BFb0028345"},{"key":"2331_CR179","doi-asserted-by":"crossref","unstructured":"Han, X., Leung, T., Jia, Y., Sukthankar, R., Berg, A.C.(2015). Matchnet: Unifying feature and metric learning for patch-based matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3279\u20133286","DOI":"10.1109\/CVPR.2015.7298948"},{"issue":"65","key":"2331_CR180","first-page":"1","volume":"17","author":"J \u017dbontar","year":"2016","unstructured":"\u017dbontar, J., & LeCun, Y. (2016). Stereo matching by training a convolutional neural network to compare image patches. Journal of Machine Learning Research, 17(65), 1\u201332.","journal-title":"Journal of Machine Learning Research"},{"key":"2331_CR181","doi-asserted-by":"crossref","unstructured":"Chen, Z., Sun, X., Wang, L., Yu, Y., Huang, C. (2015). A deep visual correspondence embedding model for stereo matching costs. In Proceedings of the IEEE International Conference on Computer Vision, 972\u2013980","DOI":"10.1109\/ICCV.2015.117"},{"key":"2331_CR182","doi-asserted-by":"crossref","unstructured":"Luo, W., Schwing, A.G., Urtasun, R. (2016). Efficient deep learning for stereo matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5695\u20135703","DOI":"10.1109\/CVPR.2016.614"},{"key":"2331_CR183","doi-asserted-by":"crossref","unstructured":"Park, M.-G., Yoon, K.-J. (2015). Leveraging stereo matching with learning-based confidence measures. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 101\u2013109","DOI":"10.1109\/CVPR.2015.7298605"},{"key":"2331_CR184","doi-asserted-by":"crossref","unstructured":"Spyropoulos, A., Komodakis, N., Mordohai, P. (2014). Learning to detect ground control points for improving the accuracy of stereo matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1621\u20131628","DOI":"10.1109\/CVPR.2014.210"},{"key":"#cr-split#-2331_CR185.1","unstructured":"Poggi, M., Mattoccia, S. (2016). Learning a general-purpose confidence measure based on o"},{"key":"#cr-split#-2331_CR185.2","unstructured":"(1) features and a smarter aggregation strategy for semi global matching. In 2016 Fourth International Conference on 3D Vision (3DV), 509-518. IEEE"},{"key":"2331_CR186","doi-asserted-by":"crossref","unstructured":"Shaked, A., Wolf, L. (2017) Improved stereo matching with constant highway networks and reflective confidence learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4641\u20134650","DOI":"10.1109\/CVPR.2017.730"},{"key":"2331_CR187","doi-asserted-by":"crossref","unstructured":"Gidaris, S., Komodakis, N. (2017). Detect, replace, refine: Deep structured prediction for pixel wise labeling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5248\u20135257","DOI":"10.1109\/CVPR.2017.760"},{"key":"2331_CR188","doi-asserted-by":"crossref","unstructured":"Batsos, K., Mordohai, P. (2018). Recresnet: A recurrent residual cnn architecture for disparity map enhancement. In 2018 International Conference on 3D Vision (3DV), 238\u2013247. IEEE","DOI":"10.1109\/3DV.2018.00036"},{"key":"2331_CR189","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-assisted intervention\u2013MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, 234\u2013241. Springer","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"2331_CR190","doi-asserted-by":"crossref","unstructured":"Khamis, S., Fanello, S., Rhemann, C., Kowdle, A., Valentin, J., Izadi, S. (2018). Stereonet: Guided hierarchical refinement for real-time edge-aware depth prediction. In Proceedings of the European Conference on Computer Vision (ECCV), 573\u2013590","DOI":"10.1007\/978-3-030-01267-0_35"},{"key":"2331_CR191","doi-asserted-by":"crossref","unstructured":"Wang, Y., Lai, Z., Huang, G., Wang, B.H., Van Der\u00a0Maaten, L., Campbell, M., Weinberger, K.Q. (2019). Anytime stereo image depth estimation on mobile devices. In 2019 International Conference on Robotics and Automation (ICRA), 5893\u20135900. IEEE","DOI":"10.1109\/ICRA.2019.8794003"},{"key":"2331_CR192","doi-asserted-by":"crossref","unstructured":"Song, X., Zhao, X., Hu, H., Fang, L. (2019). Edgestereo: A context integrated residual pyramid network for stereo matching. In Computer Vision\u2013ACCV 2018: 14th Asian Conference on Computer Vision, Perth, Australia, December 2\u20136, 2018, Revised Selected Papers, Part V 14, 20\u201335. Springer","DOI":"10.1007\/978-3-030-20873-8_2"},{"key":"2331_CR193","unstructured":"Cabon, Y., Murray, N., Humenberger, M. (2020). Virtual kitti 2. arXiv preprint arXiv:2001.10773"},{"key":"2331_CR194","doi-asserted-by":"crossref","unstructured":"Wang, Q., Zheng, S., Yan, Q., Deng, F., Zhao, K., Chu, X. (2021). Irs: A large naturalistic indoor robotics stereo dataset to train deep models for disparity and surface normal estimation. In 2021 IEEE International Conference on Multimedia and Expo (ICME), 1\u20136. IEEE","DOI":"10.1109\/ICME51207.2021.9428423"},{"key":"2331_CR195","doi-asserted-by":"crossref","unstructured":"Mehl, L., Schmalfuss, J., Jahedi, A., Nalivayko, Y., Bruhn, A.(2023). Spring: A high-resolution high-detail dataset and benchmark for scene flow, optical flow and stereo. In Proc. IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR52729.2023.00482"},{"key":"2331_CR196","unstructured":"Hua, Y., Kohli, P., Uplavikar, P., Ravi, A., Gunaseelan, S., Orozco, J., Li, E. (2020). Holopix50k: A large-scale in-the-wild stereo image dataset. arXiv preprint arXiv:2003.11172"},{"key":"2331_CR197","doi-asserted-by":"crossref","unstructured":"Treible, W., Saponaro, P., Sorensen, S., Kolagunda, A., O\u2019Neal, M., Phelan, B., Sherbondy, K., Kambhamettu, C. (2017). Cats: A color and thermal stereo benchmark. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2017.22"},{"key":"2331_CR198","doi-asserted-by":"publisher","unstructured":"Zhu, A. Z., Thakur, D., \u00d6zaslan, T., Pfrommer, B., Kumar, V., & Daniilidis, K. (2018). The multivehicle stereo event camera dataset: An event camera dataset for 3d perception. IEEE Robotics and Automation Letters, 3(3), 2032\u20132039. https:\/\/doi.org\/10.1109\/LRA.2018.2800793","DOI":"10.1109\/LRA.2018.2800793"},{"key":"2331_CR199","doi-asserted-by":"crossref","unstructured":"Zhang, J., Singh, S. (2014). Loam: Lidar odometry and mapping in real-time. In: Robotics: Science and Systems, 2, 1\u20139 Berkeley, CA","DOI":"10.15607\/RSS.2014.X.007"},{"issue":"2","key":"2331_CR200","doi-asserted-by":"publisher","first-page":"4861","DOI":"10.1109\/LRA.2022.3152830","volume":"7","author":"C Bai","year":"2022","unstructured":"Bai, C., Xiao, T., Chen, Y., Wang, H., Zhang, F., & Gao, X. (2022). Faster-lio: Lightweight tightly coupled lidar-inertial odometry using parallel sparse incremental voxels. IEEE Robotics and Automation Letters, 7(2), 4861\u20134868. https:\/\/doi.org\/10.1109\/LRA.2022.3152830","journal-title":"IEEE Robotics and Automation Letters"},{"key":"2331_CR201","doi-asserted-by":"crossref","unstructured":"Yang, G., Manela, J., Happold, M., Ramanan, D. (2019). Hierarchical deep stereo matching on high-resolution images. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2019.00566"},{"key":"2331_CR202","unstructured":"Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V. (2017). Carla: An open urban driving simulator. In Conference on Robot Learning, 1\u201316. PMLR"},{"key":"2331_CR203","doi-asserted-by":"crossref","unstructured":"Gaidon, A., Wang, Q., Cabon, Y., Vig, E. (2016). Virtual worlds as proxy for multi-object tracking analysis. In: CVPR","DOI":"10.1109\/CVPR.2016.470"},{"key":"2331_CR204","doi-asserted-by":"crossref","unstructured":"Wang, W., Zhu, D., Wang, X., Hu, Y., Qiu, Y., Wang, C., Hu, Y., Kapoor, A., Scherer, S. (2020). Tartanair: A dataset to push the limits of visual slam. In 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), 4909\u20134916. IEEE","DOI":"10.1109\/IROS45743.2020.9341801"},{"key":"2331_CR205","unstructured":"Jospin, L., Antony, A., Xu, L., Laga, H., Boussaid, F., & Bennamoun, M. (2022). Active-passive simstereo-benchmarking the cross-generalization capabilities of deep learning-based stereo methods. Advances in Neural Information Processing Systems, 35, 29235\u201329247."},{"key":"2331_CR206","doi-asserted-by":"crossref","unstructured":"Wu, C.-Y., Wang, J., Hall, M., Neumann, U., Su, S. (2022). Toward practical monocular indoor depth estimation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 3814\u20133824","DOI":"10.1109\/CVPR52688.2022.00379"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02331-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-024-02331-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02331-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,7]],"date-time":"2025-06-07T06:05:20Z","timestamp":1749276320000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-024-02331-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,26]]},"references-count":207,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2025,7]]}},"alternative-id":["2331"],"URL":"https:\/\/doi.org\/10.1007\/s11263-024-02331-0","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,26]]},"assertion":[{"value":"1 August 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 December 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 February 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}