{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T18:12:32Z","timestamp":1775326352938,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"23","license":[{"start":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T00:00:00Z","timestamp":1733356800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["62401310"],"award-info":[{"award-number":["62401310"]}]},{"name":"National Natural Science Foundation of China","award":["42406216"],"award-info":[{"award-number":["42406216"]}]},{"name":"National Natural Science Foundation of China","award":["ZR2024QD031"],"award-info":[{"award-number":["ZR2024QD031"]}]},{"name":"National Natural Science Foundation of China","award":["ZR2021QF028"],"award-info":[{"award-number":["ZR2021QF028"]}]},{"name":"National Natural Science Foundation of China","award":["KF2024SD003"],"award-info":[{"award-number":["KF2024SD003"]}]},{"name":"National Natural Science Foundation of China","award":["KF2024SD009"],"award-info":[{"award-number":["KF2024SD009"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["62401310"],"award-info":[{"award-number":["62401310"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["42406216"],"award-info":[{"award-number":["42406216"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["ZR2024QD031"],"award-info":[{"award-number":["ZR2024QD031"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["ZR2021QF028"],"award-info":[{"award-number":["ZR2021QF028"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["KF2024SD003"],"award-info":[{"award-number":["KF2024SD003"]}]},{"name":"Natural Science Foundation of Shandong Province","award":["KF2024SD009"],"award-info":[{"award-number":["KF2024SD009"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["62401310"],"award-info":[{"award-number":["62401310"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["42406216"],"award-info":[{"award-number":["42406216"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["ZR2024QD031"],"award-info":[{"award-number":["ZR2024QD031"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["ZR2021QF028"],"award-info":[{"award-number":["ZR2021QF028"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["KF2024SD003"],"award-info":[{"award-number":["KF2024SD003"]}]},{"name":"opening project of Shandong Province Engineering Research Centre (Qingdao University of Science and Technology)","award":["KF2024SD009"],"award-info":[{"award-number":["KF2024SD009"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Stereo matching plays a vital role in underwater environments, where accurate depth estimation is crucial for applications such as robotics and marine exploration. However, underwater imaging presents significant challenges, including noise, blurriness, and optical distortions that hinder effective stereo matching. This study develops two specialized stereo matching networks: UWNet and its lightweight counterpart, Fast-UWNet. UWNet utilizes self- and cross-attention mechanisms alongside an adaptive 1D-2D cross-search to enhance cost volume representation and refine disparity estimation through a cascaded update module, effectively addressing underwater imaging challenges. Due to the need for timely responses in underwater operations by robots and other devices, real-time processing speed is critical for task completion. Fast-UWNet addresses this challenge by prioritizing efficiency, eliminating the reliance on the time-consuming recurrent updates commonly used in traditional methods. Instead, it directly converts the cost volume into a set of disparity candidates and their associated confidence scores. Adaptive interpolation, guided by content and confidence information, refines the cost volume to produce the final accurate disparity. This streamlined approach achieves an impressive inference speed of 0.02 s per image. Comprehensive tests conducted in diverse underwater settings demonstrate the effectiveness of both networks, showcasing their ability to achieve reliable depth perception.<\/jats:p>","DOI":"10.3390\/rs16234570","type":"journal-article","created":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T11:27:53Z","timestamp":1733398073000},"page":"4570","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Reliable and Effective Stereo Matching for Underwater Scenes"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-4664-6323","authenticated-orcid":false,"given":"Lvwei","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3024-6963","authenticated-orcid":false,"given":"Ying","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiankai","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1507-1429","authenticated-orcid":false,"given":"Yongqing","family":"Li","sequence":"additional","affiliation":[{"name":"School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9950-6084","authenticated-orcid":false,"given":"Xueying","family":"Li","sequence":"additional","affiliation":[{"name":"School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,12,5]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Rodionov, A.Y., Dubrovin, F., Unru, P., and Kulik, S.Y. (2017, January 29\u201331). Experimental research of distance estimation accuracy using underwater acoustic modems to provide navigation of underwater objects. Proceedings of the 2017 24th Saint Petersburg International Conference on Integrated Navigation Systems (ICINS), Saint Petersburg, Russia.","DOI":"10.23919\/ICINS.2017.7995618"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Tan, C.S., Mohd-Mokhtar, R., and Arshad, M.R. (2018, January 1\u20133). Fast fourier transform overlap approach for underwater acoustic positioning system. Proceedings of the 2018 IEEE 8th International Conference on Underwater System Technology: Theory and Applications (USYS), Wuhan, China.","DOI":"10.1109\/USYS.2018.8779009"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Negre, P.L., Bonin-Font, F., and Oliver, G. (2016, January 16\u201321). Cluster-based loop closing detection for underwater slam in feature-poor regions. Proceedings of the 2016 IEEE international conference on robotics and automation (ICRA), Stockholm, Sweden.","DOI":"10.1109\/ICRA.2016.7487416"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Digumarti, S.T., Chaurasia, G., Taneja, A., Siegwart, R., Thomas, A., and Beardsley, P. (2016, January 7\u201310). Underwater 3d capture using a low-cost commercial depth camera. Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA.","DOI":"10.1109\/WACV.2016.7477644"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Akkaynak, D., Treibitz, T., Shlesinger, T., Loya, Y., Tamir, R., and Iluz, D. (2017, January 21\u201326). What is the space of attenuation coefficients in underwater computer vision?. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.68"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1007\/s11760-021-02052-8","article-title":"Underwater stereo-matching algorithm based on belief propagation","volume":"17","author":"Xu","year":"2023","journal-title":"Signal Image Video Process."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"5089","DOI":"10.1109\/TCSVT.2023.3249223","article-title":"Underwater depth estimation via stereo adaptation networks","volume":"33","author":"Ye","year":"2023","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Xu, B., Xu, Y., Yang, X., Jia, W., and Guo, Y. (2021, January 20\u201325). Bilateral grid learning for stereo matching networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01231"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3995","DOI":"10.1109\/TCSVT.2019.2958950","article-title":"Deep joint depth estimation and color correction from monocular underwater images based on unsupervised adaptation networks","volume":"30","author":"Ye","year":"2019","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Lipson, L., Teed, Z., and Deng, J. (2021, January 1\u20133). Raft-stereo: Multilevel recurrent field transforms for stereo matching. Proceedings of the 2021 International Conference on 3D Vision (3DV), London, UK.","DOI":"10.1109\/3DV53792.2021.00032"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Xu, G., Wang, X., Ding, X., and Yang, X. (2023, January 17\u201324). Iterative geometry encoding volume for stereo matching. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.02099"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhou, H., Zhang, Y., Chen, J., Yang, Y., and Zhao, Y. (2023, January 17\u201324). High-frequency stereo matching network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00134"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Li, J., Wang, P., Xiong, P., Cai, T., Yan, Z., Yang, L., Liu, J., Fan, H., and Liu, S. (2022, January 18\u201324). Practical stereo matching via cascaded recurrent network with adaptive correlation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01578"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Chen, Z., Long, W., Yao, H., Zhang, Y., Wang, B., Qin, Y., and Wu, J. (2024, January 16\u201322). Mocha-stereo: Motif channel attention network for stereo matching. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.02623"},{"key":"ref_15","unstructured":"Shi, Y. (2024). Rethinking iterative stereo matching from diffusion bridge model perspective. arXiv."},{"key":"ref_16","unstructured":"Xiao, W., and Zhao, W. (2024). Rectified iterative disparity for stereo matching. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"74605","DOI":"10.1109\/ACCESS.2022.3185753","article-title":"Intelligent underwater stereo camera design for fish metric estimation using reliable object matching","volume":"10","author":"Ubina","year":"2022","journal-title":"IEEE Access"},{"key":"ref_18","first-page":"2822","article-title":"Underwater single image color restoration using haze-lines and a new quantitative dataset","volume":"43","author":"Berman","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Mayer, N., Ilg, E., Hausser, P., Fischer, P., Cremers, D., Dosovitskiy, A., and Brox, T. (2016, January 27\u201330). A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.438"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1109\/JOE.2022.3226202","article-title":"A reinforcement learning paradigm of configuring visual enhancement for object detection in underwater scenes","volume":"48","author":"Wang","year":"2023","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_21","first-page":"5618319","article-title":"Metalantis: A comprehensive underwater image enhancement framework","volume":"62","author":"Wang","year":"2024","journal-title":"IEEE Trans. Geosci. Remote. Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"108411","DOI":"10.1016\/j.engappai.2024.108411","article-title":"Inspiration: A reinforcement learning-based human visual perception-driven image enhancement paradigm for underwater scenes","volume":"133","author":"Wang","year":"2024","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2024.06.019","article-title":"Self-organized underwater image enhancement","volume":"215","author":"Wang","year":"2024","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Huang, X., Liu, M.-Y., Belongie, S., and Kautz, J. (2018, January 8\u201314). Multimodal unsupervised image-to-image translation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01219-9_11"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Lee, H.-Y., Tseng, H.-Y., Huang, J.-B., Singh, M., and Yang, M.-H. (2018, January 8\u201314). Diverse image-to-image translation via disentangled representations. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01246-5_3"},{"key":"ref_26","first-page":"5519016","article-title":"Hyperspectral image super-resolution with convlstm skip-connections","volume":"62","author":"Xu","year":"2024","journal-title":"IEEE Trans. Geosci. Remote. Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Huang, X., and Belongie, S. (2017, January 22\u201329). Arbitrary style transfer in real-time with adaptive instance normalization. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.167"},{"key":"ref_28","unstructured":"Liu, M.-Y., Huang, X., Mallya, A., Karras, T., Aila, T., Lehtinen, J., and Kautz, J. (November, January 27). Few-shot unsupervised image-to-image translation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Ulyanov, D., Vedaldi, A., and Lempitsky, V. (2017, January 21\u201326). Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.437"},{"key":"ref_30","unstructured":"Simonyan, K. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Sun, J., Shen, Z., Wang, Y., Bao, H., and Zhou, X. (2021, January 20\u201325). Loftr: Detector-free local feature matching with transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00881"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2461","DOI":"10.1109\/TPAMI.2023.3335480","article-title":"Accurate and efficient stereo matching via attention concatenation volume","volume":"46","author":"Xu","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, B., Yu, H., and Long, Y. (March, January 22). Local similarity pattern and cost self-reassembling for deep stereo matching networks. Proceedings of the 36th AAAI Conference on Artificial Intelligence, Online.","DOI":"10.1609\/aaai.v36i2.20056"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Gan, Y., Xu, X., Sun, W., and Lin, L. (2018, January 8\u201314). Monocular depth estimation with affinity, vertical pooling, and label enhancement. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01219-9_14"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/23\/4570\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:48:04Z","timestamp":1760114884000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/23\/4570"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,5]]},"references-count":35,"journal-issue":{"issue":"23","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["rs16234570"],"URL":"https:\/\/doi.org\/10.3390\/rs16234570","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,5]]}}}