{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T03:07:35Z","timestamp":1780369655915,"version":"3.54.1"},"reference-count":49,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T00:00:00Z","timestamp":1768348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>Underwater optical images are the primary carriers of underwater scene information, playing a crucial role in marine resource exploration, underwater environmental monitoring, and engineering inspection. However, wavelength-dependent absorption and scattering severely deteriorate underwater images, leading to reduced contrast, chromatic distortions, and loss of structural details. To address these issues, we propose a U-shaped underwater image enhancement framework that integrates Swin-Transformer blocks with lightweight attention and residual modules. A Dual-Window Multi-Head Self-Attention (DWMSA) in the bottleneck models long-range context while preserving fine local structure. A Global-Aware Attention Map (GAMP) adaptively re-weights channels and spatial locations to focus on severely degraded regions. A Feature-Augmentation Residual Network (FARN) stabilizes deep training and emphasizes texture and color fidelity. Trained with a combination of Charbonnier, perceptual, and edge losses, our method achieves state-of-the-art results in PSNR and SSIM, the lowest LPIPS, and improvements in UIQM and UCIQE on the UFO-120 and EUVP datasets, with average metrics of PSNR 29.5 dB, SSIM 0.94, LPIPS 0.17, UIQM 3.62, and UCIQE 0.59. Qualitative results show reduced color cast, restored contrast, and sharper details. Code, weights, and evaluation scripts will be released to support reproducibility.<\/jats:p>","DOI":"10.3390\/jimaging12010044","type":"journal-article","created":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T12:04:22Z","timestamp":1768392262000},"page":"44","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["A Deep Feature Fusion Underwater Image Enhancement Model Based on Perceptual Vision Swin Transformer"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-7611-6513","authenticated-orcid":false,"given":"Shasha","family":"Tian","sequence":"first","affiliation":[{"name":"Faculty of Engineering, Rajamangala University of Technology Krungthep, Bangkok 10120, Thailand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-5537-7995","authenticated-orcid":false,"given":"Adisorn","family":"Sirikham","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, Rajamangala University of Technology Krungthep, Bangkok 10120, Thailand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7199-6402","authenticated-orcid":false,"given":"Jessada","family":"Konpang","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, Rajamangala University of Technology Krungthep, Bangkok 10120, Thailand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chuyang","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Jiangsu College of Finance and Accounting, Lianyungang 222061, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2026,1,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1002\/rob.21837","article-title":"Understanding human motion and gestures for underwater human\u2013robot collaboration","volume":"36","author":"Islam","year":"2019","journal-title":"J. Field Robot."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Kennedy, B.R., and Rotjan, R.D. (2023). Mind the gap: Comparing exploration effort with global biodiversity patterns and climate projections to determine ocean areas with greatest exploration needs. Front. Mar. Sci., 10.","DOI":"10.3389\/fmars.2023.1219799"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, X., Chen, Z., Xu, Z., Zheng, Z., Ma, F., and Wang, Y. (2024). Enhancement of underwater images through parallel fusion of transformer and CNN. J. Mar. Sci. Eng., 12.","DOI":"10.20944\/preprints202407.1575.v1"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"109408","DOI":"10.1016\/j.sigpro.2024.109408","article-title":"Algorithms for improving the quality of underwater optical images: A comprehensive review","volume":"219","author":"Shuang","year":"2024","journal-title":"Signal Process."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1016\/j.neunet.2023.11.008","article-title":"Robust underwater image enhancement with cascaded multi-level sub-networks and triple attention mechanism","volume":"169","author":"Zhang","year":"2024","journal-title":"Neural Netw."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Zhao, X., Wang, Z., Deng, Z., and Qin, H. (2024). G-net: An efficient convolutional network for underwater object detection. J. Mar. Sci. Eng., 12.","DOI":"10.3390\/jmse12010116"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2340","DOI":"10.1109\/TIP.2021.3051462","article-title":"Enlightengan: Deep light enhancement without paired supervision","volume":"30","author":"Jiang","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_8","first-page":"2341","article-title":"Single image haze removal using dark channel prior","volume":"33","author":"He","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"5664","DOI":"10.1109\/TIP.2016.2612882","article-title":"Underwater image enhancement by dehazing with minimum information loss and histogram distribution prior","volume":"25","author":"Li","year":"2016","journal-title":"IEEE Trans. Image Process."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1579","DOI":"10.1109\/TIP.2017.2663846","article-title":"Underwater image restoration based on image blurriness and light absorption","volume":"26","author":"Peng","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"She, M., Seegr\u00e4ber, F., Nakath, D., and K\u00f6ser, K. (2024, January 14\u201318). Refractive COLMAP: Refractive structure-from-motion revisited. Proceedings of the 2024 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Abu Dhabi, United Arab Emirates.","DOI":"10.1109\/IROS58592.2024.10802043"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Iqbal, K., Odetayo, M., James, A., Salam, R.A., and Talib, A.Z.H. (2010, January 10\u201313). Enhancing the low quality images using unsupervised colour correction method. Proceedings of the 2010 IEEE International Conference on Systems, Man and Cybernetics, Istanbul, Turkey.","DOI":"10.1109\/ICSMC.2010.5642311"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1016\/j.asoc.2015.08.033","article-title":"Enhancement of low quality underwater image through integrated global and local contrast correction","volume":"37","author":"Ghani","year":"2015","journal-title":"Appl. Soft Comput."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Akkaynak, D., and Treibitz, T. (2018, January 18\u201323). A revised underwater image formation model. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00703"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Fu, X., Zhuang, P., Huang, Y., Liao, Y., Zhang, X.-P., and Ding, X. (2014, January 27\u201330). A retinex-based enhancing approach for single underwater image. Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France.","DOI":"10.1109\/ICIP.2014.7025927"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"7838","DOI":"10.1109\/TMM.2024.3372400","article-title":"A pixel distribution remapping and multi-prior retinex variational model for underwater image enhancement","volume":"26","author":"Zhou","year":"2024","journal-title":"IEEE Trans. Multimed."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.isprsjprs.2024.02.004","article-title":"Advanced underwater image restoration in complex illumination conditions","volume":"209","author":"Song","year":"2024","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Wang, S., Lu, Q., Peng, B., Nie, Y., and Tao, Q. (2024). DPEC: Dual-path error compensation method for enhanced low-light image clarity. arXiv.","DOI":"10.2139\/ssrn.5012613"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1117\/12.958279","article-title":"A computer model for underwater camera systems","volume":"208","author":"McGlamery","year":"1980","journal-title":"Ocean Opt. VI"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"746052","DOI":"10.1155\/2010\/746052","article-title":"Underwater image processing: State of the art of restoration and image enhancement methods","volume":"2010","author":"Schettini","year":"2010","journal-title":"EURASIP J. Adv. Signal Process."},{"key":"ref_21","first-page":"1","article-title":"A survey on underwater computer vision","volume":"55","year":"2023","journal-title":"ACM Comput. Surv."},{"key":"ref_22","unstructured":"Cong, X., Zhao, Y., Gui, J., Hou, J., and Tao, D. (2024). A comprehensive survey on underwater image enhancement based on deep learning. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"38371","DOI":"10.1007\/s11042-023-15156-9","article-title":"A systematic review of the methodologies for the processing and enhancement of the underwater images","volume":"82","author":"Singh","year":"2023","journal-title":"Multimed. Tools Appl."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"3997","DOI":"10.1109\/TIP.2022.3177129","article-title":"Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement","volume":"31","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Image Process."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Chen, K., Li, Z., Zhou, F., and Yu, Z. (2025). CASF-Net: Underwater image enhancement with color correction and spatial fusion. Sensors, 25.","DOI":"10.3390\/s25082574"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"3613","DOI":"10.1049\/iet-ipr.2020.0003","article-title":"UCT-GAN: Underwater image colour transfer generative adversarial network","volume":"14","author":"Deng","year":"2020","journal-title":"IET Image Proc."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Guan, F., Lu, S., Lai, H., and Du, X. (2023). AUIE\u2013GAN: Adaptive underwater image enhancement based on generative adversarial networks. J. Mar. Sci. Eng., 11.","DOI":"10.3390\/jmse11071476"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1149","DOI":"10.1049\/ipr2.12702","article-title":"Underwater image enhancement using a mixed generative adversarial network","volume":"17","author":"Mu","year":"2023","journal-title":"IET Image Proc."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"218838","DOI":"10.1109\/ACCESS.2020.3041280","article-title":"Underwater image enhancement based on a spiral generative adversarial framework","volume":"8","author":"Han","year":"2020","journal-title":"IEEE Access."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1024","DOI":"10.1109\/TAI.2024.3508667","article-title":"Unformer: A transformer-based approach for adaptive multi-scale feature aggregation in underwater image enhancement","volume":"6","author":"Qing","year":"2024","journal-title":"IEEE Trans. Artif. Intell."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/j.cag.2023.01.009","article-title":"UDAformer: Underwater image enhancement based on dual attention transformer","volume":"111","author":"Shen","year":"2023","journal-title":"Comput. Graph."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1007\/s11760-022-02325-w","article-title":"MLCA2F: Multi-level context attentional feature fusion for COVID-19 lesion segmentation from CT scans","volume":"17","author":"Bakkouri","year":"2023","journal-title":"Signal Image Video Process."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1007\/s12145-025-01913-x","article-title":"U-TWGAN: Underwater image enhancement via wavelet-transformer and sparse multilayer perceptrons generative adversarial network","volume":"18","author":"Zhang","year":"2025","journal-title":"Earth Sci. Inf."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"He, K., Chen, X., Xie, S., Li, Y., Doll\u00e1r, P., and Girshick, R. (2022, January 19\u201324). Masked autoencoders are scalable vision learners. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01553"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Fan, C.M., Liu, T.J., and Liu, K.H. (June, January 27). SUNet: Swin transformer UNet for image denoising. Proceedings of the 2022 IEEE International Symposium on Circuits and Systems (ISCAS), Austin, TX, USA.","DOI":"10.1109\/ISCAS48785.2022.9937486"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Tian, S., Sirikham, A., Konpang, J., and Wang, C. (2025). High-Dimensional attention generative adversarial network framework for underwater image enhancement. Electronics, 14.","DOI":"10.3390\/electronics14061203"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"628","DOI":"10.1109\/TIE.2014.2319216","article-title":"WPD-PCA-based laser welding process monitoring and defects diagnosis by using FNN and SVM","volume":"62","author":"You","year":"2015","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_39","unstructured":"Islam, M.J., Luo, P., and Sattar, J. (2020). Simultaneous enhancement and super-resolution of underwater imagery for improved visual perception. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"3227","DOI":"10.1109\/LRA.2020.2974710","article-title":"Fast underwater image enhancement for improved visual perception","volume":"5","author":"Islam","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1109\/TIP.2003.819861","article-title":"Image quality assessment: From error visibility to structural similarity","volume":"13","author":"Wang","year":"2004","journal-title":"IEEE Trans. Image Process."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1109\/JOE.2015.2469915","article-title":"Human-Visual-System-Inspired underwater image quality measures","volume":"41","author":"Panetta","year":"2016","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"6062","DOI":"10.1109\/TIP.2015.2491020","article-title":"An underwater color image quality evaluation metric","volume":"24","author":"Yang","year":"2015","journal-title":"IEEE Trans. Image Process."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Lai, W.S., Huang, J.B., Ahuja, N., and Yang, M.H. (2017, January 21\u201326). Deep laplacian pyramid networks for fast and accurate super-resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.618"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1145\/3422622","article-title":"Generative adversarial networks","volume":"63","author":"Goodfellow","year":"2020","journal-title":"Commun. ACM"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"107038","DOI":"10.1016\/j.patcog.2019.107038","article-title":"Underwater scene prior inspired deep underwater image and video enhancement","volume":"98","author":"Li","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Fabbri, C., Islam, M.J., and Sattar, J. (2018, January 21\u201325). Enhancing underwater imagery using generative adversarial networks. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia.","DOI":"10.1109\/ICRA.2018.8460552"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"862","DOI":"10.1109\/JOE.2019.2911447","article-title":"Underwater image enhancement using a multiscale dense generative adversarial network","volume":"45","author":"Guo","year":"2019","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"4209616","DOI":"10.1109\/TGRS.2022.3205061","article-title":"Reinforced swin-convs transformer for simultaneous underwater sensing scene image enhancement and super-resolution","volume":"60","author":"Ren","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/12\/1\/44\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T12:11:24Z","timestamp":1768392684000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/12\/1\/44"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,14]]},"references-count":49,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2026,1]]}},"alternative-id":["jimaging12010044"],"URL":"https:\/\/doi.org\/10.3390\/jimaging12010044","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,14]]}}}