{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T08:26:12Z","timestamp":1768983972300,"version":"3.49.0"},"reference-count":44,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2021,10,29]],"date-time":"2021-10-29T00:00:00Z","timestamp":1635465600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Key Research and Development Program of Zhejiang Province","award":["2020C03098"],"award-info":[{"award-number":["2020C03098"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Underwater vision-based detection plays an increasingly important role in underwater security, ocean exploration and other fields. Due to the absorption and scattering effects of water on light, as well as the movement of the carrier, underwater images generally have problems such as noise pollution, color cast and motion blur, which seriously affect the performance of underwater vision-based detection. To address these problems, this study proposes an end-to-end marine organism detection framework that can jointly optimize the image enhancement and object detection. The framework uses a two-stage detection network with dynamic intersection over union (IoU) threshold as the backbone and adds an underwater image enhancement module (UIEM) composed of denoising, color correction and deblurring sub-modules to greatly improve the framework\u2019s ability to deal with severely degraded underwater images. Meanwhile, a self-built dataset is introduced to pre-train the UIEM, so that the training of the entire framework can be performed end-to-end. The experimental results show that compared with the existing end-to-end models applied to marine organism detection, the detection precision of the proposed framework can improve by at least 6%, and the detection speed has not been significantly reduced, so that it can complete the high-precision real-time detection of marine organisms.<\/jats:p>","DOI":"10.3390\/s21217205","type":"journal-article","created":{"date-parts":[[2021,11,1]],"date-time":"2021-11-01T22:24:22Z","timestamp":1635805462000},"page":"7205","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":28,"title":["A Marine Organism Detection Framework Based on the Joint Optimization of Image Enhancement and Object Detection"],"prefix":"10.3390","volume":"21","author":[{"given":"Xueting","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Mechanical Engineering, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Ocean Technology and Equipment Research Center, Hangzhou Dianzi University, Hangzhou 310018, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1322-1366","authenticated-orcid":false,"given":"Xiaohai","family":"Fang","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"}]},{"given":"Mian","family":"Pan","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"},{"name":"Ocean College, Zhejiang University, Zhoushan 316021, China"}]},{"given":"Luhua","family":"Yuan","sequence":"additional","affiliation":[{"name":"Shandong Radio Monitoring Station, Jinan 250013, China"}]},{"given":"Yaxin","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"}]},{"given":"Mengyi","family":"Yuan","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9822-9959","authenticated-orcid":false,"given":"Shuaishuai","family":"Lv","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0900-5276","authenticated-orcid":false,"given":"Haibin","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Hangzhou Dianzi University, Hangzhou 310018, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,10,29]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Teng, B., and Zhao, H. (2020). Underwater target recognition methods based on the framework of deep learning: A survey. Int. J. Adv. Robot. Syst., 17.","DOI":"10.1177\/1729881420976307"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Dobeck, J.G., Hyland, J.C., and Smedley, L. (1997, January 21\u201322). Automated detection and classification of sea mines in sonar imagery. Proceedings of the SPIE\u2014The International Society for Optical Engineering, Orlando, FL, USA.","DOI":"10.1117\/12.280846"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1177\/02783649030227002","article-title":"Path planning for robotic demining: Robust sensor-based coverage of unstructured environments and probabilistic methods","volume":"22","author":"Acar","year":"2003","journal-title":"Int. J. Robot. Res."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Cao, X., Zhang, X., Yu, Y., and Niu, L. (2016, January 16\u201318). Deep Learning-Based Recognition of Underwater Target. Proceedings of the IEEE International Conference on Digital Signal Processing (DSP), Beijing, China.","DOI":"10.1109\/ICDSP.2016.7868522"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"5470","DOI":"10.1109\/JSTARS.2021.3080919","article-title":"An Unmixing-Based Network for Underwater Target Detection from Hyperspectral Imagery","volume":"14","author":"Qi","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"39273","DOI":"10.1109\/ACCESS.2020.2976121","article-title":"Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning","volume":"8","author":"Cai","year":"2020","journal-title":"IEEE Access"},{"key":"ref_7","unstructured":"Rova, A., Mori, G., and Dill, L.M. (2007, January 16\u201318). One fish, two fish, butterfish, trumpeter: Recognizing fish in underwater video. Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), Tokyo, Japan."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"61535","DOI":"10.1109\/ACCESS.2018.2876336","article-title":"A Biological Sensor System Using Computer Vision for Water Quality Monitoring","volume":"6","author":"Yuan","year":"2018","journal-title":"IEEE Access"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1890\/0012-9658(2002)083[0480:FAATSP]2.0.CO;2","article-title":"Food availability and tiger shark predation risk influence bottlenose dolphin habitat use","volume":"83","author":"Heithaus","year":"2002","journal-title":"Ecology"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"126691","DOI":"10.1016\/j.optcom.2020.126691","article-title":"Underwater image recovery method based on hyperspectral polarization imaging","volume":"484","author":"Qian","year":"2021","journal-title":"Opt. Commun."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mathur, M., and Goel, N. (2021). Enhancement algorithm for high visibility of underwater images. IET Image Process., 1\u201316.","DOI":"10.1049\/ipr2.12210"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Han, Y., Huang, L., Hong, Z., Cao, S., Zhang, Y., and Wang, J. (2021). Deep Supervised Residual Dense Network for Underwater Image Enhancement. Sensors, 21.","DOI":"10.3390\/s21093289"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2591","DOI":"10.1049\/ipr2.12247","article-title":"Underwater image enhancement based on colour correction and fusion","volume":"15","author":"Zhu","year":"2021","journal-title":"IET Image Process."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s11042-021-11095-5","article-title":"A new underwater image enhancement algorithm based on adaptive feedback and Retinex algorithm","volume":"80","author":"Tang","year":"2021","journal-title":"Multimedia Tools Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"116174","DOI":"10.1016\/j.image.2021.116174","article-title":"Color correction and restoration based on multi-scale recursive network for underwater optical image","volume":"93","author":"Huang","year":"2021","journal-title":"Signal Process. Image Commun."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Shanmugasundaram, M., Sukumaran, S., and Shanmugavadivu, N. (2013, January 22\u201325). Fusion based denoise-engine for underwater images using curvelet transform. Proceedings of the 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Mysore, India.","DOI":"10.1109\/ICACCI.2013.6637303"},{"key":"ref_17","unstructured":"Wang, Z., Zheng, B., and Tian, W. (2013, January 23\u201327). New approach for underwater image denoise combining inhomogeneous illumination and dark channel prior. Proceedings of the MTS\/IEEE Oceans Conference, San Diego, CA, USA."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1109\/83.902291","article-title":"Active Contours without Edges","volume":"10","author":"Chan","year":"2001","journal-title":"IEEE Trans. Image Process."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Deng, J., and Dong, W. (2009, January 20\u201325). ImageNet: A Large-Scale Hierarchical Image Database. Proceedings of the IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops, Miami Beach, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Patterson, G., and Hays, J. (2016, January 11\u201314). COCO Attributes: Attributes for People, Animals, and Objects. Proceedings of the 14th European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46466-4_6"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1239","DOI":"10.1109\/TPAMI.2019.2950923","article-title":"Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification","volume":"43","author":"Pei","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","unstructured":"Bazeille, S., Quidu, I., and Jaulin, L. (2007, January 18\u201319). Identification of Underwater Man-Made Object Using a Colour Criterion. Proceedings of the Conference on detection and classification of underwater targets, Edinburgh, UK."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1007\/s11263-005-4635-4","article-title":"A statistical approach to texture classification from single images","volume":"62","author":"Varma","year":"2005","journal-title":"Int. J. Comput. Vis."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.ecoinf.2013.10.002","article-title":"Real-world underwater fish recognition and identification, using sparse representation","volume":"23","author":"Hsiao","year":"2014","journal-title":"Ecol. Inform."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Beijbom, O., Edmunds, P.J., Kline, D.I., Mitchell, B.G., and Kriegman, D. (2012, January 16\u201321). Automated Annotation of Coral Reef Survey Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6247798"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"105339","DOI":"10.1016\/j.compag.2020.105339","article-title":"Real-time robust detector for underwater live crabs based on deep learning","volume":"172","author":"Cao","year":"2020","journal-title":"Comput. Electron. Agric."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Sung, M., Yu, S.-C., and Girdhar, Y. (2017, January 16\u201321). Vision based Real-time Fish Detection Using Convolutional Neural Network. Proceedings of the Oceans Aberdeen Conference, Aberdeen, UK.","DOI":"10.1109\/OCEANSE.2017.8084889"},{"key":"ref_28","first-page":"1","article-title":"Marine Organism Detection and Classification from Underwater Vision Based on the Deep CNN Method","volume":"2020","author":"Han","year":"2020","journal-title":"Math. Probl. Eng."},{"key":"ref_29","first-page":"1","article-title":"Underwater Image Processing and Object Detection Based on Deep CNN Method","volume":"2020","author":"Han","year":"2020","journal-title":"J. Sensors"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Madhan, E.S., Kannan, K.S., Rani, P.S., Rani, J.V., and Anguraj, D.K. (2021). A distributed submerged object detection and classification enhancement with deep learning. Distrib. Parallel Databases, 1\u201317.","DOI":"10.1007\/s10619-021-07342-1"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Li, X., Yang, Z., Shang, M., and Hao, J. (2016, January 15\u201316). Underwater Image Enhancement via Dark Channel Prior and Luminance Adjustment. Proceedings of the Oceans Conference, Shanghai, China.","DOI":"10.1109\/OCEANSAP.2016.7485625"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Fan, B., Chen, W., Cong, Y., and Tian, J. (2020, January 23\u201328). Dual refinement underwater object detection network. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Part 20.","DOI":"10.1007\/978-3-030-58565-5_17"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhou, H., Huang, H., Yang, X., Zhang, L., and Qi, L. (2017, January 27\u201329). Faster R-CNN for Marine Organism Detection and Recognition Using Data Augmentation. Proceedings of the International Conference on Video and Image Processing (ICVIP), Singapore.","DOI":"10.1145\/3177404.3177433"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"He, K., and Sun, J. (2015, January 7\u201312). Convolutional Neural Networks at Constrained Time Cost. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299173"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Dollar, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_36","unstructured":"Srivastava, K.R., Greff, K., and Schmidhuber, J. (2015). Highway networks. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201322). Path Aggregation Network for Instance Segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Johnson, J., Alahi, A., and Li, F.-F. (2016, January 11\u201314). Perceptual Losses for Real-Time Style Transfer and Super-Resolution. Proceedings of the 14th European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Nah, S., Hyun Kim, T., and Mu Lee, K. (2016, January 21\u201326). Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring. Proceedings of the 30th IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.35"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Su, S., Delbracio, M., Wang, J., Sapiro, G., Heidrich, W., and Wang, O. (2017, January 21\u201326). Deep video deblurring for hand-held cameras. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.33"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_42","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_43","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Proceedings of the 29th Annual Conference on Neural Information Processing Systems (NIPS), Montreal, QC, Canada."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Cai, Z., and Vasconcelos, N. (2018, January 18\u201322). Cascade R-CNN: Delving into High Quality Object Detection. Proceedings of the 31st IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00644"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/21\/7205\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:22:58Z","timestamp":1760167378000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/21\/7205"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,29]]},"references-count":44,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["s21217205"],"URL":"https:\/\/doi.org\/10.3390\/s21217205","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,29]]}}}