{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T18:01:34Z","timestamp":1774980094564,"version":"3.50.1"},"reference-count":92,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2023,1,11]],"date-time":"2023-01-11T00:00:00Z","timestamp":1673395200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42101449"],"award-info":[{"award-number":["42101449"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2022CFB773"],"award-info":[{"award-number":["2022CFB773"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["Y211006"],"award-info":[{"award-number":["Y211006"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of Hubei Province, China","award":["42101449"],"award-info":[{"award-number":["42101449"]}]},{"name":"Natural Science Foundation of Hubei Province, China","award":["2022CFB773"],"award-info":[{"award-number":["2022CFB773"]}]},{"name":"Natural Science Foundation of Hubei Province, China","award":["Y211006"],"award-info":[{"award-number":["Y211006"]}]},{"name":"Key Research and Development Project of Jinzhong City, China","award":["42101449"],"award-info":[{"award-number":["42101449"]}]},{"name":"Key Research and Development Project of Jinzhong City, China","award":["2022CFB773"],"award-info":[{"award-number":["2022CFB773"]}]},{"name":"Key Research and Development Project of Jinzhong City, China","award":["Y211006"],"award-info":[{"award-number":["Y211006"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Fast and high-accuracy detection of underwater targets based on side scan sonar images has great potential for marine fisheries, underwater security, marine mapping, underwater engineering and other applications. The following problems, however, must be addressed when using low-resolution side scan sonar images for underwater target detection: (1) the detection performance is limited due to the restriction on the input of multi-scale images; (2) the widely used deep learning algorithms have a low detection effect due to their complex convolution layer structures; (3) the detection performance is limited due to insufficient model complexity in training process; and (4) the number of samples is not enough because of the bad dataset preprocessing methods. To solve these problems, an improved neural network for underwater target detection\u2014which is based on side scan sonar images and fully utilizes spatial pyramid pooling and online dataset preprocessing based on the You Look Only Once version three (YOLO V3) algorithm\u2014is proposed. The methodology of the proposed approach is as follows: (1) the AlexNet, GoogleNet, VGGNet and the ResNet networks and an adopted YOLO V3 algorithm were the backbone networks. The structure of the YOLO V3 model is more mature and compact and has higher target detection accuracy and better detection efficiency than the other models; (2) spatial pyramid pooling was added at the end of the convolution layer to improve detection performance. Spatial pyramid pooling breaks the scale restrictions when inputting images to improve feature extraction because spatial pyramid pooling enables the backbone network to learn faster at high accuracy; and (3) online dataset preprocessing based on YOLO V3 with spatial pyramid pooling increases the number of samples and improves the complexity of the model to further improve detection process performance. Three-side scan imagery datasets were used for training and were tested in experiments. The quantitative evaluation using Accuracy, Recall, Precision, mAP and F1-Score metrics indicates that: for the AlexNet, GoogleNet, VGGNet and ResNet algorithms, when spatial pyramid pooling is added to their backbone networks, the average detection accuracy of the three sets of data was improved by 2%, 4%, 2% and 2%, respectively, as compared to their original formulations. Compared with the original YOLO V3 model, the proposed ODP+YOLO V3+SPP underwater target detection algorithm model has improved detection performance through the mAP qualitative evaluation index has increased by 6%, the Precision qualitative evaluation index has increased by 13%, and the detection efficiency has increased by 9.34%. These demonstrate that adding spatial pyramid pooling and online dataset preprocessing can improve the target detection accuracy of these commonly used algorithms. The proposed, improved neural network with spatial pyramid pooling and online dataset preprocessing based on the YOLO V3 method achieves the highest scores for underwater target detection results for sunken ships, fish flocks and seafloor topography, with mAP scores of 98%, 91% and 96% for the above three kinds of datasets, respectively.<\/jats:p>","DOI":"10.3390\/rs15020440","type":"journal-article","created":{"date-parts":[[2023,1,12]],"date-time":"2023-01-12T03:11:02Z","timestamp":1673493062000},"page":"440","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":47,"title":["Improved Neural Network with Spatial Pyramid Pooling and Online Datasets Preprocessing for Underwater Target Detection Based on Side Scan Sonar Imagery"],"prefix":"10.3390","volume":"15","author":[{"given":"Jinrui","family":"Li","sequence":"first","affiliation":[{"name":"College of Resources and Environment, Yangtze University, Wuhan 430100, China"},{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}]},{"given":"Libin","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Resources and Environment, Yangtze University, Wuhan 430100, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5561-6983","authenticated-orcid":false,"given":"Jian","family":"Shen","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3035-7727","authenticated-orcid":false,"given":"Xiongwu","family":"Xiao","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}]},{"given":"Xiaosong","family":"Liu","sequence":"additional","affiliation":[{"name":"Coal Geological Geophysical Exploration Surveying & Mapping Institute of Shanxi Province, Jinzhong 030621, China"}]},{"given":"Xin","family":"Sun","sequence":"additional","affiliation":[{"name":"Coal Geological Geophysical Exploration Surveying & Mapping Institute of Shanxi Province, Jinzhong 030621, China"}]},{"given":"Xiao","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Marine Technology and Geomatics, Jiangsu Ocean University, Lianyungang 222000, China"}]},{"given":"Deren","family":"Li","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,1,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/j.isprsjprs.2021.10.010","article-title":"Balance learning for ship detection from synthetic aperture radar remote sensing imagery","volume":"182","author":"Zhang","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Siradjuddin, I.A., and Muntasa, A. (2021, January 24\u201325). Faster Region-based Convolutional Neural Network for Mask Face Detection. Proceedings of the 2021 5th International Conference on Informatics and Computational Sciences (ICICoS), Semarang, Indonesia.","DOI":"10.1109\/ICICoS53627.2021.9651744"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Zhang, T., Zhang, X., and Ke, X. (2021). Quad-FPN: A novel quad feature pyramid network for SAR ship detection. Remote Sens., 13.","DOI":"10.3390\/rs13142771"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Zhang, T., and Zhang, X. (2022). HTC+ for SAR Ship Instance Segmentation. Remote Sens., 14.","DOI":"10.3390\/rs14102395"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1049\/el:20000633","article-title":"High-speed focusing algorithm for circular synthetic aperture radar (C-SAR)","volume":"36","author":"Bara","year":"2000","journal-title":"Electron. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"12455","DOI":"10.1109\/JSTARS.2021.3131162","article-title":"H2Det: A high-speed and high-accurate ship detector in SAR images","volume":"14","author":"Zhu","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1016\/j.isprsjprs.2020.05.016","article-title":"HyperLi-Net: A hyper-light deep learning network for high-accurate and high-speed ship detection from synthetic aperture radar imagery","volume":"167","author":"Zhang","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_8","first-page":"1","article-title":"Balance scene learning mechanism for offshore and inshore ship detection in SAR images","volume":"19","author":"Zhang","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2886","DOI":"10.1109\/JSTARS.2021.3061747","article-title":"Automatic overlapping area determination and segmentation for multiple side scan sonar images mosaic","volume":"14","author":"Shang","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"29416","DOI":"10.1109\/ACCESS.2021.3052206","article-title":"Optimizing the sediment classification of small side-scan sonar images based on deep learning","volume":"9","author":"Qin","year":"2021","journal-title":"IEEE Access"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhu, M., Song, Y., Guo, J., Feng, C., Li, G., Yan, T., and He, B. (2017, January 21\u201324). PCA and kernel-based extreme learning machine for side-scan sonar image classification. Proceedings of the 2017 IEEE Underwater Technology (UT), Busan, Republic of Korea.","DOI":"10.1109\/UT.2017.7890275"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Fallon, M.F., Kaess, M., Johannsson, H., and Leonard, J.J. (2011, January 9\u201313). Efficient AUV navigation fusing acoustic ranging and side-scan sonar. Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China.","DOI":"10.1109\/ICRA.2011.5980302"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wu, Z., Yang, F., and Tang, Y. (2021). Side-scan sonar and sub-bottom profiler surveying. High-Resolution Seafloor Survey and Applications, Springer.","DOI":"10.1007\/978-981-15-9750-3"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"52843","DOI":"10.1109\/ACCESS.2018.2869577","article-title":"Deep Learning Approach Combining Sparse Autoencoder With SVM for Network Intrusion Detection","volume":"6","author":"Lasheng","year":"2018","journal-title":"IEEE Access"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Bertasius, G., Shi, J., and Torresani, L. (2015, January 7\u201312). Deepedge: A multi-scale bifurcated deep network for top-down contour detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299067"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1016\/j.ndteint.2008.05.004","article-title":"Detection of line weld defects based on multiple thresholds and support vector machine","volume":"41","author":"Wang","year":"2008","journal-title":"NDT E Int."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1109\/TFUZZ.2013.2249072","article-title":"Fuzzy clustering with a modified MRF energy function for change detection in synthetic aperture radar images","volume":"22","author":"Gong","year":"2013","journal-title":"IEEE Trans. Fuzzy Syst."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Dzieciuch, I., Gebhardt, D., Barngrover, C., and Parikh, K. (2017, January 21\u201325). Non-Linear Convolutional Neural Network for Automatic Detection of Mine-Like Objects in Sonar Imagery. Proceedings of the International Conference on Applications in Nonlinear Dynamics, Rome, Italy.","DOI":"10.1007\/978-3-319-52621-8_27"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Rhinelander, J. (2016, January 6\u20139). Feature extraction and target classification of side-scan sonar images. Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI), Athens, Greece.","DOI":"10.1109\/SSCI.2016.7850074"},{"key":"ref_20","unstructured":"Song, Y., Zhu, Y., Li, G., Feng, C., He, B., and Yan, T. (2017, January 18\u201321). Side scan sonar segmentation using deep convolutional neural network. Proceedings of the OCEANS 2017-Anchorage, Anchorage, AK, USA."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhu, P., Isaacs, J., Fu, B., and Ferrari, S. (2017, January 12\u201315). Deep learning feature extraction for target recognition and classification in underwater sonar images. Proceedings of the 2017 IEEE 56th Annual Conference on Decision and Control (CDC), Melbourne, Australia.","DOI":"10.1109\/CDC.2017.8264055"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Einsidler, D., Dhanak, M., and Beaujean, P.P. (2018, January 22\u201325). A deep learning approach to target recognition in side scan sonar imagery. Proceedings of the OCEANS 2018 MTS\/IEEE Charleston, Charleston, SC, USA.","DOI":"10.1109\/OCEANS.2018.8604879"},{"key":"ref_23","first-page":"118","article-title":"The application of convolutional neural networks for automatic detection of underwater object in side scan sonar images","volume":"37","author":"Kim","year":"2018","journal-title":"J. Acoust. Soc. Korea"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Wu, M., Wang, Q., Rigall, E., Li, K., Zhu, W., and He, B. (2019). ECNet: Efficient convolutional networks for side scan sonar image segmentation. Sensors, 19.","DOI":"10.3390\/s19092009"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, M., Yu, F., Feng, C., Li, K., Zhu, Y., Rigall, E., and He, B. (2019). Rt-seg: A real-time semantic segmentation network for side scan sonar images. Sensors, 19.","DOI":"10.3390\/s19091985"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Yu, F., He, B., Li, K., Yan, T., Shen, Y., Wang, Q., and Wu, M. (2021). Side scan sonar images segmentation for AUV with recurrent residual convolutional neural network module and self-guidance module. Appl. Ocean Res., 113.","DOI":"10.1016\/j.apor.2021.102608"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, Y., Liu, J., Yu, S., Wang, K., Han, Z., and Tang, Y. (2021, January 15\u201317). Underwater Object Detection based on YOLO-v3 network. Proceedings of the 2021 IEEE International Conference on Unmanned Systems (ICUS), Beijing, China.","DOI":"10.1109\/ICUS52573.2021.9641489"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Li, J.W., and Cao, X. (2022, January 25\u201327). Target Recognition and Detection in Side scan Sonar Images based on YOLO v3 Model. Proceedings of the 2022 41st Chinese Control Conference (CCC), Hefei, China.","DOI":"10.23919\/CCC55666.2022.9902742"},{"key":"ref_29","first-page":"349","article-title":"A seabed sediment classification model based on PSO-AlexNet","volume":"Volume 12246","author":"Jin","year":"2022","journal-title":"Proceedings of the 2nd International Conference on Signal Image Processing and Communication (ICSIPC 2022)"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Jia, X., Wei, X., Cao, X., and Foroosh, H. (2019, January 15\u201320). Comdefend: An efficient image compression model to defend adversarial examples. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00624"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Yang, X., and Yan, J. (2020, January 23\u201328). Arbitrary-oriented object detection with circular smooth label. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-58598-3_40"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"5478","DOI":"10.1364\/OE.26.005478","article-title":"Distortion optimization for wide-angle computational cameras","volume":"26","author":"Sahin","year":"2018","journal-title":"Optics Express"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhang, X., Karaman, S., and Chang, S.F. (2019, January 9\u201312). Detecting and simulating artifacts in gan fake images. Proceedings of the 2019 IEEE International Workshop on Information Forensics and Security (WIFS), Delft, The Netherlands.","DOI":"10.1109\/WIFS47025.2019.9035107"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1109\/TIP.2005.859378","article-title":"Image information and visual quality","volume":"15","author":"Sheikh","year":"2006","journal-title":"IEEE Trans. Image Process."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1109\/TCSVT.2018.2886771","article-title":"Blind image quality assessment using a deep bilinear convolutional neural network","volume":"30","author":"Zhang","year":"2018","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_36","unstructured":"Canziani, A., Paszke, A., and Culurciello, E. (2016). An analysis of deep neural network models for practical applications. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Zhou, Z., Cui, Z., Zang, Z., Meng, X., Cao, Z., and Yang, J. (2022). UltraHi-PrNet: An Ultra-High Precision Deep Learning Network for Dense Multi-Scale Target Detection in SAR Images. Remote Sens., 14.","DOI":"10.3390\/rs14215596"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Han, B., Hu, Z., Su, Z., Bai, X., Yin, S., Luo, J., and Zhao, Y. (2022). Mask_LaC R-CNN for measuring morphological features of fish. Measurement, 203.","DOI":"10.1016\/j.measurement.2022.111859"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Wang, H., Shi, Y., Yue, Y., and Zhao, H. (2020, January 24\u201326). Study on freshwater fish image recognition integrating SPP and DenseNet network. Proceedings of the 2020 IEEE International Conference on Mechatronics and Automation (ICMA), Osaka, Japan.","DOI":"10.1109\/ICMA49215.2020.9233696"},{"key":"ref_40","first-page":"105","article-title":"Dock detection method in remote sensing images based on improved YOLOv4","volume":"Volume 12342","author":"Guo","year":"2022","journal-title":"Proceedings of the Fourteenth International Conference on Digital Image Processing (ICDIP 2022)"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1847","DOI":"10.1007\/s40747-021-00322-z","article-title":"Remote sensing image building detection method based on Mask R-CNN","volume":"8","author":"Han","year":"2022","journal-title":"Complex Intell. Syst."},{"key":"ref_42","first-page":"3","article-title":"Tiny imagenet visual recognition challenge","volume":"7","author":"Le","year":"2015","journal-title":"CS 231n"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, X., and Shen, Z. (2022). YOLO-Submarine Cable: An Improved YOLO-V3 Network for Object Detection on Submarine Cable Images. J. Mar. Sci. Eng., 10.","DOI":"10.3390\/jmse10081143"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Ju, M., Luo, H., Wang, Z., Hui, B., and Chang, Z. (2019). The application of improved YOLO V3 in multi-scale target detection. Appl. Sci., 9.","DOI":"10.3390\/app9183775"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Xu, D., and Wu, Y. (2020). Improved YOLO-V3 with DenseNet for multi-scale remote sensing target detection. Sensors, 20.","DOI":"10.3390\/s20154276"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"2309","DOI":"10.1080\/01431160600606890","article-title":"A comparison study on fusion methods using evaluation indicators","volume":"28","author":"Karathanassi","year":"2007","journal-title":"Int. J. Remote Sens."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"2834","DOI":"10.1109\/JSTARS.2018.2836909","article-title":"SAR targets classification based on deep memory convolution neural networks and transfer parameters","volume":"11","author":"Shang","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Pang, Y., Zhao, X., Zhang, L., and Lu, H. (2020, January 13\u201319). Multi-scale interactive network for salient object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00943"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Sriram, S., Vinayakumar, R., Sowmya, V., Aamoun, A., and Soman, K. (2020, January 6\u20139). Multi-scale learning based malware variant detection using spatial pyramid pooling network. Proceedings of the IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada.","DOI":"10.1109\/INFOCOMWKSHPS50562.2020.9162661"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Li, S., Yuan, S., Liu, S., Wen, J., Huang, Q., and Zhang, Z. (2021). Characteristics of Low-Frequency Acoustic Wave Propagation in Ice-Covered Shallow Water Environment. Appl. Sci., 11.","DOI":"10.3390\/app11177815"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0022-0981(96)02636-6","article-title":"Effects of water flow and branch spacing on particle capture by the reef coral Madracis mirabilis (Duchassaing and Michelotti)","volume":"211","author":"Sebens","year":"1997","journal-title":"J. Exp. Mar. Biol. Ecol."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Wang, X., Wang, L., Li, G., and Xie, X. (2021). A Robust and Fast Method for Sidescan Sonar Image Segmentation Based on Region Growing. Sensors, 21.","DOI":"10.3390\/s21216960"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Ying, X. (2019). An overview of overfitting and its solutions. J. Phys. Conf. Ser., 1168.","DOI":"10.1088\/1742-6596\/1168\/2\/022022"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Pecorelli, F., Di Nucci, D., De Roover, C., and De Lucia, A. (2020). A large empirical assessment of the role of data balancing in machine-learning-based code smell detection. J. Syst. Softw., 169.","DOI":"10.1016\/j.jss.2020.110693"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Osetsky, Y., Barashev, A.V., and Zhang, Y. (2021). Sluggish, chemical bias and percolation phenomena in atomic transport by vacancy and interstitial diffusion in NiFe alloys. Curr. Opin. Solid State Mater. Sci., 25.","DOI":"10.1016\/j.cossms.2021.100961"},{"key":"ref_56","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., and Chintala, S. (2019). Pytorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems, MIT Press."},{"key":"ref_57","unstructured":"Moura, P., Crocker, P., and Nunes, P. (2008, January 7\u20138). High-level multi-threading programming in logtalk. Proceedings of the International Symposium on Practical Aspects of Declarative Languages, San Francisco, CA, USA."},{"key":"ref_58","unstructured":"He, K., Girshick, R., and Doll\u00e1r, P. (November, January 27). Rethinking imagenet pre-training. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Jaikrishnan, S.V.J., Chantarakasemchit, O., and Meesad, P. (2019, January 10\u201311). A breakup machine learning approach for breast cancer prediction. Proceedings of the 2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE), Pattaya, Thailand.","DOI":"10.1109\/ICITEED.2019.8929977"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Chu, H., Xiong, X., Gao, Y.J., Luo, J., and Jing, H. (2021). Diffuse reflection and reciprocity-protected transmission via a random-flip metasurface. Sci. Adv., 7.","DOI":"10.1126\/sciadv.abj0935"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., and Sun, J. (2018). Detnet: A backbone network for object detection. arXiv.","DOI":"10.1007\/978-3-030-01240-3_21"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Albawi, S., Mohammed, T.A., and Al-Zawi, S. (2017, January 21\u201323). Understanding of a convolutional neural network. Proceedings of the 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey.","DOI":"10.1109\/ICEngTechnol.2017.8308186"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"1396","DOI":"10.1109\/TVCG.2020.3030418","article-title":"CNN explainer: Learning convolutional neural networks with interactive visualization","volume":"27","author":"Wang","year":"2020","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_64","unstructured":"O\u2019Shea, K., and Nash, R. (2015). An introduction to convolutional neural networks. arXiv."},{"key":"ref_65","first-page":"1","article-title":"HOG-ShipCLSNet: A novel deep learning network with hog feature fusion for SAR ship classification","volume":"60","author":"Zhang","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, C., and Zhang, H. (2018). Ship classification in high-resolution SAR images using deep learning of small datasets. Sensors, 18.","DOI":"10.3390\/s18092929"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Nguyen, D.P.T., Matsuo, Y., and Ishizuka, M. (2007, January 6\u201312). Exploiting syntactic and semantic information for relation extraction from wikipedia. Proceedings of the IJCAI Workshop on Text-Mining & Link-Analysis (TextLink 2007), Hyderabad, India.","DOI":"10.3115\/1614108.1614140"},{"key":"ref_68","unstructured":"Sacramento, J., Ponte Costa, R., Bengio, Y., and Senn, W. (2018). Dendritic cortical microcircuits approximate the backpropagation algorithm. Advances in Neural Information Processing Systems, MIT Press."},{"key":"ref_69","unstructured":"Alom, M.Z., Taha, T.M., Yakopcic, C., Westberg, S., and Asari, V.K. (2018). The history began from alexnet: A comprehensive survey on deep learning approaches. arXiv."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Alippi, C., Disabato, S., and Roveri, M. (2018, January 11\u201313). Moving convolutional neural networks to embedded systems: The alexnet and VGG-16 case. Proceedings of the 2018 17th ACM\/IEEE International Conference on Information Processing in Sensor Networks (IPSN), Porto, Portugal.","DOI":"10.1109\/IPSN.2018.00049"},{"key":"ref_71","first-page":"65","article-title":"Feature extraction and image retrieval based on AlexNet","volume":"Volume 10033","author":"Yuan","year":"2016","journal-title":"Proceedings of the Eighth International Conference on Digital Image Processing (ICDIP 2016)"},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Ballester, P., and Araujo, R.M. (2016, January 12\u201317). On the performance of GoogLeNet and AlexNet applied to sketches. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.10171"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Anand, R., Shanthi, T., Nithish, M.S., and Lakshman, S. (2020). Face recognition and classification using GoogleNET architecture. Soft Computing for Problem Solving, Springer.","DOI":"10.1007\/978-981-15-0035-0_20"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Salavati, P., and Mohammadi, H.M. (2018, January 25\u201326). Obstacle detection using GoogleNet. Proceedings of the 2018 8th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran.","DOI":"10.1109\/ICCKE.2018.8566315"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Muhammad, U., Wang, W., Chattha, S.P., and Ali, S. (2018, January 20\u201324). Pre-trained VGGNet architecture for remote-sensing image scene classification. Proceedings of the 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, China.","DOI":"10.1109\/ICPR.2018.8545591"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Sathish, K., Ramasubbareddy, S., and Govinda, K. (2020). Detection and localization of multiple objects using VGGNet and single shot detection. Emerging Research in Data Engineering Systems and Computer Communications, Springer.","DOI":"10.1007\/978-981-15-0135-7_40"},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.patcog.2019.01.006","article-title":"Wider or deeper: Revisiting the ResNet model for visual recognition","volume":"90","author":"Wu","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_78","unstructured":"Li, S., Jiao, J., Han, Y., and Weissman, T. (2016). Demystifying ResNet. arXiv."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Chen, Z., Xie, Z., Zhang, W., and Xu, X. (2017, January 20\u201324). ResNet and Model Fusion for Automatic Spoofing Detection. Proceedings of the Interspeech, Stockholm, Sweden.","DOI":"10.21437\/Interspeech.2017-1085"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Khan, R.U., Zhang, X., Kumar, R., Aboagye, E.O., and Kumar, R. (2018, January 12\u201314). Evaluating the performance of ResNet model based on image recognition. Proceedings of the 2018 International Conference on Computing and Artificial Intelligence, Chengdu, China.","DOI":"10.1145\/3194452.3194461"},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"5349","DOI":"10.1109\/TNNLS.2020.2966319","article-title":"Why ResNet works? Residuals generalize","volume":"31","author":"He","year":"2020","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_82","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Zhao, L., and Li, S. (2020). Object detection algorithm based on improved YOLOV3. Electronics, 9.","DOI":"10.3390\/electronics9030537"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Won, J.H., Lee, D.H., Lee, K.M., and Lin, C.H. (2019, January 23\u201326). An improved YOLOv3-based neural network for de-identification technology. Proceedings of the 2019 34th International Technical Conference on Circuits\/Systems, Computers and Communications (ITC-CSCC), Jeju, Republic of Korea.","DOI":"10.1109\/ITC-CSCC.2019.8793382"},{"key":"ref_85","unstructured":"Lang, P., Fu, X., Martorella, M., Dong, J., and Xie, M. (2020). A comprehensive survey of machine learning applied to radar signal processing. arXiv."},{"key":"ref_86","first-page":"85","article-title":"Comparison of CNN and YOLO for Object Detection","volume":"19","author":"Lee","year":"2020","journal-title":"J. Semicond. Disp. Technol."},{"key":"ref_87","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"875","DOI":"10.1080\/2150704X.2016.1193793","article-title":"A deep learning framework for hyperspectral image classification using spatial pyramid pooling","volume":"7","author":"Yue","year":"2016","journal-title":"Remote Sens. Lett."},{"key":"ref_89","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/j.ins.2020.02.067","article-title":"DC-SPP-YOLO: Dense connection and spatial pyramid pooling based YOLO for object detection","volume":"522","author":"Huang","year":"2020","journal-title":"Inf. Sci."},{"key":"ref_90","doi-asserted-by":"crossref","first-page":"128774","DOI":"10.1109\/ACCESS.2019.2940527","article-title":"Automatic building extraction on high-resolution remote sensing imagery using deep convolutional encoder-decoder with spatial pyramid pooling","volume":"7","author":"Liu","year":"2019","journal-title":"IEEE Access"},{"key":"ref_91","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1080\/21642583.2020.1824132","article-title":"An improved YOLOv3 model based on skipping connections and spatial pyramid pooling","volume":"9","author":"Zhang","year":"2021","journal-title":"Syst. Sci. Control Eng."},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"14881","DOI":"10.1007\/s00521-022-07264-8","article-title":"Refined marine object detector with attention-based spatial pyramid pooling networks and bidirectional feature fusion strategy","volume":"34","author":"Xu","year":"2022","journal-title":"Neural Comput. Appl."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/2\/440\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:03:19Z","timestamp":1760119399000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/2\/440"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,11]]},"references-count":92,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["rs15020440"],"URL":"https:\/\/doi.org\/10.3390\/rs15020440","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,11]]}}}