{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T06:03:30Z","timestamp":1775282610037,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2023,7,14]],"date-time":"2023-07-14T00:00:00Z","timestamp":1689292800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["52106080"],"award-info":[{"award-number":["52106080"]}]},{"name":"National Natural Science Foundation of China","award":["JJKH20230135KJ"],"award-info":[{"award-number":["JJKH20230135KJ"]}]},{"name":"National Natural Science Foundation of China","award":["BSJXM-2021116"],"award-info":[{"award-number":["BSJXM-2021116"]}]},{"name":"Jilin Provincial Department of Education Science and Technology Research Project","award":["52106080"],"award-info":[{"award-number":["52106080"]}]},{"name":"Jilin Provincial Department of Education Science and Technology Research Project","award":["JJKH20230135KJ"],"award-info":[{"award-number":["JJKH20230135KJ"]}]},{"name":"Jilin Provincial Department of Education Science and Technology Research Project","award":["BSJXM-2021116"],"award-info":[{"award-number":["BSJXM-2021116"]}]},{"DOI":"10.13039\/501100015752","name":"Northeast Electric Power University","doi-asserted-by":"publisher","award":["52106080"],"award-info":[{"award-number":["52106080"]}],"id":[{"id":"10.13039\/501100015752","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015752","name":"Northeast Electric Power University","doi-asserted-by":"publisher","award":["JJKH20230135KJ"],"award-info":[{"award-number":["JJKH20230135KJ"]}],"id":[{"id":"10.13039\/501100015752","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015752","name":"Northeast Electric Power University","doi-asserted-by":"publisher","award":["BSJXM-2021116"],"award-info":[{"award-number":["BSJXM-2021116"]}],"id":[{"id":"10.13039\/501100015752","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>With the continuous development of deep learning technology, object detection has received extensive attention across various computer fields as a fundamental task of computational vision. Effective detection of objects in remote sensing images is a key challenge, owing to their small size and low resolution. In this study, a remote sensing image detection (RSI-YOLO) approach based on the YOLOv5 target detection algorithm is proposed, which has been proven to be one of the most representative and effective algorithms for this task. The channel attention and spatial attention mechanisms are used to strengthen the features fused by the neural network. The multi-scale feature fusion structure of the original network based on a PANet structure is improved to a weighted bidirectional feature pyramid structure to achieve more efficient and richer feature fusion. In addition, a small object detection layer is added, and the loss function is modified to optimise the network model. The experimental results from four remote sensing image datasets, such as DOTA and NWPU-VHR 10, indicate that RSI-YOLO outperforms the original YOLO in terms of detection performance. The proposed RSI-YOLO algorithm demonstrated superior detection performance compared to other classical object detection algorithms, thus validating the effectiveness of the improvements introduced into the YOLOv5 algorithm.<\/jats:p>","DOI":"10.3390\/s23146414","type":"journal-article","created":{"date-parts":[[2023,7,17]],"date-time":"2023-07-17T01:06:36Z","timestamp":1689555996000},"page":"6414","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":48,"title":["RSI-YOLO: Object Detection Method for Remote Sensing Images Based on Improved YOLO"],"prefix":"10.3390","volume":"23","author":[{"given":"Zhuang","family":"Li","sequence":"first","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]},{"given":"Jianhui","family":"Yuan","sequence":"additional","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]},{"given":"Guixiang","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]},{"given":"Hao","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6436-4952","authenticated-orcid":false,"given":"Xingcan","family":"Li","sequence":"additional","affiliation":[{"name":"School of Energy and Power Engineering, Northeast Electric Power University, Jilin 132012, China"}]},{"given":"Dan","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]},{"given":"Xinhua","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Northeast Electric Power University, Jilin 132012, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Mao, M., Zhao, H., Tang, G., and Ren, J. (2023). In-Season Crop Type Detection by Combing Sentinel-1A and Sentinel-2 Imagery Based on the CNN Model. Agronomy, 13.","DOI":"10.3390\/agronomy13071723"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Cardama, F.J., Heras, D.B., and Arg\u00fcello, F. (2023). Consensus Techniques for Unsupervised Binary Change Detection Using Multi-Scale Segmentation Detectors for Land Cover Vegetation Images. Remote Sens., 15.","DOI":"10.3390\/rs15112889"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Suleymanov, A., Suleymanov, R., Kulagin, A., and Yurkevich, M. (2023). Mercury Prediction in Urban Soils by Remote Sensing and Relief Data Using Machine Learning Techniques. Remote. Sens., 15.","DOI":"10.3390\/rs15123158"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"137","DOI":"10.3390\/geomatics3010007","article-title":"Remote Sensing Image Scene Classification: Advances and Open Challenges","volume":"3","author":"Tombe","year":"2023","journal-title":"Geomatics"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"924","DOI":"10.1109\/TGRS.2018.2863224","article-title":"Learning Spectral-Spatial-Temporal Features via a Recurrent Convolutional Neural Network for Change Detection in Multispectral Imagery","volume":"2","author":"Mou","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2335","DOI":"10.1007\/s12145-022-00885-6","article-title":"FCD-R2U-net: Forest change detection in bi-temporal satellite images using the recurrent residual-based U-net","volume":"15","author":"Khankeshizadeh","year":"2022","journal-title":"Earth Sci. Inform."},{"key":"ref_7","unstructured":"Purkait, P., Zhao, C., and Zach, C. (2017). SPP-Net: Deep absolute pose regression with synthetic views. arXiv."},{"key":"ref_8","unstructured":"Gkioxari, G., Hariharan, B., Girshick, R., and Malik, J. (2014). R-cnns for pose estimation and action detection. arXiv."},{"key":"ref_9","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst., 28."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single shot multibox detector. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands. Proceedings, Part I 14.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 14\u201319). Efficientdet: Scalable and efficient object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_13","first-page":"12993","article-title":"Distance-IoU loss: Faster and better learning for bounding box regression","volume":"34","author":"Zheng","year":"2020","journal-title":"AAAI Conf. Artif. Intell."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1016\/j.neucom.2022.07.042","article-title":"Focal and efficient IOU loss for accurate bounding box regression","volume":"506","author":"Zhang","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Adegun, A.A., Dombeu, J.V.F., Viriri, S., and Odindi, J. (2023). State-of-the-Art Deep Learning Methods for Objects Detection in Remote Sensing Satellite Images. Sensors, 23.","DOI":"10.3390\/s23135849"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Ma, J., Hu, Z., Shao, Q., Wang, Y., Zhou, Y., Liu, J., and Liu, S. (2022). Detection of Large Herbivores in UAV Images: A New Method for Small Target Recognition in Large-Scale Images. Diversity, 14.","DOI":"10.3390\/d14080624"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, W., Shi, Y., Zhang, J., Hu, L., Li, S., He, D., and Liu, F. (2023). Traditional Village Building Extraction Based on Improved Mask R-CNN: A Case Study of Beijing, China. Remote Sens., 15.","DOI":"10.3390\/rs15102616"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Wu, Q., Feng, D., Cao, C., Zeng, X., Feng, Z., Wu, J., and Huang, Z. (2021). Improved Mask R-CNN for Aircraft Detection in Remote Sensing Images. Sensors, 21.","DOI":"10.3390\/s21082618"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Huo, B., Li, C., Zhang, J., Xue, Y., and Lin, Z. (2023). SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing. Remote Sens., 15.","DOI":"10.3390\/rs15123027"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Betti, A., and Tucci, M. (2023). YOLO-S: A Lightweight and Accurate YOLO-like Network for Small Target Detection in Aerial Imagery. Sensors, 23.","DOI":"10.3390\/s23041865"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Lai, H., Chen, L., Liu, W., Yan, Z., and Ye, S. (2023). STC-YOLO: Small Object Detection Network for Traffic Signs in Complex Environ-ments. Sensors, 23.","DOI":"10.3390\/s23115307"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, J., Zhang, F., Zhang, Y., Liu, Y., and Cheng, T. (2023). Lightweight Object Detection Algorithm for UAV Aerial Imagery. Sensors, 23.","DOI":"10.3390\/s23135786"},{"key":"ref_23","first-page":"100","article-title":"Algorithm AS 136: A k-means clustering algorithm","volume":"28","author":"Hartigan","year":"1979","journal-title":"J. R. Stat. Soc. Ser. C"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_25","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_27","unstructured":"Wang, W., Xie, E., Song, X., Zang, Y., Wang, W., Lu, T., Yu, G., and Shen, C. (November, January 27). Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_28","unstructured":"Zhang, Z., and Sabuncu, M. (2018). Generalized cross entropy loss for training deep neural networks with noisy labels. Adv. Neural Inf. Process. Syst., 31."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the Computer Vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland. Proceedings, Part V 13.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_30","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","article-title":"Learning representations by back-propagating errors","volume":"323","author":"Rumelhart","year":"1986","journal-title":"Nature"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Xia, G.-S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_33","first-page":"1","article-title":"Anchor-Free Oriented Proposal Generator for Object Detection","volume":"60","author":"Cheng","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_35","first-page":"2671","article-title":"MAR20: A Benchmark for Military Aircraft Recognition in Remote Sensing Images","volume":"12","author":"Yu","year":"2022","journal-title":"Natl. Remote Sens. Bull."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/14\/6414\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:12:19Z","timestamp":1760127139000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/14\/6414"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,14]]},"references-count":35,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["s23146414"],"URL":"https:\/\/doi.org\/10.3390\/s23146414","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,14]]}}}