{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T20:36:19Z","timestamp":1771014979518,"version":"3.50.1"},"reference-count":42,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2025,8,23]],"date-time":"2025-08-23T00:00:00Z","timestamp":1755907200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Key R &amp; D and achievement transformation in Inner Mongolia Autonomous Region","award":["2023YFDZ0060"],"award-info":[{"award-number":["2023YFDZ0060"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>(1) Objective: This study is aiming at the key problems, such as insufficient detection accuracy of small targets and complex background interference in remote-sensing image target detection; (2) Methods: by optimizing the YOLOv10x model architecture, the YOLO-KRM model is proposed. Firstly, a new backbone network structure is constructed. By replacing the C2f of the third layer of the backbone network with the Kolmogorov\u2013Arnold network, the approximation ability of the model to complete complex nonlinear functions in high-dimensional space is improved. Then, the C2f of the fifth layer of the backbone network is replaced by the receptive field attention convolution, which enhances the model\u2019s ability to capture the global context information of the features. In addition, the C2f and C2fCIB structures in the upsampling operation in the neck network are replaced by the hybrid local channel attention mechanism module, which significantly improves the feature representation ability of the model. Results: In order to validate the effectiveness of the YOLO-KRM model, detailed experiments were conducted on two remote-sensing datasets, RSOD and NWPU VHR-10. The experimental results show that, compared with the original model YOLOv10x, the mAP@50 of the YOLO-KRM model on the two datasets is increased by 1.77% and 2.75%, respectively, and the mAP @ 50:95 index is increased by 3.82% and 5.23%, respectively; (3) Results: by improving the model, the accuracy of target detection in remote-sensing images is successfully enhanced. The experimental results verify the effectiveness of the model in dealing with complex backgrounds and small targets, especially in high-resolution remote-sensing images.<\/jats:p>","DOI":"10.3390\/a18090537","type":"journal-article","created":{"date-parts":[[2025,8,25]],"date-time":"2025-08-25T00:09:53Z","timestamp":1756080593000},"page":"537","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Optimization of Object Detection Network Architecture for High-Resolution Remote Sensing"],"prefix":"10.3390","volume":"18","author":[{"given":"Hongyan","family":"Shi","sequence":"first","affiliation":[{"name":"College of Engineering, Inner Mongolia Minzu University, Tongliao 028000, China"}]},{"given":"Xiaofeng","family":"Bai","sequence":"additional","affiliation":[{"name":"School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China"}]},{"given":"Chenshuai","family":"Bai","sequence":"additional","affiliation":[{"name":"School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,8,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Gui, S., Song, S., Qin, R., and Tang, Y. (2024). Remote sensing object detection in the deep learning era\u2014A review. Remote Sens., 16.","DOI":"10.3390\/rs16020327"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wang, L., Zhang, M., Gao, X., and Shi, W. (2024). Advances and challenges in deep learning-based change detection for remote sensing images: A review through various learning paradigms. Remote Sens., 16.","DOI":"10.3390\/rs16050804"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"6283","DOI":"10.1007\/s00521-024-09422-6","article-title":"A review of small object detection based on deep learning","volume":"36","author":"Wei","year":"2024","journal-title":"Neural Comput. Appl."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1199","DOI":"10.1016\/j.eswa.2023.119960","article-title":"A comprehensive survey of oriented object detection in remote sensing images","volume":"224","author":"Wen","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10462-025-11150-9","article-title":"A survey of small object detection based on deep learning in aerial images","volume":"58","author":"Hua","year":"2025","journal-title":"Artif. Intell. Rev."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Wang, X., Wang, A., Yi, J., Song, Y., and Chehri, A. (2023). Small object detection based on deep learning for remote sensing: A comprehensive review. Remote Sens., 15.","DOI":"10.3390\/rs15133265"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"116793","DOI":"10.1016\/j.eswa.2022.116793","article-title":"Remote sensing image super-resolution and object detection: Benchmark and state of the art","volume":"197","author":"Wang","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_8","first-page":"723","article-title":"Imagenet classification with deep convolutional neural networks","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_9","first-page":"235","article-title":"SOD-YOLOv10: Small Object Detection in Remote Sensing Images Based on YOLOv10","volume":"18","author":"Sun","year":"2025","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","first-page":"1","article-title":"A DINO remote sensing target detection algorithm combining efficient hybrid encoder and structural reparameterization","volume":"3","author":"Zhang","year":"2025","journal-title":"J. Beijing Univ. Aeronaut. Astronaut."},{"key":"ref_11","first-page":"233","article-title":"A Context-Aware Multiple Receptive Field Fusion Network for Oriented Object Detection in Remote Sensing Images","volume":"47","author":"Yao","year":"2025","journal-title":"J. Electron. Inf. Technol."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Liu, X., Gong, W., Shang, L., Li, X., and Gong, Z. (2023). Remote Sensing Image Target Detection and Recognition Based on YOLOv5. Remote Sens., 15.","DOI":"10.3390\/rs15184459"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Sohan, M., Sai Ram, T., and Rami Reddy, C.V. (2024). A Review on yolov8 and Its Advancements. International Conference on Data Intelligence and Cognitive Informatics, Springer.","DOI":"10.1007\/978-981-99-7962-2_39"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Yeh, I.H., and Mark Liao, H.Y. (2024). Yolov9: Learning what you want to learn using programmable gradient information. European Conference on Computer Vision, Springer Nature.","DOI":"10.1007\/978-3-031-72751-1_1"},{"key":"ref_15","unstructured":"Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., and Han, J. (2024). Yolov10: Real-time end-to-end object detection. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"100049","DOI":"10.1016\/j.geomat.2025.100049","article-title":"LW-UAV\u2013YOLOv10: A lightweight model for small UAV detection on infrared data based on YOLOv10","volume":"77","author":"Nguyen","year":"2025","journal-title":"Geomatica"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"155654","DOI":"10.1109\/ACCESS.2024.3479320","article-title":"YOLO-remote: An object detection algorithm for remote sensing targets","volume":"12","author":"Fan","year":"2024","journal-title":"IEEE Access"},{"key":"ref_18","first-page":"3045","article-title":"Progress of research on deep learning algorithms for object detection in optical remote sensing images","volume":"28","author":"Xu","year":"2024","journal-title":"Natl. Remote Sens. Bull."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1016\/j.isprsjprs.2022.12.004","article-title":"Generalized few-shot object detection in remote sensing images","volume":"195","author":"Zhang","year":"2023","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Jiang, H., Peng, M., Zhong, Y., Xie, H., Hap, Z., Lin, J., and Ma, X. (2022). A survey on deep learning-based change detection from high-resolution remote sensing images. Remote Sens., 14.","DOI":"10.3390\/rs14071552"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Qu, J., Tang, Z., Zhang, L., Zhang, Y., and Zhang, Z. (2023). Remote sensing small object detection network based on attention mechanism and multi-scale feature fusion. Remote Sens., 15.","DOI":"10.3390\/rs15112728"},{"key":"ref_22","first-page":"105054","article-title":"Starting from the structure: A review of small object detection based on deep learning","volume":"10","author":"Zheng","year":"2024","journal-title":"Image Vis. Comput."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"5521","DOI":"10.1007\/s12145-024-01447-8","article-title":"MSCANet: A multi-scale context-aware network for remote sensing object detection","volume":"17","author":"Zhou","year":"2024","journal-title":"Earth Sci. Inform."},{"key":"ref_24","first-page":"10324","article-title":"MFCANet: Multiscale Feature Context Aggregation Network for Oriented Object Detection in Remote-Sensing Images","volume":"32","author":"Jiang","year":"2024","journal-title":"IEEE Access"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2024.3510781","article-title":"Multistage enhancement network for tiny object detection in remote sensing images","volume":"62","author":"Zhang","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"15622","DOI":"10.1109\/ACCESS.2024.3356716","article-title":"A deep cnn-based detection method for multi-scale fine-grained objects in remote sensing images","volume":"12","author":"Xie","year":"2024","journal-title":"IEEE Access"},{"key":"ref_27","first-page":"1","article-title":"FANet: An arbitrary direction remote sensing object detection network based on feature fusion and angle classification","volume":"61","author":"Zhang","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_28","first-page":"103270","article-title":"A remote sensing image rotation object detection approach for real-time environmental monitoring","volume":"57","author":"Tang","year":"2023","journal-title":"Sustain. Energy Technol. Assess."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1108\/IJICC-01-2024-0020","article-title":"MFLD: Lightweight object detection with multi-receptive field and long-range dependency in remote sensing images","volume":"17","author":"Wang","year":"2024","journal-title":"Int. J. Intell. Comput. Cybern."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"55508","DOI":"10.1109\/ACCESS.2022.3172504","article-title":"Lightweight spatial sliced-concatenate-multireceptive-field enhance and joint channel attention mechanism for infrared object detection","volume":"10","author":"Pan","year":"2022","journal-title":"IEEE Access"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2023.3335418","article-title":"Receptive-field and direction induced attention network for infrared dim small target detection with a large-scale dataset IRDST","volume":"61","author":"Sun","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"129866","DOI":"10.1016\/j.neucom.2025.129866","article-title":"SCSA: Exploring the synergistic effects between spatial and channel attention","volume":"634","author":"Si","year":"2025","journal-title":"Neurocomputing"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"106442","DOI":"10.1016\/j.engappai.2023.106442","article-title":"Mixed local channel attention for object detection","volume":"123","author":"Wan","year":"2023","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"108630","DOI":"10.1016\/j.compag.2024.108630","article-title":"Detection of Fusarium head blight in wheat using UAV remote sensing based on parallel channel space attention","volume":"217","author":"Bao","year":"2024","journal-title":"Comput. Electron. Agric."},{"key":"ref_35","first-page":"1371","article-title":"Spatial-Channel Attention Transformer with Pseudo Regions for Remote Sensing Image-Text Retrieval","volume":"261","author":"Wu","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"2353290","DOI":"10.1080\/22797254.2024.2353290","article-title":"A multi-scale multi-channel CNN introducing a channel-spatial attention mechanism hyperspectral remote sensing image classification method","volume":"57","author":"Zhao","year":"2024","journal-title":"Eur. J. Remote Sens."},{"key":"ref_37","first-page":"630","article-title":"Object detection by channel and spatial exchange for multimodal remote sensing imagery","volume":"72","author":"Nan","year":"2024","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_38","first-page":"331","article-title":"A survey on Kolmogorov-Arnold network","volume":"68","author":"Somvanshi","year":"2024","journal-title":"ACM Comput. Surv."},{"key":"ref_39","unstructured":"Zhang, X., Liu, C., Yang, D., Song, T., Ye, Y., Li, K., and Soong, Y. (2023). RFAConv: Innovating spatial attention and standard convolutional operation. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 13\u201319). ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01155"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/9\/537\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:34:45Z","timestamp":1760034885000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/9\/537"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,23]]},"references-count":42,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2025,9]]}},"alternative-id":["a18090537"],"URL":"https:\/\/doi.org\/10.3390\/a18090537","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,23]]}}}