{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,15]],"date-time":"2025-12-15T13:41:29Z","timestamp":1765806089983,"version":"3.41.2"},"reference-count":39,"publisher":"World Scientific Pub Co Pte Ltd","issue":"05","funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2022ZD0119900"],"award-info":[{"award-number":["2022ZD0119900"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Shanghai Science and Technology program","award":["22015810300"],"award-info":[{"award-number":["22015810300"]}]},{"name":"Hainan Province Science and Technology Special Fund","award":["ZDYF2021GXJS041"],"award-info":[{"award-number":["ZDYF2021GXJS041"]}]},{"name":"Scientific Research Fund of Hainan University","award":["KYQD(ZR)23025"],"award-info":[{"award-number":["KYQD(ZR)23025"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U2141234"],"award-info":[{"award-number":["U2141234"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,4]]},"abstract":"<jats:p> Object detection on unmanned aerial vehicle (UAV) images is an important branch of object detection, belonging to small object detection in a broad sense. Detecting objects in UAV images poses a greater challenge due to the predominance of small objects and dense occlusion caused by UAV capturing images from varying heights and angles. To solve the above problems, we propose Residual Spatial Reduced Transformer based on YOLOv5 (RSRT-YOLOv5). Specifically, Slice Aided Enhancement Module (SAEM) is introduced to enhance the feature quality of small objects. Secondly, a Global attention-based Bi-directional Feature Fusion (GBFF) module is proposed. In the Neck architecture, an efficient Residual Spatial Reduced Transformer (RSRT) module is integrated in order to achieve more efficient feature representation and richer global contextual associations. Finally, our method is evaluated on the Visdrone2019 dataset, and the experimental results show that RSRT-YOLOv5 outperforms the baseline model (yolov5) and successfully improves the detection performance of UAV images. <\/jats:p>","DOI":"10.1142\/s0218001424500071","type":"journal-article","created":{"date-parts":[[2024,4,11]],"date-time":"2024-04-11T12:01:52Z","timestamp":1712836912000},"source":"Crossref","is-referenced-by-count":1,"title":["Residual Spatial Reduced Transformer Based on YOLOv5 for UAV Images Object Detection"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-5305-3250","authenticated-orcid":false,"given":"Li","family":"Chen","sequence":"first","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2024-3897","authenticated-orcid":false,"given":"Naimeng","family":"Cang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2549-2549","authenticated-orcid":false,"given":"Wenbo","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-7932-0115","authenticated-orcid":false,"given":"Chan","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4700-1276","authenticated-orcid":false,"given":"Weidong","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"},{"name":"Department of Automation, Shanghai Jiaotong University, No. 800 Dongchuan Road, Shanghai 200240, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1700-1996","authenticated-orcid":false,"given":"Dongsheng","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, Hainan University, No. 58 People\u2019s Road, Haikou 570228, P.\u00a0R.\u00a0China"}]}],"member":"219","published-online":{"date-parts":[[2024,5,31]]},"reference":[{"key":"S0218001424500071BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP46576.2022.9897990"},{"key":"S0218001424500071BIB002","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2022.108548"},{"key":"S0218001424500071BIB003","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3074273"},{"key":"S0218001424500071BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/AEECA55500.2022.9918861"},{"key":"S0218001424500071BIB006","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S0218001424500071BIB008","doi-asserted-by":"publisher","DOI":"10.1109\/ICOSNIKOM56551.2022.10034913"},{"key":"S0218001424500071BIB009","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"S0218001424500071BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00714"},{"key":"S0218001424500071BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"S0218001424500071BIB013","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2021.104197"},{"key":"S0218001424500071BIB014","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00913"},{"key":"S0218001424500071BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"S0218001424500071BIB017","doi-asserted-by":"publisher","DOI":"10.1109\/ICME51207.2021.9428241"},{"key":"S0218001424500071BIB018","doi-asserted-by":"publisher","DOI":"10.1155\/2018\/2497471"},{"key":"S0218001424500071BIB019","doi-asserted-by":"publisher","DOI":"10.1177\/0165551516677911"},{"key":"S0218001424500071BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2945911"},{"key":"S0218001424500071BIB021","doi-asserted-by":"publisher","DOI":"10.1002\/cae.22179"},{"key":"S0218001424500071BIB022","doi-asserted-by":"publisher","DOI":"10.1002\/cae.22253"},{"key":"S0218001424500071BIB023","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.5909"},{"key":"S0218001424500071BIB024","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2022.02.025"},{"key":"S0218001424500071BIB025","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.120908"},{"issue":"7","key":"S0218001424500071BIB026","first-page":"101610","volume":"35","author":"Onan A.","year":"2023","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"issue":"7","key":"S0218001424500071BIB027","first-page":"101611","volume":"35","author":"Onan A.","year":"2023","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"key":"S0218001424500071BIB028","first-page":"5901087","volume":"2019","author":"Onan A.","year":"2019","journal-title":"Sci. Program."},{"key":"S0218001424500071BIB029","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2016.03.045"},{"key":"S0218001424500071BIB030","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2016.06.005"},{"key":"S0218001424500071BIB031","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2017.02.008"},{"key":"S0218001424500071BIB032","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3049734"},{"key":"S0218001424500071BIB033","doi-asserted-by":"publisher","DOI":"10.3390\/rs11131594"},{"key":"S0218001424500071BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01196"},{"key":"S0218001424500071BIB037","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2020.3019505"},{"key":"S0218001424500071BIB038","doi-asserted-by":"publisher","DOI":"10.7717\/peerj-cs.1441"},{"key":"S0218001424500071BIB039","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"S0218001424500071BIB040","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00061"},{"key":"S0218001424500071BIB041","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"S0218001424500071BIB042","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-20077-9_31"},{"key":"S0218001424500071BIB043","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00832"},{"key":"S0218001424500071BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.00995"},{"key":"S0218001424500071BIB045","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58583-9_34"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001424500071","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,20]],"date-time":"2024-06-20T08:49:53Z","timestamp":1718873393000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001424500071"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4]]},"references-count":39,"journal-issue":{"issue":"05","published-print":{"date-parts":[[2024,4]]}},"alternative-id":["10.1142\/S0218001424500071"],"URL":"https:\/\/doi.org\/10.1142\/s0218001424500071","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,4]]},"article-number":"2450007"}}