{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T01:59:13Z","timestamp":1768096753728,"version":"3.49.0"},"reference-count":38,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61973075"],"award-info":[{"award-number":["61973075"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Top six talents in Jiangsu","award":["RJFW-001"],"award-info":[{"award-number":["RJFW-001"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2022,1]]},"abstract":"<jats:p> Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection. Difficult objects include small objects, objects of scale variation and objects with serious background interference. Class imbalances come from the number of different classes of objects and sampling of positive and negative samples. Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy. In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed. By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism. On this basis, an algorithm with ideal speed and accuracy for object detection is obtained. To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT. The experimental results show that the AP of AVS-YOLO increases by 6.22% and 5.09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3. In addition, the AP of AVS-YOLO is 1.82% higher than that of YOLOv4 on the VisDrone2019 dataset. In terms of detection speed, AVS-YOLO can process 31.8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.1 frames per second for YOLOv3. Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset. <\/jats:p>","DOI":"10.1142\/s0218001422500045","type":"journal-article","created":{"date-parts":[[2022,1,20]],"date-time":"2022-01-20T14:57:18Z","timestamp":1642690638000},"source":"Crossref","is-referenced-by-count":13,"title":["AVS-YOLO: Object Detection in Aerial Visual Scene"],"prefix":"10.1142","volume":"36","author":[{"given":"You","family":"Ma","sequence":"first","affiliation":[{"name":"Key Laboratory of Measurement and Control of Complex, Systems of Engineering and School of Automation, Southeast University, Nanjing 210096, P.\u00a0R.\u00a0China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5802-1133","authenticated-orcid":false,"given":"Lin","family":"Chai","sequence":"additional","affiliation":[{"name":"Key Laboratory of Measurement and Control of Complex, Systems of Engineering and School of Automation, Southeast University, Nanjing 210096, P.\u00a0R.\u00a0China"}]},{"given":"Lizuo","family":"Jin","sequence":"additional","affiliation":[{"name":"Key Laboratory of Measurement and Control of Complex, Systems of Engineering and School of Automation, Southeast University, Nanjing 210096, P.\u00a0R.\u00a0China"}]},{"given":"Yafeng","family":"Yu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Measurement and Control of Complex, Systems of Engineering and School of Automation, Southeast University, Nanjing 210096, P.\u00a0R.\u00a0China"}]},{"given":"Jun","family":"Yan","sequence":"additional","affiliation":[{"name":"Department of Geriatric Neurology, Affiliated Brain Hospital of Nanjing Medical University, Nanjing, P.\u00a0R.\u00a0China"}]}],"member":"219","published-online":{"date-parts":[[2022,1,19]]},"reference":[{"key":"S0218001422500045BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2644615"},{"key":"S0218001422500045BIB002","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01261-8_13"},{"key":"S0218001422500045BIB003","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.11.068"},{"key":"S0218001422500045BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00644"},{"issue":"5","key":"S0218001422500045BIB006","first-page":"2050","volume":"50","author":"Chen S.","year":"2018","journal-title":"IEEE Trans. Syst., Man, Cybern."},{"key":"S0218001422500045BIB007","doi-asserted-by":"publisher","DOI":"10.1007\/BF00994018"},{"key":"S0218001422500045BIB008","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"S0218001422500045BIB009","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01249-6_23"},{"key":"S0218001422500045BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00030"},{"issue":"5","key":"S0218001422500045BIB011","first-page":"740","volume":"13","author":"Evo I.","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"S0218001422500045BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/CompComm.2018.8780579"},{"key":"S0218001422500045BIB013","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2993998"},{"key":"S0218001422500045BIB014","doi-asserted-by":"publisher","DOI":"10.1006\/jcss.1997.1504"},{"key":"S0218001422500045BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"S0218001422500045BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"S0218001422500045BIB017","doi-asserted-by":"publisher","DOI":"10.2307\/2346830"},{"key":"S0218001422500045BIB018","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00065"},{"key":"S0218001422500045BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2979260"},{"key":"S0218001422500045BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/LGRS.2019.2936173"},{"key":"S0218001422500045BIB021","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2015.09.183"},{"key":"S0218001422500045BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2019.2905881"},{"key":"S0218001422500045BIB023","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"S0218001422500045BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2858826"},{"key":"S0218001422500045BIB025","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.06.011"},{"key":"S0218001422500045BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"S0218001422500045BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2939488"},{"key":"S0218001422500045BIB028","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"S0218001422500045BIB029","doi-asserted-by":"publisher","DOI":"10.1109\/MLBDBI48998.2019.00032"},{"key":"S0218001422500045BIB030","doi-asserted-by":"publisher","DOI":"10.3390\/rs9020100"},{"key":"S0218001422500045BIB031","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"S0218001422500045BIB032","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"S0218001422500045BIB033","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00075"},{"key":"S0218001422500045BIB034","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00377"},{"key":"S0218001422500045BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2018.2875449"},{"key":"S0218001422500045BIB036","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.06.076"},{"issue":"1","key":"S0218001422500045BIB037","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-017-02088-w","volume":"9","author":"Zhang M.","year":"2018","journal-title":"Nat. Commun."},{"key":"S0218001422500045BIB038","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00020"},{"key":"S0218001422500045BIB039","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.07.073"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001422500045","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,3]],"date-time":"2022-03-03T07:59:27Z","timestamp":1646294367000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218001422500045"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1]]},"references-count":38,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2022,1]]}},"alternative-id":["10.1142\/S0218001422500045"],"URL":"https:\/\/doi.org\/10.1142\/s0218001422500045","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1]]},"article-number":"2250004"}}