{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T17:12:18Z","timestamp":1781025138741,"version":"3.54.1"},"reference-count":25,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2020,6,21]],"date-time":"2020-06-21T00:00:00Z","timestamp":1592697600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Haitao Jia","award":["2018GZDZX0034"],"award-info":[{"award-number":["2018GZDZX0034"]}]},{"name":"Weimin Hou","award":["19255901D"],"award-info":[{"award-number":["19255901D"]}]},{"name":"Weimin Hou","award":["6142A010301"],"award-info":[{"award-number":["6142A010301"]}]},{"name":"Wenbo Xu","award":["2018JY0516"],"award-info":[{"award-number":["2018JY0516"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Vehicle targets in unmanned aerial vehicle (UAV) images are generally small, so a significant amount of detailed information on targets may be lost after neural computing, which leads to the poor performances of the existing recognition algorithms. Based on convolutional neural networks that utilize the YOLOv3 algorithm, this article focuses on the development of a quick automatic vehicle detection method for UAV images. First, a vehicle dataset for target recognition is constructed. Then, a novel YOLOv3 vehicle detection framework is proposed according to the following characteristics: The vehicle targets in the UAV image are relatively small and dense. The average precision (AP) increased by 5.48%, from 92.01% to 97.49%, which still remains the rather high processing speed of the YOLO network. Finally, the proposed framework is tested using three datasets: COWC, VEDAI, and CAR. The experimental results demonstrate that our method had a better detection capability.<\/jats:p>","DOI":"10.3390\/rs12121994","type":"journal-article","created":{"date-parts":[[2020,6,23]],"date-time":"2020-06-23T09:05:33Z","timestamp":1592903133000},"page":"1994","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":40,"title":["Fast Automatic Vehicle Detection in UAV Images Using Convolutional Neural Networks"],"prefix":"10.3390","volume":"12","author":[{"given":"Xin","family":"Luo","sequence":"first","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoyue","family":"Tian","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Huijie","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Communication Engineering, School of Automation and Electrical Engineering, Chengdu Technological University, Chengdu 611730, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Weimin","family":"Hou","sequence":"additional","affiliation":[{"name":"Department of Communication Engineering, School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang 050018, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Geng","family":"Leng","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wenbo","family":"Xu","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haitao","family":"Jia","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xixu","family":"He","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Meng","family":"Wang","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jian","family":"Zhang","sequence":"additional","affiliation":[{"name":"Spatial Information and Digital Technology, School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2020,6,21]]},"reference":[{"key":"ref_1","unstructured":"Luo, P., Liu, F., Liu, X., and Yang, Y. (2012, January 26\u201328). Stationary vehicle detection in aerial surveillance with a UAV. Proceedings of the 2012 8th International Conference on Information Science and Digital Content Technology (ICIDT2012), Jeju, Korea."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wei, H., Zhou, G., Zheng, Z., Li, X., Liu, Y., Zhang, Y., Li, S., and Yue, T. (2013, January 21\u201326). Vehicle detection from parking lot aerial images. Proceedings of the 2013 IEEE International Geoscience and Remote Sensing Symposium\u2014IGARSS, Melbourne, VIC, Australia.","DOI":"10.1109\/IGARSS.2013.6723710"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ray, S. (2019, January 14\u201316). A Quick Review of Machine Learning Algorithms. Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India.","DOI":"10.1109\/COMITCon.2019.8862451"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Bellary, J., Peyakunta, B., and Konetigari, S. (2010, January 9\u201311). Hybrid Machine Learning Approach in Data Mining. Proceedings of the 2010 Second International Conference on Machine Learning and Computing, Bangalore, India.","DOI":"10.1109\/ICMLC.2010.57"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Yang, Y., Wang, J., and Yang, Y. (October, January 30). Improving SVM classifier with prior knowledge in microcalcification detection1. Proceedings of the 2012 19th IEEE International Conference on Image Processing, Orlando, FL, USA.","DOI":"10.1109\/ICIP.2012.6467490"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Cunhe, L., and Chenggang, W. (2010, January 21\u201323). A new semi-supervised support vector machine learning algorithm based on active learning. Proceedings of the 2010 2nd International Conference on Future Computer and Communication, Wuhan, China.","DOI":"10.1109\/ICFCC.2010.5497471"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1109\/LSP.2014.2313570","article-title":"Parameterized AdaBoost: Introducing a Parameter to Speed Up the Training of Real AdaBoost","volume":"21","author":"Wu","year":"2014","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2767","DOI":"10.1016\/j.ijleo.2012.08.040","article-title":"Facial expression recognition based on fusion feature of PCA and LBP with SVM","volume":"124","author":"Luo","year":"2013","journal-title":"Opt. Int. J. Light Electron Opt."},{"key":"ref_9","first-page":"84","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"NIPS Curran Assoc. Inc."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 24\u201327). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_11","first-page":"91","article-title":"Faster RCNN: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kourris, A., Kyrkou, C., and Bouganis, C. (2019, January 3\u20138). Informed Region Selection for Efficient UAV-based Object Detectors: Altitude-aware Vehicle Detection with CyCar Dataset. Proceedings of the 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China.","DOI":"10.1109\/IROS40897.2019.8967722"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Kyrkou, C., Plastiras, G., Theocharides, T., Venieris, S.I., and Bouganis, C.S. (2018, January 19\u201323). DroNet: Efficient convolutional neural network detector for real-time UAV applications. Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), Dresden, Germany.","DOI":"10.23919\/DATE.2018.8342149"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition","volume":"37","author":"He","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 13\u201316). Fast RCNN. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_18","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLO V3: An Incremental Improvement. arXiv, 1\u201322."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Shah, S., and Singh, M. (2012, January 11\u201312). Comparison of a Time Efficient Modified K-mean Algorithm with K-Mean and K-Medoid Algorithm. Proceedings of the 2012 International Conference on Communication Systems and Network Technologies, Rajkot, India.","DOI":"10.1109\/CSNT.2012.100"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Bodla, N., Singh, B., Chellappa, R., and Davis, L.S. (2017, January 22\u201329). Soft-NMS\u2014Improving Object Detection with One Line of Code. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.593"},{"key":"ref_21","first-page":"1","article-title":"YOLOv3 Network Based on Improved Loss Function","volume":"28","author":"Shuo","year":"2019","journal-title":"Comput. Syst. Appl."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Keskar, N.S., and Saon, G. (2015, January 19\u201324). A nonmonotone learning rate strategy for SGD training of deep neural networks. Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, QLD, Australia.","DOI":"10.1109\/ICASSP.2015.7178917"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sokolov, R.I. (2016, January 19\u201320). Theoretical investigation of Gaussian and non-Gaussian noise masking properties. Proceedings of the 2016 2nd International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM), Chelyabinsk, Russia.","DOI":"10.1109\/ICIEAM.2016.7911554"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Wang, M., Luo, X., and Tian, X. (2020, January 19\u201324). Research on Vehicle Detection Based on Faster R-CNN for UAV Images. Proceedings of the IGARSS 2020, Waikoloa, HA, USA.","DOI":"10.1109\/IGARSS39084.2020.9323323"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.jvcir.2015.11.002","article-title":"Vehicle Detection in Aerial Imagery: A small target detection benchmark","volume":"34","author":"Razakarivony","year":"2016","journal-title":"J. Vis. Commun. Image Represent."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/12\/1994\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T09:41:28Z","timestamp":1760175688000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/12\/1994"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,21]]},"references-count":25,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2020,6]]}},"alternative-id":["rs12121994"],"URL":"https:\/\/doi.org\/10.3390\/rs12121994","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,21]]}}}