{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T00:42:04Z","timestamp":1760488924507,"version":"build-2065373602"},"reference-count":54,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2019,9,13]],"date-time":"2019-09-13T00:00:00Z","timestamp":1568332800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Vehicle detection is an important research area that provides background information for the diversity of unmanned-aerial-vehicle (UAV) applications. In this paper, we propose a vehicle-detection method using a convolutional-neural-network (CNN)-based object detector. We design our method, DRFBNet300, with a Deeper Receptive Field Block (DRFB) module that enhances the expressiveness of feature maps to detect small objects in the UAV imagery. We also propose the UAV-cars dataset that includes the composition and angular distortion of vehicles in UAV imagery to train our DRFBNet300. Lastly, we propose a Split Image Processing (SIP) method to improve the accuracy of the detection model. Our DRFBNet300 achieves 21 mAP with 45 FPS in the MS COCO metric, which is the highest score compared to other lightweight single-stage methods running in real time. In addition, DRFBNet300, trained on the UAV-cars dataset, obtains the highest AP score at altitudes of 20\u201350 m. The gap of accuracy improvement by applying the SIP method became larger when the altitude increases. The DRFBNet300 trained on the UAV-cars dataset with SIP method operates at 33 FPS, enabling real-time vehicle detection.<\/jats:p>","DOI":"10.3390\/s19183958","type":"journal-article","created":{"date-parts":[[2019,9,13]],"date-time":"2019-09-13T10:32:41Z","timestamp":1568370761000},"page":"3958","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Real-Time Vehicle-Detection Method in Bird-View Unmanned-Aerial-Vehicle Imagery"],"prefix":"10.3390","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0971-3376","authenticated-orcid":false,"given":"Seongkyun","family":"Han","sequence":"first","affiliation":[{"name":"Department of Electrical Engineering, Kwangwoon University, 20 Kwangwoon-ro, Nowon-gu, Seoul 01897, Korea"}]},{"given":"Jisang","family":"Yoo","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, Kwangwoon University, 20 Kwangwoon-ro, Nowon-gu, Seoul 01897, Korea"}]},{"given":"Soonchul","family":"Kwon","sequence":"additional","affiliation":[{"name":"Department of Smart Convergence, Kwangwoon University, 20 Kwangwoon-ro, Nowon-gu, Seoul 01897, Korea"}]}],"member":"1968","published-online":{"date-parts":[[2019,9,13]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zacharie, M., Fuji, S., and Minori, S. (2018, January 21\u201324). Rapid Human Body Detection in Disaster Sites Using Image Processing from Unmanned Aerial Vehicle (UAV) Cameras. Proceedings of the 2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), Bangkok, Thailand.","DOI":"10.1109\/ICIIBMS.2018.8549955"},{"key":"ref_2","unstructured":"Doherty, P., and Rudol, P. (2007, January 2\u20136). A uav search and rescue scenario with human body detection and geolocalization. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Gold Coast, Australia."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ma\u2019sum, M.A., Arrofi, M.K., Jati, G., Arifin, F., Kurniawan, M.N., Mursanto, P., and Jatmiko, W. (2013, January 28\u201329). Simulation of intelligent Unmanned Aerial Vehicle (UAV) For military surveillance. Proceedings of the International Conference on Advanced Computer Science and Information Systems (ICACSIS), Bali, Indonesia.","DOI":"10.1109\/ICACSIS.2013.6761569"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Xiaozhu, X., and Cheng, H. (2017, January 21\u201323). Object detection of armored vehicles based on deep learning in battlefield environment. Proceedings of the 2017 4th International Conference on Information Science and Control Engineering (ICISCE), Changsha, China.","DOI":"10.1109\/ICISCE.2017.327"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"4968","DOI":"10.1109\/JSTARS.2018.2879368","article-title":"Urban Traffic Density Estimation Based on Ultrahigh-Resolution UAV Video and Deep Neural Network","volume":"12","author":"Zhu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1109\/TITS.2018.2797697","article-title":"Real-time traffic flow parameter estimation from UAV video based on ensemble classifier and optical flow","volume":"20","author":"Ke","year":"2018","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1016\/j.imavis.2017.09.008","article-title":"Vehicle Detection in Intelligent Transportation Systems and its Applications under Varying Environments: A Review","volume":"69","author":"Yang","year":"2017","journal-title":"Image Vis. Comput."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1016\/S0262-8856(03)00064-7","article-title":"Car detection in low resolution aerial images","volume":"21","author":"Zhao","year":"2003","journal-title":"Image Vis. Comput."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yang, Y., Liu, F., Wang, P., Luo, P., and Liu, X. (2012, January 24\u201327). Vehicle detection methods from an unmanned aerial vehicle platform. Proceedings of 2012 IEEE International Conference on Vehicular Electronics and Safety (ICVES 2012), Istanbul, Turkey.","DOI":"10.1109\/ICVES.2012.6294294"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1845","DOI":"10.1109\/TITS.2016.2617202","article-title":"An enhanced Viola-Jones vehicle detection method from unmanned aerial vehicles imagery","volume":"18","author":"Xu","year":"2017","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_11","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20138). ImageNet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet Large Scale Visual Recognition Challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis. (IJCV)"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 8\u201316). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Liu, S., and Huang, D. (2018, January 8\u201314). Receptive field block net for accurate and fast object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01252-6_24"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_18","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards real-time object detection with region proposal networks. Proceedings of the International Conference on Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Radovic, M., Adarkwa, O., and Wang, Q. (2017). Object recognition in aerial images using convolutional neural networks. J. Imaging, 3.","DOI":"10.3390\/jimaging3020021"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Fan, Q., Brown, L., and Smith, J. (2016, January 19\u201322). A closer look at Faster R-CNN for vehicle detection. Proceedings of the 2016 Intelligent Vehicles Symposium (IV), Gothenburg, Sweden.","DOI":"10.1109\/IVS.2016.7535375"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Xu, Z., Shi, H., Li, N., Xiang, C., and Zhou, H. (2018, January 10\u201312). Vehicle Detection Under UAV Based on Optimal Dense YOLO Method. Proceedings of the 2018 5th International Conference on Systems and Informatics (ICSAI), Nanjing, China.","DOI":"10.1109\/ICSAI.2018.8599403"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The PASCAL visual object classes (VOC) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis. IJCV"},{"key":"ref_25","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_26","unstructured":"Yu, F., and Koltun, V. (2016, January 2\u20134). Multi-scale context aggregation by dilated convolutions. Proceedings of the International Conference on Learning Representations (ICLR 2016), San Juan, PR, USA."},{"key":"ref_27","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_28","unstructured":"Li, Z., and Zhou, F. (2017). FSSD: Feature Fusion Single Shot Multibox Detector. arXiv."},{"key":"ref_29","unstructured":"Breckon, T.P., Barnes, S.E., Eichner, M.L., and Wahren, K. (April, January 30). Autonomous real-time vehicle detection from a medium-level UAV. Proceedings of the 24th International Conference on Unmanned Air Vehicle Systems, Bristol, UK."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Gaszczak, A., Breckon, T.P., and Han, J. (2011, January 24). Real-time people and vehicle detection from UAV imagery. Proceedings of the Intelligent Robots and Computer Vision XXVIII: Algorithms and Techniques, San Francisco, CA, USA.","DOI":"10.1117\/12.876663"},{"key":"ref_31","first-page":"4379","article-title":"Car detection from high-resolution aerial imagery using multiple features","volume":"53","author":"Shao","year":"2012","journal-title":"IEEE Int. Geosci. Remote Sens. Symp."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"6356","DOI":"10.1109\/TGRS.2013.2296351","article-title":"Detecting Cars in UAV Images With a Catalog-Based Approach","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Tang, T., Deng, Z., Zhou, S., Lei, L., and Zou, H. (2017, January 18\u201321). Fast vehicle detection in UAV images. Proceedings of the 2017 International Workshop on Remote Sensing with Intelligent Processing (RSIP), Shanghai, China.","DOI":"10.1109\/RSIP.2017.7958795"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Xie, X., Yang, W., Cao, G., Yang, J., Zhao, Z., Chen, S., Liao, Q., and Shi, G. (2018, January 13\u201316). Real-time Vehicle Detection from UAV Imagery. Proceedings of the Fourth IEEE International Conference on Multimedia Big Data (BigMM), Xi\u2019an, China.","DOI":"10.1109\/BigMM.2018.8499466"},{"key":"ref_35","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, HI, USA."},{"key":"ref_36","unstructured":"Dala, N., and Triggs, B. (2005, January 20\u201326). Histograms of Oriented Gradients for Human Detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1109\/TGRS.1990.572934","article-title":"Texture Unit, Texture Spectrum, In addition, Texture Analysis","volume":"28","author":"He","year":"1990","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Lowe, D.G. (1999, January 20\u201327). Object recognition from local scale-invariant features. Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greek.","DOI":"10.1109\/ICCV.1999.790410"},{"key":"ref_39","unstructured":"Suhr, J.K. (2009). Kanade-lucas-tomasi (klt) feature tracker. Computer Vision (EEE6503), Yonsei University."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zhang, S., Wen, L., Bian, X., Lei, Z., and Li, S.Z. (2018, January 18\u201322). Single-Shot Refinement Neural Network for Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00442"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., Fischer, I., Wojna, Z., Song, Y., and Murphy, K. (2017, January 21\u201326). Speed\/Accuracy Trade-Offs for Modern Convolutional Object Detectors. Proceedings of the IEEE 2017 Conference on Computer Vision and Pattern Recognition (CVPR) 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.351"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_43","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1016\/j.tics.2015.03.009","article-title":"Computational neuroimaging and population receptive fields","volume":"19","author":"Wandell","year":"2015","journal-title":"Trends Cognit. Sci."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (2015, January 7\u201312). Rethinking the Inception Architecture for Computer Vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2016.308"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Ioffe, S., Vanhoucke, V., and Alemi, A. (2017, January 4\u20139). Inception-v4, inception-resnet and the impact of residual connections on learning. Proceedings of the Thirty-First, AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.","DOI":"10.1609\/aaai.v31i1.11231"},{"key":"ref_48","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking Atrous Convolution for Semantic Image Segmentation. arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., and Wei, Y. (2017, January 22\u201329). Deformable Convolutional Networks. Proceedings of the International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.89"},{"key":"ref_50","unstructured":"Mehta, R., and Ozturk, C. (, January 8\u201314). Object detection at 200 frames per second. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Li, F.-F. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_52","unstructured":"Tzutalin (2019, July 30). LabelImg. Available online: https:\/\/github.com\/tzutalin\/labelImg."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_54","unstructured":"Goyal, P., Doll\u00e1r, P., Girshick, R., Noordhuis, P., Wesolowski, L., Kyrola, A., Tulloch, A., Jia, Y., and He, K. (2017). Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/18\/3958\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:19:53Z","timestamp":1760188793000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/18\/3958"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,13]]},"references-count":54,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2019,9]]}},"alternative-id":["s19183958"],"URL":"https:\/\/doi.org\/10.3390\/s19183958","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2019,9,13]]}}}