{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T08:02:04Z","timestamp":1770969724972,"version":"3.50.1"},"reference-count":33,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2017,2,10]],"date-time":"2017-02-10T00:00:00Z","timestamp":1486684800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Detecting vehicles in aerial imagery plays an important role in a wide range of applications. The current vehicle detection methods are mostly based on sliding-window search and handcrafted or shallow-learning-based features, having limited description capability and heavy computational costs. Recently, due to the powerful feature representations, region convolutional neural networks (CNN) based detection methods have achieved state-of-the-art performance in computer vision, especially Faster R-CNN. However, directly using it for vehicle detection in aerial images has many limitations: (1) region proposal network (RPN) in Faster R-CNN has poor performance for accurately locating small-sized vehicles, due to the relatively coarse feature maps; and (2) the classifier after RPN cannot distinguish vehicles and complex backgrounds well. In this study, an improved detection method based on Faster R-CNN is proposed in order to accomplish the two challenges mentioned above. Firstly, to improve the recall, we employ a hyper region proposal network (HRPN) to extract vehicle-like targets with a combination of hierarchical feature maps. Then, we replace the classifier after RPN by a cascade of boosted classifiers to verify the candidate regions, aiming at reducing false detection by negative example mining. We evaluate our method on the Munich vehicle dataset and the collected vehicle dataset, with improvements in accuracy and robustness compared to existing methods.<\/jats:p>","DOI":"10.3390\/s17020336","type":"journal-article","created":{"date-parts":[[2017,2,10]],"date-time":"2017-02-10T05:27:09Z","timestamp":1486704429000},"page":"336","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":233,"title":["Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9986-1681","authenticated-orcid":false,"given":"Tianyu","family":"Tang","sequence":"first","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Shilin","family":"Zhou","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8582-2040","authenticated-orcid":false,"given":"Zhipeng","family":"Deng","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Huanxin","family":"Zou","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China"}]},{"given":"Lin","family":"Lei","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China"}]}],"member":"1968","published-online":{"date-parts":[[2017,2,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"11315","DOI":"10.3390\/rs61111315","article-title":"An Operational System for Estimating Road Traffic Information from Aerial Images","volume":"6","author":"Leitloff","year":"2014","journal-title":"Remote Sens."},{"key":"ref_2","first-page":"1","article-title":"Fast Multiclass Vehicle Detection on Aerial Images","volume":"12","author":"Liu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1109\/TGRS.2013.2253108","article-title":"Automatic Car Counting Method for Unmanned Aerial Vehicle Images","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"6356","DOI":"10.1109\/TGRS.2013.2296351","article-title":"Detecting Cars in UAV Images With a Catalog-Based Approach","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2296","DOI":"10.1109\/TITS.2016.2517826","article-title":"Vehicle Detection in High-Resolution Aerial Images Based on Fast Sparse Representation Classification and Multiorder Feature","volume":"17","author":"Chen","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2152","DOI":"10.1109\/TIP.2011.2172798","article-title":"Vehicle detection in aerial surveillance using dynamic Bayesian networks","volume":"21","author":"Cheng","year":"2012","journal-title":"IEEE Trans. Image Process."},{"key":"ref_7","first-page":"4379","article-title":"Car detection from high-resolution aerial imagery using multiple features","volume":"53","author":"Shao","year":"2012","journal-title":"IEEE Int. Geosci. Remote Sens. Symp."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Kluckner, S., Pacher, G., Grabner, H., and Bischof, H. (2007, January 14\u201321). A 3D Teacher for Car Detection in Aerial Images. Proceedings of the IEEE 11th International Conference on Computer Vision, Rio de Janeiro, Brazil.","DOI":"10.1109\/ICCV.2007.4408834"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1250","DOI":"10.1109\/TPAMI.2010.182","article-title":"Vehicle detection using partial least squares","volume":"33","author":"Kembhavi","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","first-page":"1","article-title":"Vehicle Detection in High-Resolution Aerial Images via Sparse Representation and Superpixels","volume":"54","author":"Chen","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1109\/LGRS.2014.2309695","article-title":"Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks","volume":"11","author":"Chen","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (arXiv, 2016). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, arXiv.","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., and Smeulders, A.W.M. (2011, January 6\u201313). Segmentation as selective search for object recognition. Proceedings of the International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126456"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Alexe, B., Deselaers, T., and Ferrari, V. (2010, January 13\u201318). What is an object?. Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540226"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kuo, W., Hariharan, B., and Malik, J. (2015, January 7\u201313). DeepBox: Learning Objectness with Convolutional Networks. Proceedings of the IEEE International Conference on Computer Vision, Los Alamitos, CA, USA.","DOI":"10.1109\/ICCV.2015.285"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2327","DOI":"10.1109\/JSTARS.2013.2242846","article-title":"Airborne Vehicle Detection in Dense Urban Areas Using HoG Features and Disparity Maps","volume":"6","author":"Tuermer","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_19","first-page":"1","article-title":"Region-Based Convolutional Networks for Accurate Object Detection and Segmentation","volume":"38","author":"Girshick","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, Los Alamitos, CA, USA.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2013). Visualizing and Understanding Convolutional Networks, Springer.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_22","unstructured":"Simonyan, K., and Zisserman, A. (arXiv, 2014). Very Deep Convolutional Networks for Large-Scale Image Recognition, arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ghodrati, A., Diba, A., Pedersoli, M., Tuytelaars, T., and Gool, L.V. (2015, January 7\u201313). DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers. Proceedings of the IEEE International Conference on Computer Vision, Los Alamitos, CA, USA.","DOI":"10.1109\/ICCV.2015.296"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 21\u201326). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_25","unstructured":"Shaoqing, R. Faster _ rcnn. Available online: https:\/\/github.com\/ShaoqingRen\/faster_rcnn."},{"key":"ref_26","unstructured":"Lab, S.V. ImageNet. Available online: http:\/\/www.image-net.org\/."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1214\/aos\/1016218223","article-title":"Additive logistic regression: A statistical view of boosting (With discussion and a rejoinder by the authors)","volume":"28","author":"Friedman","year":"2000","journal-title":"Ann. Stat."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1109\/TPAMI.2014.2300479","article-title":"Fast Feature Pyramids for Object Detection","volume":"36","author":"Dollar","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_29","unstructured":"Turchenko, V., and Luczak, A. (arXiv, 2014). Caffe: Convolutional Architecture for Fast Feature Embedding, arXiv."},{"key":"ref_30","unstructured":"Liu, K., and Mattyus, G. DLR 3k Munich Vehicle Aerial Image Dataset. Available online: http:\/\/pba-freesoftware.eoc.dlr.de\/3K_VehicleDetection_dataset.zip."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A Survey on Object Detection in Optical Remote Sensing Images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1127\/1432-8364\/2010\/0041","article-title":"The DGPF-Test on Digital Airborne Camera Evaluation\u2014Overview and Test Design","volume":"2010","author":"Cramer","year":"2010","journal-title":"Photogramm. Fernerkund. Geoinform."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/2\/336\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T18:27:54Z","timestamp":1760207274000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/17\/2\/336"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,2,10]]},"references-count":33,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2017,2]]}},"alternative-id":["s17020336"],"URL":"https:\/\/doi.org\/10.3390\/s17020336","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,2,10]]}}}