{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T18:04:41Z","timestamp":1773684281312,"version":"3.50.1"},"reference-count":38,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2017,3,27]],"date-time":"2017-03-27T00:00:00Z","timestamp":1490572800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>This paper presents an automatic solution to the problem of detecting and counting cars in unmanned aerial vehicle (UAV) images. This is a challenging task given the very high spatial resolution of UAV images (on the order of a few centimetres) and the extremely high level of detail, which require suitable automatic analysis methods. Our proposed method begins by segmenting the input image into small homogeneous regions, which can be used as candidate locations for car detection. Next, a window is extracted around each region, and deep learning is used to mine highly descriptive features from these windows. We use a deep convolutional neural network (CNN) system that is already pre-trained on huge auxiliary data as a feature extraction tool, combined with a linear support vector machine (SVM) classifier to classify regions into \u201ccar\u201d and \u201cno-car\u201d classes. The final step is devoted to a fine-tuning procedure which performs morphological dilation to smooth the detected regions and fill any holes. In addition, small isolated regions are analysed further using a few sliding rectangular windows to locate cars more accurately and remove false positives. To evaluate our method, experiments were conducted on a challenging set of real UAV images acquired over an urban area. The experimental results have proven that the proposed method outperforms the state-of-the-art methods, both in terms of accuracy and computational time.<\/jats:p>","DOI":"10.3390\/rs9040312","type":"journal-article","created":{"date-parts":[[2017,3,27]],"date-time":"2017-03-27T10:49:10Z","timestamp":1490611750000},"page":"312","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":265,"title":["Deep Learning Approach for Car Detection in UAV Imagery"],"prefix":"10.3390","volume":"9","author":[{"given":"Nassim","family":"Ammour","sequence":"first","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2164-043X","authenticated-orcid":false,"given":"Haikel","family":"Alhichri","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9287-0596","authenticated-orcid":false,"given":"Yakoub","family":"Bazi","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"given":"Bilel","family":"Benjdira","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"given":"Naif","family":"Alajlan","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"given":"Mansour","family":"Zuair","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]}],"member":"1968","published-online":{"date-parts":[[2017,3,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/j.isprsjprs.2014.02.013","article-title":"Unmanned aerial systems for photogrammetry and remote sensing: A review","volume":"92","author":"Colomina","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1109\/TGRS.2008.2010457","article-title":"Thermal and Narrowband Multispectral Remote Sensing for Vegetation Monitoring From an Unmanned Aerial Vehicle","volume":"47","author":"Berni","year":"2009","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1109\/JSTARS.2013.2250921","article-title":"Characterization of Rice Paddies by a UAV-Mounted Miniature Hyperspectral Sensor System","volume":"6","author":"Uto","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_4","first-page":"93","article-title":"A 3D Model of Castle Landenberg (CH) from Combined Photogrammetric Processing of Terrestrial and UAV-based Images","volume":"37","author":"Sauerbier","year":"2008","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Moranduzzo, T., Mekhalfi, M.L., and Melgani, F. (2015, January 26\u201331). LBP-based multiclass classification method for UAV imagery. Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy.","DOI":"10.1109\/IGARSS.2015.7326283"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"870","DOI":"10.1109\/JSTARS.2011.2143696","article-title":"Combining GeoEye-1 Satellite Remote Sensing, UAV Aerial Imaging, and Geophysical Surveys in Anomaly Detection Applied to Archaeology","volume":"4","author":"Lin","year":"2011","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Moranduzzo, T., and Melgani, F. (2012, January 22\u201327). A SIFT-SVM method for detecting cars in UAV images. Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, Munich, Germany.","DOI":"10.1109\/IGARSS.2012.6352585"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1109\/TGRS.2013.2253108","article-title":"Automatic Car Counting Method for Unmanned Aerial Vehicle Images","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"6356","DOI":"10.1109\/TGRS.2013.2296351","article-title":"Detecting Cars in UAV Images With a Catalog-Based Approach","volume":"52","author":"Moranduzzo","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1080\/01431161.2015.1043760","article-title":"A fast object detector based on high-order gradients and Gaussian process regression for UAV images","volume":"36","author":"Moranduzzo","year":"2015","journal-title":"Int. J. Remote Sens."},{"key":"ref_11","unstructured":"Zhao, T., and Nevatia, R. (2001, January 7\u201314). Car detection in low resolution aerial image. Proceedings of the Eighth IEEE International Conference on Computer Vision (ICCV 2001), Washington, WA, USA."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0262-8856(01)00059-2","article-title":"Performance analysis of a simple vehicle detection algorithm","volume":"20","author":"Moon","year":"2002","journal-title":"Image Vis. Comput."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Kluckner, S., Pacher, G., Grabner, H., Bischof, H., and Bauer, J. (2007, January 14\u201321). A 3D Teacher for Car Detection in Aerial Images. Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio de Janeiro, Brazil.","DOI":"10.1109\/ICCV.2007.4408834"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"871","DOI":"10.14358\/PERS.75.7.871","article-title":"Object-based Detection and Classification of Vehicles from Highresolution Aerial Photography. Photogrammetric Engineering and Remote","volume":"75","author":"Holt","year":"2009","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Shao, W., Yang, W., Liu, G., and Liu, J. (2012, January 22\u201327). Car detection from high-resolution aerial imagery using multiple features. Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, Munich, Germany.","DOI":"10.1109\/IGARSS.2012.6350403"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1109\/LGRS.2014.2309695","article-title":"Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks","volume":"11","author":"Chen","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"11315","DOI":"10.3390\/rs61111315","article-title":"An Operational System for Estimating Road Traffic Information from Aerial Images","volume":"6","author":"Leitloff","year":"2014","journal-title":"Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Xu, Y., Yu, G., Wang, Y., Wu, X., and Ma, Y. (2016). A Hybrid Vehicle Detection Method Based on Viola-Jones and HOG + SVM from UAV Images. Sensors, 16.","DOI":"10.3390\/s16081325"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","article-title":"A Fast Learning Algorithm for Deep Belief Nets","volume":"18","author":"Hinton","year":"2006","journal-title":"Neural Comput."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Vincent, P., Larochelle, H., Bengio, Y., and Manzagol, P.-A. (2008, January 5\u20139). Extracting and Composing Robust Features with Denoising Autoencoders. Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland.","DOI":"10.1145\/1390156.1390294"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1120","DOI":"10.1109\/LSP.2014.2325781","article-title":"Convolutional Neural Networks for Distant Speech Recognition","volume":"21","author":"Swietojanski","year":"2014","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Lampert, C.H., Blaschko, M.B., and Hofmann, T. (2008, January 23\u201328). Beyond sliding windows: Object localization by efficient subwindow search. Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA.","DOI":"10.1109\/CVPR.2008.4587586"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Vedaldi, A., Gulshan, V., Varma, M., and Zisserman, A. (October, January 27). Multiple kernels for object detection. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.","DOI":"10.1109\/ICCV.2009.5459183"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 24\u201327). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2071","DOI":"10.1109\/TPAMI.2015.2389830","article-title":"Regionlets for Generic Object Detection","volume":"37","author":"Wang","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2016). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1109\/34.400568","article-title":"Mean shift, mode seeking, and clustering","volume":"17","author":"Cheng","year":"1995","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1109\/34.1000236","article-title":"Mean shift: A robust approach toward feature space analysis","volume":"24","author":"Comaniciu","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1007\/s11263-006-7934-5","article-title":"Graph Cuts and Efficient N-D Image Segmentation","volume":"70","author":"Boykov","year":"2006","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"888","DOI":"10.1109\/34.868688","article-title":"Normalized cuts and image segmentation","volume":"22","author":"Malik","year":"2000","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1007\/978-3-540-88693-8_52","article-title":"Quick Shift and Kernel Methods for Mode Seeking","volume":"Volume 5305","author":"Forsyth","year":"2008","journal-title":"Computer Vision\u2014ECCV 2008"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/34.87344","article-title":"Watersheds in digital spaces: An efficient algorithm based on immersion simulations","volume":"13","author":"Vincent","year":"1991","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2290","DOI":"10.1109\/TPAMI.2009.96","article-title":"TurboPixels: Fast superpixels using geometric flows","volume":"31","author":"Levinshtein","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1915","DOI":"10.1109\/TPAMI.2012.231","article-title":"Learning Hierarchical Features for Scene Labeling","volume":"35","author":"Farabet","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1162\/NECO_a_00682","article-title":"Efficient Training of Convolutional Deep Belief Networks in the Frequency Domain for Application to High-resolution 2D and 3D Images","volume":"27","author":"Brosch","year":"2015","journal-title":"Neural Comput."},{"key":"ref_37","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Advances in Neural Information Processing Systems, MIT Press."},{"key":"ref_38","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very Deep Convolutional Networks for Large-Scale Image Recognition. Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/9\/4\/312\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T18:31:22Z","timestamp":1760207482000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/9\/4\/312"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,3,27]]},"references-count":38,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2017,4]]}},"alternative-id":["rs9040312"],"URL":"https:\/\/doi.org\/10.3390\/rs9040312","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,3,27]]}}}