{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,17]],"date-time":"2026-01-17T21:41:40Z","timestamp":1768686100176,"version":"3.49.0"},"reference-count":43,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2019,9,4]],"date-time":"2019-09-04T00:00:00Z","timestamp":1567555200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2016YFB0501501"],"award-info":[{"award-number":["2016YFB0501501"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41706201"],"award-info":[{"award-number":["41706201"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"International Research Cooperation Seed Fund of Beijing University of Technology","award":["2018-B1"],"award-info":[{"award-number":["2018-B1"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Vehicle detection based on very high-resolution (VHR) remote sensing images is beneficial in many fields such as military surveillance, traffic control, and social\/economic studies. However, intricate details about the vehicle and the surrounding background provided by VHR images require sophisticated analysis based on massive data samples, though the number of reliable labeled training data is limited. In practice, data augmentation is often leveraged to solve this conflict. The traditional data augmentation strategy uses a combination of rotation, scaling, and flipping transformations, etc., and has limited capabilities in capturing the essence of feature distribution and proving data diversity. In this study, we propose a learning method named Vehicle Synthesis Generative Adversarial Networks (VS-GANs) to generate annotated vehicles from remote sensing images. The proposed framework has one generator and two discriminators, which try to synthesize realistic vehicles and learn the background context simultaneously. The method can quickly generate high-quality annotated vehicle data samples and greatly helps in the training of vehicle detectors. Experimental results show that the proposed framework can synthesize vehicles and their background images with variations and different levels of details. Compared with traditional data augmentation methods, the proposed method significantly improves the generalization capability of vehicle detectors. Finally, the contribution of VS-GANs to vehicle detection in VHR remote sensing images was proved in experiments conducted on UCAS-AOD and NWPU VHR-10 datasets using up-to-date target detection frameworks.<\/jats:p>","DOI":"10.3390\/ijgi8090390","type":"journal-article","created":{"date-parts":[[2019,9,5]],"date-time":"2019-09-05T03:22:36Z","timestamp":1567653756000},"page":"390","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":47,"title":["Using Vehicle Synthesis Generative Adversarial Networks to Improve Vehicle Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8966-1184","authenticated-orcid":false,"given":"Kun","family":"Zheng","sequence":"first","affiliation":[{"name":"Faculty of Information Technology, Beijing University of Technology, No.100, Pingleyuan Road, Beijing 100124, China"}]},{"given":"Mengfei","family":"Wei","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Beijing University of Technology, No.100, Pingleyuan Road, Beijing 100124, China"}]},{"given":"Guangmin","family":"Sun","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Beijing University of Technology, No.100, Pingleyuan Road, Beijing 100124, China"}]},{"given":"Bilal","family":"Anas","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Beijing University of Technology, No.100, Pingleyuan Road, Beijing 100124, China"}]},{"given":"Yu","family":"Li","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Beijing University of Technology, No.100, Pingleyuan Road, Beijing 100124, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,9,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Shao, W., Yang, W., Liu, G., and Liu, J. (2012, January 22\u201327). Car detection from high-resolution aerial imagery using multiple features. Proceedings of the Geoscience and Remote Sensing Symposium, Munich, Germany.","DOI":"10.1109\/IGARSS.2012.6350403"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Kluckner, S., Pacher, G., Grabner, H., Bischof, H., and Bauer, J. (2007, January 14\u201321). A 3D teacher for car detection in aerial images. Proceedings of the International Conference on Computer Vision, Rio de Janeiro, Brazil.","DOI":"10.1109\/ICCV.2007.4408834"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2327","DOI":"10.1109\/JSTARS.2013.2242846","article-title":"Airborne vehicle detection in dense urban areas using HoG features and disparity maps","volume":"6","author":"Tuermer","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Tianyu, T., Shilin, Z., Zhipeng, D., Huanxin, Z., and Lin, L. (2017). Vehicle detection in aerial images based on region convolutional neural networks and hard negative example mining. Sensors, 17.","DOI":"10.3390\/s17020336"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Yang, M.Y., Liao, W., Li, X., and Rosenhahn, B. (2018, January 7\u201310). Deep Learning for Vehicle Detection in Aerial Images. Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece.","DOI":"10.1109\/ICIP.2018.8451454"},{"key":"ref_6","unstructured":"Van Etten, A. You Only Look Twice: Rapid Multi-Scale Object Detection in Satellite Imagery. Available online: https:\/\/arxiv.org\/abs\/1805.09512."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1016\/S0262-8856(03)00064-7","article-title":"Car detection in low resolution aerial images","volume":"21","author":"Zhao","year":"2003","journal-title":"Image Vis. Comput."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.isprsjprs.2008.09.005","article-title":"Classification-based vehicle detection in high-resolution satellite images","volume":"64","author":"Line","year":"2009","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zheng, H., Pan, L., and Li, L. (2006, January 3\u20136). A morphological neural network approach for vehicle detection from high resolution satellite imagery. Proceedings of the International Conference on Neural Information Processing, Hong Kong, China.","DOI":"10.1007\/11893257_11"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"21651","DOI":"10.1007\/s11042-016-4043-5","article-title":"Vehicle detection from high-resolution aerial images using spatial pyramid pooling-based deep convolutional neural networks","volume":"76","author":"Qu","year":"2016","journal-title":"Multimed. Tools Appl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1080\/2150704X.2016.1258127","article-title":"Vehicle detection in remote sensing images using denoising-based convolutional neural networks","volume":"8","author":"Li","year":"2017","journal-title":"Remote Sens. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Chauhan, R., Ghanshala, K.K., and Joshi, R.C. (2018, January 15\u201317). Convolutional Neural Network (CNN) for Image Detection and Recognition. Proceedings of the 2018 First International Conference on Secure Cyber Computing and Communication (ICSCCC), Jalandhar, India.","DOI":"10.1109\/ICSCCC.2018.8703316"},{"key":"ref_13","unstructured":"Zhai, S., Cheng, Y., Feris, R., and Zhang, Z. (2019, August 20). Generative Adversarial Networks as Variational Training of Energy Based Models. Available online: https:\/\/arxiv.org\/abs\/1611.01799."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1010","DOI":"10.1109\/TITS.2018.2838132","article-title":"SINet: A Scale-Insensitive Convolutional Neural Network for Fast Vehicle Detection","volume":"20","author":"Hu","year":"2019","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_15","first-page":"2812","article-title":"Vehicle Detection in Remote Sensing Images of Dense Areas Based on Deformable Convolution Neural Network","volume":"40","author":"Gao","year":"2018","journal-title":"J. Electron. Inf. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"43607","DOI":"10.1109\/ACCESS.2019.2908016","article-title":"Multi-Scale Image Block-Level F-CNN for Remote Sensing Images Object Detection","volume":"7","author":"Zhao","year":"2019","journal-title":"IEEE Access"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1016\/j.isprsjprs.2018.09.014","article-title":"Deep networks under scene-level supervision for multi-class geospatial object detection from remote sensing images","volume":"146","author":"Li","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Montserrat, D.M., Lin, Q., Allebach, J., and Delp, E. (2017). Training object detection and recognition cnn models using data augmentation. Electron. Imaging, 27\u201336.","DOI":"10.2352\/ISSN.2470-1173.2017.10.IMAWM-163"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Oliveira, \u00cd., Medeiros, J., and de Sousa, V. (2016, January 4\u20137). A Data Augmentation Methodology to Improve Age Estimation using Convolutional Neural Networks. Proceedings of the SIBGRAPI\u2014Conference on Graphics Patterns and Images, Sao Paulo, Brazil.","DOI":"10.1109\/SIBGRAPI.2016.021"},{"key":"ref_20","unstructured":"Goodfellow, I.J., Pouget-Abadie, J., Mirza, M., Bing, X., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014, January 3\u20136). Generative adversarial nets. Proceedings of the International Conference on Neural Information Processing Systems, Kuching, Malaysia."},{"key":"ref_21","unstructured":"Toan, T., Pham, T., Carneiro, G., Palmer, L., and Reid, L. (2019, August 20). A Bayesian Data Augmentation Approach for Learning Deep Model. Available online: https:\/\/arxiv.org\/pdf\/1710.10564."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Mao, X., Li, Q., Xie, H., Lau, R.Y.K., and Wang, Z. (2017, January 22\u201329). Least squares generative adversarial networks. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.304"},{"key":"ref_23","unstructured":"Denton, E., Chintala, S., Szlam, A., and Fergus, R. (2019, August 20). Deep Generative Image Models Using a Laplacian Pyramid of Adversarial Networks. Available online: https:\/\/arxiv.org\/abs\/1506.05751."},{"key":"ref_24","unstructured":"Arjovsky, M., Chintala, S., and Bottou, L. (2019, August 20). Wasserstein GAN. Available online: https:\/\/arxiv.org\/abs\/1701.07875."},{"key":"ref_25","unstructured":"Radford, A., Metz, L., and Chintala, S. (2019, August 20). Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. Available online: https:\/\/arxiv.org\/abs\/1511.06434."},{"key":"ref_26","unstructured":"Zhu, J.Y., Park, T., Isola, P., and Efros, A.A. (2019, August 20). Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Available online: https:\/\/arxiv.org\/abs\/1703.10593."},{"key":"ref_27","unstructured":"Liu, M.Y., Breuel, T., and Kautz, J. (2017, January 4\u20139). Unsupervised image-to-image translation networks. Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Isola, P., Zhu, J.Y., Zhou, T., and Efros, A.A. (2017, January 21\u201326). Image-to-image translation with conditional adversarial networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.632"},{"key":"ref_29","unstructured":"Taigman, Y., Polyak, A., and Wolf, L. (2019, August 20). Unsupervised Cross-Domain Image Generation. Available online: https:\/\/arxiv.org\/abs\/1611.02200."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Choi, Y., Choi, M., Kim, M., Ha, J.W., Kim, S., and Choo, J. (2018, January 18\u201323). Stargan: Unified generative adversarial networks for multi-domain image-to-image translation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00916"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ledig, C., Theis, L., Huszar, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., and Wang, Z. (2017, January 21\u201326). Photo-realistic single image super-resolution using a generative adversarial network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.19"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Yeh, R.A., Chen, C., Lim, T.Y., Schwing, A.G., Hasegawa-Johnson, M., and Do, M.N. (2017, January 21\u201326). Semantic image inpainting with deep generative models. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.728"},{"key":"ref_33","unstructured":"Denton, E., Gross, S., and Fergus, R. (2019, August 20). Semi-Supervised Learning with Context-Conditional Generative Adversarial Networks. Available online: https:\/\/arxiv.org\/pdf\/1611.06430."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., and Efros, A.A. (2016, January 27\u201330). Context encoders: Feature learning by inpainting. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.278"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Liu, W., Liu, X., Ma, H., and Cheng, P. (2017, January 23\u201327). Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GAN. Proceedings of the 25th ACM international conference on Multimedia, Mountain View, CA, USA.","DOI":"10.1145\/3123266.3123422"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Inoue, N., Furuta, R., Yamasaki, T., and Aizawa, K. (2018, January 18\u201323). Cross-domain weakly-supervised object detection through progressive domain adaptation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00525"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., and Yan, S. (2017, January 21\u201326). Perceptual generative adversarial networks for small object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.211"},{"key":"ref_38","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2019, August 20). Deep Residual Learning for Image Recognition. Available online: https:\/\/arxiv.org\/abs\/1512.03385."},{"key":"ref_39","unstructured":"Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., and Courville, A. (2019, August 20). Improved Training of Wasserstein Gans. Available online: https:\/\/arxiv.org\/abs\/1704.00028."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., and Jiao, J. (2015, January 27\u201330). Orientation robust object detection in aerial images using deep convolutional neural network. Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada.","DOI":"10.1109\/ICIP.2015.7351502"},{"key":"ref_41","unstructured":"Redmon, J., and Farhadi, A. (2019, August 20). Yolov3: An Incremental Improvement. Available online: https:\/\/arxiv.org\/pdf\/1804.02767."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Piotr, D. (2017). Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell., 2999\u20133007.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). Dota: A large-scale dataset for object detection in aerial images. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/8\/9\/390\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:16:40Z","timestamp":1760188600000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/8\/9\/390"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,4]]},"references-count":43,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2019,9]]}},"alternative-id":["ijgi8090390"],"URL":"https:\/\/doi.org\/10.3390\/ijgi8090390","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,9,4]]}}}