{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:55:45Z","timestamp":1777704945702,"version":"3.51.4"},"reference-count":35,"publisher":"MDPI AG","issue":"23","license":[{"start":{"date-parts":[[2019,11,30]],"date-time":"2019-11-30T00:00:00Z","timestamp":1575072000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61663010,61563014"],"award-info":[{"award-number":["61663010,61563014"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Key Research and Development Project of Jiangxi Province, China","award":["20171BBH80024"],"award-info":[{"award-number":["20171BBH80024"]}]},{"name":"Science and Technology Research Foundation of Jiangxi Provincial Education Department, China","award":["GJJ180314"],"award-info":[{"award-number":["GJJ180314"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In recent years, the research on optical remote sensing images has received greater and greater attention. Object detection, as one of the most challenging tasks in the area of remote sensing, has been remarkably promoted by convolutional neural network (CNN)-based methods like You Only Look Once (YOLO) and Faster R-CNN. However, due to the complexity of backgrounds and the distinctive object distribution, directly applying these general object detection methods to the remote sensing object detection usually renders poor performance. To tackle this problem, a highly efficient and robust framework based on YOLO is proposed. We devise and integrate VaryBlock to the architecture which effectively offsets some of the information loss caused by downsampling. In addition, some techniques are utilized to facilitate the performance and to avoid overfitting. Experimental results show that our proposed method can enormously improve the mean average precision by a large margin on the NWPU VHR-10 dataset.<\/jats:p>","DOI":"10.3390\/s19235284","type":"journal-article","created":{"date-parts":[[2019,12,2]],"date-time":"2019-12-02T10:50:45Z","timestamp":1575283845000},"page":"5284","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["VaryBlock: A Novel Approach for Object Detection in Remote Sensed Images"],"prefix":"10.3390","volume":"19","author":[{"given":"Heng","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Information Engineering, East China Jiaotong University, Nanchang 330013, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiayu","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, East China Jiaotong University, Nanchang 330013, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yanli","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, East China Jiaotong University, Nanchang 330013, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jia","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, East China Jiaotong University, Nanchang 330013, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2019,11,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1007\/s11036-017-0932-8","article-title":"Brain Intelligence: Go beyond Artificial Intelligence","volume":"23","author":"Lu","year":"2018","journal-title":"Mobile Netw. Appl."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1109\/TPAMI.2015.2437384","article-title":"Region-Based Convolutional Networks for Accurate Object Detection and Segmentation","volume":"38","author":"Girshick","year":"2016","journal-title":"IEEE Trans. Pattern Analy. Mach. Intell."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"14680","DOI":"10.3390\/rs71114680","article-title":"Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Radovic, M., Adarkwa, O., and Wang, Q. (2017). Object Recognition in Aerial Images Using Convolutional Neural Networks. J. Imaging, 3.","DOI":"10.3390\/jimaging3020021"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Chen, F., Ren, R., Van de Voorde, T., Xu, W., Zhou, G., and Zhou, Y. (2018). Fast Automatic Airport Detection in Remote Sensing Images Using Convolutional Neural Networks. Remote Sens., 10.","DOI":"10.3390\/rs10030443"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Sakai, Y., Lu, H., Tan, J.K., and Kim, H. (2018, January 22\u201324). Environment Recognition for Electric Wheelchair Based on YOLOv2. Proceedings of the 3rd International Conference on Biomedical Signal and Image Processing, New York, NY, USA.","DOI":"10.1145\/3278229.3278231"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.future.2018.02.036","article-title":"Automatic road detection system for an air-land amphibious car drone","volume":"85","author":"Li","year":"2018","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2486","DOI":"10.1109\/TGRS.2016.2645610","article-title":"Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks","volume":"55","author":"Long","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Yang, X., Sun, H., Fu, K., Yang, J., Sun, X., Yan, M., and Guo, Z. (2018). Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Chen, Z., Zhang, T., and Ouyang, C. (2018). End-to-End Airplane Detection Using Transfer Learning in Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10010139"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Xu, Y., Zhu, M., Li, S., Feng, H., Ma, S., and Che, J. (2018). End-to-End Airport Detection in Remote Sensing Images Combining Cascade Region Proposal Networks and Multi-Threshold Detection Networks. Remote Sens., 10.","DOI":"10.3390\/rs10101516"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Shen, Z., Liu, Z., Li, J., Jiang, Y., Chen, Y., and Xue, X. (2017, January 22\u201329). DSOD: Learning Deeply Supervised Object Detectors from Scratch. Proceedings of the 2017 IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.212"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_16","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the 14th European Conference, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhang, S., Wen, L., Bian, X., Lei, Z., and Li, S.Z. (2018, January 18\u201323). Single-Shot Refinement Neural Network for Object Detection. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00442"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2019). CornerNet: Detecting Objects as Paired Keypoints. Int. J. Comput. Vis.","DOI":"10.1007\/s11263-019-01204-1"},{"key":"ref_20","unstructured":"Everingham, M., Zisserman, A., Williams, C., and Gool, L.V. (2019, November 28). The PASCAL Visual Object Classes Challenge 2006 Results. Available online: http:\/\/www.pascal-network.org\/challenges\/VOC\/voc2006\/results.pdf."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The Pascal Visual Object Classes (VOC) Challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_22","unstructured":"Fu, C.Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017). DSSD: Deconvolutional Single Shot Detector. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2014, January 6\u201312). Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. Proceedings of the 13th European Conference on Computer Vision Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10578-9_23"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Tang, C., Ling, Y., Yang, X., Jin, W., and Zheng, C. (2018). Multi-View Object Detection Based on Deep Learning. Appl. Sci., 8.","DOI":"10.3390\/app8091423"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Yang, G., Yang, J., Sheng, W., Junior, F.E.F., and Li, S. (2018). Convolutional Neural Network-Based Embarrassing Situation Detection under Camera for Social Robot in Smart Homes. Sensors, 18.","DOI":"10.3390\/s18051530"},{"key":"ref_29","unstructured":"Lin, M., Chen, Q., and Yan, S. (2013). Network In Network. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhu, J., Park, T., Isola, P., and Efros, A.A. (2017, January 22\u201329). Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proceedings of the 2017 IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.244"},{"key":"ref_31","unstructured":"Arthur, D., and Vassilvitskii, S. (2007, January 7\u20139). K-means++: The Advantages of Careful Seeding. Proceedings of the 18th Annual ACM-SIAM Aymposium on Discrete Algorithms, Philadelphia, PA, USA."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogram. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1109\/TCSVT.2014.2358031","article-title":"Efficient Feature Selection and Classification for Vehicle Detection","volume":"25","author":"Wen","year":"2015","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1109\/TGRS.2017.2778300","article-title":"Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Han, X., Zhong, Y., and Zhang, L. (2017). An Efficient and Robust Integrated Geospatial Object Detection Framework for High Spatial Resolution Remote Sensing Imagery. Remote Sens., 9.","DOI":"10.3390\/rs9070666"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/23\/5284\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:38:58Z","timestamp":1760189938000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/23\/5284"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,30]]},"references-count":35,"journal-issue":{"issue":"23","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["s19235284"],"URL":"https:\/\/doi.org\/10.3390\/s19235284","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,30]]}}}