{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,27]],"date-time":"2026-05-27T12:01:16Z","timestamp":1779883276677,"version":"3.53.1"},"reference-count":74,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2021,9,10]],"date-time":"2021-09-10T00:00:00Z","timestamp":1631232000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Detecting small objects (e.g., manhole covers, license plates, and roadside milestones) in urban images is a long-standing challenge mainly due to the scale of small object and background clutter. Although convolution neural network (CNN)-based methods have made significant progress and achieved impressive results in generic object detection, the problem of small object detection remains unsolved. To address this challenge, in this study we developed an end-to-end network architecture that has three significant characteristics compared to previous works. First, we designed a backbone network module, namely Reduced Downsampling Network (RD-Net), to extract informative feature representations with high spatial resolutions and preserve local information for small objects. Second, we introduced an Adjustable Sample Selection (ADSS) module which frees the Intersection-over-Union (IoU) threshold hyperparameters and defines positive and negative training samples based on statistical characteristics between generated anchors and ground reference bounding boxes. Third, we incorporated the generalized Intersection-over-Union (GIoU) loss for bounding box regression, which efficiently bridges the gap between distance-based optimization loss and area-based evaluation metrics. We demonstrated the effectiveness of our method by performing extensive experiments on the public Urban Element Detection (UED) dataset acquired by Mobile Mapping Systems (MMS). The Average Precision (AP) of the proposed method was 81.71%, representing an improvement of 1.2% compared with the popular detection framework Faster R-CNN.<\/jats:p>","DOI":"10.3390\/rs13183608","type":"journal-article","created":{"date-parts":[[2021,9,12]],"date-time":"2021-09-12T21:48:01Z","timestamp":1631483281000},"page":"3608","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Learning Adjustable Reduced Downsampling Network for Small Object Detection in Urban Environments"],"prefix":"10.3390","volume":"13","author":[{"given":"Huijie","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Geography, University of California, Santa Barbara, CA 93106, USA"},{"name":"Department of Geography, San Diego State University, San Diego, CA 92182, USA"},{"name":"Center for Complex Human-Environment Systems, San Diego State University, San Diego, CA 92182, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Li","family":"An","sequence":"additional","affiliation":[{"name":"Department of Geography, San Diego State University, San Diego, CA 92182, USA"},{"name":"Center for Complex Human-Environment Systems, San Diego State University, San Diego, CA 92182, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vena W.","family":"Chu","sequence":"additional","affiliation":[{"name":"Department of Geography, University of California, Santa Barbara, CA 93106, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Douglas A.","family":"Stow","sequence":"additional","affiliation":[{"name":"Department of Geography, San Diego State University, San Diego, CA 92182, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaobai","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Computer Science, San Diego State University, San Diego, CA 92182, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Qinghua","family":"Ding","sequence":"additional","affiliation":[{"name":"Department of Geography, University of California, Santa Barbara, CA 93106, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.isprsjprs.2013.10.008","article-title":"An algorithm for automatic detection of pole-like street furniture objects from Mobile Laser Scanner point clouds","volume":"87","author":"Cabo","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1109\/TITS.2016.2565698","article-title":"Rapid Localization and Extraction of Street Light Poles in Mobile LiDAR Point Clouds: A Supervoxel-Based Approach","volume":"18","author":"Wu","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"12680","DOI":"10.3390\/rs71012680","article-title":"Automatic Detection and Classification of Pole-Like Objects in Urban Point Cloud Data Using an Anomaly Detection Algorithm","volume":"7","author":"Alonso","year":"2015","journal-title":"Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.isprsjprs.2016.07.009","article-title":"A dual growing method for the automatic extraction of individual trees from mobile laser scanning data","volume":"120","author":"Li","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1109\/TGRS.2016.2617819","article-title":"Road curb extraction from mobile LiDAR point clouds","volume":"55","author":"Xu","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2018.11.012","article-title":"Efficient and robust lane marking extraction from mobile lidar point clouds","volume":"147","author":"Jung","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1572","DOI":"10.1109\/JSTARS.2019.2904514","article-title":"Generation of Horizontally Curved Driving Lines in HD Maps Using Mobile Laser Scanning Point Clouds","volume":"12","author":"Ma","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"766","DOI":"10.1109\/LGRS.2012.2222342","article-title":"Semiautomated Building Facade Footprint Extraction From Mobile LiDAR Point Clouds","volume":"10","author":"Yang","year":"2012","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1016\/j.isprsjprs.2018.08.009","article-title":"Extraction of residential building instances in suburban areas from mobile LiDAR data","volume":"144","author":"Xia","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_10","unstructured":"Shen, X. (2019). A survey of Object Classification and Detection based on 2D\/3D data. arXiv."},{"key":"ref_11","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_15","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016, January 8\u201316). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_17","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_18","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Xie, S., Girshick, R., Doll\u00e1r, P., Tu, Z., and He, K. (2017, January 21\u201326). Aggregated residual transformations for deep neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.634"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"8445","DOI":"10.1109\/TGRS.2019.2921111","article-title":"Detecting Small Objects in Urban Settings Using SlimNet Model","volume":"57","author":"Yang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"3258","DOI":"10.1109\/TITS.2015.2413812","article-title":"Automated detection of urban road manhole covers using mobile laser scanning data","volume":"16","author":"Yu","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1042","DOI":"10.1080\/2150704X.2014.994716","article-title":"Automated extraction of manhole covers using mobile LiDAR data","volume":"5","author":"Guan","year":"2014","journal-title":"Remote Sens. Lett."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2076","DOI":"10.1109\/TITS.2017.2728680","article-title":"Automatic pavement object detection using superpixel segmentation combined with conditional random field","volume":"19","author":"Sultani","year":"2017","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_27","unstructured":"Niigaki, H., Shimamura, J., and Morimoto, M. (2012, January 11\u201315). Circular object detection based on separability and uniformity of feature distributions using Bhattacharyya coefficient. Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1802","DOI":"10.1109\/JSTARS.2015.2504401","article-title":"Detection of manhole covers in high-resolution aerial images of urban areas by combining two methods","volume":"9","author":"Pasquet","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Chong, Z., and Yang, L. (2016, January 8\u20139). An Algorithm for Automatic Recognition of Manhole Covers Based on MMS Images. Proceedings of the Chinese Conference on Image and Graphics Technologies, Beijing, China.","DOI":"10.1007\/978-981-10-2260-9_4"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Wei, Z., Yang, M., Wang, L., Ma, H., Chen, X., and Zhong, R. (2019). Customized mobile LiDAR system for manhole cover detection and identification. Sensors, 19.","DOI":"10.3390\/s19102422"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"11203","DOI":"10.1109\/ACCESS.2020.3047929","article-title":"Automated License Plate Recognition: A Survey on Methods and Techniques","volume":"9","author":"Shashirangana","year":"2020","journal-title":"IEEE Access"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1109\/TCSVT.2012.2203741","article-title":"Automatic license plate recognition (ALPR): A state-of-the-art review","volume":"23","author":"Du","year":"2012","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_33","unstructured":"Hongliang, B., and Changping, L. (2004, January 26). A hybrid license plate extraction method based on edge statistics and morphology. Proceedings of the 17th International Conference on Pattern Recognition, Cambridge, UK."},{"key":"ref_34","unstructured":"Jia, W., Zhang, H., He, X., and Piccardi, M. (2005, January 16). Mean shift for accurate license plate localization. Proceedings of the 2005 IEEE Intelligent Transportation Systems, Vienna, Austria."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Deb, K., and Jo, K.-H. (2008, January 14\u201317). HSI color based vehicle license plate detection. Proceedings of the 2008 International Conference on Control, Automation and Systems, Seoul, Korea.","DOI":"10.1109\/ICCAS.2008.4694589"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1109\/TVT.2012.2226218","article-title":"Application-oriented license plate recognition","volume":"62","author":"Hsu","year":"2012","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_37","first-page":"7","article-title":"Study on positioning technology of mileage piles based on multi-sensor information fusion","volume":"10","author":"Ma","year":"2016","journal-title":"J. Highw. Transp. Res. Dev."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_41","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. arXiv."},{"key":"ref_42","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., and Sun, J. (2017). Light-head r-cnn: In defense of two-stage object detector. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., and Wei, Y. (2017, January 22\u201329). Deformable convolutional networks. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.89"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Cai, Z., and Vasconcelos, N. (2018, January 18\u201323). Cascade r-cnn: Delving into high quality object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00644"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_47","unstructured":"Fu, C.-Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017). Dssd: Deconvolutional single shot detector. arXiv."},{"key":"ref_48","unstructured":"Li, Y., Chen, Y., Wang, N., and Zhang, Z. (November, January 27). Scale-aware trident networks for object detection. Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"103910","DOI":"10.1016\/j.imavis.2020.103910","article-title":"Recent advances in small object detection based on deep learning: A review","volume":"97","author":"Tong","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_50","unstructured":"Zhu, Y., Urtasun, R., Salakhutdinov, R., and Fidler, S. (2015, January 7\u201312). segdeepm: Exploiting segmentation and context in deep neural networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Bell, S., Zitnick, C.L., Bala, K., and Girshick, R. (2016, January 27\u201330). Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.314"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Zagoruyko, S., Lerer, A., Lin, T.-Y., Pinheiro, P.O., Gross, S., Chintala, S., and Doll\u00e1r, P. (2016). A multipath network for object detection. arXiv.","DOI":"10.5244\/C.30.15"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Liu, Y., Wang, R., Shan, S., and Chen, X. (2018, January 18\u201323). Structure inference net: Object detection using scene-level context and instance-level relationships. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00730"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1080\/1573062X.2019.1687743","article-title":"Automated localization of urban drainage infrastructure from public-access street-level images","volume":"16","author":"Boller","year":"2019","journal-title":"Urban Water J."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Hebbalaguppe, R., Garg, G., Hassan, E., Ghosh, H., and Verma, A. (2017, January 24\u201331). Telecom Inventory management via object recognition and localisation on Google Street View Images. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA.","DOI":"10.1109\/WACV.2017.86"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Liu, W., Cheng, D., Yin, P., Yang, M., Li, E., Xie, M., and Zhang, L. (2019). Small manhole cover detection in remote sensing imagery with deep convolutional neural networks. ISPRS Int. J. Geo-Inform., 8.","DOI":"10.3390\/ijgi8010049"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1016\/j.imavis.2018.02.002","article-title":"Reading car license plates using deep neural networks","volume":"72","author":"Li","year":"2018","journal-title":"Image Vis. Comput."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Montazzolli, S., and Jung, C. (2017, January 17\u201320). Real-time brazilian license plate detection and recognition using deep convolutional neural networks. Proceedings of the 2017 30th SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), Niteroi, Brazil.","DOI":"10.1109\/SIBGRAPI.2017.14"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1109\/TITS.2017.2784093","article-title":"A new CNN-based method for multi-directional car license plate detection","volume":"19","author":"Xie","year":"2018","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Laroca, R., Severo, E., Zanlorensi, L.A., Oliveira, L.S., Gon\u00e7alves, G.R., Schwartz, W.R., and Menotti, D. (2018, January 8\u201313). A robust real-time automatic license plate recognition based on the YOLO detector. Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil.","DOI":"10.1109\/IJCNN.2018.8489629"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.imavis.2019.04.007","article-title":"Automatic License Plate Recognition via sliding-window darknet-YOLO deep learning","volume":"87","author":"Hendry","year":"2019","journal-title":"Image Vis. Comput."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/j.eswa.2019.06.036","article-title":"A two-stage deep neural network for multi-norm license plate detection and recognition","volume":"136","author":"Kessentini","year":"2019","journal-title":"Expert Syst. Appl."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"3377","DOI":"10.1109\/TGRS.2019.2954328","article-title":"FMSSD: Feature-merged single-shot detection for multiscale objects in large-scale remote sensing imagery","volume":"58","author":"Wang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., and Sun, J. (2018, January 8\u201314). Detnet: Design backbone for object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01240-3_21"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Zhu, R., Zhang, S., Wang, X., Wen, L., Shi, H., Bo, L., and Mei, T. (2019, January 15\u201320). ScratchDet: Training single-shot object detectors from scratch. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00237"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Zhang, S., Chi, C., Yao, Y., Lei, Z., and Li, S.Z. (2020, January 13\u201319). Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00978"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. (2019, January 15\u201320). Generalized intersection over union: A metric and a loss for bounding box regression. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_70","unstructured":"Wu, Y., Kirillov, A., Massa, F., Lo, W.-Y., and Girshick, R. (2020, May 06). Detectron2. Available online: https:\/\/research.fb.com\/wp-content\/uploads\/2019\/12\/4.-detectron2.pdf."},{"key":"ref_71","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Li, F.-F. (2019, January 15\u201320). Imagenet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Mhaskar, H., Liao, Q., and Poggio, T. (2017, January 4\u20139). When and why are deep networks better than shallow ones?. Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.","DOI":"10.1609\/aaai.v31i1.10913"},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"2104","DOI":"10.1109\/TGRS.2019.2953119","article-title":"Object Detection in High Resolution Remote Sensing Imagery Based on Convolutional Neural Networks with Suitable Object Scale Features","volume":"58","author":"Dong","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Small object detection in optical remote sensing images via modified faster R-CNN. Appl. Sci., 8.","DOI":"10.3390\/app8050813"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3608\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:00:14Z","timestamp":1760166014000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3608"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,10]]},"references-count":74,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["rs13183608"],"URL":"https:\/\/doi.org\/10.3390\/rs13183608","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,10]]}}}