{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,28]],"date-time":"2025-11-28T12:38:04Z","timestamp":1764333484502,"version":"build-2065373602"},"reference-count":71,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2024,10,31]],"date-time":"2024-10-31T00:00:00Z","timestamp":1730332800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Alberta Major Innovation Fund"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Object detection architectures struggle to detect small objects across applications including remote sensing and autonomous vehicles. Specifically, for unmanned aerial vehicles, poor detection of small objects directly limits this technology\u2019s applicability. Objects both appear smaller than they are in large-scale images captured in aerial imagery and are represented by reduced information in high-altitude imagery. This paper presents a new architecture, CR-CNN, which predicts independent regions of interest from two unique prediction branches within the first stage of the network: a conventional R-CNN convolutional backbone and an hourglass backbone. Utilizing two independent sources within the first stage, our approach leads to an increase in successful predictions of regions that contain smaller objects. Anchor-based methods such as R-CNNs also utilize less than half the number of small objects compared to larger ones during training due to the poor intersection over union (IoU) scores between the generated anchors and the groundtruth\u2014further reducing their performance on small objects. Therefore, we also propose artificially inflating the IoU of smaller objects during training using a simple, size-based Gaussian multiplier\u2014leading to an increase in the quantity of small objects seen per training cycle based on an increase in the number of anchor\u2013object pairs during training. This architecture and training strategy led to improved detection overall on two challenging aerial-based datasets heavily composed of small objects while predicting fewer false positives compared to Mask R-CNN. These results suggest that while new and unique architectures will continue to play a part in advancing the field of object detection, the training methodologies and strategies used will also play a valuable role.<\/jats:p>","DOI":"10.3390\/rs16214065","type":"journal-article","created":{"date-parts":[[2024,10,31]],"date-time":"2024-10-31T09:57:36Z","timestamp":1730368656000},"page":"4065","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["A Heatmap-Supplemented R-CNN Trained Using an Inflated IoU for Small Object Detection"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6283-1449","authenticated-orcid":false,"given":"Justin","family":"Butler","sequence":"first","affiliation":[{"name":"Department of Electrical and Software Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5984-107X","authenticated-orcid":false,"given":"Henry","family":"Leung","sequence":"additional","affiliation":[{"name":"Department of Electrical and Software Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada"}]}],"member":"1968","published-online":{"date-parts":[[2024,10,31]]},"reference":[{"key":"ref_1","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_2","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the Computer Vision\u2014ECCV 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1109\/TPAMI.2018.2844175","article-title":"Mask R-CNN","volume":"42","author":"He","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"114602","DOI":"10.1016\/j.eswa.2021.114602","article-title":"A survey and performance evaluation of deep learning methods for small object detection","volume":"172","author":"Liu","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"104471","DOI":"10.1016\/j.imavis.2022.104471","article-title":"Deep learning-based detection from the perspective of small or tiny objects: A survey","volume":"123","author":"Tong","year":"2022","journal-title":"Image Vis. Comput."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"4341","DOI":"10.1109\/TIP.2023.3297408","article-title":"Centralized Feature Pyramid for Object Detection","volume":"32","author":"Quan","year":"2023","journal-title":"IEEE Trans. Image Process."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Small Object Detection in Optical Remote Sensing Images via Modified Faster R-CNN. Appl. Sci., 8.","DOI":"10.3390\/app8050813"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"5512","DOI":"10.1109\/TGRS.2019.2899955","article-title":"R2 -CNN: Fast Tiny Object Detection in Large-Scale Remote Sensing Images","volume":"57","author":"Pang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Hamaguchi, R., Fujita, A., Nemoto, K., Imaizumi, T., and Hikosaka, S. (2018, January 12\u201315). Effective Use of Dilated Convolutions for Segmenting Small Object Instances in Remote Sensing Imagery. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00162"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhu, Z., Liang, D., Zhang, S., Huang, X., Li, B., and Hu, S. (2016, January 27\u201330). Traffic-Sign Detection and Classification in the Wild. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.232"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Song, T., Sun, L., Xie, D., Sun, H., and Pu, S. (2018, January 8\u201314). Small-scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_33"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"3046","DOI":"10.1109\/TITS.2019.2923752","article-title":"Small-Scale Pedestrian Detection Based on Deep Neural Network","volume":"21","author":"Han","year":"2020","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the Computer Vision\u2014ECCV 2014, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Kellenberger, B., Volpi, M., and Tuia, D. (2017, January 23\u201328). Fast animal detection in UAV images using convolutional neural networks. Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA.","DOI":"10.1109\/IGARSS.2017.8127090"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"352","DOI":"10.1108\/00022660510617077","article-title":"Monitoring of gas pipelines\u2014A civil UAV application","volume":"77","author":"Hausamann","year":"2005","journal-title":"Aircr. Eng. Aerosp. Technol."},{"key":"ref_17","unstructured":"Mejias, L., Corke, P., and Roberts, J. (2015). Outdoor Flight Testing of a Pole Inspection UAV Incorporating High-speed Vision. Field and Service Robotics: Results of the 9th International Conference, Springer International Publishing."},{"key":"ref_18","unstructured":"Doherty, P., and Rudol, P. (2007, January 10\u201313). A UAV Search and Rescue Scenario with Human Body Detection and Geolocalization. Proceedings of the AI 2007: Advances in Artificial Intelligence, Osnabr\u00fcck, Germany."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"103910","DOI":"10.1016\/j.imavis.2020.103910","article-title":"Recent advances in small object detection based on deep learning: A review","volume":"97","author":"Tong","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1109\/TAI.2021.3081057","article-title":"Robust Vehicle Detection in High-Resolution Aerial Images with Imbalanced Data","volume":"2","author":"Li","year":"2021","journal-title":"IEEE Trans. Artif. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Rabbi, J., Ray, N., Schubert, M., Chowdhury, S., and Chao, D. (2020). Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network. Remote Sens., 12.","DOI":"10.20944\/preprints202003.0313.v2"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, X., Wang, A., Yi, J., Song, Y., and Chehri, A. (2023). Small Object Detection Based on Deep Learning for Remote Sensing: A Comprehensive Review. Remote Sens., 15.","DOI":"10.3390\/rs15133265"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Courtrai, L., Pham, M.T., and Lef\u00e8vre, S. (2020). Small Object Detection in Remote Sensing Images Based on Super-Resolution with Auxiliary Generative Adversarial Networks. Remote Sens., 12.","DOI":"10.3390\/rs12193152"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Fu, Y., Zheng, C., Yuan, L., Chen, H., and Nie, J. (2021, January 13\u201315). Small Object Detection in Complex Large Scale Spatial Image by Concatenating SRGAN and Multi-Task WGAN. Proceedings of the 2021 7th International Conference on Big Data Computing and Communications (BigCom), Deqing, China.","DOI":"10.1109\/BigCom53800.2021.00017"},{"key":"ref_25","first-page":"106151E","article-title":"Feature-fused SSD: Fast detection for small objects","volume":"Volume 10615","author":"Yu","year":"2018","journal-title":"Proceedings of the Ninth International Conference on Graphic and Image Processing (ICGIP 2017), Qingdao, China, 14\u201316 October 2017"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Liang, Z., Shao, J., Zhang, D., and Gao, L. (2018, January 21\u201322). Small Object Detection Using Deep Feature Pyramid Networks. Proceedings of the Advances in Multimedia Information Processing\u2014PCM 2018, Hefei, China.","DOI":"10.1007\/978-3-030-00764-5_51"},{"key":"ref_27","unstructured":"Fu, C.Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017). DSSD: Deconvolutional Single Shot Detector. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1109\/TAI.2022.3178062","article-title":"Towards Disturbance Rejection in Feature Pyramid Network","volume":"4","author":"Hu","year":"2023","journal-title":"IEEE Trans. Artif. Intell."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Huang, W., Li, G., Chen, Q., Ju, M., and Qu, J. (2021). CF2PN: A Cross-Scale Feature Fusion Pyramid Network Based Remote Sensing Target Detection. Remote Sens., 13.","DOI":"10.3390\/rs13050847"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Singh, B., and Davis, L.S. (2018, January 18\u201322). An Analysis of Scale Invariance in Object Detection\u2014SNIP. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00377"},{"key":"ref_31","unstructured":"Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N., and Garnett, R. (2018). SNIPER: Efficient Multi-Scale Training. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8\u201313 December 2014, Curran Associates, Inc."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201322). Path Aggregation Network for Instance Segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for Dense Object Detection","volume":"42","author":"Lin","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Eggert, C., Brehm, S., Winschel, A., Zecha, D., and Lienhart, R. (2017, January 10\u201314). A closer look: Small object detection in faster R-CNN. Proceedings of the 2017 IEEE International Conference on Multimedia and Expo (ICME), Hong Kong, China.","DOI":"10.1109\/ICME.2017.8019550"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Kisantal, M., Wojna, Z., Murawski, J., Naruniec, J., and Cho, K. (2019). Augmentation for small object detection. arXiv.","DOI":"10.5121\/csit.2019.91713"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1016\/j.neucom.2019.07.073","article-title":"Scale adaptive image cropping for UAV object detection","volume":"366","author":"Zhou","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"7380","DOI":"10.1109\/TPAMI.2021.3119563","article-title":"Detection and Tracking Meet Drones Challenge","volume":"44","author":"Zhu","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Terrail, J.O.D., and Jurie, F. (2017, January 17\u201320). On the use of deep neural networks for the detection of small vehicles in ortho-images. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8297076"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Plastiras, G., Siddiqui, S., Kyrkou, C., and Theocharides, T. (September, January 31). Efficient Embedded Deep Neural-Network-based Object Detection Via Joint Quantization and Tiling. Proceedings of the 2020 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS), Genova, Italy.","DOI":"10.1109\/AICAS48895.2020.9073885"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Ozge Unel, F., Ozkalayci, B.O., and Cigla, C. (2019, January 15\u201320). The Power of Tiling for Small Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Long Beach, CA, USA.","DOI":"10.1109\/CVPRW.2019.00084"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1895","DOI":"10.1109\/TIP.2022.3148874","article-title":"A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection","volume":"31","author":"Huang","year":"2022","journal-title":"IEEE Trans. Image Process."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"5602914","DOI":"10.1109\/TGRS.2023.3344195","article-title":"Dual Appearance-Aware Enhancement for Oriented Object Detection","volume":"62","author":"Gong","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Yao, Y., Cheng, G., Lang, C., Yuan, X., Xie, X., and Han, J. (2024). Hierarchical Mask Prompting and Robust Integrated Regression for Oriented Object Detection. IEEE Trans. Circuits Syst. Video Technol., 1.","DOI":"10.1109\/TCSVT.2024.3444795"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Tang, T., Zhou, S., Deng, Z., Zou, H., and Lei, L. (2017). Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining. Sensors, 17.","DOI":"10.3390\/s17020336"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Wu, Z., Suresh, K., Narayanan, P., Xu, H., Kwon, H., and Wang, Z. (November, January 27). Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00129"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"4599","DOI":"10.1109\/TIP.2021.3073319","article-title":"GSDet: Object Detection in Aerial Images Based on Scale Reasoning","volume":"30","author":"Li","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Weng, W., Wei, M., Ren, J., and Shen, F. (2024). Enhancing Aerial Object Detection with Selective Frequency Interaction Network. IEEE Trans. Artif. Intell., 1\u201312.","DOI":"10.1109\/TAI.2024.3381096"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Ammar, A., Koubaa, A., Ahmed, M., Saad, A., and Benjdira, B. (2021). Vehicle Detection from Aerial Images Using Deep Learning: A Comparative Study. Electronics, 10.","DOI":"10.3390\/electronics10070820"},{"key":"ref_50","unstructured":"Jocher, G., Chaurasia, A., Stoken, A., Borovec, J., NanoCode012, Kwon, Y., Tao, X., Michael, K., Fang, J. (2022). Ultralytics\/yolov5: v6.2\u2014YOLOv5 Classification Models, Apple M1, Reproducibility, ClearML and Deci.ai integrations. Zenodo."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020, January 23\u201328). End-to-End Object Detection with Transformers. Proceedings of the Computer Vision\u2014ECCV 2020, Glasgow, UK.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"104046","DOI":"10.1016\/j.imavis.2020.104046","article-title":"Deep learning-based object detection in low-altitude UAV datasets: A survey","volume":"104","author":"Mittal","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Robicquet, A., Sadeghian, A., Alahi, A., and Savarese, S. (2016, January 11\u201314). Learning Social Etiquette: Human Trajectory Understanding In Crowded Scenes. Proceedings of the Computer Vision\u2014ECCV 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46484-8_33"},{"key":"ref_54","unstructured":"Kharuzhy, Y. (2019, December 01). Aerial-Cars-Dataset. Available online: https:\/\/github.com\/jekhor\/aerial-cars-dataset."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Wang, J., Simeonova, S., and Shahbazi, M. (2019). Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos. Remote Sens., 11.","DOI":"10.3390\/rs11182155"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1007\/s11263-019-01266-1","article-title":"The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline","volume":"128","author":"Yu","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_59","unstructured":"Cortes, C., Lawrence, N., Lee, D., Sugiyama, M., and Garnett, R. (2015). Attention-Based Models for Speech Recognition. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4\u20139 December 2017, Curran Associates, Inc."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"642","DOI":"10.1007\/s11263-019-01204-1","article-title":"CornerNet: Detecting Objects as Paired Keypoints","volume":"128","author":"Law","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). CenterNet: Keypoint Triplets for Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00667"},{"key":"ref_63","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as Points. arXiv."},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Newell, A., Yang, K., and Deng, J. (2016, January 11\u201314). Stacked Hourglass Networks for Human Pose Estimation. Proceedings of the Computer Vision\u2014ECCV 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1109\/TPAMI.2019.2929257","article-title":"OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields","volume":"43","author":"Cao","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_66","unstructured":"Insafutdinov, E. (2024, October 26). Towards Accurate Multi-Person Pose Estimation in the Wild. Available online: https:\/\/publikationen.sulb.uni-saarland.de\/handle\/20.500.11880\/31184."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. (2019, January 15\u201320). Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"5175","DOI":"10.1109\/TIP.2020.2976856","article-title":"Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation","volume":"29","author":"Yang","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Dwibedi, D., Misra, I., and Hebert, M. (2017, January 22\u201329). Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.146"},{"key":"ref_71","unstructured":"Wu, Y., Kirillov, A., Massa, F., Lo, W.Y., and Girshick, R. (2020, January 01). Detectron2. Available online: https:\/\/github.com\/facebookresearch\/detectron2."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/21\/4065\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:25:33Z","timestamp":1760113533000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/21\/4065"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,31]]},"references-count":71,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2024,11]]}},"alternative-id":["rs16214065"],"URL":"https:\/\/doi.org\/10.3390\/rs16214065","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2024,10,31]]}}}