{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T16:45:50Z","timestamp":1776444350119,"version":"3.51.2"},"reference-count":65,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2021,4,25]],"date-time":"2021-04-25T00:00:00Z","timestamp":1619308800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Tracking objects across multiple video frames is a challenging task due to several difficult issues such as occlusions, background clutter, lighting as well as object and camera view-point variations, which directly affect the object detection. These aspects are even more emphasized when analyzing unmanned aerial vehicles (UAV) based images, where the vehicle movement can also impact the image quality. A common strategy employed to address these issues is to analyze the input images at different scales to obtain as much information as possible to correctly detect and track the objects across video sequences. Following this rationale, in this paper, we introduce a simple yet effective novel multi-stream (MS) architecture, where different kernel sizes are applied to each stream to simulate a multi-scale image analysis. The proposed architecture is then used as backbone for the well-known Faster-R-CNN pipeline, defining a MS-Faster R-CNN object detector that consistently detects objects in video sequences. Subsequently, this detector is jointly used with the Simple Online and Real-time Tracking with a Deep Association Metric (Deep SORT) algorithm to achieve real-time tracking capabilities on UAV images. To assess the presented architecture, extensive experiments were performed on the UMCD, UAVDT, UAV20L, and UAV123 datasets. The presented pipeline achieved state-of-the-art performance, confirming that the proposed multi-stream method can correctly emulate the robust multi-scale image analysis paradigm.<\/jats:p>","DOI":"10.3390\/rs13091670","type":"journal-article","created":{"date-parts":[[2021,4,25]],"date-time":"2021-04-25T22:31:39Z","timestamp":1619389899000},"page":"1670","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":92,"title":["MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images"],"prefix":"10.3390","volume":"13","author":[{"given":"Danilo","family":"Avola","sequence":"first","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"given":"Luigi","family":"Cinque","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"given":"Anxhelo","family":"Diko","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8111-9120","authenticated-orcid":false,"given":"Alessio","family":"Fagioli","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8425-6892","authenticated-orcid":false,"given":"Gian Luca","family":"Foresti","sequence":"additional","affiliation":[{"name":"Department of Mathematics, Computer Science and Physics, University of Udine, 33100 Udine, Italy"}]},{"given":"Alessio","family":"Mecca","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"given":"Daniele","family":"Pannone","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Sapienza University, 00198 Rome, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5305-1520","authenticated-orcid":false,"given":"Claudio","family":"Piciarelli","sequence":"additional","affiliation":[{"name":"Department of Mathematics, Computer Science and Physics, University of Udine, 33100 Udine, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2021,4,25]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., and Pannone, D. (2020). Design of a 3D Platform for Immersive Neurocognitive Rehabilitation. Information, 11.","DOI":"10.3390\/info11030134"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"102509","DOI":"10.1016\/j.ijhcs.2020.102509","article-title":"The impact of serious games with humanoid robots on mild cognitive impairment older adults","volume":"145","author":"Manca","year":"2021","journal-title":"Int. J. Hum. Comput. Stud."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"24955","DOI":"10.1007\/s11042-018-5730-1","article-title":"VRheab: A fully immersive motor rehabilitation system based on recurrent neural network","volume":"77","author":"Avola","year":"2018","journal-title":"Multimed. Tools Appl."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ladakis, I., Kilintzis, V., Xanthopoulou, D., and Chouvarda, I. (2021, January 11\u201313). Virtual Reality and Serious Games for Stress Reduction with Application in Work Environments. Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies\u2013Volume 5: HEALTHINF, Online Streaming.","DOI":"10.5220\/0010300905410548"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1511","DOI":"10.1109\/TNSRE.2019.2926786","article-title":"Multipurpose virtual reality environment for biomedical and health applications","volume":"27","author":"Torner","year":"2019","journal-title":"IEEE Trans. Neural Syst. Rehabil. Eng."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., Foresti, G.L., Mercuri, C., and Pannone, D. (2016, January 24\u201326). A Practical Framework for the Development of Augmented Reality Applications by Using ArUco Markers. Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods, Rome, Italy.","DOI":"10.5220\/0005755806450654"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"3798","DOI":"10.1109\/ACCESS.2020.3047698","article-title":"Dynamic Pose Tracking Performance Evaluation of HTC Vive Virtual Reality System","volume":"9","author":"Ikbal","year":"2021","journal-title":"IEEE Access"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1080\/17538947.2020.1733680","article-title":"Three-dimensional CityGML building models in mobile augmented reality: A smartphone-based pose tracking system","volume":"14","author":"Blut","year":"2021","journal-title":"Int. J. Digit. Earth"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"9584","DOI":"10.1109\/ACCESS.2021.3049798","article-title":"Quality of Experience Comparison of Stereoscopic 3D Videos in Different Projection Devices: Flat Screen, Panoramic Screen and Virtual Reality Headset","volume":"9","author":"Choy","year":"2021","journal-title":"IEEE Access"},{"key":"ref_10","first-page":"1","article-title":"Applications of virtual and augmented reality in biomedical imaging","volume":"43","author":"Izard","year":"2019","journal-title":"J. Med. Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., Foresti, G.L., and Pannone, D. (2019, January 9\u201311). Automatic Deception Detection in RGB Videos Using Facial Action Units. Proceedings of the 13th International Conference on Distributed Smart Cameras, Trento, Italy.","DOI":"10.1145\/3349801.3349806"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"114341","DOI":"10.1016\/j.eswa.2020.114341","article-title":"Deception in the eyes of deceiver: A computer vision and machine learning based automated deception detection","volume":"169","author":"Khan","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1016\/j.patrec.2020.08.014","article-title":"LieToMe: Preliminary study on hand gestures for deception detection via Fisher-LSTM","volume":"138","author":"Avola","year":"2020","journal-title":"Pattern Recognit. Lett."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wu, Z., Singh, B., Davis, L., and Subrahmanian, V. (2018, January 2\u20137). Deception detection in videos. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.11502"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., Foresti, G.L., and Pannone, D. (2019, January 16\u201318). Visual Cryptography for Detecting Hidden Targets by Small-Scale Robots. Proceedings of the Pattern Recognition Applications and Methods, Funchal, Madeira, Portugal.","DOI":"10.1007\/978-3-030-05499-1_10"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1007\/s41315-019-00107-1","article-title":"A computer vision and artificial intelligence based cost-effective object sensing robot","volume":"3","author":"Roy","year":"2019","journal-title":"Int. J. Intell. Robot. Appl."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"18387","DOI":"10.1007\/s11042-020-08758-0","article-title":"Homography vs similarity transformation in aerial mosaicking: Which is the best at different altitudes?","volume":"79","author":"Avola","year":"2020","journal-title":"Multimed. Tools Appl."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1351","DOI":"10.1109\/LRA.2019.2895272","article-title":"Autonomous Navigation for Unmanned Underwater Vehicles: Real-Time Experiments Using Computer Vision","volume":"4","author":"Manzanilla","year":"2019","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.foodcont.2018.04.037","article-title":"Robotics and computer vision techniques combined with non-invasive consumer biometrics to assess quality traits from beer foamability using machine learning: A potential for artificial intelligence applications","volume":"92","author":"Viejo","year":"2018","journal-title":"Food Control"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Lauterbach, H.A., Koch, C.B., Hess, R., Eck, D., Schilling, K., and N\u00fcchter, A. (2019, January 2\u20134). The Eins3D project\u2014Instantaneous UAV-Based 3D Mapping for Search and Rescue Applications. Proceedings of the 2019 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), W\u00fcrzburg, Germany.","DOI":"10.1109\/SSRR.2019.8848972"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Ruetten, L., Regis, P.A., Feil-Seifer, D., and Sengupta, S. (2020, January 6\u20138). Area-Optimized UAV Swarm Network for Search and Rescue Operations. Proceedings of the 2020 10th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA.","DOI":"10.1109\/CCWC47524.2020.9031197"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"55817","DOI":"10.1109\/ACCESS.2019.2912306","article-title":"Lsar: Multi-uav collaboration for search and rescue missions","volume":"7","author":"Alotaibi","year":"2019","journal-title":"IEEE Access"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"7053","DOI":"10.1109\/TGRS.2017.2739133","article-title":"Quasi-polar-based FFBP algorithm for miniature UAV SAR imaging without navigational data","volume":"55","author":"Zhou","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","first-page":"102274","article-title":"A framework for registering UAV-based imagery for crop-tracking in Precision Agriculture","volume":"97","author":"Jurado","year":"2021","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Mazzia, V., Comba, L., Khaliq, A., Chiaberge, M., and Gay, P. (2020). UAV and Machine Learning Based Refinement of a Satellite-Driven Vegetation Index for Precision Agriculture. Sensors, 20.","DOI":"10.3390\/s20092530"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2161","DOI":"10.1080\/01431161.2016.1249311","article-title":"Accurate ortho-mosaicked six-band multispectral UAV images as affected by mission planning for precision agriculture proposes","volume":"38","year":"2017","journal-title":"Int. J. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Popescu, D., Stoican, F., Stamatescu, G., Ichim, L., and Dragana, C. (2020). Advanced UAV\u2013WSN system for intelligent monitoring in precision agriculture. Sensors, 20.","DOI":"10.3390\/s20030817"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Tsouros, D.C., Bibi, S., and Sarigiannidis, P.G. (2019). A review on UAV-based applications for precision agriculture. Information, 10.","DOI":"10.3390\/info10110349"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., Fagioli, A., Foresti, G.L., Pannone, D., and Piciarelli, C. (2021). Automatic estimation of optimal UAV flight parameters for real-time wide areas monitoring. Multimed. Tools Appl., 1\u201323.","DOI":"10.1007\/s11042-021-10859-3"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Avola, D., Foresti, G.L., Martinel, N., Micheloni, C., Pannone, D., and Piciarelli, C. (September, January 29). Aerial video surveillance system for small-scale UAV environment monitoring. Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy.","DOI":"10.1109\/AVSS.2017.8078523"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1049\/iet-cvi.2019.0963","article-title":"Drone swarm patrolling with uneven coverage requirements","volume":"14","author":"Piciarelli","year":"2020","journal-title":"IET Comput. Vis."},{"key":"ref_32","first-page":"130","article-title":"Comparison of four UAV georeferencing methods for environmental monitoring purposes focusing on the combined use with airborne and satellite remote sensing platforms","volume":"75","author":"Planas","year":"2019","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Avola, D., Cinque, L., Fagioli, A., Foresti, G.L., Massaroni, C., and Pannone, D. (2019, January 9\u201313). Feature-based SLAM algorithm for small scale UAV with nadir view. Proceedings of the International Conference on Image Analysis and Processing, Trento, Italy.","DOI":"10.1007\/978-3-030-30645-8_42"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_35","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very Deep Convolutional Networks for Large-Scale Image Recognition. Proceedings of the International Conference on Learning Representations, San Diego, CA, USA."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 27). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Wojke, N., Bewley, A., and Paulus, D. (2017, January 17\u201320). Simple online and realtime tracking with a deep association metric. Proceedings of the IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Du, D., Qi, Y., Yu, H., Yang, Y., Duan, K., Li, G., Zhang, W., Huang, Q., and Tian, Q. (2018, January 8\u201314). The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01249-6_23"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Leibe, B., Matas, J., Sebe, N., and Welling, M. (2016, January 8\u201316). A Benchmark and Simulator for UAV Tracking. Proceedings of the Computer Vision\u2014ECCV 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46478-7"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"2139","DOI":"10.1109\/TSMC.2018.2804766","article-title":"A UAV Video Dataset for Mosaicking and Change Detection From Low-Altitude Flights","volume":"50","author":"Avola","year":"2020","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3391743","article-title":"Video object segmentation and tracking: A survey","volume":"11","author":"Yao","year":"2020","journal-title":"ACM Trans. Intell. Syst. Technol. (TIST)"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1109\/TMM.2018.2875360","article-title":"Deep alignment network based multi-person tracking with occlusion and motion reasoning","volume":"21","author":"Zhou","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Chen, L., Ai, H., Zhuang, Z., and Shang, C. (2018, January 23\u201327). Real-time multiple people tracking with deeply learned candidate selection and person re-identification. Proceedings of the 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA.","DOI":"10.1109\/ICME.2018.8486597"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Tang, Z., Wang, G., Xiao, H., Zheng, A., and Hwang, J.N. (2018, January 18\u201322). Single-camera and inter-camera vehicle tracking and 3D speed estimation based on fusion of visual and semantic features. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00022"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"24101","DOI":"10.1007\/s11432-018-9590-5","article-title":"Vehicle tracking by detection in UAV aerial video","volume":"62","author":"Liu","year":"2019","journal-title":"Sci. China Inf. Sci."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"104002","DOI":"10.1016\/j.imavis.2020.104002","article-title":"Multi-level prediction Siamese network for real-time UAV visual tracking","volume":"103","author":"Zhu","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Huang, W., Zhou, X., Dong, M., and Xu, H. (2021). Multiple objects tracking in the UAV system based on hierarchical deep high-resolution network. Multimed. Tools Appl., 1\u201319.","DOI":"10.1007\/s11042-020-10427-1"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 11\u201318). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_51","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019, January 8\u201314). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Proceedings of the Advances in Neural Information Processing Systems 32, Vancouver, BC, Canada."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"3232","DOI":"10.1109\/TIP.2019.2895411","article-title":"Dynamic Saliency-Aware Regularization for Correlation Filter-Based Object Tracking","volume":"28","author":"Feng","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Bhat, G., Shahbaz Khan, F., and Felsberg, M. (2017, January 21\u201326). ECO: Efficient Convolution Operators for Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.733"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Li, F., Tian, C., Zuo, W., Zhang, L., and Yang, M. (2018, January 18\u201322). Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00515"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Mueller, M., Smith, N., and Ghanem, B. (2017, January 21\u201326). Context-Aware Correlation Filter Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.152"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Danelljan, M., H\u00e4ger, G., Khan, F.S., and Felsberg, M. (2015, January 11\u201318). Learning Spatially Regularized Correlation Filters for Visual Tracking. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.490"},{"key":"ref_57","unstructured":"Danelljan, M., H\u00e4ger, G., Khan, F.S., and Felsberg, M. (\u20131, January 26). Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Galoogahi, H.K., Fagg, A., and Lucey, S. (2017, January 22\u201329). Learning Background-Aware Correlation Filters for Visual Tracking. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.129"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Wang, C., Zhang, L., Xie, L., and Yuan, J. (2018, January 2\u20137). Kernel Cross-Correlator. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.11710"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1109\/TPAMI.2016.2609928","article-title":"Discriminative Scale Space Tracking","volume":"39","author":"Danelljan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Li, Y., and Zhu, J. (2014, January 6\u201312). A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration. Proceedings of the Computer Vision\u2014ECCV Workshops, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-16181-5_18"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Danelljan, M., H\u00e4ger, G., Shahbaz Khan, F., and Felsberg, M. (2014, January 1\u20135). Accurate Scale Estimation for Robust Visual Tracking. Proceedings of the British Machine Vision Conference, Nottingham, UK.","DOI":"10.5244\/C.28.65"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TPAMI.2014.2345390","article-title":"High-Speed Tracking with Kernelized Correlation Filters","volume":"37","author":"Henriques","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"8940","DOI":"10.1109\/TGRS.2020.2992301","article-title":"Object Saliency-Aware Dual Regularized Correlation Filter for Real-Time Aerial Tracking","volume":"58","author":"Fu","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., Fischer, I., Wojna, Z., Song, Y., and Guadarrama, S. (2017, January 21\u201326). Speed\/Accuracy Trade-Offs for Modern Convolutional Object Detectors. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.351"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/9\/1670\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:52:44Z","timestamp":1760161964000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/9\/1670"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,25]]},"references-count":65,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["rs13091670"],"URL":"https:\/\/doi.org\/10.3390\/rs13091670","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4,25]]}}}