{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T18:54:54Z","timestamp":1778180094014,"version":"3.51.4"},"reference-count":86,"publisher":"MDPI AG","issue":"19","license":[{"start":{"date-parts":[[2022,9,23]],"date-time":"2022-09-23T00:00:00Z","timestamp":1663891200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81671787"],"award-info":[{"award-number":["81671787"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Although the performance of unmanned aerial vehicle (UAV) tracking has benefited from the successful application of discriminative correlation filters (DCF) and convolutional neural networks (CNNs), UAV tracking under occlusion and deformation remains a challenge. The main dilemma is that challenging scenes, such as occlusion or deformation, are very complex and changeable, making it difficult to obtain training data covering all situations, resulting in trained networks that may be confused by new contexts that differ from historical information. Data-driven strategies are the main direction of current solutions, but gathering large-scale datasets with object instances under various occlusion and deformation conditions is difficult and lacks diversity. This paper proposes an attention-based mask generation network (AMGN) for UAV-specific tracking, which combines the attention mechanism and adversarial learning to improve the tracker\u2019s ability to handle occlusion and deformation. After the base CNN extracts the deep features of the candidate region, a series of masks are determined by the spatial attention module and sent to the generator, and the generator discards some features according to these masks to simulate the occlusion and deformation of the object, producing more hard positive samples. The discriminator seeks to distinguish these hard positive samples while guiding mask generation. Such adversarial learning can effectively complement occluded and deformable positive samples in the feature space, allowing to capture more robust features to distinguish objects from backgrounds. Comparative experiments show that our AMGN-based tracker achieves the highest area under curve (AUC) of 0.490 and 0.349, and the highest precision scores of 0.742 and 0.662, on the UAV123 tracking benchmark with partial and full occlusion attributes, respectively. It also achieves the highest AUC of 0.555 and the highest precision score of 0.797 on the DTB70 tracking benchmark with the deformation attribute. On the UAVDT tracking benchmark with the large occlusion attribute, it achieves the highest AUC of 0.407 and the highest precision score of 0.582.<\/jats:p>","DOI":"10.3390\/rs14194756","type":"journal-article","created":{"date-parts":[[2022,9,26]],"date-time":"2022-09-26T03:34:17Z","timestamp":1664163257000},"page":"4756","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Occlusion and Deformation Handling Visual Tracking for UAV via Attention-Based Mask Generative Network"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4953-5474","authenticated-orcid":false,"given":"Yashuo","family":"Bai","sequence":"first","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3108-2307","authenticated-orcid":false,"given":"Yong","family":"Song","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5919-7595","authenticated-orcid":false,"given":"Yufei","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ya","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiyan","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuxin","family":"He","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zishuo","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xin","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qun","family":"Hao","sequence":"additional","affiliation":[{"name":"School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China"},{"name":"Beijing Key Laboratory for Precision Optoelectronic Measurement Instrument and Technology, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,9,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1109\/TMM.2015.2455418","article-title":"On-Road Pedestrian Tracking Across Multiple Driving Recorders","volume":"17","author":"Lee","year":"2015","journal-title":"IEEE Trans. Multimed."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"9882","DOI":"10.1109\/TIE.2019.2955411","article-title":"SAT: Single-shot adversarial tracker","volume":"67","author":"Wu","year":"2019","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"9360","DOI":"10.1109\/TIE.2019.2893829","article-title":"Vision-based target-following guider for mobile robot","volume":"66","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2054","DOI":"10.1109\/TIE.2018.2835390","article-title":"Real-time event-triggered object tracking in the presence of model drift and occlusion","volume":"66","author":"Guan","year":"2018","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Bolme, D.S., Beveridge, J.R., Draper, B.A., and Lui, Y.M. (2010, January 13\u201318). Visual object tracking using adaptive correlation filters. Proceedings of the 2010 IEEE Computer SOCIETY Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539960"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Li, B., Fu, C., Ding, F., Ye, J., and Lin, F. (June, January 30). ADTrack: Target-aware dual filter learning for real-time anti-dark UAV tracking. Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi\u2019an, Shaanxi, China.","DOI":"10.1109\/ICRA48506.2021.9561564"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"194601","DOI":"10.1109\/ACCESS.2020.3033481","article-title":"Vision-based moving UAV tracking by another UAV on low-cost hardware and a new ground control station","volume":"8","year":"2020","journal-title":"IEEE Access"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"10469","DOI":"10.1109\/TITS.2021.3094654","article-title":"ReCF: Exploiting Response Reasoning for Correlation Filters in Real-Time UAV Tracking","volume":"23","author":"Lin","year":"2021","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Huang, B., Chen, J., Xu, T., Wang, Y., Jiang, S., Wang, Y., Wang, L., and Li, J. (2021, January 11\u201317). SiamSTA: Spatio-Temporal Attention based Siamese Tracker for Tracking UAVs. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00140"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Song, Y., Ma, C., Wu, X., Gong, L., Bao, L., Zuo, W., Shen, C., Lau, R.W., and Yang, M.H. (2018, January 18\u201323). Vital: Visual tracking via adversarial learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00937"},{"key":"ref_11","unstructured":"Bo, L., Yan, J., Wei, W., Zheng, Z., and Hu, X. (2018, January 18\u201323). High Performance Visual Tracking with Siamese Region Proposal Network. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Li, Y., Fu, C., Ding, F., Huang, Z., and Lu, G. (2020, January 13\u201319). AutoTrack: Towards high-performance visual tracking for UAV with automatic spatio-temporal regularization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01194"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1109\/TMM.2018.2875360","article-title":"Deep Alignment Network Based Multi-Person Tracking With Occlusion and Motion Reasoning","volume":"21","author":"Zhou","year":"2019","journal-title":"IEEE Trans. Multimed."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"161349","DOI":"10.1109\/ACCESS.2020.3019206","article-title":"Stably Adaptive Anti-Occlusion Siamese Region Proposal Network for Real-Time Object Tracking","volume":"8","author":"Wu","year":"2020","journal-title":"IEEE Access"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1186\/s13640-020-0496-6","article-title":"A scale-adaptive object-tracking algorithm with occlusion detection","volume":"2020","author":"Yuan","year":"2020","journal-title":"Eurasip J. Image Video Process."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Wang, X., Shrivastava, A., and Gupta, A. (2017, January 21\u201326). A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.324"},{"key":"ref_17","unstructured":"Qi, Y., Zhang, S., Zhang, W., Su, L., Huang, Q., and Yang, M.H. (February, January 27). Learning attribute-specific representations for visual tracking. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.neucom.2018.10.035","article-title":"Robust visual tracking via scale-and-state-awareness","volume":"329","author":"Qi","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Chen, Y., Song, L., Hu, Y., and He, R. (2018, January 22\u201325). Adversarial occlusion-aware face detection. Proceedings of the 2018 IEEE 9th International Conference on Biometrics: Theory, Applications and Systems (BTAS), Redondo Beach, CA, USA.","DOI":"10.1109\/BTAS.2018.8698572"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1016\/j.neunet.2020.06.011","article-title":"Appearance variation adaptation tracker using adversarial network\u2014ScienceDirect","volume":"129","author":"Javanmardi","year":"2020","journal-title":"Neural Netw."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Souly, N., Spampinato, C., and Shah, M. (2017, January 22\u201329). Semi Supervised Semantic Segmentation Using Generative Adversarial Network. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.606"},{"key":"ref_22","unstructured":"Xiao, W., Li, C., Luo, B., and Jin, T. (2018, January 18\u201323). SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhang, T., Jia, K., Xu, C., Ma, Y., and Ahuja, N. (2014, January 23\u201328). Partial occlusion handling for visual tracking via robust part matching. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.164"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1016\/j.neucom.2018.11.083","article-title":"Sample-based adaptive Kalman filtering for accurate camera pose tracking","volume":"333","author":"Aa","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/j.neucom.2019.03.077","article-title":"Multiple pedestrian tracking by combining particle filter and network flow model","volume":"351","author":"Cui","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"3864","DOI":"10.1007\/s10489-019-01480-x","article-title":"Research on scale adaptive particle filter tracker with feature integration","volume":"49","author":"Xiao","year":"2019","journal-title":"Appl. Intell."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1016\/j.patrec.2014.03.025","article-title":"Robust scale-adaptive mean-shift for tracking","volume":"49","author":"Vojir","year":"2014","journal-title":"Pattern Recognit. Lett."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"6129","DOI":"10.1007\/s10489-021-02694-8","article-title":"Distractor-aware visual tracking using hierarchical correlation filters adaptive selection","volume":"52","author":"Zhang","year":"2022","journal-title":"Appl. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Chen, D., and Zheng, Y. (2022). Satellite Video Tracking by Multi-Feature Correlation Filters with Motion Estimation. Remote Sens., 14.","DOI":"10.3390\/rs14112691"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Qi, Y., Yao, H., Sun, X., Sun, X., Zhang, Y., and Huang, Q. (2014, January 27\u201330). Structure-aware multi-object discovery for weakly supervised tracking. Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France.","DOI":"10.1109\/ICIP.2014.7025093"},{"key":"ref_32","unstructured":"Yang, Y., Li, G., Qi, Y., and Huang, Q. (2020;, January 7\u201312). Release the power of online-training for robust visual tracking. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Gool, L.V., and Timofte, R. (2020, January 13\u201319). Probabilistic regression for visual tracking. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00721"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Nam, H., and Han, B. (2016, January 27\u201330). Learning multi-domain convolutional neural networks for visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.465"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yang, T., and Chan, A.B. (2018, January 8\u201314). Learning dynamic memory networks for object tracking. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01240-3_10"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Yang, T., and Chan, A.B. (2017, January 22\u201329). Recurrent filter learning for visual tracking. Proceedings of the IEEE International Conference on Computer Vision Workshops, Venice, Italy.","DOI":"10.1109\/ICCVW.2017.235"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Chen, Z., Zhong, B., Li, G., Zhang, S., and Ji, R. (2020, January 13\u201319). Siamese box adaptive network for visual tracking. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00670"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Bhat, G., Khan, F.S., and Felsberg, M. (2019, January 15\u201320). Atom: Accurate tracking by overlap maximization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00479"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Dai, K., Zhang, Y., Wang, D., Li, J., Lu, H., and Yang, X. (2020, January 13\u201319). High-performance long-term tracking with meta-updater. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00633"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/TITS.2019.2956813","article-title":"Overcoming occlusion in the automotive environment\u2014A review","volume":"22","author":"Gilroy","year":"2019","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Mehmood, K., Jalil, A., Ali, A., Khan, B., Murad, M., Cheema, K.M., and Milyani, A.H. (2021). Spatio-Temporal Context, Correlation Filter and Measurement Estimation Collaboration Based Visual Object Tracking. Sensors, 21.","DOI":"10.3390\/s21082841"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Mehmood, K., Ali, A., Jalil, A., Khan, B., Cheema, K.M., Murad, M., and Milyani, A.H. (2021). Efficient Online Object Tracking Scheme for Challenging Scenarios. Sensors, 21.","DOI":"10.3390\/s21248481"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Kortylewski, A., He, J., Liu, Q., and Yuille, A.L. (2020, January 13\u201319). Compositional convolutional neural networks: A deep architecture with innate robustness to partial occlusion. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00896"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Deformable Faster R-CNN with Aggregating Multi-Layer Features for Partially Occluded Object Detection in Optical Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10091470"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"450","DOI":"10.1016\/j.neucom.2021.08.107","article-title":"Detector\u2013tracker integration framework and attention mechanism for multi\u2013object tracking","volume":"464","author":"Li","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Zeng, Y., Wang, H., and Lu, T. (2019, January 11\u201313). Learning spatial-channel attention for visual tracking. Proceedings of the 2019 IEEE\/CIC International Conference on Communications in China (ICCC), Changchun, China.","DOI":"10.1109\/ICCChina.2019.8855908"},{"key":"ref_47","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018). CBAM: Convolutional Block Attention Module, Springer.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_50","first-page":"2672","article-title":"Generative adversarial nets","volume":"27","author":"Goodfellow","year":"2014","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_51","unstructured":"Gui, J., Sun, Z., Wen, Y., Tao, D., and Ye, J. (2021). A review on generative adversarial networks: Algorithms, theory, and applications. IEEE Trans. Knowl. Data Eng."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"4700","DOI":"10.1523\/JNEUROSCI.13-11-04700.1993","article-title":"A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information","volume":"13","author":"Olshausen","year":"1993","journal-title":"J. Neurosci."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Chatfield, K., Simonyan, K., Vedaldi, A., and Zisserman, A. (2014). Return of the devil in the details: Delving deep into convolutional nets. arXiv.","DOI":"10.5244\/C.28.6"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Mueller, M., Smith, N., and Ghanem, B. (2016, January 11\u201314). A benchmark and simulator for uav tracking. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_27"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Li, S., and Yeung, D.Y. (2017, January 4\u20139). Visual object tracking for unmanned aerial vehicles: A benchmark and new motion models. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.","DOI":"10.1609\/aaai.v31i1.11205"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Du, D., Qi, Y., Yu, H., Yang, Y., Duan, K., Li, G., Zhang, W., Huang, Q., and Tian, Q. (2018, January 8\u201314). The unmanned aerial vehicle benchmark: Object detection and tracking. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01249-6_23"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Wen, L., Zhu, P., Du, D., Bian, X., Ling, H., Hu, Q., Liu, C., Cheng, H., Liu, X., and Ma, W. (2018, January 8\u201314). Visdrone-sot2018: The vision meets drone single-object tracking challenge results. Proceedings of the European Conference on Computer Vision (ECCV) Workshops, Munich, Germany.","DOI":"10.1007\/978-3-030-11021-5_28"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"1834","DOI":"10.1109\/TPAMI.2014.2388226","article-title":"Object Tracking Benchmark","volume":"37","author":"Wu","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_59","unstructured":"Kristan, M., Matas, J., Leonardis, A., Felsberg, M., Cehovin, L., Fern\u00e1ndez, G., Vojir, T., H\u00e4ger, G., Luke\u017ei\u010d, A., and Fern\u00e1ndez, G. (2016, January 11\u201314). The visual object tracking vot2016 challenge results. Proceedings of the European Conference on Computer Vision (ECCV) Workshops, Amsterdam, The Netherlands."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"1562","DOI":"10.1109\/TPAMI.2019.2957464","article-title":"Got-10k: A large high-diversity benchmark for generic object tracking in the wild","volume":"43","author":"Huang","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"9152","DOI":"10.1109\/TIP.2020.3023621","article-title":"Siamese local and global networks for robust face tracking","volume":"29","author":"Qi","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Bhat, G., Shahbaz Khan, F., and Felsberg, M. (2017, January 21\u201326). ECO: Efficient convolution operators for tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.733"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Robinson, A., Shahbaz Khan, F., and Felsberg, M. (2016, January 11\u201314). Beyond correlation filters: Learning continuous convolution operators for visual tracking. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46454-1_29"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Li, F., Tian, C., Zuo, W., Zhang, L., and Yang, M.H. (2018, January 18\u201323). Learning spatial-temporal regularized correlation filters for visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00515"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Li, X., Ma, C., Wu, B., He, Z., and Yang, M.H. (2019, January 15\u201320). Target-aware deep tracking. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00146"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Hager, G., Shahbaz Khan, F., and Felsberg, M. (2015, January 7\u201313). Learning spatially regularized correlation filters for visual tracking. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.490"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Hager, G., Shahbaz Khan, F., and Felsberg, M. (2016, January 27\u201330). Adaptive decontamination of the training set: A unified formulation for discriminative visual tracking. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.159"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Li, Y., and Zhu, J. (2014, January 6\u201312). A scale adaptive kernel correlation filter tracker with feature integration. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-16181-5_18"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Golodetz, S., Miksik, O., and Torr, P.H. (2016, January 27\u201330). Staple: Complementary learners for real-time tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.156"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Mueller, M., Smith, N., and Ghanem, B. (2017, January 21\u201326). Context-aware correlation filter tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.152"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TPAMI.2014.2345390","article-title":"High-speed tracking with kernelized correlation filters","volume":"37","author":"Henriques","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Wang, C., Zhang, L., Xie, L., and Yuan, J. (2018, January 2\u20137). Kernel Cross-Correlator. Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.11710"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Danelljan, M., H\u00e4ger, G., Khan, F., and Felsberg, M. (2014, January 1\u20135). Accurate scale estimation for robust visual tracking. Proceedings of the British Machine Vision Conference, Nottingham, UK.","DOI":"10.5244\/C.28.65"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Wang, N., Song, Y., Ma, C., Zhou, W., Liu, W., and Li, H. (2019, January 15\u201320). Unsupervised deep tracking. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00140"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Kiani Galoogahi, H., Fagg, A., and Lucey, S. (2017, January 22\u201329). Learning background-aware correlation filters for visual tracking. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.129"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Lukezic, A., Vojir, T., \u010cehovin Zajc, L., Matas, J., and Kristan, M. (2017, January 21\u201326). Discriminative correlation filter with channel and spatial reliability. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.515"},{"key":"ref_77","doi-asserted-by":"crossref","unstructured":"Zhang, J., Ma, S., and Sclaroff, S. (2014, January 6\u201312). MEEM: Robust tracking via multiple experts using entropy minimization. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10599-4_13"},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.patcog.2017.04.004","article-title":"Robust Visual Tracking via Co-trained Kernelized Correlation Filters","volume":"69","author":"Zhang","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Wang, N., Zhou, W., Tian, Q., Hong, R., Wang, M., and Li, H. (2018, January 18\u201323). Multi-cue correlation filters for robust visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00509"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Zhang, T., Xu, C., and Yang, M.H. (2017, January 21\u201326). Multi-task correlation particle filter for robust object tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.512"},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1109\/TPAMI.2016.2609928","article-title":"Discriminative scale space tracking","volume":"39","author":"Danelljan","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Li, F., Yao, Y., Li, P., Zhang, D., Zuo, W., and Yang, M.H. (2017, January 22\u201329). Integrating boundary and center correlation filters for visual tracking with aspect ratio variation. Proceedings of the IEEE International Conference on Computer Vision Workshops, Venice, Italy.","DOI":"10.1109\/ICCVW.2017.234"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., and Torr, P. (2016). Fully-Convolutional Siamese Networks for Object Tracking, Springer.","DOI":"10.1007\/978-3-319-48881-3_56"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Song, Y., Ma, C., Gong, L., Zhang, J., Lau, R.W., and Yang, M.H. (2017, January 22\u201329). Crest: Convolutional residual learning for visual tracking. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.279"},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Valmadre, J., Bertinetto, L., Henriques, J., Vedaldi, A., and Torr, P.H. (2017, January 21\u201326). End-to-end representation learning for correlation filter based tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.531"},{"key":"ref_86","doi-asserted-by":"crossref","first-page":"2137","DOI":"10.1109\/TPAMI.2016.2516982","article-title":"A novel performance evaluation methodology for single-target trackers","volume":"38","author":"Kristan","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/19\/4756\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:38:04Z","timestamp":1760143084000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/19\/4756"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,23]]},"references-count":86,"journal-issue":{"issue":"19","published-online":{"date-parts":[[2022,10]]}},"alternative-id":["rs14194756"],"URL":"https:\/\/doi.org\/10.3390\/rs14194756","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,23]]}}}