{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:42:03Z","timestamp":1760362923116,"version":"build-2065373602"},"reference-count":52,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2020,1,19]],"date-time":"2020-01-19T00:00:00Z","timestamp":1579392000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In urban environment monitoring, visual tracking on unmanned aerial vehicles (UAVs) can produce more applications owing to the inherent advantages, but it also brings new challenges for existing visual tracking approaches (such as complex background clutters, rotation, fast motion, small objects, and realtime issues due to camera motion and viewpoint changes). Based on the Siamese network, tracking can be conducted efficiently in recent UAV datasets. Unfortunately, the learned convolutional neural network (CNN) features are not discriminative when identifying the target from the background\/clutter, In particular for the distractor, and cannot capture the appearance variations temporally. Additionally, occlusion and disappearance are also reasons for tracking failure. In this paper, a semantic subspace module is designed to be integrated into the Siamese network tracker to encode the local fine-grained details of the target for UAV tracking. More specifically, the target\u2019s semantic subspace is learned online to adapt to the target in the temporal domain. Additionally, the pixel-wise response of the semantic subspace can be used to detect occlusion and disappearance of the target, and this enables reasonable updating to relieve model drifting. Substantial experiments conducted on challenging UAV benchmarks illustrate that the proposed method can obtain competitive results in both accuracy and efficiency when they are applied to UAV videos.<\/jats:p>","DOI":"10.3390\/rs12020325","type":"journal-article","created":{"date-parts":[[2020,1,20]],"date-time":"2020-01-20T04:27:09Z","timestamp":1579494429000},"page":"325","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Online Semantic Subspace Learning with Siamese Network for UAV Tracking"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5013-2501","authenticated-orcid":false,"given":"Yufei","family":"Zha","sequence":"first","affiliation":[{"name":"National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University, Xi\u2019an 710072, China"},{"name":"Aeronautics Engineering College, Air Force Engineering University, Xi\u2019an 710038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min","family":"Wu","sequence":"additional","affiliation":[{"name":"Aeronautics Engineering College, Air Force Engineering University, Xi\u2019an 710038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhuling","family":"Qiu","sequence":"additional","affiliation":[{"name":"Aeronautics Engineering College, Air Force Engineering University, Xi\u2019an 710038, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingxian","family":"Sun","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peng","family":"Zhang","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University, Xi\u2019an 710072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Computer and information Engineering, Jiangxi Normal University, Nanchang 330006, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,1,19]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Fu, C., Lin, F., Li, Y., and Chen, G. (2019). Correlation Filter-Based Visual Tracking for UAV with Online Multi-Feature Learning. Remote. Sens., 11.","DOI":"10.3390\/rs11050549"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/10095020.2017.1420510","article-title":"information processing for unmanned aerial vehicles (UAVs) in surveying, mapping, and navigation","volume":"21","author":"Xia","year":"2018","journal-title":"Geo-Spat. Inf. Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1080\/10095020.2017.1420509","article-title":"A survey on vision-based UAV navigation","volume":"21","author":"Lu","year":"2018","journal-title":"Geo-Spat. Inf. Sci."},{"key":"ref_4","unstructured":"Lyu, Y., Vosselman, G., Xia, G., Yilmaz, A., and Yang, M.Y. (2018). The UAVid Dataset for Video Semantic Segmentation. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1109\/MGRS.2019.2918840","article-title":"Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects","volume":"7","author":"Xiang","year":"2019","journal-title":"IEEE Geosci. Remote. Sens. Mag."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Mueller, M., Smith, N., and Ghanem, B. (2016, January 11\u201314). A Benchmark and Simulator for UAV Tracking. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_27"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Du, D., Qi, Y., Yu, H., Yang, Y., Duan, K., Li, G., Zhang, W., Huang, Q., and Tian, Q. (2018, January 8\u201314). The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking. Proceedings of the European Conference Computer Vision(ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01249-6_23"},{"key":"ref_8","unstructured":"Zhu, P., Wen, L., Bian, X., Ling, H., and Hu, Q. (2018). Vision Meets Drones: A Challenge. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L., Kai, L., and Li, F.-F. (2009, January 20\u201325). ImageNet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_10","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20138). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Twenty-Sixth Annual Conference on Neural information Processing Systems (NIPS), Lake Tahoe, NV, USA."},{"key":"ref_11","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TPAMI.2014.2345390","article-title":"High-Speed Tracking with Kernelized Correlation Filters","volume":"37","author":"Henriques","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"(2018). Visual object tracking by correlation filters and online learning. ISPRS J. Photogramm. Remote. Sens., 140, 77\u201389.","DOI":"10.1016\/j.isprsjprs.2017.07.009"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Khan, F.S., Felsberg, M., and van de Weijer, J. (2014, January 23\u201328). Adaptive Color Attributes for Real-Time Visual Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.143"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Li, B., Yan, J., Wu, W., Zhu, Z., and Hu, X. (2018, January 18\u201323). High Performance Visual Tracking With Siamese Region Proposal Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00935"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Li, B., Wu, W., Wang, Q., Zhang, F., Xing, J., and Yan, J. (2019, January 16\u201320). SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00441"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Guo, Q., Feng, W., Zhou, C., Huang, R., Wan, L., and Wang, S. (2017, January 22\u201329). Learning Dynamic Siamese Network for Visual Object Tracking. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.196"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wang, Q., Zhang, M., Xing, J., Gao, J., Hu, W., and Maybank, S. (2018, January 13\u201319). Do not Lose the Details: Reinformationrced Representation Learning for High Performance Visual Tracking. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden.","DOI":"10.24963\/ijcai.2018\/137"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., and Torr, P.H.S. (2016, January 8\u201310). Fully-Convolutional Siamese Networks for Object Tracking. Proceedings of the European Conference Computer Vision Workshops (ECCVW), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-48881-3_56"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"He, A., Luo, C., Tian, X., and Zeng, W. (2018, January 18\u201323). A Twofold Siamese Network for Real-Time Object Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00508"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"89777","DOI":"10.1109\/ACCESS.2019.2927211","article-title":"Distractor-Aware Visual Tracking by Online Siamese Network","volume":"7","author":"Zha","year":"2019","journal-title":"IEEE Access"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional Networks for Biomedical Image Segmentation. Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1016\/j.patcog.2017.11.007","article-title":"Deep visual tracking: Review and experimental comparison","volume":"76","author":"Li","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Tao, R., Gavves, E., and Smeulders, A.W.M. (2016, January 27\u201330). Siamese Instance Search for Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.158"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wang, Q., Teng, Z., Xing, J., Gao, J., Hu, W., and Maybank, S. (2018, January 18\u201323). Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00510"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, Z., and Peng, H. (2019, January 16\u201320). Deeper and Wider Siamese Networks for Real-Time Visual Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00472"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1007\/s11263-007-0075-7","article-title":"Incremental Learning for Robust Visual Tracking","volume":"77","author":"Ross","year":"2008","journal-title":"Int. J. Comput. Vis."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1109\/TPAMI.2008.79","article-title":"Robust Face Recognition via Sparse Representation","volume":"31","author":"Wright","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2661","DOI":"10.1109\/TIP.2013.2255301","article-title":"Efficient Minimum Error Bounded Particle Resampling L1 Tracker With Occlusion Detection","volume":"22","author":"Mei","year":"2013","journal-title":"IEEE Trans. Image Process."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1301","DOI":"10.1109\/TCSVT.2013.2291355","article-title":"L2-RLS-Based Object Tracking","volume":"24","author":"Xiao","year":"2014","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear Dimensionality Reduction by Locally Linear Embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1786","DOI":"10.1109\/TNNLS.2017.2688448","article-title":"Manifold Regularized Correlation Object Tracking","volume":"29","author":"Hu","year":"2018","journal-title":"IEEE Trans. Neural Netw. Learning Syst."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"510","DOI":"10.1109\/TMM.2018.2859831","article-title":"Robust Object Tracking Using Manifold Regularized Convolutional Neural Networks","volume":"21","author":"Hu","year":"2019","journal-title":"IEEE Trans. Multimed."},{"key":"ref_35","unstructured":"Li, Y., and Zhu, J. (2014, January 6\u201312). A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration. Proceedings of the European Conference Computer Vision (ECCV), Zurich, Switzerland."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1834","DOI":"10.1109\/TPAMI.2014.2388226","article-title":"Object Tracking Benchmark","volume":"37","author":"Wu","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Ma, C., Huang, J., Yang, X., and Yang, M. (2015, January 7\u201313). Hierarchical Convolutional Features for Visual Tracking. Proceedings of the IEEE Conference on International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.352"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Nam, H., and Han, B. (2016, January 27\u201330). Learning Multi-domain Convolutional Neural Networks for Visual Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.465"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Song, Y., Ma, C., Gong, L., Zhang, J., Lau, R.W.H., and Yang, M. (2017, January 22\u201329). CREST: Convolutional Residual Learning for Visual Tracking. Proceedings of the IEEE Conference on International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.279"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Danelljan, M., H\u00e4ger, G., Khan, F.S., and Felsberg, M. (2015, January 7\u201313). Learning Spatially Regularized Correlation Filters for Visual Tracking. Proceedings of the IEEE Conference on International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.490"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Dai, K., Wang, D., Lu, H., Sun, C., and Li, J. (2019, January 16\u201320). Visual Tracking via Adaptive Spatially-Regularized Correlation Filters. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00480"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Luke\u017eic, A., Voj\u00edr, T., Zajc, L.C., Matas, J., and Kristan, M. (2017, January 21\u201326). Discriminative Correlation Filter with Channel and Spatial Reliability. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.515"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"5596","DOI":"10.1109\/TIP.2019.2919201","article-title":"Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Object Tracking","volume":"28","author":"Xu","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Galoogahi, H.K., Fagg, A., and Lucey, S. (2017, January 22\u201329). Learning Background-Aware Correlation Filters for Visual Tracking. Proceedings of the IEEE Conference on International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.129"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Wu, Y., Lim, J., and Yang, M. (2013, January 23\u201328). Online Object Tracking: A Benchmark. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.312"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Bhat, G., Khan, F.S., and Felsberg, M. (2017, January 21\u201326). ECO: Efficient Convolution Operators for Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.733"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Robinson, A., Shahbaz Khan, F., and Felsberg, M. (2016, January 11\u201314). Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking. Proceedings of the European Conference Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46454-1_29"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Golodetz, S., Miksik, O., and Torr, P.H.S. (2016, January 27\u201330). Staple: Complementary Learners for Real-Time Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.156"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Zhang, J., Ma, S., and Sclaroff, S. (2014, January 6\u201312). MEEM: Robust Tracking via Multiple Experts Using Entropy Minimization. Proceedings of the European Conference Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10599-4_13"},{"key":"ref_50","unstructured":"Li, Y., Zhu, J., Hoi, S.C.H., Song, W., Wang, Z., and Liu, H. (February, January 27). Robust Estimation of Similarity Transformation for Visual Object Tracking. Proceedings of the Thirty-Third Conference on Artificial Intelligence, AAAI, Honolulu, HI, USA."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1409","DOI":"10.1109\/TPAMI.2011.239","article-title":"Tracking-Learning-Detection","volume":"34","author":"Kalal","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1109\/TPAMI.2016.2609928","article-title":"Discriminative Scale Space Tracking","volume":"39","author":"Danelljan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/2\/325\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:20:13Z","timestamp":1760361613000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/2\/325"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,19]]},"references-count":52,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2020,1]]}},"alternative-id":["rs12020325"],"URL":"https:\/\/doi.org\/10.3390\/rs12020325","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2020,1,19]]}}}