{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T10:29:21Z","timestamp":1769941761192,"version":"3.49.0"},"reference-count":43,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,27]],"date-time":"2022-12-27T00:00:00Z","timestamp":1672099200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Nature Science Foundation of China (NSFC)","award":["61931016"],"award-info":[{"award-number":["61931016"]}]},{"name":"Nature Science Foundation of China (NSFC)","award":["62071344"],"award-info":[{"award-number":["62071344"]}]},{"name":"Nature Science Foundation of China (NSFC)","award":["62001352"],"award-info":[{"award-number":["62001352"]}]},{"name":"Nature Science Foundation of China (NSFC)","award":["CLDL-20202412"],"award-info":[{"award-number":["CLDL-20202412"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61931016"],"award-info":[{"award-number":["61931016"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62071344"],"award-info":[{"award-number":["62071344"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62001352"],"award-info":[{"award-number":["62001352"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["CLDL-20202412"],"award-info":[{"award-number":["CLDL-20202412"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Open Foundation of CETC Key Laboratory of Data Link Technology","award":["61931016"],"award-info":[{"award-number":["61931016"]}]},{"name":"Open Foundation of CETC Key Laboratory of Data Link Technology","award":["62071344"],"award-info":[{"award-number":["62071344"]}]},{"name":"Open Foundation of CETC Key Laboratory of Data Link Technology","award":["62001352"],"award-info":[{"award-number":["62001352"]}]},{"name":"Open Foundation of CETC Key Laboratory of Data Link Technology","award":["CLDL-20202412"],"award-info":[{"award-number":["CLDL-20202412"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Shadows are widely used in the tracking of moving targets by video synthetic aperture radar (video SAR). However, they always appear in groups in video SAR images. In such cases, track effects produced by existing single-target tracking methods are no longer satisfactory. To this end, an effective way to obtain the capability of multiple target tracking (MTT) is in urgent demand. Note that tracking by detection (TBD) for MTT in optical images has achieved great success. However, TBD cannot be utilized in video SAR MTT directly. The reasons for this is that shadows of moving target are quite different from in video SAR image than optical images as they are time-varying and their pixel sizes are small. The aforementioned characteristics make shadows in video SAR images hard to detect in the process of TBD and lead to numerous matching errors in the data association process, which greatly affects the final tracking performance. Aiming at the above two problems, in this paper, we propose a multiple target tracking method based on TBD and the Siamese network. Specifically, to improve the detection accuracy, the multi-scale Faster-RCNN is first proposed to detect the shadows of moving targets. Meanwhile, dimension clusters are used to accelerate the convergence speed of the model in the training process as well as to obtain better network weights. Then, SiamNet is proposed for data association to reduce matching errors. Finally, we apply a Kalman filter to update the tracking results. The experimental results on two real video SAR datasets demonstrate that the proposed method outperforms other state-of-art methods, and the ablation experiment verifies the effectiveness of multi-scale Faster-RCNN and SimaNet.<\/jats:p>","DOI":"10.3390\/rs15010146","type":"journal-article","created":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T05:30:27Z","timestamp":1672205427000},"page":"146","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Siam-Sort: Multi-Target Tracking in Video SAR Based on Tracking by Detection and Siamese Network"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1902-8156","authenticated-orcid":false,"given":"Hui","family":"Fang","sequence":"first","affiliation":[{"name":"National Laboratory of Radar Signal Processing, Xidian University, Xi\u2019an 710071, China"}]},{"given":"Guisheng","family":"Liao","sequence":"additional","affiliation":[{"name":"National Laboratory of Radar Signal Processing, Xidian University, Xi\u2019an 710071, China"}]},{"given":"Yongjun","family":"Liu","sequence":"additional","affiliation":[{"name":"National Laboratory of Radar Signal Processing, Xidian University, Xi\u2019an 710071, China"}]},{"given":"Cao","family":"Zeng","sequence":"additional","affiliation":[{"name":"National Laboratory of Radar Signal Processing, Xidian University, Xi\u2019an 710071, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"8236","DOI":"10.1109\/JSTARS.2021.3104603","article-title":"Joint track-before-detect algorithm for high-maneuvering target indication in video SAR","volume":"14","author":"Qin","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Raynal, A.M., Bickel, D.L., and Doerry, A.W. (2014). Stationary and moving target shadow characteristics in synthetic aperture radar. Radar Sensor Technology XVIII, SPIE.","DOI":"10.1117\/12.2049729"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Miller, J., Bishop, E., Doerry, A., and Raynal, A. (2015). Impact of ground mover motion and windowing on stationary and moving shadows in synthetic aperture radar imagery. Algorithms for Synthetic Aperture Radar Imagery XXII, SPIE.","DOI":"10.1117\/12.2179173"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Xu, Z., Zhang, Y., Li, H., Mu, H., and Zhuang, Y. (2017, January 14). A new shadow tracking method to locate the moving target in SAR imagery based on KCF. Proceedings of the International Conference in Communications, Signal Processing, and Systems, Harbin, China.","DOI":"10.1007\/978-981-10-6571-2_322"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Yang, X., Shi, J., Zhou, Y., Wang, C., Hu, Y., and Zhang, X. (2020). Ground moving target tracking and refocusing using shadow in video-SAR. Remote Sens., 12.","DOI":"10.3390\/rs12183083"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Yang, S., Li, H., and Xu, Z. (2018, January 22). Shadow tracking of moving target based on CNN for video SAR system. Proceedings of the IGARSS 2018\u20142018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518431"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bewley, A., Ge, Z., Ott, L., Ramos, F., and Upcroft, B. (2016, January 25\u201328). Simple online and realtime tracking. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wojke, N., Bewley, A., and Paulus, D. (2017, January 25). Simple online and realtime tracking with a deep association metric. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3069","DOI":"10.1007\/s11263-021-01513-4","article-title":"Fairmot: On the fairness of detection and re-identification in multiple object tracking","volume":"129","author":"Zhang","year":"2021","journal-title":"Int. J. Comput. Vis."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Zhou, X., Koltun, V., and Kr\u00e4henb\u00fchl, P. (2020, January 23). Tracking objects as points. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-58548-8_28"},{"key":"ref_11","unstructured":"Gao, S. (2009). Grape Theory and Network Flow Theory, Higher Education Press."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","article-title":"A new approach to linear filtering and prediction problems","volume":"82","author":"Kalman","year":"1960","journal-title":"J. Basic Eng. Mar."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"Imagenet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., and Torr, P.H. (2016, January 8). Fully-convolutional siamese networks for object tracking. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-48881-3_56"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"735","DOI":"10.1016\/B978-0-444-59507-2.50139-6","article-title":"Statistical Monitoring of Water Systems","volume":"31","author":"Viteri","year":"2015","journal-title":"Comput. Aided Chem. Eng."},{"key":"ref_18","first-page":"1137","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016). You only look once: Unified, real-time object detection. Advances in Neural Information Processing Systems, Palais des Congr\u00e8s de Montr\u00e9al.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., and Fu, C.-Y. (2016, January 8). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 26). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2174","DOI":"10.1109\/TGRS.2020.3003264","article-title":"FEC: A feature fusion framework for SAR target recognition based on electromagnetic scattering features and deep CNN features","volume":"59","author":"Zhang","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.isprsjprs.2020.12.015","article-title":"PBNet: Part-based convolutional neural network for complex composite object detection in remote sensing imagery","volume":"173","author":"Sun","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_24","first-page":"1","article-title":"DABNet: Deformable contextual and boundary-weighted network for cloud detection in remote sensing images","volume":"60","author":"He","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","unstructured":"Goyal, A., Bochkovskiy, A., Deng, J., and Koltun, V. (2021). Non-deep networks. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 26). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_28","unstructured":"MacQueen, J. (1967). Classification and analysis of multivariate observations. 5th Berkeley Symp. Math. Statist. Probability, Statistical Laboratory of the University of California."},{"key":"ref_29","first-page":"100","article-title":"Algorithm AS 136: A k-means clustering algorithm","volume":"28","author":"Hartigan","year":"1979","journal-title":"J. R. Stat. Soc. Ser. C (Appl. Stat.)"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1109\/TPAMI.1984.4767478","article-title":"K-means-type algorithms: A generalized convergence theorem and characterization of local optimality","volume":"PAMI-6","author":"Selim","year":"1984","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Bromley, J., Guyon, I., LeCun, Y., S\u00e4ckinger, E., and Shah, R. (1993). Signature verification using a \u201csiamese\u201d time delay neural network. Advances in Neural Information Processing Systems, Morgan Kaufmann Publishers Inc.","DOI":"10.1142\/9789812797926_0003"},{"key":"ref_32","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_33","unstructured":"Chopra, S., Hadsell, R., and LeCun, Y. (2005, January 20). Learning a similarity metric discriminatively, with application to face verification. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_34","unstructured":"Hadsell, R., Chopra, S., and LeCun, Y. (2006, January 17). Dimensionality reduction by learning an invariant mapping. Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201906), New York, NY, USA."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"14680","DOI":"10.3390\/rs71114680","article-title":"Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1484","DOI":"10.1109\/LGRS.2017.2717486","article-title":"Improving SAR automatic target recognition models with transfer learning from simulated data","volume":"14","author":"Kusk","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Neubeck, A., and Van Gool, L. (2006, January 20). Efficient non-maximum suppression. Proceedings of the 18th International Conference on Pattern Recognition (ICPR\u201906), Hong Kong, China.","DOI":"10.1109\/ICPR.2006.479"},{"key":"ref_38","first-page":"1","article-title":"Generalizing from a few examples: A survey on few-shot learning","volume":"53","author":"Wang","year":"2020","journal-title":"ACM Comput. Surv."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2008\/246309","article-title":"Evaluating multiple object tracking performance: The clear mot metrics","volume":"2008","author":"Bernardin","year":"2008","journal-title":"Eurasip J. Image Video Process."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Jin, Z., Yu, D., Song, L., Yuan, Z., and Yu, L. (2022). You Should Look at All Objects. arXiv.","DOI":"10.1007\/978-3-031-20077-9_20"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1109\/MAES.2004.1263228","article-title":"Multiple hypothesis tracking for multiple target tracking","volume":"19","author":"Blackman","year":"2004","journal-title":"IEEE Aerosp. Electron. Syst. Mag."},{"key":"ref_42","unstructured":"Yan, H., Mao, X., Zhang, J., and Zhu, D. (2016, January 24). Frame rate analysis of video synthetic aperture radar (ViSAR). Proceedings of the 2016 International Symposium on Antennas and Propagation (ISAP), Okinawa, Japan."},{"key":"ref_43","unstructured":"Dalal, N., and Triggs, B. (2005, January 20). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/1\/146\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:52:44Z","timestamp":1760147564000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/1\/146"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,27]]},"references-count":43,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["rs15010146"],"URL":"https:\/\/doi.org\/10.3390\/rs15010146","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,27]]}}}