{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T00:10:54Z","timestamp":1775175054885,"version":"3.50.1"},"reference-count":41,"publisher":"MDPI AG","issue":"17","license":[{"start":{"date-parts":[[2023,8,26]],"date-time":"2023-08-26T00:00:00Z","timestamp":1693008000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In the field of computer vision, the detection of infrared small targets (IRSTD) is a crucial research area that plays an important role in space exploration, infrared warning systems, and other applications. However, the existing IRSTD methods are prone to generating a higher number of false alarms and an inability to accurately locate the target, especially in scenarios with low signal-to-noise ratio or high noise interference. To address this issue, we proposes a fully convolutional-based small target detection algorithm (FCST). The algorithm builds on the anchor-free detection method FCOS and adds a focus structure and a single aggregation approach to design a lightweight feature extraction network that efficiently extracts features for small targets. Furthermore, we propose a feature refinement mechanism to emphasize the target and suppress conflicting information at multiple scales, enhancing the detection of infrared small targets. Experimental results demonstrate that the proposed algorithm achieves a detection rate of 95% and a false alarm rate of 2.32% for IRSTD tasks. To tackle even more complex scenarios, we propose a temporally-aware fully convolutional infrared small target detection (TFCST) algorithm that leverages both spatial and temporal information from sequence images. Building on a single-frame detection network, the algorithm incorporates ConvLSTM units to extract spatiotemporal contextual information from the sequence images, boosting the detection of infrared small targets. The proposed algorithm shows fast detection speed and achieves a 2.73% improvement in detection rate and an 8.13% reduction in false alarm rate relative to the baseline single-frame detection networks.<\/jats:p>","DOI":"10.3390\/rs15174198","type":"journal-article","created":{"date-parts":[[2023,8,28]],"date-time":"2023-08-28T05:46:47Z","timestamp":1693201607000},"page":"4198","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Infrared Small Target Detection Based on a Temporally-Aware Fully Convolutional Neural Network"],"prefix":"10.3390","volume":"15","author":[{"given":"Lei","family":"Zhang","sequence":"first","affiliation":[{"name":"Beijing Institute of Tracking and Telecommunication Technology, Beijing 100094, China"},{"name":"College of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China"}]},{"given":"Peng","family":"Han","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]},{"given":"Jiahua","family":"Xi","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-1325-6425","authenticated-orcid":false,"given":"Zhengrong","family":"Zuo","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,8,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"3632","DOI":"10.1109\/TPAMI.2020.2985395","article-title":"Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-Segmentation","volume":"43","author":"Chen","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Izadinia, H., Sadeghi, F., Divvala, S., Hajishirzi, H., Choi, Y., and Farhadi, A. (2015, January 7\u201313). Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.10"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2650","DOI":"10.1109\/TCSVT.2019.2923712","article-title":"Spatial-Temporal Context-Aware Online Action Detection and Prediction","volume":"30","author":"Huang","year":"2020","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"3854","DOI":"10.1049\/ipr2.12599","article-title":"Class-wise boundary regression by uncertainty in temporal action detection","volume":"16","author":"Chen","year":"2022","journal-title":"IET Image Process."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2350","DOI":"10.1109\/TIP.2021.3052069","article-title":"Depth-Quality-Aware Salient Object Detection","volume":"30","author":"Chen","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Shivappriya, S., Priyadarsini, M., Stateczny, A., Puttamadappa, C., and Parameshachari, B. (2021). Cascade Object Detection and Remote Sensing Object Detection Method Based on Trainable Activation Function. Remote Sens., 13.","DOI":"10.3390\/rs13020200"},{"key":"ref_7","first-page":"9","article-title":"Discussion on the suppression and method of background noise of small target infrared image","volume":"2","author":"Zhao","year":"2004","journal-title":"Opt. Optoelectron. Technol."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1117\/12.364049","article-title":"Max-mean and max-median filters for detection of small targets","volume":"3809","author":"Deshpande","year":"1999","journal-title":"Proc. SPIE"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1109\/31.1775","article-title":"The Two-Dimensional Adaptive LMS (TDLMS) Algorithm","volume":"35","author":"Hadhoud","year":"1988","journal-title":"IEEE Trans. Circuits Syst."},{"key":"ref_10","first-page":"735","article-title":"Small Target Detection Using Bilateral Filter Based on Edge Component","volume":"31","author":"Bae","year":"2010","journal-title":"J. Infrared Millim. Terahertz Waves"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.infrared.2005.04.006","article-title":"The design of top-hat morphological filter and application to infrared target detection","volume":"48","author":"Zeng","year":"2006","journal-title":"Infrared Phys. Technol."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TGRS.2013.2242477","article-title":"A Local Contrast Method for Small Infrared Target Detection","volume":"52","author":"Chen","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2168","DOI":"10.1109\/LGRS.2014.2323236","article-title":"A Robust Infrared Small Target Detection Algorithm Based on Human Visual System","volume":"11","author":"Han","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, H., Wang, H., Zhou, L., and Wang, L. (November, January 27). Miss Detection vs. False Alarm: Adversarial Learning for Small Object Segmentation in Infrared Images. Proceedings of the International Conference on Computer Vision 2019, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00860"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1469","DOI":"10.1007\/s11036-019-01377-6","article-title":"Infrared Dim and Small Target Detection Based on Denoising Autoencoder Network","volume":"25","author":"Shi","year":"2020","journal-title":"Mob. Netw. Appl."},{"key":"ref_16","first-page":"1","article-title":"A Novel Pattern for Infrared Small Target Detection With Generative Adversarial Network","volume":"99","author":"Zhao","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1109\/TMM.2021.3070138","article-title":"Deep-IRTarget: An automatic target detector in infrared imagery using dual-domain feature extraction and allocation","volume":"24","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Multimed."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.neucom.2021.10.110","article-title":"Graph-based few-shot learning with transformed feature propagation and optimal class allocation","volume":"470","author":"Zhang","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.infrared.2019.05.010","article-title":"Infrared dim and small target detection based on three-dimensional collaborative filtering and spatial inversion modeling","volume":"101","author":"Ren","year":"2019","journal-title":"Infrared Phys. Technol."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Irsoy, O., and Cardie, C. (2014, January 26\u201328). Opinion mining with deep recurrent neural networks. Proceedings of the Conference on Empirical Methods in Natural Language Processing 2014, Doha, Qatar.","DOI":"10.3115\/v1\/D14-1080"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Xiao, F., Xiao, F., and Lee, Y. (2018, January 8\u201314). Video Object Detection with an Aligned Spatial-Temporal Memory. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01237-3_30"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Lu, Y., Lu, Y., Lu, C., and Tang, C. (2017, January 22\u201329). Online video object detection using association LSTM. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.257"},{"key":"ref_23","unstructured":"Shi, X., Shi, X., Chen, Z., Wang, H., Yeung, D., Wong, W., and Woo, W. (2015, January 7\u201312). Convolutional LSTM network: A machine learning apporach for precipitation nowcasting. Proceedings of the 28th International Conference on Neural Infromation Processing Systems, Montreal, QC, Canada."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, W., Liu, W., Anguelov, D., and Erhan, D. (2016). SSD: Single Shot MultiBox Detector, Springer International Publishing.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1109\/TNNLS.2020.2978756","article-title":"3-d quasi-recurrent neural network for hyperspectral image denoising","volume":"32","author":"Wei","year":"2020","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_26","unstructured":"Tian, Z., Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). FCOS: Fully convolutional one-stage object detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"He, K., He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Huang, G., Huang, G., Liu, Z., and Laurens, V. (2017, January 21\u201326). Densely Connected Convolutional Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Lee, Y., Lee, Y., Hwang, J., Lee, S., Bae, Y., and Park, J. (2019, January 16\u201317). An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA.","DOI":"10.1109\/CVPRW.2019.00103"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lin, T., Lin, T., Dollar, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision & Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_31","first-page":"2999","article-title":"Focal Loss for Dense Object Detection","volume":"99","author":"Lin","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. (2019, January 15\u201320). Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_33","unstructured":"Ballas, N., Ballas, N., Yao, L., Pal, C., and Courville, A. Delving Deeper into Convolutional Networks for Learning Video Representations. Comput. Sci., 2015."},{"key":"ref_34","unstructured":"Liu, M., and Zhu, M. (2018, January 18\u201323). Mobile Video Object Detection with Temporally-Aware Feature Maps. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Cho, K., Cho, K., Van Merrienboer, B., and Gulcehre, C. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. Comput. Sci.","DOI":"10.3115\/v1\/D14-1179"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.patcog.2016.04.002","article-title":"Multiscale patch-based contrast measure for small infrared target detection","volume":"58","author":"Wei","year":"2016","journal-title":"Pattern Recognit."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"4996","DOI":"10.1109\/TIP.2013.2281420","article-title":"Infrared patch-image model for small target detection in a single image","volume":"22","author":"Gao","year":"2013","journal-title":"IEEE Trans. Image Process."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, L., Zhang, L., Peng, L., Zhang, T., Cao, S., and Peng, Z. (2018). Infrared Small Target Detection via Non-Convex Rank Approximation Minimization Joint l2,1 Norm. Remote Sens., 10.","DOI":"10.3390\/rs10111821"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhang, L., Zhang, L., and Peng, Z. (2019). Infrared small target detection based on partial sum of the tensor nuclear norm. Remote Sens., 11.","DOI":"10.3390\/rs11040382"},{"key":"ref_40","unstructured":"Redmon, J., Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_41","unstructured":"Ge, Z., Ge, Z., Liu, S., Wang, F., and Sun, J. (2021). YOLOX: Exceeding YOLO Series in 2021. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/17\/4198\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:39:38Z","timestamp":1760128778000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/17\/4198"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,26]]},"references-count":41,"journal-issue":{"issue":"17","published-online":{"date-parts":[[2023,9]]}},"alternative-id":["rs15174198"],"URL":"https:\/\/doi.org\/10.3390\/rs15174198","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,26]]}}}