{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T17:28:46Z","timestamp":1771003726245,"version":"3.50.1"},"reference-count":31,"publisher":"SAGE Publications","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JCM"],"published-print":{"date-parts":[[2022,7,8]]},"abstract":"<jats:p>Target detection is the basis for automatic target recognition system of infrared imaging guidance to complete subsequent tasks such as recognition and tracking. Existing systems have not the autonomous learning ability of target feature, and it will be powerless once the task environment exceeds the pre-planned condition. The single-stage target detection based on deep learning has the ability of autonomous learning and high computational efficiency, which is an effective way to solve the problem of infrared imaging guidance target detection in complex environment. SSD (Single Shot MultiBox Detector) is a classical single-stage detection model, however, the convolution layer with strong semantic information in SSD has low resolution, which is not conducive to small target detection. In addition, the location loss of SSD does not consider the impact of target scale change. Therefore, this paper puts forward two improvement ideas in view of SSD: (1) Starting from FPN (Feature Pyramid Network), feature channel\u2019s importance is distinguished through efficient channel attention mechanism, the contribution of each feature layer to the fusion output is described based on the learnable weight, and the feature weighted fusion of bidirectional multi-scale is realized between the feature layer which has low resolution and strong semantics and the feature layer which has high resolution and weak semantics. (2) Starting from IoU (Intersection over Union) and considering non overlapping parts and geometric relationship between the predicted box and the ground-truth box, the location loss of SSD that remains invariable to the target scale change is constructed to improve the sensitivity of the detection model to the locating error of small target. The experimental results show that, for 300 \u00d7 300 input, the presented method achieves 84.7% mAP (mean Average Precision) on VOC2007 test and for 512 \u00d7 512 input, it reaches 86.6%. On the self-built infrared aircraft data set, the proposed method achieves 81.1% mAP and can detect more small targets. Without affecting detection speed, the presented method on experimental results outperforms some comparable state-of-the-art models such as YOLOv3 (You Only Look Once), DSSD (Deconvolutional Single Shot Multibox Detector), RSSD (Rainbow Single Shot Multibox Detector) and FSSD (Fusion Single Shot Multibox Detector).<\/jats:p>","DOI":"10.3233\/jcm-226112","type":"journal-article","created":{"date-parts":[[2022,5,10]],"date-time":"2022-05-10T12:24:56Z","timestamp":1652185496000},"page":"1393-1408","source":"Crossref","is-referenced-by-count":3,"title":["An improved SSD method for infrared target detection based on convolutional neural network"],"prefix":"10.1177","volume":"22","author":[{"given":"Gang","family":"Liu","sequence":"first","affiliation":[]},{"given":"Zixuan","family":"Cao","sequence":"additional","affiliation":[]},{"given":"Sen","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Bin","family":"Song","sequence":"additional","affiliation":[]},{"given":"Zhonghua","family":"Liu","sequence":"additional","affiliation":[]}],"member":"179","reference":[{"key":"10.3233\/JCM-226112_ref1","doi-asserted-by":"crossref","first-page":"2429","DOI":"10.1016\/S0042-6989(97)00464-1","article-title":"The role of the primary visual cortex in higher level vision","volume":"38","author":"Sing","year":"1998","journal-title":"Vision Research"},{"issue":"4","key":"10.3233\/JCM-226112_ref2","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1109\/MCI.2010.938364","article-title":"Deep machine learning-a new frontier in artificial intelligence research","volume":"5","author":"Arel","year":"2010","journal-title":"IEEE Transactions on Computational Intelligence Magazine"},{"key":"10.3233\/JCM-226112_ref3","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning review","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"issue":"8","key":"10.3233\/JCM-226112_ref4","first-page":"1289","article-title":"Advances and perspectives on applications of deep learning in visual object detection","volume":"43","author":"Zhang","year":"2017","journal-title":"Acta Automatica Sinica"},{"issue":"2","key":"10.3233\/JCM-226112_ref5","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/s11263-019-01247-4","article-title":"Deep learning for generic object detection: a survey","volume":"128","author":"Liu","year":"2020","journal-title":"International Journal of Computer Vision"},{"issue":"3","key":"10.3233\/JCM-226112_ref6","first-page":"44","article-title":"Research on single-stage object detection algorithm based on deep learning","volume":"27","author":"Liu","year":"2019","journal-title":"Aero Weaponry"},{"key":"10.3233\/JCM-226112_ref7","doi-asserted-by":"crossref","unstructured":"Redmon J, Divvala S, Girshick R, Farhadi A. You only look once: unified, real-Time object detection, in: Proc 2016 IEEE conference on computer vision and pattern recognition. 2016; 779-788.","DOI":"10.1109\/CVPR.2016.91"},{"key":"10.3233\/JCM-226112_ref8","doi-asserted-by":"crossref","unstructured":"Redmon J, Farhadi A. YOLO9000: better, faster, stronger, in: Proc. 2017 IEEE conference on computer vision and pattern recognition. 2017; 6517-6525.","DOI":"10.1109\/CVPR.2017.690"},{"key":"10.3233\/JCM-226112_ref9","unstructured":"Redmon J, Farhadi A. YOLOv3: an incremental improvement, arXiv preprint arXiv: 180402767, 2018."},{"key":"10.3233\/JCM-226112_ref10","doi-asserted-by":"crossref","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC. SSD: single shot multibox detector, in: Proc 2016 Europe Conference-Computer Vision. 2016; 21-37.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"10.3233\/JCM-226112_ref11","unstructured":"Fu C, Liu W, Ananth R, Ambrish T, Alexander CB. DSSD: deconvolutional single shot detector, arXiv preprint arXiv: 170106659, 2017."},{"key":"10.3233\/JCM-226112_ref12","doi-asserted-by":"crossref","unstructured":"Jeong J, Park H, Kwak N. Enhancement of SSD by concatenating feature maps for object detection, in: Proc. 2017 British Machine Vision Conference, 2017.","DOI":"10.5244\/C.31.76"},{"key":"10.3233\/JCM-226112_ref13","unstructured":"Li Z, Zhou F. FSSD: Feature fusion single shot multibox detector, arXiv preprint arXiv: 171200960, 2018."},{"issue":"2","key":"10.3233\/JCM-226112_ref14","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal loss for dense object detection","volume":"42","author":"Lin","year":"2018","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"10.3233\/JCM-226112_ref15","doi-asserted-by":"crossref","unstructured":"Lin T, Dollar P, Girshick R, He KM, Hariharan B, Belongie S. Feature pyramid networks for object detection, in: Proc. 2017 IEEE conference on computer vision and pattern recognition, 2017, 936-944.","DOI":"10.1109\/CVPR.2017.106"},{"issue":"7","key":"10.3233\/JCM-226112_ref16","first-page":"0615001","article-title":"Enhancement of single shot multibox detector for aerial infrared target detection","volume":"39","author":"Xie","year":"2019","journal-title":"Acta Optica Sinica"},{"issue":"11","key":"10.3233\/JCM-226112_ref17","first-page":"11040031","article-title":"Infrared camouflage detection method for special vehicles based on improved SSD","volume":"48","author":"Zhao","year":"2019","journal-title":"Infrared and Laser Engineering"},{"key":"10.3233\/JCM-226112_ref18","unstructured":"Marcel S, Andrew W, Mehryar E, Wang S, Barry C. POL-LWIR vehicle detection: Convolutional neural networks meet polarised infrared sensors, in: Proc. 2018 IEEE conference on computer vision and pattern recognition, 2018, 1328-1334."},{"key":"10.3233\/JCM-226112_ref19","unstructured":"Alperen E, Bensu A, Burak B, Yusuf A. Comparison of image classification and object detection for passenger seat belt violation detection using NIR RGB surveillance camera images, in: Proc. 2018 IEEE international conference on advanced video and signal-based surveillance, 2018."},{"key":"10.3233\/JCM-226112_ref20","unstructured":"Shu LZ. Research on key technology of fast image fusion, object detection and tracking based on UAV, PhD. dissertation, National University of Defense Technology, 2018."},{"issue":"1","key":"10.3233\/JCM-226112_ref21","first-page":"42","article-title":"An infrared image pedestrian detection algorithm based on improved SSD algorithm","volume":"27","author":"Liu","year":"2020","journal-title":"Electronics Optics & Control"},{"key":"10.3233\/JCM-226112_ref22","unstructured":"Zhu DW. Infrared image plane target detection method based on deep learning, PhD. dissertation, Xidian University, 2018."},{"key":"10.3233\/JCM-226112_ref23","unstructured":"Saksham J, Indu S. Robust detection of iris region using an adapted SSD framework, in: Proc 2018 Indian conference on computer vision, graphics and image processing, 2018."},{"key":"10.3233\/JCM-226112_ref24","doi-asserted-by":"crossref","unstructured":"Liu S, Qi L, Qin HF, Shi JP, Jia JY. Path aggregation network for instance segmentation, in Proc. 2018 IEEE conference on computer vision and pattern recognition, 2018, 8759-8768.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"10.3233\/JCM-226112_ref25","doi-asserted-by":"crossref","unstructured":"Zhou P, Ni BB, Geng C, Hu JG, Xu Y. Scale-transferrable object detection. in: Proc. 2018 IEEE conference on computer vision and pattern recognition, 2018, 528-537.","DOI":"10.1109\/CVPR.2018.00062"},{"key":"10.3233\/JCM-226112_ref26","doi-asserted-by":"crossref","unstructured":"Zhao QJ, Sheng T, Wang YT, Tang Z, Chen Y, Cai L, Ling HB. M2det: a Single-shot object detector based on multi-level feature pyramid network, in: Proc. 2019 Innovative applications of artificial intelligence conference, 2019, 9259-9266.","DOI":"10.1609\/aaai.v33i01.33019259"},{"key":"10.3233\/JCM-226112_ref27","doi-asserted-by":"crossref","unstructured":"Islam MA, Rochan M, Bruce N, Wang Y. Gated feedback refinement network for dense image labeling, in: Proc. 2017 IEEE conference on computer vision and pattern recognition, 2017, 4877-4885.","DOI":"10.1109\/CVPR.2017.518"},{"key":"10.3233\/JCM-226112_ref28","doi-asserted-by":"crossref","unstructured":"Ghiasi G, Lin T, Pang RM, Le QV. Nas-fpn: learning scalable feature pyramid architecture for object detection, in: Proc. 2019 IEEE conference on computer vision and pattern recognition, 2019, 7029-7038.","DOI":"10.1109\/CVPR.2019.00720"},{"key":"10.3233\/JCM-226112_ref29","first-page":"11531","article-title":"ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks,","author":"Wang","year":"2020","journal-title":"Proc. 2020 IEEE conference on computer vision and pattern recognition"},{"key":"10.3233\/JCM-226112_ref30","doi-asserted-by":"crossref","unstructured":"Rezatofighi H, Tsoi N, Gwak JY, Sadeghian A, Reid I, Savarese S. Generalized Intersection over Union: A Metric and a Loss for Bounding Box Regression, in: Proc. 2019 IEEE conference on computer vision and pattern recognition, 2019, 658-666.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"10.3233\/JCM-226112_ref31","doi-asserted-by":"crossref","unstructured":"Wang X, Girshick R, Gupta A, He KM. Non-local Neural Net-works. in: Proc. 2018 IEEE conference on computer vision and pattern recognition, 2018, 7794-7803.","DOI":"10.1109\/CVPR.2018.00813"}],"container-title":["Journal of Computational Methods in Sciences and Engineering"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JCM-226112","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T16:32:35Z","timestamp":1771000355000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JCM-226112"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,8]]},"references-count":31,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.3233\/jcm-226112","relation":{},"ISSN":["1472-7978","1875-8983"],"issn-type":[{"value":"1472-7978","type":"print"},{"value":"1875-8983","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,8]]}}}