{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T08:20:07Z","timestamp":1769156407110,"version":"3.49.0"},"reference-count":33,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2020,6,28]],"date-time":"2020-06-28T00:00:00Z","timestamp":1593302400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Chung-Ang University Graduate Research Scholarship","award":["2018"],"award-info":[{"award-number":["2018"]}]},{"name":"Korean Ministry of Trade, Industry and Energy (MOTIE)","award":["P0002397"],"award-info":[{"award-number":["P0002397"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The single shot multi-box detector (SSD) exhibits low accuracy in small-object detection; this is because it does not consider the scale contextual information between its layers, and the shallow layers lack adequate semantic information. To improve the accuracy of the original SSD, this paper proposes a new single shot multi-box detector using trident feature and squeeze and extraction feature fusion (SSD-TSEFFM); this detector employs the trident network and the squeeze and excitation feature fusion module. Furthermore, a trident feature module (TFM) is developed, inspired by the trident network, to consider the scale contextual information. The use of this module makes the proposed model robust to scale changes owing to the application of dilated convolution. Further, the squeeze and excitation block feature fusion module (SEFFM) is used to provide more semantic information to the model. The SSD-TSEFFM is compared with the faster regions with convolution neural network features (RCNN) (2015), SSD (2016), and DF-SSD (2020) on the PASCAL VOC 2007 and 2012 datasets. The experimental results demonstrate the high accuracy of the proposed model in small-object detection, in addition to a good overall accuracy. The SSD-TSEFFM achieved 80.4% mAP and 80.2% mAP on the 2007 and 2012 datasets, respectively. This indicates an average improvement of approximately 2% over other models.<\/jats:p>","DOI":"10.3390\/s20133630","type":"journal-article","created":{"date-parts":[[2020,6,29]],"date-time":"2020-06-29T11:17:17Z","timestamp":1593429437000},"page":"3630","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion"],"prefix":"10.3390","volume":"20","author":[{"given":"Young-Joon","family":"Hwang","sequence":"first","affiliation":[{"name":"School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1587-7684","authenticated-orcid":false,"given":"Jin-Gu","family":"Lee","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Un-Chul","family":"Moon","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ho-Hyun","family":"Park","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,6,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Wu, Y., Tang, S., Zhang, S., and Ogai, H. (2019). An Enhanced Feature Pyramid Object Detection Network for Autonomous Driving. Sensors, 9.","DOI":"10.3390\/app9204363"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Peng, C., Bu, W., Xiao, J., Wong, K.-C., and Yang, M. (2018). An Improved Neural Network Cascade for Face Detection in Large Scene Surveillance. Sensors, 8.","DOI":"10.3390\/app8112222"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Zhu, X., Ding, M., Huang, T., Jin, X., and Zhang, X. (2018). PCANet-Based Structural Representation for Nonrigid Multimodal Medical Image Registration. Sensors, 18.","DOI":"10.3390\/s18051477"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Cal\u00ec, M., and Ambu, R. (2018). Advanced 3D Photogrammetric Surface Reconstruction of Extensive Objects by UAV Camera Image Acquisition. Sensors, 18.","DOI":"10.3390\/s18092815"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ryu, J., and Kim, S. (2019). Chinese Character Boxes: Single Shot Detector Network for Chinese Character Detection. Sensors, 9.","DOI":"10.3390\/app9020315"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Ali, S., Scovanner, P., and Shah, M. (2007, January 24\u201329). A 3-Dimensional SIFT descriptor and its application to action recognition. Proceedings of the 15th International Conference on Multimedia, Augsburg, Germany.","DOI":"10.1145\/1291233.1291311"},{"key":"ref_7","unstructured":"Dalal, N., and Triggs, B. (2005, January 25). Histograms of oriented gradients for human detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), San Diego, CA, USA."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/j.patcog.2004.04.010","article-title":"Illuminant and device invariant colour using histogram equalisation","volume":"38","author":"Finlayson","year":"2005","journal-title":"Pattern Recognit."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2014), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_12","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems, Proceedings of the Neural Information Processing Systems Conference (NIPS 2015), Montreal, QC, Canada, 7\u201312 December 2015, Neural Information Processing Systems Foundation, Inc."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Dollar, P., and Girshick, R. (2017, January 22\u201329). Mask R-CNN. Proceedings of the 16th IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 17\u201322). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Minneapolis, MN, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2015, January 7\u201312). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), Boston, MA, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Fu, C., and Berg, A.C. (2016, January 8\u201316). SSD: Single Shot Multibox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_18","unstructured":"Fu, C.Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017, January 21\u201326). DSSD: Deconvolutional single shot detector. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, HI, USA."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Jeong, J., Park, H., and Kwak, N. (2017, January 4\u20137). Enhancement of SSD by concatenating feature maps for object detection. Proceedings of the British Machine Vision Conference (BMVC 2017), London, UK.","DOI":"10.5244\/C.31.76"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"24344","DOI":"10.1109\/ACCESS.2020.2971026","article-title":"DF-SSD: An Improved SSD Object Detection Algorithm Based on Dense Net and Feature Fusion","volume":"8","author":"Zhai","year":"2020","journal-title":"IEEE Access"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Li, Y., Chen, Y., Wang, N., and Zhang, Z. (November, January 27). Scale-Aware Trident Networks for Object Detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV 2019), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00615"},{"key":"ref_22","unstructured":"Yu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), Salt Lake City, UT, USA."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The PASCAL visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_24","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Singh, B., and Davis, L.S. (2018, January 19\u201321). An Analysis of Scale Invariance in Object Detection\u2014SNIP. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00377"},{"key":"ref_26","first-page":"33","article-title":"Pyramid methods in image processing","volume":"29","author":"Adelson","year":"1984","journal-title":"RCA Eng."},{"key":"ref_27","unstructured":"Singh, B., Najibi, M., and Davis, L.S. (2018, January 2\u20138). SNIPER: Efficient Multi-Scale Training. Proceedings of the Advances in Neural Information Processing Systems (NIPS 2018), Montreal, QC, Canada."},{"key":"ref_28","unstructured":"Yu, F., and Koltun, V. (2016, January 2\u20134). Multi-scale context aggregation by dilated convolutions. Proceedings of the 2016 International Conference on Learning Representations, San Juan, Puerto Rico."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 8\u201316). Deep Residual Learning for Image Recognition. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"ImageNet Large Scale Visual Recognition Challenge","volume":"115","author":"Russakovsky","year":"2014","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","unstructured":"Glorot, X., and Bengio, Y. (2010, January 13\u201315). Understanding the difficulty of training deep feedforward neural networks. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Sardinia, Italy."},{"key":"ref_32","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. (2017, January 8\u20139). Automatic differentiation in PyTorch. Proceedings of the Advances in Neural Information Processing Systems (NIPS) Workshops, Long Beach, CA, USA."},{"key":"ref_33","unstructured":"Chetlur, S., Woolley, C., Vandermersch, P., Cohen, J., Tran, J., Catanzaro, B., and Shelhamer, E. (2014). cudnn: Efficient primitives for deep learning. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/13\/3630\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T09:44:09Z","timestamp":1760175849000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/13\/3630"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,28]]},"references-count":33,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2020,7]]}},"alternative-id":["s20133630"],"URL":"https:\/\/doi.org\/10.3390\/s20133630","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,28]]}}}