{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T18:53:22Z","timestamp":1772823202210,"version":"3.50.1"},"reference-count":39,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2023,7,14]],"date-time":"2023-07-14T00:00:00Z","timestamp":1689292800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100007128","name":"Natural Science Foundation of Shaanxi Province","doi-asserted-by":"publisher","award":["2020JM-206"],"award-info":[{"award-number":["2020JM-206"]}],"id":[{"id":"10.13039\/501100007128","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007128","name":"Natural Science Foundation of Shaanxi Province","doi-asserted-by":"publisher","award":["SKLLIM2103"],"award-info":[{"award-number":["SKLLIM2103"]}],"id":[{"id":"10.13039\/501100007128","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007128","name":"Natural Science Foundation of Shaanxi Province","doi-asserted-by":"publisher","award":["B17035"],"award-info":[{"award-number":["B17035"]}],"id":[{"id":"10.13039\/501100007128","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015245","name":"State Key Laboratory of Laser Interaction with Matter","doi-asserted-by":"publisher","award":["2020JM-206"],"award-info":[{"award-number":["2020JM-206"]}],"id":[{"id":"10.13039\/501100015245","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015245","name":"State Key Laboratory of Laser Interaction with Matter","doi-asserted-by":"publisher","award":["SKLLIM2103"],"award-info":[{"award-number":["SKLLIM2103"]}],"id":[{"id":"10.13039\/501100015245","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015245","name":"State Key Laboratory of Laser Interaction with Matter","doi-asserted-by":"publisher","award":["B17035"],"award-info":[{"award-number":["B17035"]}],"id":[{"id":"10.13039\/501100015245","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013314","name":"111 project","doi-asserted-by":"publisher","award":["2020JM-206"],"award-info":[{"award-number":["2020JM-206"]}],"id":[{"id":"10.13039\/501100013314","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013314","name":"111 project","doi-asserted-by":"publisher","award":["SKLLIM2103"],"award-info":[{"award-number":["SKLLIM2103"]}],"id":[{"id":"10.13039\/501100013314","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013314","name":"111 project","doi-asserted-by":"publisher","award":["B17035"],"award-info":[{"award-number":["B17035"]}],"id":[{"id":"10.13039\/501100013314","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The You Only Look Once (YOLO) series has been widely adopted across various domains. With the increasing prevalence of continuous satellite observation, the resulting video streams can be subjected to intelligent analysis for various applications, such as traffic flow statistics, military operations, and other fields. Nevertheless, the signal-to-noise ratio of objects in satellite videos is considerably low, and their size is often smaller, ranging from tens to one percent, when compared to those taken by drones and other equipment. Consequently, the original YOLO algorithm\u2019s performance is inadequate when detecting tiny objects in satellite videos. Hence, we propose an improved framework, named HB-YOLO. To enable the backbone to extract features, we replaced the universal convolution with an improved HorNet that enables higher-order spatial interactions. We replaced all Extended Efficient Layer Aggregation Networks (ELANs) with the BoTNet attention mechanism to make the features fully fused. In addition, anchors were re-adjusted, and image segmentation was integrated to achieve detection results, which are tracked using the BoT-SORT algorithm. The experimental results indicate that the original algorithm failed to learn using the satellite video dataset, whereas our proposed approach yielded improved recall and precision. Specifically, the F1-score and mean average precision increased to 0.58 and 0.53, respectively, and the object-tracking performance was enhanced by incorporating the image segmentation method.<\/jats:p>","DOI":"10.3390\/rs15143551","type":"journal-article","created":{"date-parts":[[2023,7,17]],"date-time":"2023-07-17T00:56:47Z","timestamp":1689555407000},"page":"3551","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["HB-YOLO: An Improved YOLOv7 Algorithm for Dim-Object Tracking in Satellite Remote Sensing Videos"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-8200-1990","authenticated-orcid":false,"given":"Chaoran","family":"Yu","sequence":"first","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]},{"given":"Zhejun","family":"Feng","sequence":"additional","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]},{"given":"Zengyan","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]},{"given":"Runxi","family":"Wei","sequence":"additional","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]},{"given":"Baoming","family":"Song","sequence":"additional","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8901-880X","authenticated-orcid":false,"given":"Changqing","family":"Cao","sequence":"additional","affiliation":[{"name":"School of Optoelectronic Engineering, Xidian University, 2 South Taibai Road, Xi\u2019an 710071, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5612518","DOI":"10.1109\/TGRS.2021.3130436","article-title":"Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark","volume":"60","author":"Yin","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","first-page":"5617611","article-title":"SatSOT: A Benchmark Dataset for Satellite Video Single Object Tracking","volume":"60","author":"Zhao","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ye, F., Ai, T., Wang, J., Yao, Y., and Zhou, Z. (2022). A Method for Classifying Complex Features in Urban Areas Using Video Satellite Remote Sensing Data. Remote Sens., 14.","DOI":"10.3390\/rs14102324"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Yang, L., Yuan, G., Zhou, H., Liu, H., Chen, J., and Wu, H. (2022). RS-YOLOX: A High-Precision Detector for Object Detection in Satellite Remote Sensing Images. Appl. Sci., 12.","DOI":"10.3390\/app12178707"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"7010","DOI":"10.1109\/TGRS.2020.2978512","article-title":"Small Object Tracking in Satellite Videos Using Background Compensation","volume":"58","author":"Wang","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"44069","DOI":"10.1109\/ACCESS.2021.3059487","article-title":"Research on Multiview Stereo Mapping Based on Satellite Video Images","volume":"9","author":"Li","year":"2021","journal-title":"IEEE Access"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"7860","DOI":"10.1109\/TGRS.2019.2916953","article-title":"Tracking Objects From Satellite Videos: A Velocity Feature Based Correlation Filter","volume":"57","author":"Shao","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wu, J., Cao, C., Zhou, Y., Zeng, X., Feng, Z., Wu, Q., and Huang, Z. (2021). Multiple Ship Tracking in Remote Sensing Images Using Deep Learning. Remote Sens., 13.","DOI":"10.3390\/rs13183601"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1742","DOI":"10.1109\/ACCESS.2023.3233964","article-title":"YOLO-Extract: Improved YOLOv5 for Aircraft Object Detection in Remote Sensing Images","volume":"11","author":"Liu","year":"2023","journal-title":"IEEE Access"},{"key":"ref_10","unstructured":"Etten, A.V. (2018). You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 11\u201318). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask R-CNN. Proceedings of the IEEE ICCV, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE CVPR, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Xu, D., and Wu, Y. (2020). Improved YOLO-V3 with DenseNet for multi-scale remote sensing object detection. Sensors, 20.","DOI":"10.3390\/s20154276"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_17","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). YOLOv4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_18","unstructured":"Wang, C.Y., Bochkovskiy, A., and Mark Liao, H.-Y. (2022). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the IEEE ECCV, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.cosrev.2018.03.001","article-title":"New trends on moving object detection in video images captured by a moving camera: A survey","volume":"28","author":"Mehran","year":"2018","journal-title":"Comput. Sci. Rev."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2882","DOI":"10.1109\/TIM.2017.2729378","article-title":"Object tracking using multiple features and adaptive model updating","volume":"66","author":"Hu","year":"2017","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_22","unstructured":"Luca, B., Jack, V., Jo\u00e3o, F.H., Andrea, V., and Philip, H.S.T. (2016). Fully-convolutional Siamese networks for object tracking. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Bewley, A., Ge, Z., Ott, L., Ramos, F., and Upcroft, B. (2016, January 25\u201328). Simple online and real-time tracking. Proceedings of the 23rd IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1007\/s11263-013-0624-1","article-title":"Multiframe many\u2013many point correspondence for vehicle tracking in high density wide area aerial videos","volume":"104","author":"Saleemi","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Wojke, N., Bewley, A., and Paulus, D. (2017, January 17\u201320). Simple online and real-time tracking with a deep association metric. Proceedings of the IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Lalonde, R., Zhang, D., and Shah, M. (2018, January 18\u201322). ClusterNet: Detecting small objects in large scenes by exploiting spatio-temporal information. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake, Utah.","DOI":"10.1109\/CVPR.2018.00421"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Sun, P., Jiang, Y., Yu, D., Yuan, Z., Luo, P., Liu, W., and Wang, X. (2021). ByteTrack: Multi-Object Tracking by Associating Every Detection Box. arXiv.","DOI":"10.1007\/978-3-031-20047-2_1"},{"key":"ref_28","unstructured":"Aharon, N., Orfaig, R., and Bobrovsky, B. (2022). BoT-SORT: Robust Associations Multi-Pedestrian Tracking. arXiv."},{"key":"ref_29","unstructured":"Rao, Y., Zhao, W., Tang, Y., Zhou, J., Lim, S., and Lu, J. (2022). HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutionsar. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Srinivas, A., Lin, T., Parmar, N., Shlens, J., Abbeel, P., and Vaswani, A. (2021, January 20\u201325). Bottleneck Transformers for Visual Recognition. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01625"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Cao, Y., Wang, G., Yan, D., and Zhao, Z. (2016). Two algorithms for the detection and tracking of moving vehicle objects in aerial infrared image sequences. Remote Sens., 8.","DOI":"10.3390\/rs8010028"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1016\/j.patrec.2005.11.005","article-title":"Efficient adaptive density estimation per image pixel for the task of background subtraction","volume":"27","author":"Zivkovic","year":"2006","journal-title":"Pattern Recognit. Lett."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1709","DOI":"10.1109\/TIP.2010.2101613","article-title":"ViBe: A universal background subtraction algorithm for video sequences","volume":"20","author":"Barnich","year":"2011","journal-title":"IEEE Trans. Image Process."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Rezaei, B., and Ostadabbas, S. (2017, January 22\u201329). Background subtraction via fast robust matrix completion. Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, Italy.","DOI":"10.1109\/ICCVW.2017.221"},{"key":"ref_36","unstructured":"Pflugfelder, R., Weissenfeld, A., and Wagner, J. (2020). On learning vehicle detection in satellite video. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_38","first-page":"186","article-title":"Detection and tracking of large number of objects in wide area surveillance","volume":"6313","author":"Reilly","year":"2010","journal-title":"Proc. Eur. Conf. Comput. Vis."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Rodriguez, P., and Wohlberg, B. (2013, January 15\u201318). Fast principal component pursuit via alternating minimization. Proceedings of the 2013 IEEE International Conference on Image Processing, Melbourne, VIC, Australia.","DOI":"10.1109\/ICIP.2013.6738015"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/14\/3551\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:12:22Z","timestamp":1760127142000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/14\/3551"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,14]]},"references-count":39,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["rs15143551"],"URL":"https:\/\/doi.org\/10.3390\/rs15143551","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,14]]}}}