{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T04:52:49Z","timestamp":1774327969616,"version":"3.50.1"},"reference-count":30,"publisher":"MDPI AG","issue":"17","license":[{"start":{"date-parts":[[2019,8,26]],"date-time":"2019-08-26T00:00:00Z","timestamp":1566777600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61673376"],"award-info":[{"award-number":["61673376"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>An ultra-high-speed algorithm based on Histogram of Oriented Gradient (HOG) and Support Vector Machine (SVM) for hardware implementation at 10,000 frames per second (FPS) under complex backgrounds is proposed for object detection. The algorithm is implemented on the field-programmable gate array (FPGA) in the high-speed-vision platform, in which 64 pixels are input per clock cycle. The high pixel parallelism of the vision platform limits its performance, as it is difficult to reduce the strides between detection windows below 16 pixels, thus introduce non-negligible deviation of object detection. In addition, limited by the transmission bandwidth, only one frame in every four frames can be transmitted to PC for post-processing, that is, 75% image information is wasted. To overcome the mentioned problem, a multi-frame information fusion model is proposed in this paper. Image data and synchronization signals are first regenerated according to image frame numbers. The maximum HOG feature value and corresponding coordinates of each frame are stored in the bottom of the image with that of adjacent frames\u2019. The compensated ones will be obtained through information fusion with the confidence of continuous frames. Several experiments are conducted to demonstrate the performance of the proposed algorithm. As the evaluation result shows, the deviation is reduced with our proposed method compared with the existing one.<\/jats:p>","DOI":"10.3390\/s19173707","type":"journal-article","created":{"date-parts":[[2019,8,26]],"date-time":"2019-08-26T10:54:53Z","timestamp":1566816893000},"page":"3707","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["An FPGA-Based Ultra-High-Speed Object Detection Algorithm with Multi-Frame Information Fusion"],"prefix":"10.3390","volume":"19","author":[{"given":"Xianlei","family":"Long","sequence":"first","affiliation":[{"name":"The Research Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China"}]},{"given":"Shenhua","family":"Hu","sequence":"additional","affiliation":[{"name":"The Research Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China"}]},{"given":"Yiming","family":"Hu","sequence":"additional","affiliation":[{"name":"The Research Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8332-5350","authenticated-orcid":false,"given":"Qingyi","family":"Gu","sequence":"additional","affiliation":[{"name":"The Research Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China"}]},{"given":"Idaku","family":"Ishii","sequence":"additional","affiliation":[{"name":"The Robotics Laboratory, Department of System Cybernetics, Hiroshima University, Hiroshima 739-8527, Japan"}]}],"member":"1968","published-online":{"date-parts":[[2019,8,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1109\/TMM.2014.2298377","article-title":"An advanced moving object detection algorithm for automatic traffic monitoring in real-world limited bandwidth networks","volume":"16","author":"Chen","year":"2014","journal-title":"IEEE Trans. Multimed."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"982","DOI":"10.1109\/TCSVT.2018.2828606","article-title":"A Robust Moving Object Detection in Multi-Scenario Big Data for Video Surveillance","volume":"29","author":"Chen","year":"2018","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Chen, X., Chen, B., Guan, J., Huang, Y., and He, Y. (2018). Space-range-Doppler focus-based low-observable moving target detection using frequency diverse array MIMO radar. IEEE Access, 43892\u201343904.","DOI":"10.1109\/ACCESS.2018.2863745"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Calonder, M., Lepetit, V., Strecha, C., and Fua, P. (2010). Brief: Binary Robust Independent Elementary Features, European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-642-15561-1_56"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_6","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of Oriented Gradients for Human Detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, realtime object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_10","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv Preprint."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Sharma, A., Shimasaki, K., Gu, Q., Chen, J., Aoyama, T., Takaki, T., Ishii, I., Tamura, K., and Tajima, K. (2016, January 13\u201315). Super high-speed vision platform for processing 1024 \u00d7 1024 images in real time at 12,500 fps. Proceedings of the International Symposium on System Integration, Sapporo, Japan.","DOI":"10.1109\/SII.2016.7844055"},{"key":"ref_12","first-page":"2090","article-title":"Data-driven background subtraction algorithm for in-camera acceleration in thermal imagery","volume":"18","author":"Makantasis","year":"2017","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"585","DOI":"10.3390\/s120100585","article-title":"FPGA implementation for real-time background subtraction based on horprasert model","volume":"12","author":"Diaz","year":"2012","journal-title":"Sensors"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Acevedo-Avila, R., Gonzalez-Mendoza, M., and Garcia-Garcia, A. (2016). A linked list-based algorithm for blob detection on embedded vision-based sensors. Sensors, 16.","DOI":"10.3390\/s16060782"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Li, J., Yin, Y., Liu, X., Xu, D., and Gu, Q. (2017, January 24\u201328). 12,000-fps Multi-object detection using HOG descriptor and SVM classifier. Proceedings of the 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.","DOI":"10.1109\/IROS.2017.8206487"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"3818","DOI":"10.1109\/JSEN.2019.2895294","article-title":"A Hardware-Oriented Algorithm for Ultra-High-Speed Object Detection","volume":"19","author":"Li","year":"2019","journal-title":"IEEE Sens. J."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1109\/TCSVT.2012.2202195","article-title":"Fast FPGA-based multiobject feature extraction","volume":"23","author":"Gu","year":"2013","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_18","unstructured":"Bauer, S., Brunsmann, U., and Schlotterbeck-Macht, S. (2009, January 10). FPGA implementation of a HOG-based pedestrian recognition system. Proceedings of the MPC-Workshop, Karlsruhe, Germany."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Kadota, R., Sugano, H., Hiromoto, M., Ochi, H., Miyamoto, R., and Nakamura, Y. (2009, January 12\u201314). Hardware architecture for HOG feature extraction. Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kyoto, Japan.","DOI":"10.1109\/IIH-MSP.2009.216"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_22","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems 25 (NIPS 2012), Lake Tahoe, NV, USA."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016, January 11\u201314). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_25","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201310). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems 28 (NIPS 2015), Montreal, QC, Canada."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective search for object recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Shen, Y., Ferdman, M., and Milder, P. (September, January 29). Overcoming resource underutilization in spatial CNN accelerators. Proceedings of the 2016 26th International Conference on Field Programmable Logic and Applications (FPL), Lausanne, Switzerland.","DOI":"10.1109\/FPL.2016.7577315"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhao, R., Niu, X., Wu, Y., Luk, W., and Liu, Q. (2017, January 3\u20137). Optimizing CNN-based object detection algorithms on embedded FPGA platforms. Proceedings of the International Symposium on Applied Reconfigurable Computing, Delft, The Netherlands.","DOI":"10.1007\/978-3-319-56258-2_22"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1007\/s11633-016-1024-0","article-title":"Review of some advances and applications in real-time high-speed vision: Our views and experiences","volume":"13","author":"Gu","year":"2016","journal-title":"Int. J. Autom. Comput."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"013010","DOI":"10.1117\/1.JEI.21.1.013010","article-title":"Color-histogram-based tracking at 2000 fps","volume":"21","author":"Ishii","year":"2012","journal-title":"J. Electron. Imaging"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/17\/3707\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:14:09Z","timestamp":1760188449000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/17\/3707"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,8,26]]},"references-count":30,"journal-issue":{"issue":"17","published-online":{"date-parts":[[2019,9]]}},"alternative-id":["s19173707"],"URL":"https:\/\/doi.org\/10.3390\/s19173707","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,8,26]]}}}