{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,27]],"date-time":"2026-04-27T07:52:05Z","timestamp":1777276325574,"version":"3.51.4"},"reference-count":80,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2023,4,21]],"date-time":"2023-04-21T00:00:00Z","timestamp":1682035200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004543","name":"China Scholarship Council","doi-asserted-by":"publisher","award":["202006540007"],"award-info":[{"award-number":["202006540007"]}],"id":[{"id":"10.13039\/501100004543","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This study proposes a visual tracking system that can detect and track multiple fast-moving appearance-varying targets simultaneously with 500 fps image processing. The system comprises a high-speed camera and a pan-tilt galvanometer system, which can rapidly generate large-scale high-definition images of the wide monitored area. We developed a CNN-based hybrid tracking algorithm that can robustly track multiple high-speed moving objects simultaneously. Experimental results demonstrate that our system can track up to three moving objects with velocities lower than 30 m per second simultaneously within an 8-m range. The effectiveness of our system was demonstrated through several experiments conducted on simultaneous zoom shooting of multiple moving objects (persons and bottles) in a natural outdoor scene. Moreover, our system demonstrates high robustness to target loss and crossing situations.<\/jats:p>","DOI":"10.3390\/s23084150","type":"journal-article","created":{"date-parts":[[2023,4,21]],"date-time":"2023-04-21T02:05:31Z","timestamp":1682042731000},"page":"4150","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["An Active Multi-Object Ultrafast Tracking System with CNN-Based Hybrid Object Detection"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6823-2217","authenticated-orcid":false,"given":"Qing","family":"Li","sequence":"first","affiliation":[{"name":"Smart Robotics Laboratory, Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima 739-8527, Hiroshima, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9176-3606","authenticated-orcid":false,"given":"Shaopeng","family":"Hu","sequence":"additional","affiliation":[{"name":"Smart Robotics Laboratory, Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima 739-8527, Hiroshima, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4773-1575","authenticated-orcid":false,"given":"Kohei","family":"Shimasaki","sequence":"additional","affiliation":[{"name":"Smart Robotics Laboratory, Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima 739-8527, Hiroshima, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7728-2363","authenticated-orcid":false,"given":"Idaku","family":"Ishii","sequence":"additional","affiliation":[{"name":"Smart Robotics Laboratory, Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima 739-8527, Hiroshima, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,4,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"103448","DOI":"10.1016\/j.artint.2020.103448","article-title":"Multiple object tracking: A literature review","volume":"293","author":"Luo","year":"2021","journal-title":"Artif. Intell."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Mwamba, I.C., Morshedi, M., Padhye, S., Davatgari, A., Yoon, S., Labi, S., and Hastak, M. (2021). Synthesis Study of Best Practices for Mapping and Coordinating Detours for Maintenance of Traffic (MOT) and Risk Assessment for Duration of Traffic Control Activities, Purdue University. Joint Transportation Research Program.","DOI":"10.5703\/1288284317344"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ghafir, I., Prenosil, V., Svoboda, J., and Hammoudeh, M. (2016, January 22\u201324). A survey on network security monitoring systems. Proceedings of the 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW), Vienna, Austria.","DOI":"10.1109\/W-FiCloud.2016.30"},{"key":"ref_4","first-page":"629","article-title":"A survey on intelligent transportation systems","volume":"15","author":"Qureshi","year":"2013","journal-title":"Middle-East J. Sci. Res."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Weng, X., Wang, J., Held, D., and Kitani, K. (2020, January 24\u201330). 3d multi-object tracking: A baseline and new evaluation metrics. Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, USA.","DOI":"10.1109\/IROS45743.2020.9341164"},{"key":"ref_6","unstructured":"Rzadca, K., Findeisen, P., Swiderski, J., Zych, P., Broniek, P., Kusmierek, J., Nowak, P., Strack, B., Witusowski, P., and Hand, S. (2023, January 8\u201312). Autopilot: Workload autoscaling at Google. Proceedings of the Fifteenth European Conference on Computer Systems, New York, NY, USA."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bohush, R., and Zakharava, I. (2019, January 21\u201323). Robust person tracking algorithm based on convolutional neural network for indoor video surveillance systems. Proceedings of the Pattern Recognition and Information Processing: 14th International Conference, PRIP 2019, Minsk, Belarus. Revised Selected Papers 14.","DOI":"10.1007\/978-3-030-35430-5_24"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Kaiser, R., Thaler, M., Kriechbaum, A., Fassold, H., Bailer, W., and Rosner, J. (2011, January 16\u201317). Real-time Person Tracking in High-resolution Panoramic Video for Automated Broadcast Production. Proceedings of the 2011 Conference for Visual Media Production, Washington, DC, USA.","DOI":"10.1109\/CVMP.2011.9"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1017\/jfm.2019.238","article-title":"Super-resolution reconstruction of turbulent flows with machine learning","volume":"870","author":"Fukami","year":"2019","journal-title":"J. Fluid Mech."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3390462","article-title":"A deep journey into super-resolution: A survey","volume":"53","author":"Anwar","year":"2020","journal-title":"ACM Comput. Surv. (CSUR)"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1889","DOI":"10.1364\/OL.39.001889","article-title":"Super-resolution imaging using a camera array","volume":"39","author":"Carles","year":"2014","journal-title":"Opt. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kang, S., Paik, J.K., Koschan, A., Abidi, B.R., and Abidi, M.A. (2003, January 19\u201322). Real-time video tracking using PTZ cameras. Proceedings of the Sixth International Conference on Quality Control by Artificial Vision, Gatlinburg, TN, USA.","DOI":"10.1117\/12.514945"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"3282","DOI":"10.1109\/TIP.2012.2188806","article-title":"Collaborative sensing in a distributed PTZ camera network","volume":"21","author":"Ding","year":"2012","journal-title":"IEEE Trans. Image Process."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Li, Q., Chen, M., Gu, Q., and Ishii, I. (2022, January 23\u201327). A Flexible Calibration Algorithm for High-speed Bionic Vision System based on Galvanometer. Proceedings of the 2022 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan.","DOI":"10.1109\/IROS47612.2022.9981947"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1007\/s11633-016-1024-0","article-title":"Review of some advances and applications in real-time high-speed vision: Our views and experiences","volume":"13","author":"Gu","year":"2016","journal-title":"Int. J. Autom. Comput."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Okumura, K., Oku, H., and Ishikawa, M. (2011, January 9\u201313). High-speed gaze controller for millisecond-order pan\/tilt camera. Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China.","DOI":"10.1109\/ICRA.2011.5980080"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1109\/TMECH.2017.2764504","article-title":"Vibration sensing of a bridge model using a multithread active vision system","volume":"23","author":"Aoyama","year":"2017","journal-title":"IEEE\/ASME Trans. Mechatron."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Hu, S., Shimasaki, K., Jiang, M., Takaki, T., and Ishii, I. (2019, January 6\u20138). A dual-camera-based ultrafast tracking system for simultaneous multi-target zooming. Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), Dali, China.","DOI":"10.1109\/ROBIO49542.2019.8961601"},{"key":"ref_19","first-page":"1","article-title":"500-fps omnidirectional visual tracking using three-axis active vision system","volume":"70","author":"Jiang","year":"2021","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1109\/81.222795","article-title":"The CNN paradigm","volume":"40","author":"Chua","year":"1993","journal-title":"IEEE Trans. Circuits Syst. Fundam. Theory Appl."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1109\/LRA.2020.3048653","article-title":"A 500-fps pan-tilt tracking system with deep-learning-based object detection","volume":"6","author":"Jiang","year":"2021","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/JPROC.2023.3238524","article-title":"Object detection in 20 years: A survey","volume":"111","author":"Zou","year":"2023","journal-title":"Proc. IEEE"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1162\/jocn.1991.3.1.71","article-title":"Eigenfaces for Recognition","volume":"3","author":"Turk","year":"1991","journal-title":"J. Cogn. Neurosci."},{"key":"ref_24","unstructured":"Amit, Y., Felzenszwalb, P., and Girshick, R. (2020). Computer Vision: A Reference Guide, Springer International Publishing."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Li, G., Xu, D., Cheng, X., Si, L., and Zheng, C. (2022, January 18\u201322). Simvit: Exploring a simple vision transformer with sliding windows. Proceedings of the 2022 IEEE International Conference on Multimedia and Expo (ICME), Taipei, Taiwan.","DOI":"10.1109\/ICME52920.2022.9859907"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"103661","DOI":"10.1016\/j.compind.2022.103661","article-title":"Deep learning-based object detection in augmented reality: A systematic review","volume":"139","author":"Ghasemi","year":"2022","journal-title":"Comput. Ind."},{"key":"ref_27","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201326). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_28","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, Kauai, HI, USA."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"3634","DOI":"10.1109\/TITS.2018.2877200","article-title":"Real-time vehicle detection using an effective region proposal-based depth and 3-channel pattern","volume":"20","author":"Nguyen","year":"2018","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_30","unstructured":"Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., and Keutzer, K. (2016). SqueezeNet: AlexNet-level accuracy with 50\u00d7 fewer parameters and <0.5 MB model size. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_33","unstructured":"Targ, S., Almeida, D., and Lyman, K. (2016). Resnet in resnet: Generalizing residual architectures. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Xie, S., Girshick, R., Doll\u00e1r, P., Tu, Z., and He, K. (2017, January 21\u201326). Aggregated Residual Transformations for Deep Neural Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.634"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Liao, H.Y.M., Wu, Y.H., Chen, P.Y., Hsieh, J.W., and Yeh, I.H. (2020, January 14\u201319). CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_36","unstructured":"Tan, M., and Le, Q. (2019, January 10\u201315). Efficientnet: Rethinking model scaling for convolutional neural networks. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.isprsjprs.2020.12.010","article-title":"Review on Convolutional Neural Networks (CNN) in vegetation remote sensing","volume":"173","author":"Kattenborn","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_39","unstructured":"Purkait, P., Zhao, C., and Zach, C. (2017). SPP-Net: Deep absolute pose regression with synthetic views. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_41","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst., 28."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_43","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. Adv. Neural Inf. Process. Syst., 29."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Qiao, S., Chen, L.C., and Yuille, A. (2021, January 20\u201325). Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01008"},{"key":"ref_45","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). Fcos: Fully convolutional one-stage object detection. Proceedings of the IEEE\/CVF International Conference on Computer Cision, Seoul, Republic of Korea."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). Ssd: Single shot multibox detector. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). CenterNet: Keypoint Triplets for Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00667"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 13\u201319). EfficientDet: Scalable and Efficient Object Detection. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_51","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1066","DOI":"10.1016\/j.procs.2022.01.135","article-title":"A Review of Yolo algorithm developments","volume":"199","author":"Jiang","year":"2022","journal-title":"Procedia Comput. Sci."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Sang, J., Wu, Z., Guo, P., Hu, H., Xiang, H., Zhang, Q., and Cai, B. (2018). An improved YOLOv2 for vehicle detection. Sensors, 18.","DOI":"10.3390\/s18124272"},{"key":"ref_54","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Yu, J., and Zhang, W. (2021). Face mask wearing detection algorithm based on improved YOLO-v4. Sensors, 21.","DOI":"10.3390\/s21093263"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Zhu, X., Lyu, S., Wang, X., and Zhao, Q. (2021, January 11\u201317). TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00312"},{"key":"ref_57","unstructured":"Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., and Nie, W. (2022). YOLOv6: A single-stage object detection framework for industrial applications. arXiv."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Soleimanitaleb, Z., Keyvanrad, M.A., and Jafari, A. (2019, January 24\u201325). Object Tracking Methods:A Review. Proceedings of the 2019 9th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Khorasan Razavi, Iran.","DOI":"10.1109\/ICCKE48569.2019.8964761"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1109\/34.400568","article-title":"Mean shift, mode seeking, and clustering","volume":"17","author":"Cheng","year":"1995","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1049\/ip-rsn:19990255","article-title":"Improved particle filter for nonlinear problems","volume":"146","author":"Carpenter","year":"1999","journal-title":"IEEE Proc.-Radar Sonar Navig."},{"key":"ref_61","unstructured":"Li, Y., and Zhu, J. (2014, January 6\u20137). A scale adaptive kernel correlation filter tracker with feature integration. Proceedings of the ECCV Workshops (2), Zurich, Switzerland."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Bolme, D.S., Beveridge, J.R., Draper, B.A., and Lui, Y.M. (2010, January 13\u201318). Visual object tracking using adaptive correlation filters. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539960"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Bewley, A., Ge, Z., Ott, L., Ramos, F., and Upcroft, B. (2016, January 25\u201328). Simple online and realtime tracking. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Wojke, N., Bewley, A., and Paulus, D. (2017, January 17\u201320). Simple online and realtime tracking with a deep association metric. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Wang, Z., Zheng, L., Liu, Y., Li, Y., and Wang, S. (2020, January 23\u201328). Towards real-time multi-object tracking. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK.","DOI":"10.1007\/978-3-030-58621-8_7"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Chen, L., Ai, H., Zhuang, Z., and Shang, C. (2018, January 23\u201327). Real-time multiple people tracking with deeply learned candidate selection and person re-identification. Proceedings of the 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA.","DOI":"10.1109\/ICME.2018.8486597"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Bergmann, P., Meinhardt, T., and Leal-Taixe, L. (2019, January 15\u201320). Tracking without bells and whistles. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Long Beach, CA, USA.","DOI":"10.1109\/ICCV.2019.00103"},{"key":"ref_68","unstructured":"Zhang, J., Zhou, S., Chang, X., Wan, F., Wang, J., Wu, Y., and Huang, D. (2020). Multiple object tracking by flowing and fusing. arXiv."},{"key":"ref_69","unstructured":"Sun, P., Cao, J., Jiang, Y., Zhang, R., Xie, E., Yuan, Z., Wang, C., and Luo, P. (2020). Transtrack: Multiple object tracking with transformer. arXiv."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Meinhardt, T., Kirillov, A., Leal-Taixe, L., and Feichtenhofer, C. (2022, January 18\u201324). Trackformer: Multi-object tracking with transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00864"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1109\/MVT.2009.935537","article-title":"Intelligent transportation systems","volume":"5","author":"Dimitrakopoulos","year":"2010","journal-title":"IEEE Veh. Technol. Mag."},{"key":"ref_72","first-page":"1","article-title":"Special issue on high-speed vision-based autonomous navigation of uavs","volume":"1","author":"Loianno","year":"2018","journal-title":"J. Field Robot."},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Huang, S., Bergstr\u00f6m, N., Yamakawa, Y., Senoo, T., and Ishikawa, M. (2016). Applying high-speed vision sensing to an industrial robot for high-performance position regulation under uncertainties. Sensors, 16.","DOI":"10.3390\/s16081195"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Ishii, I., Taniguchi, T., Sukenobe, R., and Yamamoto, K. (2022, January 18\u201324). Development of high-speed and real-time vision platform, H3 vision. Proceedings of the 2009 IEEE\/RSJ International Conference on Intelligent Robots and Systems, New Orleans, LA, USA.","DOI":"10.1109\/IROS.2009.5354718"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Sharma, A., Shimasaki, K., Gu, Q., Chen, J., Aoyama, T., Takaki, T., Ishii, I., Tamura, K., and Tajima, K. (2016, January 13\u201315). Super high-speed vision platform for processing 1024 \u00d7 1024 images in real time at 12500 fps. Proceedings of the 2016 IEEE\/SICE International Symposium on System Integration (SII), Shiroishi, Japan.","DOI":"10.1109\/SII.2016.7844055"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Gu, Q., Nakamura, N., Aoyama, T., Takaki, T., and Ishii, I. (2015, January 2\u20134). A Full-Pixel Optical Flow System Using a GPU-Based High-Frame-Rate Vision. Proceedings of the 2015 Conference on Advances In Robotics, Goa, India.","DOI":"10.1145\/2783449.2783501"},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"836","DOI":"10.1109\/LRA.2016.2524981","article-title":"Design principles of a high-speed omni-scannable gaze controller","volume":"1","author":"Oku","year":"2016","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"9436","DOI":"10.1109\/JSEN.2021.3054425","article-title":"A simultaneous multi-object zooming system using an ultrafast pan-tilt camera","volume":"21","author":"Hu","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1007\/s00034-009-9130-7","article-title":"Fast normalized cross-correlation","volume":"28","author":"Yoo","year":"2009","journal-title":"Circuits Syst. Signal Process."},{"key":"ref_80","doi-asserted-by":"crossref","first-page":"2587","DOI":"10.1109\/TIP.2020.3048632","article-title":"SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition","volume":"30","author":"Zhong","year":"2021","journal-title":"IEEE Trans. Image Process."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/8\/4150\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:20:17Z","timestamp":1760124017000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/8\/4150"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,21]]},"references-count":80,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2023,4]]}},"alternative-id":["s23084150"],"URL":"https:\/\/doi.org\/10.3390\/s23084150","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,21]]}}}