{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,20]],"date-time":"2026-05-20T23:28:25Z","timestamp":1779319705777,"version":"3.51.4"},"reference-count":33,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2018,11,12]],"date-time":"2018-11-12T00:00:00Z","timestamp":1541980800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003662","name":"Korea Evaluation Institute of Industrial Technology","doi-asserted-by":"publisher","award":["10063413"],"award-info":[{"award-number":["10063413"]}],"id":[{"id":"10.13039\/501100003662","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Person-following technology is an important robot service. The major trend of person-following is to utilize computer vision technology to localize the target person, due to the wide view and rich information that is obtained from the real world through a camera. However, most existing approaches employ the detecting-by-tracking strategy, which suffers from low speed, accompanied with more complicated detecting models and unstable region of interest (ROI) outputs in unexpressed situations. In this paper, we propose a novel classification-lock strategy to localize the target person, which incorporates the visual tracking technology with object detection technology, to adapt the localization model to different environments online, and to keep a high frame-per-second (FPS) on the mobile platform. This person-following approach consists of three key parts. In the first step, a pairwise cluster tracker is employed to localize the person. A positive and negative classifier is then utilized to verify the tracker\u2019s result and to update the tracking model. In addition, a detector pre-trained by a CPU-optimized convolutional neural network is used to further improve the result of tracking. In the experiment, our approach is compared with other state-of-art approaches by a Vojir tracking dataset, with three sequences in the items of human to prove the quality of person localization. Moreover, the common challenges during the following task are evaluated by several image sequences in a static scene, and a dynamic scene is used to evaluate the improvement from the classification-lock strategy. Finally, our approach is deployed on a mobile robot to test its performance on the function of the person-following. Compared with other state-of-art methods, our approach achieves the highest score (0.91 recall rate). In the static and dynamic scene, the output of the ROI based on the classification-lock strategy is significantly better than that without it. Our approach also succeeds in a long-term following task in an indoor multi-floor scenario.<\/jats:p>","DOI":"10.3390\/s18113903","type":"journal-article","created":{"date-parts":[[2018,11,14]],"date-time":"2018-11-14T10:58:22Z","timestamp":1542193102000},"page":"3903","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["A Classification-Lock Tracking Strategy Allowing a Person-Following Robot to Operate in a Complicated Indoor Environment \u2020"],"prefix":"10.3390","volume":"18","author":[{"given":"Shenlu","family":"Jiang","sequence":"first","affiliation":[{"name":"College of Information and Communication Engineering, Sungkyunkwan University, Suwon 440-746, Korea"},{"name":"Department of Land Surveying and Geo-informatics, The Hongkong Polytechnic University, Hung Hom, Hong Kong, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7704-0615","authenticated-orcid":false,"given":"Wei","family":"Yao","sequence":"additional","affiliation":[{"name":"Department of Land Surveying and Geo-informatics, The Hongkong Polytechnic University, Hung Hom, Hong Kong, China"},{"name":"Research Institute for Sustainable Urban Development, The Hongkong Polytechnic University, Hung Hom, Hong Kong, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0045-1066","authenticated-orcid":false,"given":"Zhonghua","family":"Hong","sequence":"additional","affiliation":[{"name":"College of Information Technology, Shanghai Ocean University, Shanghai 201306, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ling","family":"Li","sequence":"additional","affiliation":[{"name":"College of Information and Communication Engineering, Sungkyunkwan University, Suwon 440-746, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cheng","family":"Su","sequence":"additional","affiliation":[{"name":"College of Information and Communication Engineering, Sungkyunkwan University, Suwon 440-746, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tae-Yong","family":"Kuc","sequence":"additional","affiliation":[{"name":"College of Information and Communication Engineering, Sungkyunkwan University, Suwon 440-746, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,12]]},"reference":[{"key":"ref_1","unstructured":"Nishimura, S., Takemura, H., and Mizoguchi, H. (2007, January 15\u201318). Development of Attachable Modules for Robotizing Daily Items\u2014Person Following Shopping Cart Robot. Proceedings of the IEEE International Conference on Robotics and Biomimetics (ROBIO 2007), Sanya, China."},{"key":"ref_2","first-page":"56","article-title":"Key Technologies in Robot Assistants: Motion Coordination between a Human and a Mobile Robot","volume":"4","author":"Prassler","year":"2002","journal-title":"Trans. Control Autom. Syst. Eng."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Spinello, L., Luber, M., and Arras, K.O. (2011, January 9\u201313). Tracking People in 3D Using a Bottom-up top-down Detector. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA 2011), Shanghai, China.","DOI":"10.1109\/ICRA.2011.5980085"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Arras, K.O., Mozos, O.M., and Burgard, W. (2007, January 10\u201314). Using Boosted Features for the Detection of People in 2D Range Data. Proceedings of the IEEE International Conference on Robotics and Automation. (ICRA 2007), Roma, Italy.","DOI":"10.1109\/ROBOT.2007.363998"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1177\/0278364903022002002","article-title":"People Tracking with A Mobile Robot Using Sample-based Joint Probabilistic Data Association Filters","volume":"22","author":"Schulz","year":"2003","journal-title":"Int. J. Robot. Res."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1516","DOI":"10.1177\/0278364910370216","article-title":"Pedestrian Detection and Tracking Using Three-dimensional LADAR Data","volume":"29","author":"Mertz","year":"2010","journal-title":"Int. J. Robot. Res."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Yoshimi, T., Nishiyama, M., Sonoura, T., Hideichi, N., Seiji, T., Hirokazu, S., Fumio, O., Nobuto, M., and Hiroshi, M. (2006, January 9\u201315). Development of A Person Following Robot with Vision Based Target Detection. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS 2006), Beijing, China.","DOI":"10.1109\/IROS.2006.282029"},{"key":"ref_8","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid Object Detection Using a Boosted Cascade of Simple Features. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2001), Kauai, HI, USA."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Livingston, A.M., Sebastian, J., Ai, Z., and Decker, W.J. (2012, January 4\u20138). Performance Measurements for The Microsoft Kinect Skeleton. Proceedings of the IEEE International Conference on Virtual Reality and 3D User Interfaces (IEEE VR 2012), Costa Mesa, CA, USA.","DOI":"10.1109\/VR.2012.6180911"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Bian, Z., Xu, T., Su, C., and Luo, X. (2016, January 23\u201324). Human Abnormal Behavior Detection Based on RGBD Video\u2019s Skeleton Information Entropy. Proceedings of the IEEE International Conference on Communications, Signal Processing, and Systems, Chengdu, China.","DOI":"10.1007\/978-3-662-49831-6_74"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Naseer, T., Sturm, J., and Cremers, D. (2013, January 3\u20137). Followme: Person Following and Gesture Recognition with A Quadrocopter. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS 2013), Tokyo, Japan.","DOI":"10.1109\/IROS.2013.6696416"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1466","DOI":"10.1177\/0278364909341884","article-title":"A Fast Stereo-Based System for Detecting and Tracking Pedestrians from a Moving Vehicle","volume":"28","author":"Bajracharya","year":"2009","journal-title":"Int. J. Robot. Res."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1016\/j.robot.2016.07.004","article-title":"Identification of a specific person using color, height, and gait features for a person following robot","volume":"84","author":"Koide","year":"2016","journal-title":"Robot. Autom. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Perre, J. (2018, January 21\u201323). End-to-End Deep Learning for Robotic Follow. Proceedings of the 2018 2nd International Conference on Mechatronics Systems and Control Engineering (ICMSCE 2018), Amsterdam, The Netherlands.","DOI":"10.1145\/3185066.3185084"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1145\/2508037.2508039","article-title":"A Survey of Appearance Models in Visual Object Tracking","volume":"4","author":"Li","year":"2013","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1409","DOI":"10.1109\/TPAMI.2011.239","article-title":"Tracking-learning-detection","volume":"34","author":"Kalal","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hare, S., Saffari, A., and Torr, P.H. (2011, January 6\u201313). Struck: Structured Output Tracking with Kernels. Proceedings of the IEEE International on Computer Vision (ICCV 2011), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126251"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Nebehay, G., and Pflugfelder, R. (2015, January 7\u201312). Clustering of Static-adaptive Correspondences for Deformable Object Tracking. Proceedings of the IEEE International conference on Computer Vision and Pattern Recognition (CVPR 2015), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298895"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Kalal, Z., Matas, J., and Mikolajczyk, K. (2010, January 13\u201318). P-N learning: Bootstrapping Binary Classifiers by Structural Constraints. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2014), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540231"},{"key":"ref_20","unstructured":"Lenc, K., and Vedaldi, A. (2018, November 11). R-CNN Minus R. Available online: https:\/\/arxiv.org\/abs\/1506.06981."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Girshick, R.B. (2015, January 8\u201310). Fast R-CNN. Proceedings of the IEEE international conference on computer vision (CVPR 2015), Boston, MA, USA.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_22","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards real-time Object Detection with Region Proposal Networks. Proceedings of the International Conference on Neural Information Processing Systems (NIPS 2015), Montreal, PQ, Canada."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: Unified, Real-time Object Detection. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NE, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., and Berg, A.C. (2016, January 8\u201316). SSD: Single Shot MultiBox Detector. Proceedings of the IEEE European Conference on Computer Vision (ECCV 2016), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_25","unstructured":"Andrew, H., Menglong, Z., Bo, C., Dmitry, K., Weijun, W., Tobias, W., Marco, A., and Hartwig, A. (2018, November 11). Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Available online: https:\/\/arxiv.org\/abs\/1704.04861."},{"key":"ref_26","unstructured":"Vojir, T. (2018, November 11). Tracking Dataset. Available online: http:\/\/cmp.felk.cvut.cz\/~vojirtom\/dataset\/ tv77\/."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Jiang, S., Hang, M., Kuc, T., Joo, S., Yang, S., and Hong, Z. (2017, January 5\u20138). Classification-lock Tracking Approach Applied on Person Following Robot. Proceedings of the IEEE International Conference on Robotics and Biomimetics (ROBIO 2017), Macau, China.","DOI":"10.1109\/ROBIO.2017.8324803"},{"key":"ref_28","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of Oriented Gradients for Human Detection. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2005), San Diego, CA, USA."},{"key":"ref_29","unstructured":"Shi, S.B. (1994, January 21\u201323). Good features to track. Proceedings of the IEEE International on Computer Vision and Pattern Recognition (CVPR 1994), Seattle, WA, USA."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Leutenegger, S., Chili, M., and Siegwart, R.Y. (2011, January 6\u201313). BRISK: Binary Robust invariant scalable keypoints. Proceedings of the IEEE International Conference on Computer Vision (ICCV 2011), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126542"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1109\/34.709601","article-title":"The Random Subspace Method for Constructing Decision Forests","volume":"20","author":"Ho","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zhang, K., Zhang, L., and Yang, H.M. (2012, January 7\u201313). Real-time Compressive Tracking. Proceedings of the IEEE International Conference on Computer Vision (ECCV 2012), Firenze, Italy.","DOI":"10.1007\/978-3-642-33712-3_62"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhang, K., Zhang, L., Liu, Q., Zhang, D., and Yang, M.H. (2014, January 6\u201312). Fast Tracking Via Dense Spatio-temporal Context Learning. Proceedings of the IEEE European Conference on Computer Vision (ECCV 2014), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_9"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/18\/11\/3903\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:29:19Z","timestamp":1760196559000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/18\/11\/3903"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,12]]},"references-count":33,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2018,11]]}},"alternative-id":["s18113903"],"URL":"https:\/\/doi.org\/10.3390\/s18113903","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,12]]}}}