{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,17]],"date-time":"2026-02-17T15:02:24Z","timestamp":1771340544948,"version":"3.50.1"},"reference-count":50,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2019,1,3]],"date-time":"2019-01-03T00:00:00Z","timestamp":1546473600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41871329"],"award-info":[{"award-number":["41871329"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Shenzhen Future Industry Development Funding Program","award":["201607281039561400"],"award-info":[{"award-number":["201607281039561400"]}]},{"name":"Shenzhen Scientific Research and Development Funding Program","award":["JCYJ20170818092931604"],"award-info":[{"award-number":["JCYJ20170818092931604"]}]},{"name":"Horizon Centre for Doctoral Training at the University of Nottingham","award":["EP\/L015463\/1"],"award-info":[{"award-number":["EP\/L015463\/1"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>This paper presents a novel indoor topological localization method based on mobile phone videos. Conventional methods suffer from indoor dynamic environmental changes and scene ambiguity. The proposed Visual Landmark Sequence-based Indoor Localization (VLSIL) method is capable of addressing problems by taking steady indoor objects as landmarks. Unlike many feature or appearance matching-based localization methods, our method utilizes highly abstracted landmark sematic information to represent locations and thus is invariant to illumination changes, temporal variations, and occlusions. We match consistently detected landmarks against the topological map based on the occurrence order in the videos. The proposed approach contains two components: a convolutional neural network (CNN)-based landmark detector and a topological matching algorithm. The proposed detector is capable of reliably and accurately detecting landmarks. The other part is the matching algorithm built on the second order hidden Markov model and it can successfully handle the environmental ambiguity by fusing sematic and connectivity information of landmarks. To evaluate the method, we conduct extensive experiments on the real world dataset collected in two indoor environments, and the results show that our deep neural network-based indoor landmark detector accurately detects all landmarks and is expected to be utilized in similar environments without retraining and that VLSIL can effectively localize indoor landmarks.<\/jats:p>","DOI":"10.3390\/rs11010073","type":"journal-article","created":{"date-parts":[[2019,1,3]],"date-time":"2019-01-03T11:11:56Z","timestamp":1546513916000},"page":"73","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":19,"title":["Indoor Topological Localization Using a Visual Landmark Sequence"],"prefix":"10.3390","volume":"11","author":[{"given":"Jiasong","family":"Zhu","sequence":"first","affiliation":[{"name":"Shenzhen Key Laboratory of Spatial Smart Sensing and Services &amp; Key Laboratory for Geo-Environmental Monitoring of Coastal Zone of the National Administration of Surveying, Mapping and Geoinformation, Shenzhen University, Shenzhen 518060, China"}]},{"given":"Qing","family":"Li","sequence":"additional","affiliation":[{"name":"Shenzhen Key Laboratory of Spatial Smart Sensing and Services &amp; Key Laboratory for Geo-Environmental Monitoring of Coastal Zone of the National Administration of Surveying, Mapping and Geoinformation, Shenzhen University, Shenzhen 518060, China"},{"name":"School of Computer Science, The University of Nottingham, Nottingham NG8 1BB, UK"},{"name":"College of Information Engineering &amp; Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1440-4175","authenticated-orcid":false,"given":"Rui","family":"Cao","sequence":"additional","affiliation":[{"name":"Shenzhen Key Laboratory of Spatial Smart Sensing and Services &amp; Key Laboratory for Geo-Environmental Monitoring of Coastal Zone of the National Administration of Surveying, Mapping and Geoinformation, Shenzhen University, Shenzhen 518060, China"},{"name":"College of Information Engineering &amp; Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060, China"},{"name":"International Doctoral Innovation Centre &amp; School of Computer Science, The University of Nottingham Ningbo China, Ningbo 315100, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3116-3203","authenticated-orcid":false,"given":"Ke","family":"Sun","sequence":"additional","affiliation":[{"name":"Shenzhen Key Laboratory of Spatial Smart Sensing and Services &amp; Key Laboratory for Geo-Environmental Monitoring of Coastal Zone of the National Administration of Surveying, Mapping and Geoinformation, Shenzhen University, Shenzhen 518060, China"}]},{"given":"Tao","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Resource and Environment, Henan University of Economics and Law, Zhengzhou 450046, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9690-7074","authenticated-orcid":false,"given":"Jonathan M.","family":"Garibaldi","sequence":"additional","affiliation":[{"name":"School of Computer Science, The University of Nottingham, Nottingham NG8 1BB, UK"}]},{"given":"Qingquan","family":"Li","sequence":"additional","affiliation":[{"name":"Shenzhen Key Laboratory of Spatial Smart Sensing and Services &amp; Key Laboratory for Geo-Environmental Monitoring of Coastal Zone of the National Administration of Surveying, Mapping and Geoinformation, Shenzhen University, Shenzhen 518060, China"}]},{"given":"Bozhi","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Information Engineering &amp; Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060, China"}]},{"given":"Guoping","family":"Qiu","sequence":"additional","affiliation":[{"name":"School of Computer Science, The University of Nottingham, Nottingham NG8 1BB, UK"},{"name":"College of Information Engineering &amp; Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,1,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/S0921-8890(02)00276-2","article-title":"Topological navigation and qualitative localization for indoor environment using multi-sensory perception","volume":"41","author":"Ranganathan","year":"2002","journal-title":"Robot. Auton. Syst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"729","DOI":"10.1109\/TASE.2014.2351814","article-title":"Topological Indoor Localization and Navigation for Autonomous Mobile Robot","volume":"12","author":"Cheng","year":"2015","journal-title":"IEEE Trans. Autom. Sci. Eng."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Bradley, D.M., Patel, R., Vandapel, N., and Thayer, S.M. (2005, January 2\u20136). Real-time image-based topological localization in large outdoor environments. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems, Edmonton, AB, Canada.","DOI":"10.1109\/IROS.2005.1545442"},{"key":"ref_4","unstructured":"Becker, C., Salas, J., Tokusei, K., and Latombe, J.C. (1995, January 21\u201327). Reliable navigation using landmarks. Proceedings of the 1995 IEEE International Conference on Robotics and Automation, Nagoya, Japan."},{"key":"ref_5","unstructured":"Kosecka, J., Zhou, L., Barber, P., and Duric, Z. (2003, January 18\u201320). Qualitative image based localization in indoors environments. Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Madison, WI, USA."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Li, Q., Zhu, J., Liu, T., Garibaldi, J., Li, Q., and Qiu, G. (2017, January 7\u201310). Visual landmark sequence-based indoor localization. Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery, Los Angeles, CA, USA.","DOI":"10.1145\/3149808.3149812"},{"key":"ref_7","unstructured":"Ahn, S.J., Rauh, W., and Recknagel, M. (1999, January 17\u201321). Circular coded landmark for optical 3D-measurement and robot vision. Proceedings of the 1999 IEEE\/RSJ International Conference on Intelligent Robots and Systems, Kyongju, Korea."},{"key":"ref_8","unstructured":"Jang, G., Lee, S., and Kweon, I. (2002, January 11\u201315). Color landmark based self-localization for indoor mobile robots. Proceedings of the 2002 IEEE International Conference on Robotics and Automation, Washington, DC, USA."},{"key":"ref_9","first-page":"897103","article-title":"The use of quick response (qr) codes in landmark-based pedestrian navigation","volume":"2014","author":"Basiri","year":"2014","journal-title":"Int. J. Navig. Obs."},{"key":"ref_10","unstructured":"Briggs, A.J., Scharstein, D., Braziunas, D., Dima, C., and Wall, P. (2000, January 24\u201328). Mobile robot navigation using self-similar landmarks. Proceedings of the IEEE International Conference on Robotics and Automation, San Francisco, CA, USA."},{"key":"ref_11","unstructured":"Hayet, J.B., Lerasle, F., and Devy, M. (2002, January 11\u201315). A visual landmark framework for indoor mobile robot navigation. Proceedings of the 2002 IEEE International Conference on Robotics and Automation, Washington, DC, USA."},{"key":"ref_12","unstructured":"Ayala, V., Hayet, J.B., Lerasle, F., and Devy, M. (November, January 31). Visual localization of a mobile robot in indoor environments using planar landmarks. Proceedings of the 2000 IEEE\/RSJ International Conference on Intelligent Robots and Systems, Takamatsu, Japan."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1007\/s00138-012-0431-7","article-title":"Toward a computer vision-based wayfinding aid for blind persons to access unfamiliar indoor environments","volume":"24","author":"Tian","year":"2013","journal-title":"Mach. Vis. Appl."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3261","DOI":"10.1109\/TVT.2010.2052079","article-title":"Vision-based autonomous vehicle guidance for indoor security patrolling by a SIFT-based vehicle-localization technique","volume":"59","author":"Chen","year":"2010","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Bai, Y., Jia, W., Zhang, H., Mao, Z.H., and Sun, M. (2014, January 19\u201323). Landmark-based indoor positioning for visually impaired individuals. Proceedings of the 2014 12th International Conference on Signal Processing, Hangzhou, China.","DOI":"10.1109\/ICOSP.2014.7015087"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.procs.2012.10.008","article-title":"Indoor localization and navigation for blind persons using visual landmarks and a GIS","volume":"14","author":"Rodrigues","year":"2012","journal-title":"Procedia Comput. Sci."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kawaji, H., Hatada, K., Yamasaki, T., and Aizawa, K. (2010, January 29). Image-based indoor positioning system: Fast image matching using omnidirectional panoramic images. Proceedings of the 1st ACM International Workshop on Multimodal Pervasive Video Analysis, Firenze, Italy.","DOI":"10.1145\/1878039.1878041"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1016\/S0167-8655(99)00031-8","article-title":"Landmark recognition using invariant features","volume":"20","author":"Flusser","year":"1999","journal-title":"Pattern Recognit. Lett."},{"key":"ref_19","first-page":"609","article-title":"Indoor localization system based on artificial landmarks and monocular vision","volume":"10","author":"Pinto","year":"2012","journal-title":"TELKOMNIKA Telecommun. Comput. Electron. Control"},{"key":"ref_20","first-page":"1191","article-title":"A Robot Indoor Position and Orientation Method based on 2D Barcode Landmark","volume":"6","author":"Lin","year":"2011","journal-title":"JCP"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Kosmopoulos, D.I., and Chandrinos, K.V. (2002). Definition and Extraction of Visual Landmarks for Indoor Robot Navigation, Springer.","DOI":"10.1007\/3-540-46014-4_36"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_23","unstructured":"Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., and Oliva, A. (2014). Learning deep features for scene recognition using places database. Advances in Neural Information Processing Systems, Available online: http:\/\/places.csail.mit.edu\/places_NIPS14.pdf."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Werner, M., Kessel, M., and Marouane, C. (2011, January 21\u201323). Indoor positioning using smartphone camera. Proceedings of the 2011 International Conference on Indoor Positioning and Indoor Navigation, Guimaraes, Portugal.","DOI":"10.1109\/IPIN.2011.6071954"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Liang, J.Z., Corso, N., Turner, E., and Zakhor, A. (2013, January 22\u201324). Image based localization in indoor environments. Proceedings of the 2013 Fourth International Conference on Computing for Geospatial Research and Application, San Jose, CA, USA.","DOI":"10.1109\/COMGEO.2013.11"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Chen, C., Yang, B., Song, S., Tian, M., Li, J., Dai, W., and Fang, L. (2018). Calibrate Multiple Consumer RGB-D Cameras for Low-Cost and Efficient 3D Indoor Mapping. Remote Sens., 10.","DOI":"10.3390\/rs10020328"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhao, P., Hu, Q., Wang, S., Ai, M., and Mao, Q. (2018). Panoramic Image and Three-Axis Laser Scanner Integrated Approach for Indoor 3D Mapping. Remote Sens., 10.","DOI":"10.3390\/rs10081269"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Lu, G., and Kambhamettu, C. (2014). Image-based indoor localization system based on 3d sfm model. IS&T\/SPIE Electronic Imaging, International Society for Optics and Photonics. Available online: https:\/\/www.researchgate.net\/publication\/269323831_Image-based_indoor_localization_system_ based_on_3D_SfM_model.","DOI":"10.1117\/12.2038582"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Van Opdenbosch, D., Schroth, G., Huitl, R., Hilsenbeck, S., Garcea, A., and Steinbach, E. (2014, January 27\u201330). Camera-based indoor positioning using scalable streaming of compressed binary image signatures. Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France.","DOI":"10.1109\/ICIP.2014.7025567"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Hile, H., and Borriello, G. (2008). Positioning and orientation in indoor environments using camera phones. IEEE Comput. Gr. Appl., 28.","DOI":"10.1109\/MCG.2008.80"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MPRV.2009.30","article-title":"Indoor positioning and navigation with camera phones","volume":"8","author":"Mulloni","year":"2009","journal-title":"IEEE Pervasive Comput."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1016\/j.cviu.2017.05.003","article-title":"Indoor localization via multi-view images and videos","volume":"161","author":"Lu","year":"2017","journal-title":"Comput. Vis. Image Understand."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.neucom.2015.07.106","article-title":"Where am i in the dark: Exploring active transfer learning on the use of indoor localization based on thermal imaging","volume":"173","author":"Lu","year":"2016","journal-title":"Neurocomputing"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1330","DOI":"10.1109\/LSP.2016.2593958","article-title":"Visual indoor localization in known environments","volume":"23","author":"Piciarelli","year":"2016","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_35","unstructured":"Vedadi, F., and Valaee, S. (2017). Automatic Visual Fingerprinting for Indoor Image-Based Localization Applications. IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1007\/s12205-016-1057-5","article-title":"Development of indoor localization system using a mobile data acquisition platform and BoW image matching","volume":"21","author":"Lee","year":"2017","journal-title":"KSCE J. Civ. Eng."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"715","DOI":"10.3390\/s150100715","article-title":"Fusion of WiFi, smartphone sensors and landmarks using the Kalman filter for indoor localization","volume":"15","author":"Chen","year":"2015","journal-title":"Sensors"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Deng, Z.A., Wang, G., Qin, D., Na, Z., Cui, Y., and Chen, J. (2016). Continuous indoor positioning fusing WiFi, smartphone sensors and landmarks. Sensors, 16.","DOI":"10.3390\/s16091427"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Gu, F., Khoshelham, K., Shang, J., and Yu, F. (2016, January 2\u20134). Sensory landmarks for indoor localization. Proceedings of the 2016 Fourth International Conference on Ubiquitous Positioning, Indoor Navigation and Location Based Services (UPINLBS), Shanghai, China.","DOI":"10.1109\/UPINLBS.2016.7809972"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/TITS.2006.889439","article-title":"Developing landmark-based pedestrian-navigation systems","volume":"8","author":"Millonig","year":"2007","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1109\/70.563647","article-title":"Mobile robot localization using landmarks","volume":"13","author":"Betke","year":"1997","journal-title":"IEEE Trans. Robot. Autom."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1023\/B:JINT.0000015401.49928.a4","article-title":"Symbolic place recognition in voronoi-based maps by using hidden markov models","volume":"39","author":"Boada","year":"2004","journal-title":"J. Intell. Robot. Syst."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1109\/THMS.2014.2368092","article-title":"Activity sequence-based indoor pedestrian localization using smartphones","volume":"45","author":"Zhou","year":"2015","journal-title":"IEEE Trans. Hum.-Mach. Syst."},{"key":"ref_44","unstructured":"Koseck\u00e1, J., and Li, F. (May, January 26). Vision based topological Markov localization. Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, LA, USA."},{"key":"ref_45","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems, Available online: https:\/\/arxiv.org\/abs\/1506.01497."},{"key":"ref_46","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, Available online: https:\/\/papers.nips.cc\/paper\/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective search for object recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Thede, S.M., and Harper, M.P. (1999, January 20\u201326). A second-order hidden Markov model for part-of-speech tagging. Proceedings of the the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, College Park, MD, USA.","DOI":"10.3115\/1034678.1034712"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. (2014, January 3\u20137). Caffe: Convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654889"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1023\/A:1011139631724","article-title":"Modeling the shape of the scene: A holistic representation of the spatial envelope","volume":"42","author":"Oliva","year":"2001","journal-title":"Int. J. Comput. Vis."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/1\/73\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:23:23Z","timestamp":1760185403000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/1\/73"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,1,3]]},"references-count":50,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2019,1]]}},"alternative-id":["rs11010073"],"URL":"https:\/\/doi.org\/10.3390\/rs11010073","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,1,3]]}}}