{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T03:47:33Z","timestamp":1761709653613,"version":"build-2065373602"},"reference-count":56,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2019,11,1]],"date-time":"2019-11-01T00:00:00Z","timestamp":1572566400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2016YFB0502102","2018YFB1305004"],"award-info":[{"award-number":["2016YFB0502102","2018YFB1305004"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>High-accuracy indoor positioning is a prerequisite to satisfy the increasing demands of position-based services in complex indoor scenes. Current indoor visual-positioning methods mainly include image retrieval-based methods, visual landmarks-based methods, and learning-based methods. To better overcome the limitations of traditional methods such as them being labor-intensive, of poor accuracy, and time-consuming, this paper proposes a novel indoor-positioning method with automated red, green, blue and depth (RGB-D) image database construction. First, strategies for automated database construction are developed to reduce the workload of manually selecting database images and ensure the requirements of high-accuracy indoor positioning. The database is automatically constructed according to the rules, which is more objective and improves the efficiency of the image-retrieval process. Second, by combining the automated database construction module, convolutional neural network (CNN)-based image-retrieval module, and strict geometric relations-based pose estimation module, we obtain a high-accuracy indoor-positioning system. Furthermore, in order to verify the proposed method, we conducted extensive experiments on the public indoor environment dataset. The detailed experimental results demonstrated the effectiveness and efficiency of our indoor-positioning method.<\/jats:p>","DOI":"10.3390\/rs11212572","type":"journal-article","created":{"date-parts":[[2019,11,1]],"date-time":"2019-11-01T12:30:50Z","timestamp":1572611450000},"page":"2572","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["A High-Accuracy Indoor-Positioning Method with Automated RGB-D Image Database Construction"],"prefix":"10.3390","volume":"11","author":[{"given":"Runzhi","family":"Wang","sequence":"first","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 20A, Datun Road, Chaoyang District, Beijing 100101, China"},{"name":"College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China"}]},{"given":"Wenhui","family":"Wan","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 20A, Datun Road, Chaoyang District, Beijing 100101, China"}]},{"given":"Kaichang","family":"Di","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 20A, Datun Road, Chaoyang District, Beijing 100101, China"}]},{"given":"Ruilin","family":"Chen","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 20A, Datun Road, Chaoyang District, Beijing 100101, China"}]},{"given":"Xiaoxue","family":"Feng","sequence":"additional","affiliation":[{"name":"Institute of Remote Sensing and GIS, School of Earth and Space Sciences, Peking University, Beijing 100871, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,11,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1016\/j.compenvurbsys.2006.02.003","article-title":"Location-based services and GIS in perspective","volume":"30","author":"Jiang","year":"2006","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_2","first-page":"3","article-title":"The computer for the 21st century","volume":"3","author":"Weiser","year":"1999","journal-title":"IEEE Pervasive Comput."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1109\/COMST.2016.2637663","article-title":"A survey of selected indoor positioning methods for smartphones","volume":"19","author":"Davidson","year":"2017","journal-title":"IEEE Commun. Surv. Tutor."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1774","DOI":"10.1109\/TMC.2012.143","article-title":"Dynamic Online-Calibrated Radio Maps for Indoor Positioning in Wireless Local Area Networks","volume":"12","author":"Atia","year":"2013","journal-title":"IEEE Trans. Mob. Comput."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"8358","DOI":"10.3390\/s150408358","article-title":"A Novel Method for Constructing a WIFI Positioning System with Efficient Manpower","volume":"15","author":"Du","year":"2015","journal-title":"Sensors"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Moghtadaiee, V., Dempster, A.G., and Lim, S. (2011, January 21\u201323). Indoor localization using FM radio signals: A fingerprinting approach. Proceedings of the 2011 International Conference on Indoor Positioning & Indoor Navigation (IPIN), Guimar\u00e3es, Portugal.","DOI":"10.1109\/IPIN.2011.6071932"},{"key":"ref_7","first-page":"775","article-title":"In RADAR: An in-building RF-based user location and tracking system","volume":"2","author":"Bahl","year":"2000","journal-title":"IEEE Infocom"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Youssef, M., and Agrawala, A. (2005, January 6\u20138). The Horus WLAN location determination system. Proceedings of the 3rd International Conference on Mobile Systems, Application, and Services (MobiSys 2005), Seattle, WA, USA.","DOI":"10.1145\/1067170.1067193"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Cant\u00f3n Paterna, V., Calveras Aug\u00e9, A., Paradells Aspas, J., and P\u00e9rez Bullones, M.A. (2017). A Bluetooth Low Energy Indoor Positioning System with Channel Diversity, Weighted Trilateration and Kalman Filtering. Sensors, 17.","DOI":"10.3390\/s17122927"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1007\/s11277-012-0777-1","article-title":"Bayesian Fusion for Indoor Positioning Using Bluetooth Fingerprints","volume":"70","author":"Chen","year":"2013","journal-title":"Wirel. Pers. Commun."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.pmcj.2017.08.003","article-title":"A fine-grained indoor fingerprinting localization based on magnetic field strength and channel state information","volume":"41","author":"Huang","year":"2017","journal-title":"Pervasive Mob. Comput."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kim, H.-S., Seo, W., and Baek, K.-R. (2017). Indoor Positioning System Using Magnetic Field Map Navigation and an Encoder System. Sensors, 17.","DOI":"10.3390\/s17030651"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Gronat, P., Obozinski, G., Sivic, J., and Pajdla, T. (2013, January 23\u201328). Learning and Calibrating Per-Location Classifiers for Visual Place Recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.122"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Vaca-Castano, G., Zamir, A.R., and Shah, M. (2012, January 16\u201321). City scale geo-spatial trajectory estimation of a moving camera. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6247800"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Arandjelovi\u0107, R., and Zisserman, A. (2012, January 16\u201321). Three things everyone should know to improve pbject retrieval. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6248018"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zamir, A.R., and Shah, M. (2010, January 5\u201311). Accurate Image Localization Based on Google Maps Street View. Proceedings of the 11th European Conference on Computer Vision, Heraklion, Crete, Greece.","DOI":"10.1007\/978-3-642-15561-1_19"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1546","DOI":"10.1109\/TPAMI.2014.2299799","article-title":"Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs","volume":"36","author":"Zamir","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Babenko, A., Slesarev, A., Chigorin, A., and Lempitsky, V. (2014, January 6\u201312). Neural Codes for Image Retrieval. Proceedings of the 13th European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10590-1_38"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Azzi, C., Asmar, D., Fakih, A., and Zelek, J. (2016, January 19\u201322). Filtering 3D keypoints using GIST for accurate image-based positioning. Proceedings of the 27th British Machine Vision Conference, York, UK.","DOI":"10.5244\/C.30.127"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_21","first-page":"117","article-title":"Product Quantization for Nearest Neighbor Search","volume":"33","author":"Douze","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","unstructured":"Nist\u00e9r, D., and Stew\u00e9nius, H. (2006, January 17\u201322). Scalable Recognition with a Vocabulary Tree. Proceedings of the Computer Vision and Pattern Recognition, New York, NY, USA."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Kim, H.J., Dunn, E., and Frahm, J.M. (2015, January 13\u201316). Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.139"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M., and Pajdla, T. (2015, January 7\u201312). 24\/7 place recognition by view synthesis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298790"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Milford, M.J., Lowry, S., Shirazi, S., Pepperell, E., Shen, C., Lin, G., Liu, F., Cadena, C., and Reid, I. (2015, January 7\u201312). Sequence searching with deep-learnt depth for condition-and viewpoint-invariant route-based place recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301395"},{"key":"ref_26","unstructured":"Poglitsch, C., Arth, C., Schmalstieg, D., and Ventura, J. (October, January 29). A Particle Filter Approach to Outdoor Localization Using Image-Based Rendering. Proceedings of the IEEE International Symposium on Mixed & Augmented Reality, Fukuoka, Japan."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Razavian, A.S., Azizpour, H., Sullivan, J., and Carlsson, S. (2014, January 23\u201328). CNN features off-the-shelf: An astounding baseline for recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern RecognitionWorkshops, Columbus, OH, USA.","DOI":"10.1109\/CVPRW.2014.131"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Yandex, A.B., and Lempitsky, V. (2015, January 13\u201316). Aggregating Local Deep Convolutional Features for Image Retrieval. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.150"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","article-title":"Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age","volume":"32","author":"Cadena","year":"2016","journal-title":"IEEE Trans. Robot."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.robot.2017.09.010","article-title":"Keyframe-based monocular SLAM: Design, survey, and future directions","volume":"98","author":"Younes","year":"2017","journal-title":"Robot. Auton. Syst."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Campbell, D., Petersson, L., Kneip, L., and Li, H. (2017, January 22\u201329). Globally-optimal inlier set maximisation for simultaneous camera pose and feature correspondence. Proceedings of the IEEE Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.10"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Liu, L., Li, H., and Dai, Y. (2017, January 22\u201329). Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map. Proceedings of the IEEE Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.260"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Yousif, K., Taguchi, Y., and Ramalingam, S. (June, January 29). MonoRGBD-SLAM: Simultaneous localization and mapping using both monocular and RGBD cameras. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989521"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1007\/s41315-017-0036-4","article-title":"A non-rigid map fusion-based rgb-depth slam method for endoscopic capsule robots","volume":"1","author":"Turan","year":"2017","journal-title":"Int. J. Intell. Robot. Appl."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Sattler, T., Torii, A., Sivic, J., Pollefeys, M., Taira, H., Okutomi, M., and Pajdla, T. (2017, January 21\u201326). Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.654"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Degol, J., Bretl, T., and Hoiem, D. (2017, January 22\u201329). ChromaTag: A Colored Marker and Fast Detection Algorithm. Proceedings of the IEEE Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.164"},{"key":"ref_37","first-page":"158","article-title":"Mapping and Localization from Planar Markers","volume":"73","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.1109\/TPAMI.2006.252","article-title":"Robust pose estimation from a planar target","volume":"28","author":"Schweighofer","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Valentin, J., Niebner, M., Shotton, J., Fitzgibbon, A., Izadi, S., and Torr, P. (2015, January 7\u201312). Exploiting uncertainty in regression forests for accurate camera relocalization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299069"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Tateno, K., Tombari, F., Laina, I., and Navab, N. (2017, January 21\u201326). CNN-SLAM: Real-Time Dense Monocular SLAM with Learned Depth Prediction. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.695"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Shotton, J., Glocker, B., Zach, C., Izadi, S., Criminisi, A., and Fitzgibbon, A. (2013, January 23\u201328). Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.377"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Kendall, A., and Cipolla, R. (2017, January 21\u201326). Geometric loss functions for camera pose regression with deep learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.694"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Tekin, B., Sinha, S.N., and Fua, P. (2018, January 18\u201322). Real-Time Seamless Single Shot 6D Object Pose Prediction. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00038"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1016\/j.patcog.2017.09.013","article-title":"A survey on Visual-Based Localization: On the benefit of heterogeneous data","volume":"74","author":"Piasco","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Ummenhofer, B., Zhou, H., Uhrig, J., Mayer, N., Ilg, E., Dosovitskiy, A., and Brox, T. (2017, January 21\u201326). DeMoN: Depth and motion network for learning monocular stereo. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.596"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Mahjourian, R., Wicke, M., and Angelova, A. (2018, January 18\u201322). Unsupervised learning of depth and ego-motion from monocular video using 3d geometric constraints. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00594"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Zhan, H., Garg, R., Weerasekera, C.S., Li, K., Agarwal, H., and Reid, I. (2018, January 18\u201322). Unsupervised learning of monocular depth estimation and visual odometry with deep feature reconstruction. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00043"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10514-015-9516-2","article-title":"Unsupervised learning to detect loops using deep neural networks for visual SLAM system","volume":"41","author":"Gao","year":"2017","journal-title":"Auton. Robot."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Wu, J., Ma, L., and Hu, X. (June, January 29). Delving deeper into convolutional neural networks for camera relocalization. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989663"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Chen, Y., Chen, R., Liu, M., Xiao, A., Wu, D., and Zhao, S. (2018). Indoor Visual Positioning Aided by CNN-Based Image Retrieval: Training-Free, 3D Modeling-Free. Sensors, 18.","DOI":"10.3390\/s18082692"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Sturm, J., Engelhard, N., Endres, F., Burgard, W., and Cremers, D. (2012, January 7\u201312). A benchmark for the evaluation of RGB-D SLAM systems. Proceedings of the 2012 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Algarve, Portugal.","DOI":"10.1109\/IROS.2012.6385773"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1109\/TPAMI.2017.2711011","article-title":"Netvlad: CNN architecture for weakly supervised place recognition","volume":"40","author":"Arandjelovic","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_53","unstructured":"Richard, H., and Andrew, Z. (2003). Multiple View Geometry in Computer Vision, Cambridge University Press. [2nd ed.]."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1145\/358669.358692","article-title":"Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography","volume":"24","author":"Fischler","year":"1981","journal-title":"Commun. Acm"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1007\/s11263-008-0152-6","article-title":"EPnP: An accurate O(n) solution to the PnP problem","volume":"81","author":"Lepetit","year":"2009","journal-title":"Int. J. Comput. Vis."},{"key":"ref_56","unstructured":"K\u00fcmmerle, R., Grisetti, G., Strasdat, H., Konolige, K., and Burgard, W. (2011, January 9\u201313). G2o: A general framework for graph optimization. Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/21\/2572\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:31:21Z","timestamp":1760189481000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/21\/2572"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,1]]},"references-count":56,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2019,11]]}},"alternative-id":["rs11212572"],"URL":"https:\/\/doi.org\/10.3390\/rs11212572","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2019,11,1]]}}}