{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T12:29:31Z","timestamp":1773923371471,"version":"3.50.1"},"reference-count":53,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2019,2,23]],"date-time":"2019-02-23T00:00:00Z","timestamp":1550880000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The urban environments represent challenging areas for handheld device pose estimation (i.e., 3D position and 3D orientation) in large displacements. It is even more challenging with low-cost sensors and computational resources that are available in pedestrian mobile devices (i.e., monocular camera and Inertial Measurement Unit). To address these challenges, we propose a continuous pose estimation based on monocular Visual Odometry. To solve the scale ambiguity and suppress the scale drift, an adaptive pedestrian step lengths estimation is used for the displacements on the horizontal plane. To complete the estimation, a handheld equipment height model, with respect to the Digital Terrain Model contained in Geographical Information Systems, is used for the displacement on the vertical axis. In addition, an accurate pose estimation based on the recognition of known objects is punctually used to correct the pose estimate and reset the monocular Visual Odometry. To validate the benefit of our framework, experimental data have been collected on a 0.7 km pedestrian path in an urban environment for various people. Thus, the proposed solution allows to achieve a positioning error of 1.6\u20137.5% of the walked distance, and confirms the benefit of the use of an adaptive step length compared to the use of a fixed-step length.<\/jats:p>","DOI":"10.3390\/s19040953","type":"journal-article","created":{"date-parts":[[2019,2,25]],"date-time":"2019-02-25T03:06:52Z","timestamp":1551064012000},"page":"953","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Solving Monocular Visual Odometry Scale Factor with Adaptive Step Length Estimates for Pedestrians Using Handheld Devices"],"prefix":"10.3390","volume":"19","author":[{"given":"Nicolas","family":"Antigny","sequence":"first","affiliation":[{"name":"Institut Fran\u00e7ais des Sciences et Technologies des Transports, de l\u2019Am\u00e9nagement et des R\u00e9seaux (IFSTTAR) AME GEOLOC, 44340 Bouguenais, France"},{"name":"Centrale Nantes, 44300 Nantes, France"},{"name":"Institut de Recherche en Sciences et Techniques de la Ville (IRSTV), 44300 Nantes, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6119-1184","authenticated-orcid":false,"given":"Hideaki","family":"Uchiyama","sequence":"additional","affiliation":[{"name":"Laboratory for Image and Media Understanding (LIMU), Kyushu University, Fukuoka 819-0395, Japan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5749-1590","authenticated-orcid":false,"given":"Myriam","family":"Servi\u00e8res","sequence":"additional","affiliation":[{"name":"Centrale Nantes, 44300 Nantes, France"},{"name":"Institut de Recherche en Sciences et Techniques de la Ville (IRSTV), 44300 Nantes, France"},{"name":"Centre de Recherche Nantais Architectures Urbanit\u00e9s (CRENAU) AAU, 44262 Nantes, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4535-5406","authenticated-orcid":false,"given":"Val\u00e9rie","family":"Renaudin","sequence":"additional","affiliation":[{"name":"Institut Fran\u00e7ais des Sciences et Technologies des Transports, de l\u2019Am\u00e9nagement et des R\u00e9seaux (IFSTTAR) AME GEOLOC, 44340 Bouguenais, France"},{"name":"Institut de Recherche en Sciences et Techniques de la Ville (IRSTV), 44300 Nantes, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8525-7133","authenticated-orcid":false,"given":"Diego","family":"Thomas","sequence":"additional","affiliation":[{"name":"Laboratory for Image and Media Understanding (LIMU), Kyushu University, Fukuoka 819-0395, Japan"}]},{"given":"Rin-ichiro","family":"Taniguchi","sequence":"additional","affiliation":[{"name":"Laboratory for Image and Media Understanding (LIMU), Kyushu University, Fukuoka 819-0395, Japan"}]}],"member":"1968","published-online":{"date-parts":[[2019,2,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Schall, G., Wagner, D., Reitmayr, G., Taichmann, E., Wieser, M., Schmalstieg, D., and Hofmann-Wellenhof, B. (2009, January 19\u201322). Global pose estimation using multi-sensor fusion for outdoor Augmented Reality. Proceedings of the 2009 8th IEEE International Symposium on Mixed and Augmented Reality, Orlando, FL, USA.","DOI":"10.1109\/ISMAR.2009.5336489"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Arth, C., Klopschitz, M., Reitmayr, G., and Schmalstieg, D. (2011, January 26\u201329). Real-time self-localization from panoramic images on mobile devices. Proceedings of the 2011 10th IEEE International Symposium on Mixed and Augmented Reality, Basel, Switzerland.","DOI":"10.1109\/ISMAR.2011.6162870"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1109\/TVCG.2014.27","article-title":"Global localization from monocular slam on a mobile phone","volume":"20","author":"Ventura","year":"2014","journal-title":"IEEE Trans. Visual Comput. Graphics"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1017\/S0373463311000051","article-title":"Positional Accuracy of Assisted GPS Data from High-Sensitivity GPS-enabled Mobile Phones","volume":"64","author":"Zandbergen","year":"2011","journal-title":"J. Navig."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Lategahn, H., Schreiber, M., Ziegler, J., and Stiller, C. (2013, January 23\u201326). Urban localization with camera and inertial measurement unit. Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV), Gold Coast, Australia.","DOI":"10.1109\/IVS.2013.6629552"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Yu, L., Joly, C., Bresson, G., and Moutarde, F. (2016, January 1\u20134). Improving robustness of monocular urban localization using augmented Street View. Proceedings of the 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil.","DOI":"10.1109\/ITSC.2016.7795603"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1109\/SURV.2012.031412.00077","article-title":"Mobile Phone Sensing Systems: A Survey","volume":"15","author":"Khan","year":"2013","journal-title":"IEEE Commun. Surv. Tutor."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1052","DOI":"10.1109\/TPAMI.2007.1049","article-title":"MonoSLAM: Real-time single camera SLAM","volume":"29","author":"Davison","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Engel, J., Sch\u00f6ps, T., and Cremers, D. (2014). LSD-SLAM: Large-scale direct monocular SLAM. Lecture Notes in Computer Science, Springer.","DOI":"10.1007\/978-3-319-10605-2_54"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras","volume":"33","year":"2017","journal-title":"IEEE Trans. Rob."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1100","DOI":"10.1177\/0278364910385483","article-title":"Appearance-only SLAM at large scale with FAB-MAP 2.0","volume":"30","author":"Cummins","year":"2011","journal-title":"Int. J. Rob. Res."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Lynen, S., Sattler, T., Bosse, M., Hesch, J., Pollefeys, M., and Siegwart, R. (2015, January 13\u201317). Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization. Proceedings of the Robotics: Science and Systems XI, Rome, Italy.","DOI":"10.15607\/RSS.2015.XI.037"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Jimenez, A.R., Seco, F., Prieto, C., and Guevara, J. (2009, January 26\u201328). A comparison of pedestrian dead-reckoning algorithms using a low-cost MEMS IMU. Proceedings of the 2009 IEEE International Symposium on Intelligent Signal Processing, Budapest, Hungary.","DOI":"10.1109\/WISP.2009.5286542"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/MPRV.2012.16","article-title":"Tutorial: Implementing a pedestrian tracker using inertial sensors","volume":"12","author":"Fischer","year":"2013","journal-title":"IEEE Pervasive Comput."},{"key":"ref_15","unstructured":"Jiang, B., Neumann, U., and You, S. (2004, January 27\u201331). A robust hybrid tracking system for outdoor augmented reality. Proceedings of the IEEE Virtual Reality 2004, Chicago, IL, USA."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Beich, M.J., and Veth, C.M. (2010, January 4\u20136). Tightly coupled image-aided inertial relative navigation using Statistical Predictive Rendering (SPR) techniques and a priori world Models. Proceedings of the IEEE\/ION Position, Location and Navigation Symposium, Indian Wells, CA, USA.","DOI":"10.1109\/PLANS.2010.5507309"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Oskiper, T., Samarasekera, S., and Kumar, R. (2011, January 26\u201329). Tightly coupled robust vision aided inertial navigation algorithm for augmented reality using monocular camera and IMU. Proceedings of the 2011 10th IEEE International Symposium on Mixed and Augmented Reality, Basel, Switzerland.","DOI":"10.1109\/ISMAR.2011.6143485"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Oskiper, T., Samarasekera, S., and Kumar, R. (2017, January 9\u201313). [POSTER] CamSLAM: Vision Aided Inertial Tracking and Mapping Framework for Large Scale AR Applications. Proceedings of the 2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct), Nantes, France.","DOI":"10.1109\/ISMAR-Adjunct.2017.71"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1109\/38.799738","article-title":"Orientation tracking for outdoor augmented reality registration","volume":"19","author":"You","year":"1999","journal-title":"IEEE Comput. Graph. Appl."},{"key":"ref_20","unstructured":"Satoh, K., Anabuki, M., Yamamoto, H., and Tamura, H. (2001, January 29\u201330). A hybrid registration method for outdoor augmented reality. Proceedings of the IEEE and ACM International Symposium on Augmented Reality, New York, NY, USA."},{"key":"ref_21","unstructured":"Nist\u00e9r, D., Naroditsky, O., and Bergen, J. (July, January 27). Visual odometry. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1109\/MRA.2011.943233","article-title":"Visual odometry [tutorial]","volume":"18","author":"Scaramuzza","year":"2011","journal-title":"IEEE Rob. Autom Mag."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Aqel, M.O.A., Marhaban, M.H., Saripan, M.I., and Ismail, N. (2016). Review of visual odometry: Types, approaches, challenges, and applications. SpringerPlus, Springer.","DOI":"10.1186\/s40064-016-3573-7"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Klein, G., and Murray, D. (2007, January 13\u201316). Parallel tracking and mapping for small AR workspaces. Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, Nara, Japan.","DOI":"10.1109\/ISMAR.2007.4538852"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Strasdat, H., Montiel, J., and Davison, A.J. (2010). Scale drift-aware large scale monocular SLAM. Robotics: Science and Systems VI, MIT Press.","DOI":"10.15607\/RSS.2010.VI.010"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wang, R., Schw\u00f6rer, M., and Cremers, D. (2017, January 22\u201329). Stereo DSO: Large-scale direct sparse visual odometry with stereo cameras. Proceedings of the International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.421"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1767","DOI":"10.1109\/TSMCB.2012.2230164","article-title":"Correcting Scale Drift by Object Recognition in Single-Camera SLAM","volume":"43","author":"Botterill","year":"2013","journal-title":"IEEE Trans. Cybern."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Lothe, P., Bourgeois, S., Royer, E., Dhome, M., and Naudet-Collette, S. (2010, January 13\u201318). Real-time vehicle global localisation with a single camera in dense urban areas: Exploitation of coarse 3D city models. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540127"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Gakne, P.V., and O\u2019Keefe, K. (2018, January 13\u201316). Tackling the Scale Factor Issue in A Monocular Visual Odometry Using A 3D City Model. Proceedings of the ITSNT 2018, International Technical Symposium on Navigation and Timing, Toulouse, France.","DOI":"10.31701\/itsnt2018.20"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Weiss, S., and Siegwart, R. (2011, January 9\u201313). Real-time metric state estimation for modular vision-inertial systems. Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China.","DOI":"10.1109\/ICRA.2011.5979982"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1007\/s10846-010-9490-z","article-title":"Fusion of IMU and vision for absolute scale estimation in monocular SLAM","volume":"61","author":"Weiss","year":"2011","journal-title":"J. Intell. Rob. Syst."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Knorr, S.B., and Kurz, D. (2016, January 19\u201323). Leveraging the User\u2019s Face for Absolute Scale Estimation in Handheld Monocular SLAM. Proceedings of the 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Merida, Mexico.","DOI":"10.1109\/ISMAR.2016.20"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Lupton, T., and Sukkarieh, S. (2008, January 19\u201323). Removing scale biases and ambiguity from 6DoF monocular SLAM using inertial. Proceedings of the 2008 IEEE International Conference on Robotics and Automation, Pasadena, CA, USA.","DOI":"10.1109\/ROBOT.2008.4543778"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Guti\u00e9rrez-G\u00f3mez, D., and Guerrero, J.J. (2013, January 8\u201312). Scaled monocular SLAM for walking people. Proceedings of the 2013 International Symposium on Wearable Computers, Zurich, Switzerland.","DOI":"10.1145\/2493988.2494351"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Lepp\u00e4koski, H., Collin, J., and Takala, J. (2012, January 25\u201330). Pedestrian navigation based on inertial sensors, indoor map, and WLAN signals. Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan.","DOI":"10.1109\/ICASSP.2012.6288192"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.compind.2015.05.003","article-title":"Fusion of visual odometry and inertial navigation system on a smartphone","volume":"74","author":"Tomazic","year":"2015","journal-title":"Comput. Ind."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"8507","DOI":"10.3390\/s120708507","article-title":"Step Length Estimation Using Handheld Inertial Sensors","volume":"12","author":"Renaudin","year":"2012","journal-title":"Sensors"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1539","DOI":"10.3390\/s130201539","article-title":"Motion Mode Recognition and Step Detection Algorithms for Mobile Phone Users","volume":"13","author":"Susi","year":"2013","journal-title":"Sensors"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/s41074-017-0027-2","article-title":"Visual SLAM algorithms: A survey from 2010 to 2016","volume":"9","author":"Taketomi","year":"2017","journal-title":"IPSJ Trans. Comput. Vis. Appl."},{"key":"ref_40","unstructured":"Bouguet, J.Y. (2019, February 22). Camera Calibration Toolbox for Matlab. Available online: http:\/\/www.vision.caltech.edu\/bouguetj\/calib_doc\/."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Hartley, R., and Zisserman, A. (2003). Multiple View Geometry in Computer Vision, Cambridge University Press.","DOI":"10.1017\/CBO9780511811685"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1330","DOI":"10.1109\/34.888718","article-title":"A flexible new technique for camera calibration","volume":"22","author":"Zhang","year":"2000","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","article-title":"Speeded-Up Robust Features (SURF)","volume":"110","author":"Bay","year":"2008","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1006\/cviu.1999.0832","article-title":"MLESAC","volume":"78","author":"Torr","year":"2000","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_45","unstructured":"Wu, Y. (arXiv, 2016). Image Based Camera Localization: An Overview, arXiv."},{"key":"ref_46","unstructured":"Cai, J., and Miklavcic, S. (2011, January 26\u201328). The generation of digital terrain models from LiDAR data using seeding and filtering and its application to flood modelling. Proceedings of the 2011 International Conference on Multimedia Technology, Hangzhou, China."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1002\/navi.254","article-title":"Fusion of 3D GIS, Vision, Inertial and Magnetic Data for Improved Urban Pedestrian Navigation and Augmented Reality Applications","volume":"65","author":"Antigny","year":"2018","journal-title":"Navigation"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"4080479","DOI":"10.1155\/2017\/4080479","article-title":"A New PDR Navigation Device for Challenging Urban Environments","volume":"2017","author":"Ortiz","year":"2017","journal-title":"J. Sens."},{"key":"ref_49","unstructured":"Signorelli, V., and Leduc, T. (2015). Utilisation du Socle 3D CityGML Nantes-Secteur Centre-Ville, UMR CNRS 1563 AAU\/CRENAU. Technical Report."},{"key":"ref_50","unstructured":"Scornec, J.L., Ortiz, M., and Renaudin, V. (2017, January 18\u201321). Foot-mounted pedestrian navigation reference with tightly coupled GNSS carrier phases, inertial and magnetic data. Proceedings of the 2017 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Sapporo, Japan."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"22864","DOI":"10.3390\/s141222864","article-title":"Magnetic, Acceleration Fields and Gyroscope Quaternion (MAGYQ)-Based Attitude Estimation with Smartphone Sensors for Indoor Pedestrian Navigation","volume":"14","author":"Renaudin","year":"2014","journal-title":"Sensors"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Antigny, N., Servieres, M., and Renaudin, V. (2017, January 18\u201321). Pedestrian Track Estimation with Handheld Monocular Camera and Inertial-Magnetic Sensor for Urban Augmented Reality. Proceedings of the 2017 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Sapporo, Japan.","DOI":"10.1109\/IPIN.2017.8115934"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Engel, J., Sturm, J., and Cremers, D. (2013, January 1\u20138). Semi-dense visual odometry for a monocular camera. Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV), Sydney, Australia.","DOI":"10.1109\/ICCV.2013.183"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/4\/953\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:34:23Z","timestamp":1760186063000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/4\/953"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,23]]},"references-count":53,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2019,2]]}},"alternative-id":["s19040953"],"URL":"https:\/\/doi.org\/10.3390\/s19040953","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,2,23]]}}}