{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:11:06Z","timestamp":1768900266472,"version":"3.49.0"},"reference-count":29,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2022,11,9]],"date-time":"2022-11-09T00:00:00Z","timestamp":1667952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory","award":["201101"],"award-info":[{"award-number":["201101"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>We propose a robust RGB-Depth (RGB-D) Visual Odometry (VO) system to improve the localization performance of indoor scenes by using geometric features, including point and line features. Previous VO\/Simultaneous Localization and Mapping (SLAM) algorithms estimate the low-drift camera poses with the Manhattan World (MW)\/Atlanta World (AW) assumption, which limits the applications of such systems. In this paper, we divide the indoor environments into two different scenes: MW and non-MW scenes. The Manhattan scenes are modeled as a Mixture of Manhattan Frames, in which each Manhattan Frame in itself defines a Manhattan World of a specific orientation. Moreover, we provide a method to detect Manhattan Frames (MFs) using the dominant directions extracted from the parallel lines. Our approach is designed with lower computational complexity than existing techniques using planes to detect Manhattan Frame (MF). For MW scenes, we separately estimate rotational and translational motion. A novel method is proposed to estimate the drift-free rotation using MF observations, unit direction vectors of lines, and surface normal vectors. Then, the translation part is recovered from point-line tracking. In non-MW scenes, the tracked and matched dominant directions are combined with the point and line features to estimate the full 6 degree of freedom (DoF) camera poses. Additionally, we exploit the rotation constraints generated from the multi-view dominant directions observations. The constraints are combined with the reprojection errors of points and lines to refine the camera pose through local map bundle adjustment. Evaluations on both synthesized and real-world datasets demonstrate that our approach outperforms state-of-the-art methods. On synthesized datasets, average localization accuracy is 1.5 cm, which is equivalent to state-of-the-art methods. On real-world datasets, the average localization accuracy is 1.7 cm, which outperforms the state-of-the-art methods by 43%. Our time consumption is reduced by 36%.<\/jats:p>","DOI":"10.3390\/s22228644","type":"journal-article","created":{"date-parts":[[2022,11,10]],"date-time":"2022-11-10T02:11:15Z","timestamp":1668046275000},"page":"8644","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Robust Visual Odometry Leveraging Mixture of Manhattan Frames in Indoor Environments"],"prefix":"10.3390","volume":"22","author":[{"given":"Huayu","family":"Yuan","sequence":"first","affiliation":[{"name":"School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chengfeng","family":"Wu","sequence":"additional","affiliation":[{"name":"Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory, Beijing 100074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhongliang","family":"Deng","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiahui","family":"Yin","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,11,9]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: A versatile and accurate monocular SLAM system","volume":"31","author":"Montiel","year":"2015","journal-title":"IEEE Trans. Robot."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"734","DOI":"10.1109\/TRO.2019.2899783","article-title":"PL-SLAM: A stereo SLAM system through the combination of points and line segments","volume":"35","author":"Moreno","year":"2019","journal-title":"IEEE Trans. Robot."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Pumarola, A., Vakhitov, A., Agudo, A., Sanfeliu, A., and Moreno-Noguer, F. (June, January 29). PL-SLAM: Real-time monocular visual SLAM with points and lines. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989522"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"14651","DOI":"10.3182\/20080706-5-KR-1001.02481","article-title":"SLAM and data fusion from visual landmarks and 3D planes","volume":"41","author":"Zureiki","year":"2008","journal-title":"IFAC Proc. Vol."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhang, X., Wang, W., Qi, X., Liao, Z., and Wei, R. (2019). Point-plane slam using supposed planes for indoor environments. Sensors, 19.","DOI":"10.3390\/s19173795"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Sun, C., Qiao, N., Ge, W., and Sun, J. (2022, January 25\u201327). Robust RGB-D Visual Odometry Using Point and Line Features. Proceedings of the 2022 41st Chinese Control Conference (CCC), Hefei, China.","DOI":"10.23919\/CCC55666.2022.9901644"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"9958","DOI":"10.1109\/ACCESS.2021.3049801","article-title":"PL-GM: RGB-D SLAM With a Novel 2D and 3D Geometric Constraint Model of Point and Line Features","volume":"9","author":"Zhang","year":"2021","journal-title":"IEEE Access"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Kim, P., Coltin, B., and Kim, H.J. (2017, January 4\u20137). Visual Odometry with Drift-Free Rotation Estimation Using Indoor Scene Regularities. Proceedings of the BMVC, London, UK.","DOI":"10.5244\/C.31.62"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Kim, P., Coltin, B., and Kim, H.J. (2018, January 21\u201325). Low-drift visual odometry in structured environments by decoupling rotational and translational motion. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), South Brisbane, Australia.","DOI":"10.1109\/ICRA.2018.8463207"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Kim, P., Coltin, B., and Kim, H.J. (2018, January 8\u201314). Linear RGB-D SLAM for planar environments. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01225-0_21"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Joo, K., Oh, T.-H., Rameau, F., Bazin, J.-C., and Kweon, I.S. (31, January May). Linear rgb-d slam for atlanta world. Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France.","DOI":"10.1109\/ICRA40945.2020.9196561"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"6583","DOI":"10.1109\/LRA.2020.3015456","article-title":"Structure-slam: Low-drift monocular slam in indoor environments","volume":"5","author":"Li","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Li, Y., Yunus, R., Brasch, N., Navab, N., and Tombari, F. (June, January 30). RGB-D SLAM with structural regularities. Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi\u2032an, China.","DOI":"10.1109\/ICRA48506.2021.9561560"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Kneip, L., Rodriguez, C., and Li, H. (2016). Divide and conquer: Efficient density-based tracking of 3D sensors in Manhattan worlds. Asian Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-54193-8_1"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"6512","DOI":"10.1109\/LRA.2020.3014648","article-title":"Visual SLAM With Drift-Free Rotation Estimation in Manhattan World","volume":"5","author":"Liu","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1628","DOI":"10.1007\/s11431-020-1602-3","article-title":"A robust visual SLAM system in dynamic man-made environments","volume":"63","author":"Liu","year":"2020","journal-title":"Sci. China Technol. Sci."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Shu, F., Xie, Y., Rambach, J., Pagani, A., and Stricker, D. (2021, January 4\u20138). Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction. Proceedings of the 2021 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), Bari, Italy.","DOI":"10.1109\/ISMAR-Adjunct54149.2021.00042"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"327","DOI":"10.5194\/isprs-archives-XLIII-B4-2022-327-2022","article-title":"Structural line feature selection for improving indoor visual slam","volume":"43","author":"Xia","year":"2022","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Yunus, R., Li, Y., and Tombari, F. (June, January 30). Manhattanslam: Robust planar tracking and mapping leveraging mixture of manhattan frames. Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi\u2032an, China.","DOI":"10.1109\/ICRA48506.2021.9562030"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2803","DOI":"10.1109\/LRA.2022.3142900","article-title":"MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry","volume":"7","author":"Ortiz","year":"2022","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Straub, J., Rosman, G., Freifeld, O., Leonard, J.J., and Fisher, J.W. (2014, January 23\u201328). A mixture of manhattan frames: Beyond the manhattan world. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.488"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras","volume":"33","year":"2017","journal-title":"IEEE Trans. Robot."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_24","first-page":"722","article-title":"LSD: A fast line segment detector with a false detection control","volume":"32","author":"Jakubowicz","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1016\/j.jvcir.2013.05.006","article-title":"An efficient and robust line segment matching approach based on LBD descriptor and pairwise geometric consistency","volume":"24","author":"Zhang","year":"2013","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_26","unstructured":"K\u00fcmmerle, R., Grisetti, G., Strasdat, H., Konolige, K., and Burgard, W. (2011, January 9\u201313). g2o: A general framework for graph optimization. Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Handa, A., Whelan, T., McDonald, J., and Davison, A.J. (June, January 31). A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM. Proceedings of the 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China.","DOI":"10.1109\/ICRA.2014.6907054"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Sturm, J., Engelhard, N., Endres, F., Burgard, W., and Cremers, D. (2012, January 7\u201312). A benchmark for the evaluation of RGB-D SLAM systems. Proceedings of the 2012 IEEE\/RSJ International Conference on Intelligent Robots and Systems, Loul\u00e9, Portugal.","DOI":"10.1109\/IROS.2012.6385773"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Lu, Y., and Song, D. (2015, January 7\u201313). Robust RGB-D odometry using point and line features. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.448"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/22\/8644\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:13:12Z","timestamp":1760145192000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/22\/8644"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,9]]},"references-count":29,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["s22228644"],"URL":"https:\/\/doi.org\/10.3390\/s22228644","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,9]]}}}