{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T22:12:24Z","timestamp":1770070344965,"version":"3.49.0"},"reference-count":65,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2022,2,17]],"date-time":"2022-02-17T00:00:00Z","timestamp":1645056000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41771457"],"award-info":[{"award-number":["41771457"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012130","name":"Aeronautical Science Foundation of China","doi-asserted-by":"publisher","award":["2019460S5001"],"award-info":[{"award-number":["2019460S5001"]}],"id":[{"id":"10.13039\/501100012130","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Unmanned Aerial Vehicles (UAVs) require the ability to robustly perceive surrounding scenes for autonomous navigation. The semantic reconstruction of the scene is a truly functional understanding of the environment. However, high-performance computing is generally not available on most UAVs, so a lightweight real-time semantic reconstruction method is necessary. Existing methods rely on GPU, and it is difficult to achieve real-time semantic reconstruction on CPU. To solve the problem, an indoor dense semantic Simultaneous Localization and Mapping (SLAM) method using CPU computing is proposed in this paper, named CDSFusion. The CDSFusion is the first system integrating RGBD-based Visual-Inertial Odometry (VIO), semantic segmentation and 3D reconstruction in real-time on a CPU. In our VIO method, the depth information is introduced to improve the accuracy of pose estimation, and FAST features are used for faster tracking. In our semantic reconstruction method, the PSPNet (Pyramid Scene Parsing Network) pre-trained model is optimized to provide the semantic information in real-time on the CPU, and the semantic point clouds are fused using Voxblox. The experimental results demonstrate that camera tracking is accelerated without loss of accuracy in our VIO, and a 3D semantic map is reconstructed in real-time, which is comparable to one generated by the GPU-dependent method.<\/jats:p>","DOI":"10.3390\/rs14040979","type":"journal-article","created":{"date-parts":[[2022,2,17]],"date-time":"2022-02-17T20:26:41Z","timestamp":1645129601000},"page":"979","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["CDSFusion: Dense Semantic SLAM for Indoor Environment Using CPU Computing"],"prefix":"10.3390","volume":"14","author":[{"given":"Sheng","family":"Wang","sequence":"first","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7198-4735","authenticated-orcid":false,"given":"Guohua","family":"Gou","sequence":"additional","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haigang","family":"Sui","sequence":"additional","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yufeng","family":"Zhou","sequence":"additional","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hao","family":"Zhang","sequence":"additional","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiajie","family":"Li","sequence":"additional","affiliation":[{"name":"State Key Laboratory Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430070, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,2,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Enqvist, O., Kahl, F., and Olsson, C. (2011, January 6\u201313). Non-sequential structure from motion. Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), Barcelona, Spain.","DOI":"10.1109\/ICCVW.2011.6130252"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Sch\u00f6ps, T., Sch\u00f6nberger, J.L., Galliani, S., Sattler, T., Schindler, K., Pollefeys, M., and Geiger, A. (2017, January 21\u201326). A Multi-view Stereo Benchmark with High-Resolution Images and Multi-camera Videos. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.272"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","article-title":"Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age","volume":"32","author":"Cadena","year":"2016","journal-title":"IEEE Trans. Robot."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid Scene Parsing Network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Hu, R., Doll\u00e1r, P., He, K., Darrell, T., and Girshick, R. (2018, January 18\u201323). Learning to Segment Every Thing. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00445"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bao, S.Y.Z., and Savarese, S. (2011, January 21\u201323). Semantic Structure from Motion. Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA.","DOI":"10.1109\/CVPR.2011.5995462"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Bowman, S.L., Atanasov, N., Daniilidis, K., and Pappas, G.J. (June, January 29). Probabilistic data association for semantic SLAM. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989203"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3037","DOI":"10.1109\/LRA.2019.2923960","article-title":"Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery","volume":"4","author":"Grinvald","year":"2019","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1111\/cgf.13820","article-title":"Active Scene Understanding via Online Semantic Reconstruction","volume":"38","author":"Zheng","year":"2019","journal-title":"Comput. Graph. Forum."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Salas-Moreno, R.F., Newcombe, R.A., Strasdat, H., Kelly, P.H.J., and Davison, A.J. (2013, January 23\u201328). SLAM++: Simultaneous Localisation and Mapping at the Level of Objects. Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.178"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"McCormac, J., Handa, A., Davison, A., and Leutenegger, S. (June, January 29). SemanticFusion: Dense 3D semantic mapping with convolutional neural networks. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989538"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Rosinol, A., Abate, M., Chang, Y., and Carlone, L. (August, January 31). Kimera: An Open-Source Library for Real-Time Metric-Semantic Localization and Mapping. Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France.","DOI":"10.1109\/ICRA40945.2020.9196885"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1004","DOI":"10.1109\/TRO.2018.2853729","article-title":"VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator","volume":"34","author":"Qin","year":"2018","journal-title":"IEEE Trans. Robot."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1109\/TPAMI.2008.275","article-title":"Faster and Better: A Machine Learning Approach to Corner Detection","volume":"32","author":"Rosten","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Jianbo, S. (1994, January 21\u201323). Good features to track. Proceedings of the 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR.1994.323794"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Oleynikova, H., Taylor, Z., Fehr, M., Siegwart, R., and Nieto, J. (2017, January 24\u201328). Voxblox: Incremental 3D Euclidean Signed Distance Fields for on-board MAV planning. Proceedings of the 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.","DOI":"10.1109\/IROS.2017.8202315"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Davison (2003, January 13\u201316). Real-time simultaneous localisation and mapping with a single camera. Proceedings of the Proceedings Ninth IEEE International Conference on Computer Vision, Nice, France.","DOI":"10.1109\/ICCV.2003.1238654"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1052","DOI":"10.1109\/TPAMI.2007.1049","article-title":"MonoSLAM: Real-Time Single Camera SLAM","volume":"29","author":"Davison","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"932","DOI":"10.1109\/TRO.2008.2003276","article-title":"Inverse Depth Parametrization for Monocular SLAM","volume":"24","author":"Civera","year":"2008","journal-title":"IEEE Trans. Robot."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Klein, G., and Murray, D. (2007, January 13\u201316). Parallel Tracking and Mapping for Small AR Workspaces. Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, Nara, Japan.","DOI":"10.1109\/ISMAR.2007.4538852"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Strasdat, H., Davison, A.J., Montiel, J.M.M., and Konolige, K. (2011, January 6\u201313). Double window optimisation for constant time visual SLAM. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126517"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: A Versatile and Accurate Monocular SLAM System","volume":"31","author":"Montiel","year":"2015","journal-title":"IEEE Trans. Robot."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras","volume":"33","year":"2017","journal-title":"IEEE Trans. Robot."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1109\/TRO.2012.2197158","article-title":"Bags of Binary Words for Fast Place Recognition in Image Sequences","volume":"28","author":"Tardos","year":"2012","journal-title":"IEEE Trans. Robot."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Engel, J., Schoeps, T., and Cremers, D. (2014, January 6\u201312). LSD-SLAM: Large-Scale Direct Monocular SLAM. Proceedings of the Computer Vision\u2014ECCV 2014, PT II, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10605-2_54"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Forster, C., Pizzoli, M., and Scaramuzza, D. (June, January 31). SVO: Fast semi-direct monocular visual odometry. Proceedings of the 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China.","DOI":"10.1109\/ICRA.2014.6906584"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1109\/TRO.2016.2623335","article-title":"SVO: Semidirect Visual Odometry for Monocular and Multicamera Systems","volume":"33","author":"Forster","year":"2017","journal-title":"IEEE Trans. Robot."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1109\/TPAMI.2017.2658577","article-title":"Direct Sparse Odometry","volume":"40","author":"Engel","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1363","DOI":"10.1109\/TRO.2020.2991614","article-title":"Direct Sparse Mapping","volume":"36","author":"Zubizarreta","year":"2020","journal-title":"IEEE Trans. Robot."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Weiss, S., Achtelik, M.W., Lynen, S., Chli, M., and Siegwart, R. (2012, January 14\u201318). Real-time onboard visual-inertial state estimation and self-calibration of MAVs in unknown environments. Proceedings of the 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA.","DOI":"10.1109\/ICRA.2012.6225147"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Lynen, S., Achtelik, M.W., Weiss, S., Chli, M., and Siegwart, R. (2013, January 3\u20137). A robust and modular multi-sensor fusion approach applied to MAV navigation. Proceedings of the 2013 IEEE\/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan.","DOI":"10.1109\/IROS.2013.6696917"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Mourikis, A.I., and Roumeliotis, S.I. (2007, January 10\u201314). A Multi-State Constraint Kalman Filter for Vision-aided Inertial Navigation. Proceedings of the Proceedings 2007 IEEE International Conference on Robotics and Automation, Rome, Italy.","DOI":"10.1109\/ROBOT.2007.364024"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Bloesch, M., Omari, S., Hutter, M., and Siegwart, R. (October, January 28). Robust visual inertial odometry using a direct EKF-based approach. Proceedings of the 2015 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany.","DOI":"10.1109\/IROS.2015.7353389"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1177\/0278364914554813","article-title":"Keyframe-based visual-inertial odometry using nonlinear optimization","volume":"34","author":"Leutenegger","year":"2015","journal-title":"Int. J. Robot. Res."},{"key":"ref_37","unstructured":"Qin, T., Cao, S., Pan, J., Li, P., and Shen, S. (2019, January 12). VINS-Fusion: An Optimization-Based Multi-Sensor State Estimator. Available online: https:\/\/github.com\/HKUST-Aerial-Robotics\/VINS-Fusion."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1874","DOI":"10.1109\/TRO.2021.3075644","article-title":"ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual\u2013Inertial, and Multimap SLAM","volume":"37","author":"Campos","year":"2021","journal-title":"IEEE Trans. Robot."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Brunetto, N., Salti, S., Fioraio, N., Cavallari, T., and Stefano, L.D. (2015, January 7\u201313). Fusion of Inertial and Visual Measurements for RGB-D SLAM on Mobile Devices. Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), Santiago, Chile.","DOI":"10.1109\/ICCVW.2015.29"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Falquez, J.M., Kasper, M., and Sibley, G. (2016, January 9\u201314). Inertial aided dense & semi-dense methods for robust direct visual odometry. Proceedings of the 2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Korea.","DOI":"10.1109\/IROS.2016.7759530"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Laidlow, T., Bloesch, M., Li, W., and Leutenegger, S. (2017, January 24\u201328). Dense RGB-D-inertial SLAM with map deformations. Proceedings of the 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.","DOI":"10.1109\/IROS.2017.8206591"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Ling, Y., Liu, H., Zhu, X., Jiang, J., and Liang, B. (2018, January 5\u20138). RGB-D Inertial Odometry for Indoor Robot via Keyframe-based Nonlinear Optimization. Proceedings of the 2018 IEEE International Conference on Mechatronics and Automation (ICMA), Changchun, China.","DOI":"10.1109\/ICMA.2018.8484687"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Shan, Z., Li, R., and Schwertfeger, S. (2019). RGBD-Inertial Trajectory Estimation and Mapping for Ground Robots. Sensors, 19.","DOI":"10.3390\/s19102251"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., and Davison, A. (2011, January 16\u201319). KinectFusion: Real-time 3D reconstruction and interaction using a moving depth camera. Proceedings of the 24th annual ACM symposium on User interface software and technology, Santa Barbara, CA, USA.","DOI":"10.1145\/2047196.2047270"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Niessner, M., Zollhoefer, M., Izadi, S., and Stamminger, M. (2013). Real-time 3D Reconstruction at Scale using Voxel Hashing. ACM Trans. Graph., 32.","DOI":"10.1145\/2508363.2508374"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Whelan, T., Leutenegger, S., Salas-Moreno, R.E., Ben, G., and Davison, A.J. (2015, January 13\u201317). ElasticFusion: Dense SLAM Without A Pose Graph. Proceedings of the Robotics: Science and Systems XI, Rome, Italy.","DOI":"10.15607\/RSS.2015.XI.001"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072959.3054739","article-title":"BundleFusion: Real-Time Globally Consistent 3D Reconstruction Using On-the-Fly Surface Reintegration","volume":"36","author":"Dai","year":"2017","journal-title":"ACM Trans. Graph."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1007\/s10514-012-9321-0","article-title":"OctoMap: An efficient probabilistic 3D mapping framework based on octrees","volume":"34","author":"Hornung","year":"2013","journal-title":"Auton. Robot."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1109\/TVCG.2015.2459891","article-title":"Very High Frame Rate Volumetric Integration of Depth Images on Mobile Devices","volume":"21","author":"Kaehler","year":"2015","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Han, L., and Fang, L. (2018, January 26\u201330). FlashFusion: Real-time Globally Consistent Dense 3D Reconstruction using CPU Computing. Proceedings of the Robotics: Science and Systems XIV, Pittsburgh, PA, USA.","DOI":"10.15607\/RSS.2018.XIV.006"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Wang, K., Gao, F., and Shen, S. (2019, January 20\u201324). Real-time Scalable Dense Surfel Mapping. Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada.","DOI":"10.1109\/ICRA.2019.8794101"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1109\/LRA.2019.2953859","article-title":"Voxgraph: Globally Consistent, Volumetric Mapping Using Signed Distance Function Submaps","volume":"5","author":"Reijgwart","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"R\u00fcnz, M., and Agapito, L. (June, January 29). Co-fusion: Real-time segmentation, tracking and fusion of multiple objects. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.","DOI":"10.1109\/ICRA.2017.7989518"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Runz, M., Buffier, M., and Agapito, L. (2018, January 16\u201320). MaskFusion: Real-Time Recognition, Tracking and Reconstruction of Multiple Moving Objects. Proceedings of the 2018 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Munich, Germany.","DOI":"10.1109\/ISMAR.2018.00024"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Xu, B., Li, W., Tzoumanikas, D., Bloesch, M., Davison, A., and Leutenegger, S. (2019, January 20\u201324). MID-Fusion: Octree-based Object-Level Multi-Instance Dynamic SLAM. Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada.","DOI":"10.1109\/ICRA.2019.8794371"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Narita, G., Seno, T., Ishikawa, T., and Kaji, Y. (2019, January 3\u20138). PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. Proceedings of the 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China.","DOI":"10.1109\/IROS40897.2019.8967890"},{"key":"ref_57","unstructured":"Lucas, B.D., and Kanade, T. (1981, January 24\u201328). An Iterative Image Registration Technique with an Application to Stereo Vision. Proceedings of the International Joint Conference on Artificial Intelligence, Vancouver, BC, Canada."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1007\/s11263-008-0152-6","article-title":"EPnP: An Accurate O(n) Solution to the PnP Problem","volume":"81","author":"Lepetit","year":"2009","journal-title":"Int. J. Comput. Vis."},{"key":"ref_59","unstructured":"Agarwal, S., and Mierle, K. (2020, May 14). Ceres Solver. Available online: http:\/\/ceres-solver.org."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Calonder, M., Lepetit, V., Strecha, C., and Fua, P. (2010, January 5\u201311). BRIEF: Binary Robust Independent Elementary Features. Proceedings of the Computer Vision-ECCV 2010, PT IV, Crete, Greece.","DOI":"10.1007\/978-3-642-15561-1_56"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Scaramuzza, D., and Kosecka, J. (2018, January 1\u20135). A Tutorial on Quantitative Trajectory Evaluation for Visual(-Inertial) Odometry. Proceedings of the 2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain.","DOI":"10.1109\/IROS.2018.8593941"},{"key":"ref_62","unstructured":"(2021, November 25). OptiTrack. Available online: https:\/\/optitrack.com\/."},{"key":"ref_63","unstructured":"Xuan, Z., and David, F. (2021, July 25). Real-Time Voxel Based 3D Semantic Mapping with a Hand Held RGB-D Camera. Available online: https:\/\/github.com\/floatlazer\/semantic_slam."},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Song, S., Lichtenberg, S.P., and Xiao, J. (2015, January 7\u201312). SUN RGB-D: A RGB-D scene understanding benchmark suite. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298655"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.-C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (November, January 27). Searching for MobileNetV3. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV 2019), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00140"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/4\/979\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:21:21Z","timestamp":1760134881000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/4\/979"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,17]]},"references-count":65,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["rs14040979"],"URL":"https:\/\/doi.org\/10.3390\/rs14040979","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,17]]}}}