{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T03:23:40Z","timestamp":1740108220743,"version":"3.37.3"},"reference-count":91,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2024,10,17]],"date-time":"2024-10-17T00:00:00Z","timestamp":1729123200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,10,17]],"date-time":"2024-10-17T00:00:00Z","timestamp":1729123200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100008349","name":"Universit\u00e4t Duisburg-Essen","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100008349","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Machine Vision and Applications"],"published-print":{"date-parts":[[2024,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Pose estimation is an important component of many real-world computer vision systems. Most existing pose estimation algorithms need a large number of point correspondences to accurately determine the pose of an object. Since the number of point correspondences depends on the object\u2019s appearance, lighting and other external conditions, detecting many points may not be feasible. In many real-world applications, the movement of objects is limited, e.g. due to gravity. Hence, detecting objects with only three degrees of freedom is usually sufficient. This allows us to improve the accuracy of pose estimation by changing the underlying equations of the perspective-n-point problem to three variables instead of six. By using the simplified equations, our algorithm is more robust against detection errors with limited point correspondences. In this article, we study three scenarios where such constraints apply. The first one is about parking a vehicle on a specific spot. Here, a stationary camera is detecting the vehicle to assist the driver. The second scenario describes the perspective of a moving camera detecting objects in its environment. This scenario is common for driver assistance systems, autonomous cars or mobile robots. Third, we describe a camera observing objects from a birds-eye view, which occurs in industrial applications. In all three scenarios, observed objects can only move in the ground plane and rotate around the vertical axis. Hence, three degrees of freedom are sufficient to estimate the pose. Experiments with synthetic data and real-world photographs have shown that our algorithm outperforms state-of-the-art pose estimation algorithms. Depending on the scenario, our algorithm is able to achieve 50% better accuracy, while being equally fast.<\/jats:p>","DOI":"10.1007\/s00138-024-01618-z","type":"journal-article","created":{"date-parts":[[2024,10,17]],"date-time":"2024-10-17T14:02:54Z","timestamp":1729173774000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Axes-aligned non-linear optimized PnP algorithm"],"prefix":"10.1007","volume":"35","author":[{"given":"Peter","family":"Roch","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bijan","family":"Shahbaz\u00a0Nejad","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marcus","family":"Handte","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pedro Jos\u00e9","family":"Marr\u00f3n","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,10,17]]},"reference":[{"key":"1618_CR1","doi-asserted-by":"publisher","unstructured":"Ahmadyan, A., Zhang, L., Ablavatski, A., et\u00a0al.: Objectron: A large scale dataset of object-centric videos in the wild with pose annotations. In: 2021 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), (2021). https:\/\/doi.org\/10.1109\/CVPR46437.2021.00773","DOI":"10.1109\/CVPR46437.2021.00773"},{"key":"1618_CR2","doi-asserted-by":"publisher","DOI":"10.1057\/jors.1985.68","author":"M Al-Baali","year":"1985","unstructured":"Al-Baali, M., Fletcher, R.: Variational methods for non-linear least-squares. J. Op. Res. Soc. (1985). https:\/\/doi.org\/10.1057\/jors.1985.68","journal-title":"J. Op. Res. Soc."},{"key":"1618_CR3","doi-asserted-by":"publisher","unstructured":"Barrois, B., Hristova, S., Wohler, C., et\u00a0al.: 3D Pose estimation of vehicles using a stereo camera. In: 2009 IEEE intelligent vehicles symposium (2009). https:\/\/doi.org\/10.1109\/IVS.2009.5164289","DOI":"10.1109\/IVS.2009.5164289"},{"key":"1618_CR4","doi-asserted-by":"publisher","unstructured":"Bujnak, M., Kukelova, Z., Pajdla, T.: A general solution to the p4p problem for camera with unknown focal length. In: 2008 IEEE conference on computer vision and pattern recognition (2008). https:\/\/doi.org\/10.1109\/CVPR.2008.4587793","DOI":"10.1109\/CVPR.2008.4587793"},{"key":"1618_CR5","doi-asserted-by":"publisher","DOI":"10.1007\/BF01585997","author":"JV Burke","year":"1995","unstructured":"Burke, J.V., Ferris, M.C.: A gauss\u2013newton method for convex composite optimization. Math. Program. (1995). https:\/\/doi.org\/10.1007\/BF01585997","journal-title":"Math. Program."},{"key":"1618_CR6","doi-asserted-by":"publisher","unstructured":"Caesar, H., Bankiti, V., Lang, AH., et\u00a0al.: nuScenes: A multimodal dataset for autonomous driving. In: 2020 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), (2020). https:\/\/doi.org\/10.1109\/CVPR42600.2020.01164","DOI":"10.1109\/CVPR42600.2020.01164"},{"key":"1618_CR7","doi-asserted-by":"publisher","unstructured":"Chaudhury, K., DiVerdi, S., Ioffe, S.: Auto-rectification of user photos. In: 2014 IEEE international conference on image processing (ICIP), (2014). https:\/\/doi.org\/10.1109\/ICIP.2014.7025706","DOI":"10.1109\/ICIP.2014.7025706"},{"key":"1618_CR8","doi-asserted-by":"publisher","unstructured":"Chen, J., Zhang, L., Liu, Y., et\u00a0al.: Survey on 6d pose estimation of rigid object. In: 2020 39th Chinese control conference (CCC), (2020). https:\/\/doi.org\/10.23919\/CCC50068.2020.9189304","DOI":"10.23919\/CCC50068.2020.9189304"},{"key":"1618_CR9","doi-asserted-by":"publisher","unstructured":"Choi, C., Christensen, HI.: 3d pose estimation of daily objects using an rgb-d camera. In: 2012 IEEE\/RSJ international conference on intelligent robots and systems (2012). https:\/\/doi.org\/10.1109\/IROS.2012.6386067","DOI":"10.1109\/IROS.2012.6386067"},{"key":"1618_CR10","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0725-5","author":"T Collins","year":"2014","unstructured":"Collins, T., Bartoli, A.: Infinitesimal plane-based pose estimation. Int. J. Comput. Visi (2014). https:\/\/doi.org\/10.1007\/s11263-014-0725-5","journal-title":"Int. J. Comput. Visi"},{"key":"1618_CR11","doi-asserted-by":"publisher","unstructured":"Dhall, A., Dai, D., Van\u00a0Gool, L.: Real-time 3d traffic cone detection for autonomous driving. In: 2019 IEEE intelligent vehicles symposium (2019). (IVhttps:\/\/doi.org\/10.1109\/IVS.2019.8814089","DOI":"10.1109\/IVS.2019.8814089"},{"key":"1618_CR12","doi-asserted-by":"publisher","unstructured":"Ding, Y., Barath, D., Yang, J., et\u00a0al.: Globally optimal relative pose estimation with gravity prior. In: 2021 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), (2021). https:\/\/doi.org\/10.1109\/CVPR46437.2021.00046","DOI":"10.1109\/CVPR46437.2021.00046"},{"key":"1618_CR13","doi-asserted-by":"publisher","unstructured":"Einsiedler, J., Becker, D., Radusch, I.: External visual positioning system for enclosed carparks. In: 2014 11th Workshop on positioning, navigation and communication (2014). (WPNChttps:\/\/doi.org\/10.1109\/WPNC.2014.6843287","DOI":"10.1109\/WPNC.2014.6843287"},{"key":"1618_CR14","doi-asserted-by":"publisher","DOI":"10.1145\/3524496","author":"Z Fan","year":"2022","unstructured":"Fan, Z., Zhu, Y., He, Y., et al.: Deep learning on monocular object pose detection and tracking: a comprehensive overview. ACM Comput. Surv. (2022). https:\/\/doi.org\/10.1145\/3524496","journal-title":"ACM Comput. Surv."},{"issue":"1145\/358669","key":"1618_CR15","first-page":"358692","volume":"10","author":"MA Fischler","year":"1981","unstructured":"Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. A 10(1145\/358669), 358692 (1981)","journal-title":"Commun. A"},{"key":"1618_CR16","doi-asserted-by":"publisher","DOI":"10.1093\/imanum\/7.3.371","author":"R Fletcher","year":"1987","unstructured":"Fletcher, R., Xu, C.: Hybrid methods for nonlinear least squares. IMA J. Numer. Anal. (1987). https:\/\/doi.org\/10.1093\/imanum\/7.3.371","journal-title":"IMA J. Numer. Anal."},{"key":"1618_CR17","doi-asserted-by":"publisher","unstructured":"Fragoso, V., DeGol, J., Hua, G.: gDLS*: Generalized pose-and-scale estimation given scale and gravity priors. In: 2021 IEEE international conference on robotics and automation (ICRA), (2020). https:\/\/doi.org\/10.1109\/CVPR42600.2020.00228","DOI":"10.1109\/CVPR42600.2020.00228"},{"key":"1618_CR18","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2003.1217599","author":"XS Gao","year":"2003","unstructured":"Gao, X.S., Hou, X.R., Tang, J., et al.: Complete solution classification for the perspective-three-point problem. IEEE Trans. Pattern Anal. Mach. Intell. (2003). https:\/\/doi.org\/10.1109\/TPAMI.2003.1217599","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"1618_CR19","doi-asserted-by":"publisher","unstructured":"Garro, V., Crosilla, F., Fusiello, A.: Solving the pnp problem with anisotropic orthogonal procrustes analysis. In: visualization & transmission 2012 second international conference on 3D imaging, modeling, processing (2012). https:\/\/doi.org\/10.1109\/3DIMPVT.2012.40","DOI":"10.1109\/3DIMPVT.2012.40"},{"key":"1618_CR20","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2018.08.004","author":"AR Gaspar","year":"2018","unstructured":"Gaspar, A.R., Nunes, A., Pinto, A.M., et al.: Urban@CRAS dataset: benchmarking of visual odometry and SLAM techniques. Robot. Auton. Syst. (2018). https:\/\/doi.org\/10.1016\/j.robot.2018.08.004","journal-title":"Robot. Auton. Syst."},{"key":"1618_CR21","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913491297","author":"A Geiger","year":"2013","unstructured":"Geiger, A., Lenz, P., Stiller, C., et al.: Vision meets robotics: the kitti dataset. Int. J. Robot. Res (2013). https:\/\/doi.org\/10.1177\/0278364913491297","journal-title":"Int. J. Robot. Res"},{"key":"1618_CR22","doi-asserted-by":"publisher","DOI":"10.1007\/s001900050089","author":"EW Grafarend","year":"1997","unstructured":"Grafarend, E.W., Shan, J.: Closed-form solution of P4P or the three-dimensional resection problem in terms of M\u00f6bius barycentric coordinates. J. Geode (1997). https:\/\/doi.org\/10.1007\/s001900050089","journal-title":"J. Geode"},{"key":"1618_CR23","unstructured":"Grunert, JA.: Das Pothenot\u2019sche problem, in erweiterter Gestalt; nebst Bemerkungen \u00fcber seine Anwendung in der Geod\u00e4sie. Archiv der Mathematik und Physik (1841)"},{"key":"1618_CR24","doi-asserted-by":"publisher","unstructured":"Gu, R., Wang, G., Hwang, Jn.: Efficient multi-person hierarchical 3d pose estimation for autonomous driving. In: 2019 IEEE conference on multimedia information processing and retrieval (MIPR), (2019).https:\/\/doi.org\/10.1109\/MIPR.2019.00036","DOI":"10.1109\/MIPR.2019.00036"},{"key":"1618_CR25","doi-asserted-by":"publisher","unstructured":"Hagelskj\u00e6r, F., Savarimuthu, TR., Kr\u00fcger, N., et\u00a0al.: Using spatial constraints for fast set-up of precise pose estimation in an industrial setting. In: 2019 IEEE 15th international conference on automation science and engineering (CASE) (2019). https:\/\/doi.org\/10.1109\/COASE.2019.8842876","DOI":"10.1109\/COASE.2019.8842876"},{"key":"1618_CR26","doi-asserted-by":"publisher","unstructured":"Hajder, L., Barath, D.: Least-squares optimal relative planar motion for vehicle-mounted cameras. In: 2020 IEEE international conference on robotics and automation (ICRA) (2020). https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196755","DOI":"10.1109\/ICRA40945.2020.9196755"},{"key":"1618_CR27","doi-asserted-by":"publisher","unstructured":"Hampali, S., Rad, M., Oberweger, M., et\u00a0al.: HOnnotate: A method for 3D annotation of hand and object poses. In: 2020 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), (2020). https:\/\/doi.org\/10.1109\/CVPR42600.2020.00326","DOI":"10.1109\/CVPR42600.2020.00326"},{"key":"1618_CR28","doi-asserted-by":"publisher","DOI":"10.1016\/j.vrih.2019.10.001","author":"P Han","year":"2019","unstructured":"Han, P., Zhao, G.: A review of edge-based 3d tracking of rigid objects. Virtual Real. & Int. Hardw. (2019). https:\/\/doi.org\/10.1016\/j.vrih.2019.10.001","journal-title":"Virtual Real. & Int. Hardw."},{"key":"1618_CR29","doi-asserted-by":"publisher","unstructured":"Hesch, JA., Roumeliotis, SI.: A direct least-squares (dls) method for pnp. In: 2011 international conference on computer vision, (2011). https:\/\/doi.org\/10.1109\/ICCV.2011.6126266","DOI":"10.1109\/ICCV.2011.6126266"},{"key":"1618_CR30","doi-asserted-by":"publisher","unstructured":"Hinterstoisser, S., Lepetit, V., Ilic, S., et\u00a0al.: Model based training, detection and pose estimation of texture-less 3D objects in heavily cluttered scenes. In: computer Vision \u2013 ACCV 2012, (2013). https:\/\/doi.org\/10.1007\/978-3-642-37331-2_42","DOI":"10.1007\/978-3-642-37331-2_42"},{"key":"1618_CR31","unstructured":"IAM.: Universit\u00e4t Duisburg-Essen Taxiladekonzept f\u00fcr Elektrotaxis im \u00f6ffentlichen Raum. URL: talako.uni-due.de, last accessed: 14-01-2022 (2022)"},{"key":"1618_CR32","doi-asserted-by":"publisher","unstructured":"Jiao, Y., Liu, L., Fu, B., et\u00a0al.: Robust localization for planar moving robot in changing environment: A perspective on density of correspondence and depth. In: 2021 IEEE international conference on robotics and automation (ICRA), (2021). https:\/\/doi.org\/10.1109\/ICRA48506.2021.9561539","DOI":"10.1109\/ICRA48506.2021.9561539"},{"key":"1618_CR33","doi-asserted-by":"publisher","unstructured":"Kaskman, R., Zakharov, S., Shugurov, I., et\u00a0al.: HomebrewedDB: RGB-D dataset for 6D pose estimation of 3D objects. In: 2019 IEEE\/CVF international conference on computer vision workshop (ICCVW), (2019). https:\/\/doi.org\/10.1109\/ICCVW.2019.00338","DOI":"10.1109\/ICCVW.2019.00338"},{"key":"1618_CR34","doi-asserted-by":"publisher","first-page":"295","DOI":"10.17703\/IJACT.2021.9.4.295","volume":"9","author":"IS Kim","year":"2021","unstructured":"Kim, I.S., Jung, T.W., Jung, K.D.: Augmented reality service based on object pose prediction using pnp algorithm. Int. J. Adv. Culture Technol. 9, 295 (2021). https:\/\/doi.org\/10.17703\/IJACT.2021.9.4.295","journal-title":"Int. J. Adv. Culture Technol."},{"key":"1618_CR35","doi-asserted-by":"publisher","DOI":"10.1109\/JSYST.2020.3019296","author":"ST Kim","year":"2021","unstructured":"Kim, S.T., Fan, M., Jung, S.W., et al.: External vehicle positioning system using multiple fish-eye surveillance cameras for indoor parking lots. IEEE Syst. J. (2021). https:\/\/doi.org\/10.1109\/JSYST.2020.3019296","journal-title":"IEEE Syst. J."},{"key":"1618_CR36","doi-asserted-by":"publisher","unstructured":"Kirsanov, P., Gaskarov, A., Konokhov, F., et\u00a0al.: DISCOMAN: Dataset of indoor sCenes for odometry, mapping and navigation. In: 2019 IEEE\/RSJ international conference on intelligent robots and systems (IROS), (2019). https:\/\/doi.org\/10.1109\/IROS40897.2019.8967921","DOI":"10.1109\/IROS40897.2019.8967921"},{"key":"1618_CR37","doi-asserted-by":"publisher","unstructured":"Kneip, L., Scaramuzza, D., Siegwart, R .: A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation. In: CVPR 201(2011). https:\/\/doi.org\/10.1109\/CVPR.2011.5995464","DOI":"10.1109\/CVPR.2011.5995464"},{"key":"1618_CR38","doi-asserted-by":"publisher","unstructured":"Kneip, L., Li, H., Seo, Y.: Upnp: An optimal o(n) solution to the absolute pose problem with universal applicability. In: Computer Vision \u2013 ECCV 2014, (2014). https:\/\/doi.org\/10.1007\/978-3-319-10590-1_9","DOI":"10.1007\/978-3-319-10590-1_9"},{"key":"1618_CR39","doi-asserted-by":"publisher","DOI":"10.1007\/s12591-024-00688-9","author":"K Kumar","year":"2024","unstructured":"Kumar, K., Kostina, E.: Optimal parameter estimation techniques for complex nonlinear systems. Differ. Equ. Dyn. Syst. (2024). https:\/\/doi.org\/10.1007\/s12591-024-00688-9","journal-title":"Differ. Equ. Dyn. Syst."},{"key":"1618_CR40","doi-asserted-by":"publisher","unstructured":"Lee, S., Moon, YK.: Camera pose estimation using voxel-based features for autonomous vehicle localization tracking. In: 2022 37th international technical conference on circuits\/systems, computers and communications (ITC-CSCC), (2022). https:\/\/doi.org\/10.1109\/ITC-CSCC55581.2022.9895071","DOI":"10.1109\/ITC-CSCC55581.2022.9895071"},{"key":"1618_CR41","doi-asserted-by":"publisher","unstructured":"Lee, TE., Tremblay, J., To, T., et\u00a0al.: Camera-to-robot pose estimation from a single image. In: 2020 IEEE international conference on robotics and automation (ICRA), (2020).https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196596","DOI":"10.1109\/ICRA40945.2020.9196596"},{"key":"1618_CR42","doi-asserted-by":"publisher","unstructured":"Lepetit, V., Fua, P.: Monocular model-based 3D tracking of rigid objects: a survey. Foundations and trends\u00c2\u00ae in computer graphics and visi (2005).https:\/\/doi.org\/10.1561\/0600000001","DOI":"10.1561\/0600000001"},{"key":"1618_CR43","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-008-0152-6","author":"V Lepetit","year":"2009","unstructured":"Lepetit, V., Moreno-Noguer, F., Fua, P.: Epnp: an accurate o(n) solution to the pnp problem. Int. J. Comput. Vision (2009). https:\/\/doi.org\/10.1007\/s11263-008-0152-6","journal-title":"Int. J. Comput. Vision"},{"key":"1618_CR44","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1090\/qam\/10666","volume":"2","author":"K Levenberg","year":"1944","unstructured":"Levenberg, K.: A method for the solution of certain non-linear problems in least squares. Quart. Appl. Math. 2, 164 (1944)","journal-title":"Quart. Appl. Math."},{"key":"1618_CR45","doi-asserted-by":"publisher","DOI":"10.1007\/s101070100249","author":"C Li","year":"2002","unstructured":"Li, C., Wang, X.: On convergence of the gauss-newton method for convex composite optimization. Math. Progr. (2002). https:\/\/doi.org\/10.1007\/s101070100249","journal-title":"Math. Progr."},{"key":"1618_CR46","doi-asserted-by":"publisher","unstructured":"Li, X., Ma, T., Hou, Y., et\u00a0al.: LoGoNet: Towards accurate 3D object detection with local-to-global cross-modal fusion. In: 2023 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), (2023).https:\/\/doi.org\/10.1109\/CVPR52729.2023.01681","DOI":"10.1109\/CVPR52729.2023.01681"},{"key":"1618_CR47","doi-asserted-by":"publisher","unstructured":"Lin, Y., Tremblay, J., Tyree, S., et\u00a0al.: Multi-view fusion for multi-level robotic scene understanding. In: 2021 IEEE\/RSJ International conference on intelligent robots and systems (IROS), (2021).https:\/\/doi.org\/10.1109\/IROS51168.2021.9635994","DOI":"10.1109\/IROS51168.2021.9635994"},{"key":"1618_CR48","doi-asserted-by":"publisher","unstructured":"Liu, J., He, S.: 6d object pose estimation without pnp. CoRR (2019). https:\/\/doi.org\/10.48550\/arXiv.1902.01728","DOI":"10.48550\/arXiv.1902.01728"},{"key":"1618_CR49","doi-asserted-by":"publisher","unstructured":"Liu, X., Iwase, S., Kitani, KM.: Stereobj-1m: Large-scale stereo image dataset for 6d object pose estimation. In: 2021 IEEE\/CVF International conference on computer vision (ICCV), (2021). https:\/\/doi.org\/10.1109\/ICCV48922.2021.01069","DOI":"10.1109\/ICCV48922.2021.01069"},{"key":"1618_CR50","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1087\/5\/052009","author":"XX Lu","year":"2018","unstructured":"Lu, X.X.: A review of solutions for perspective-n-point problem in camera pose estimation. J. Phys: Conf. Ser. (2018). https:\/\/doi.org\/10.1088\/1742-6596\/1087\/5\/052009","journal-title":"J. Phys: Conf. Ser."},{"key":"1618_CR51","doi-asserted-by":"publisher","DOI":"10.1177\/0278364916679498","author":"W Maddern","year":"2017","unstructured":"Maddern, W., Pascoe, G., Linegar, C., et al.: 1 year, 1000 km: The Oxford RobotCar dataset. Int. J. Robot. Res. (2017). https:\/\/doi.org\/10.1177\/0278364916679498","journal-title":"Int. J. Robot. Res."},{"key":"1618_CR52","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2015.2513408","author":"E Marchand","year":"2016","unstructured":"Marchand, E., Uchiyama, H., Spindler, F.: Pose estimation for augmented reality: a hands-on survey. IEEE Trans. Vis. Comput. Graph. (2016). https:\/\/doi.org\/10.1109\/TVCG.2015.2513408","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"1618_CR53","doi-asserted-by":"publisher","DOI":"10.1137\/0111030","author":"DW Marquardt","year":"1963","unstructured":"Marquardt, D.W.: An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math. (1963). https:\/\/doi.org\/10.1137\/0111030","journal-title":"J. Soc. Ind. Appl. Math."},{"key":"1618_CR54","unstructured":"Martull, S., Peris, M., Fukui, K.: Realistic CG stereo image dataset with ground truth disparity maps. technical report of IEICE PRMU (2012)"},{"key":"1618_CR55","doi-asserted-by":"publisher","unstructured":"Mousavian, A., Anguelov, D., Flynn, J., et\u00a0al.: 3D Bounding box estimation using deep learning and geometry. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), (2017). https:\/\/doi.org\/10.1109\/CVPR.2017.597","DOI":"10.1109\/CVPR.2017.597"},{"key":"1618_CR56","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/7.4.308","author":"JA Nelder","year":"1965","unstructured":"Nelder, J.A., Mead, R.: A simplex method for function minimization. Comput. J. (1965). https:\/\/doi.org\/10.1093\/comjnl\/7.4.308","journal-title":"Comput. J."},{"key":"1618_CR57","unstructured":"OpenCV team: OpenCV - Open computer vision library. https:\/\/opencv.org\/, last accessed: 11-09-2024 (2024)"},{"key":"1618_CR58","doi-asserted-by":"publisher","DOI":"10.1017\/S0263574700003143","author":"D Ort\u00edn","year":"2001","unstructured":"Ort\u00edn, D., Montiel, J.M.M.: Indoor robot motion based on monocular images. Roboti (2001). https:\/\/doi.org\/10.1017\/S0263574700003143","journal-title":"Roboti"},{"key":"1618_CR59","doi-asserted-by":"publisher","unstructured":"Pan, S., Wang, X.: A survey on perspective-n-point problem. In: 2021 40th Chinese control conference (CCC), (2021).https:\/\/doi.org\/10.23919\/CCC52363.2021.9549863","DOI":"10.23919\/CCC52363.2021.9549863"},{"key":"1618_CR60","doi-asserted-by":"publisher","unstructured":"Parameshwara, CM., Hari, G., Ferm\u00fcller, C., et\u00a0al.: DiffPoseNet: Direct differentiable camera pose estimation. In: 2022 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), (2022).https:\/\/doi.org\/10.1109\/CVPR52688.2022.00672","DOI":"10.1109\/CVPR52688.2022.00672"},{"key":"1618_CR61","doi-asserted-by":"publisher","unstructured":"Peng, S., Liu, Y., Huang, Q., et\u00a0al.: Pvnet: Pixel-wise voting network for 6dof pose estimation. In: 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR), (2019). https:\/\/doi.org\/10.1109\/CVPR.2019.00469","DOI":"10.1109\/CVPR.2019.00469"},{"key":"1618_CR62","doi-asserted-by":"publisher","unstructured":"Persson, M., Nordberg, K.: Lambda twist: An accurate fast robust perspective three point (p3p) solver. In: Computer Vision - ECCV 2018, (2018).https:\/\/doi.org\/10.1007\/978-3-030-01225-0_20","DOI":"10.1007\/978-3-030-01225-0_20"},{"key":"1618_CR63","doi-asserted-by":"publisher","DOI":"10.1162\/EVCO_a_00087","author":"P Po\u0161\u00edk","year":"2012","unstructured":"Po\u0161\u00edk, P., Huyer, W.: Restarted local search algorithms for continuous black box optimization. Evol. Comput. (2012). https:\/\/doi.org\/10.1162\/EVCO_a_00087","journal-title":"Evol. Comput."},{"key":"1618_CR64","doi-asserted-by":"publisher","unstructured":"Qingxuan, J., Ping, Z., Hanxu, S.: The study of positioning with high-precision by single camera based on p3p algorithm. In: 2006 4th IEEE International conference on industrial informati (2006). https:\/\/doi.org\/10.1109\/INDIN.2006.275618","DOI":"10.1109\/INDIN.2006.275618"},{"key":"1618_CR65","doi-asserted-by":"publisher","unstructured":"Roch, P., Shahbaz\u00a0Nejad, B., Handte, M., et\u00a0al.: Car pose estimation through wheel detection. In: Advances in visual computing, (2021). https:\/\/doi.org\/10.1007\/978-3-030-90439-5_21","DOI":"10.1007\/978-3-030-90439-5_21"},{"key":"1618_CR66","doi-asserted-by":"publisher","unstructured":"Roch, P., Shahbaz\u00a0Nejad, B., Handte, M., et\u00a0al. Optimizing PnP-algorithms for\u00c2 limited point correspondences using spatial constraints. In: Advances in visual computin (2023).https:\/\/doi.org\/10.1007\/978-3-031-47966-3_17","DOI":"10.1007\/978-3-031-47966-3_17"},{"key":"1618_CR67","doi-asserted-by":"publisher","unstructured":"Rublee, E., Rabaud, V., Konolige, K., et\u00a0al.: Orb: An efficient alternative to sift or surf. In: 2011 International conference on computer visio (2011). https:\/\/doi.org\/10.1109\/ICCV.2011.6126544","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"1618_CR68","doi-asserted-by":"publisher","DOI":"10.1177\/0278364917695640","author":"J Ruiz-Sarmiento","year":"2017","unstructured":"Ruiz-Sarmiento, J., Galindo, C., Gonzalez-Jimenez, J.: Robot@Home, a robotic dataset for semantic mapping of home environments. Int. J. Robot. Res. (2017). https:\/\/doi.org\/10.1177\/0278364917695640","journal-title":"Int. J. Robot. Res."},{"key":"1618_CR69","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-011-0441-3","author":"D Scaramuzza","year":"2011","unstructured":"Scaramuzza, D.: 1-Point-RANSAC Structure from Motion for Vehicle-Mounted Cameras by Exploiting Non-holonomic Constraints. Int. J. Comput. Visi (2011). https:\/\/doi.org\/10.1007\/s11263-011-0441-3","journal-title":"Int. J. Comput. Visi"},{"key":"1618_CR70","doi-asserted-by":"publisher","unstructured":"Schwarz, M., Schulz, H., Behnke, S.: Rgb-d object recognition and pose estimation based on pre-trained convolutional neural network features. In: 2015 IEEE International conference on robotics and automation (ICRA), (2015). https:\/\/doi.org\/10.1109\/ICRA.2015.7139363","DOI":"10.1109\/ICRA.2015.7139363"},{"key":"1618_CR71","doi-asserted-by":"crossref","unstructured":"Schweighofer, G., Pinz, A.: Globally optimal o(n) solution to the pnp problem for general camera models. In: British machine vision conference (2008)","DOI":"10.5244\/C.22.55"},{"key":"1618_CR72","doi-asserted-by":"publisher","DOI":"10.1137\/20M1373190","author":"HJM Shi","year":"2022","unstructured":"Shi, H.J.M., Xie, Y., Byrd, R., et al.: A noise-tolerant quasi-Newton algorithm for unconstrained optimization. SIAM J. Opt. (2022). https:\/\/doi.org\/10.1137\/20M1373190","journal-title":"SIAM J. Opt."},{"key":"1618_CR73","doi-asserted-by":"publisher","unstructured":"Shi, X., Li, D., Zhao, P., et\u00a0al.: Are we ready for service robots? The openLORIS-scene datasets for lifelong SLAM. In: 2020 IEEE International conference on robotics and automation (ICRA), (2020). https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196638","DOI":"10.1109\/ICRA40945.2020.9196638"},{"key":"1618_CR74","doi-asserted-by":"publisher","unstructured":"Sihombing, DP., Nugroho, HA., Wibirama, S.: Perspective rectification in vehicle number plate recognition using 2D-2D transformation of Planar Homography. In: 2015 International conference on science in information technology (ICSITech),(2015). https:\/\/doi.org\/10.1109\/ICSITech.2015.7407810","DOI":"10.1109\/ICSITech.2015.7407810"},{"key":"1618_CR75","doi-asserted-by":"publisher","unstructured":"Sturm, J., Engelhard, N., Endres, F., et\u00a0al.: A benchmark for the evaluation of RGB-D SLAM systems. In: 2012 IEEE\/RSJ International conference on intelligent robots and systems (2012). https:\/\/doi.org\/10.1109\/IROS.2012.6385773","DOI":"10.1109\/IROS.2012.6385773"},{"key":"1618_CR76","doi-asserted-by":"publisher","unstructured":"Sweeney, C., Flynn, J., Nuernberger, B., et\u00a0al.: Efficient computation of absolute pose for gravity-aware augmented reality. In: 2015 IEEE international symposium on mixed and augmented reality, (2015). https:\/\/doi.org\/10.1109\/ISMAR.2015.20","DOI":"10.1109\/ISMAR.2015.20"},{"key":"1618_CR77","doi-asserted-by":"publisher","unstructured":"Terzakis, G., Lourakis, M.: A consistently fast and globally optimal solution to the perspective-n-point problem. In: Computer vision \u2013 ECCV 2020, (2020). https:\/\/doi.org\/10.1007\/978-3-030-58452-8_28","DOI":"10.1007\/978-3-030-58452-8_28"},{"key":"1618_CR78","unstructured":"The apache software foundation: Math - commons math: the apache commons mathematics library. https:\/\/commons.apache.org\/proper\/commons-math\/, last accessed: 11-09-2024 (2024)"},{"key":"1618_CR79","doi-asserted-by":"publisher","unstructured":"Tremblay, J., To, T., Sundaralingam, B., et\u00a0al.: Deep object pose estimation for semantic robotic grasping of household objects. CoRR (2018).https:\/\/doi.org\/10.48550\/arXiv.1809.10790","DOI":"10.48550\/arXiv.1809.10790"},{"key":"1618_CR80","doi-asserted-by":"publisher","DOI":"10.5194\/isprs-annals-iii-3-131-2016","author":"S Urban","year":"2016","unstructured":"Urban, S., Leitloff, J., Hinz, S.: MLPnP - A real-time maximum likelihood solution to the perspective-n-point problem. ISPRS Annal. Photogramm. Remote Sens. Spat. Inf. Sci. (2016). https:\/\/doi.org\/10.5194\/isprs-annals-iii-3-131-2016","journal-title":"ISPRS Annal. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"1618_CR81","doi-asserted-by":"publisher","DOI":"10.3390\/app112110255","author":"BM Velichkovsky","year":"2021","unstructured":"Velichkovsky, B.M., Kotov, A., Arinkin, N., et al.: From social gaze to indirect speech constructions: How to induce the impression that your companion robot is a conscious creature. Appl. Sci. (2021). https:\/\/doi.org\/10.3390\/app112110255","journal-title":"Appl. Sci."},{"key":"1618_CR82","doi-asserted-by":"publisher","unstructured":"Wang, Z., Yang, X.: V-head: Face detection and alignment for facial augmented reality applications. In: MultiMedia Modeling, (2017). https:\/\/doi.org\/10.1007\/978-3-319-51814-5_40","DOI":"10.1007\/978-3-319-51814-5_40"},{"key":"1618_CR83","doi-asserted-by":"publisher","unstructured":"Wu, H., Wen, C., Shi, S., et\u00a0al.: Virtual Sparse Convolution for Multimodal 3D Object Detection. In: 2023 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR) (2023).https:\/\/doi.org\/10.1109\/CVPR52729.2023.02074","DOI":"10.1109\/CVPR52729.2023.02074"},{"key":"1618_CR84","doi-asserted-by":"publisher","DOI":"10.3934\/naco.2020050","author":"M Xi","year":"2020","unstructured":"Xi, M., Sun, W., Chen, J.: Survey of derivative-free optimization. Numeri. Algebr, Control Opt. (2020). https:\/\/doi.org\/10.3934\/naco.2020050","journal-title":"Numeri. Algebr, Control Opt."},{"key":"1618_CR85","doi-asserted-by":"publisher","unstructured":"Xiang, Y., Schmidt, T., Narayanan, V., et\u00a0al.: Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes. CoRR (2017). https:\/\/doi.org\/10.48550\/arXiv.1711.00199","DOI":"10.48550\/arXiv.1711.00199"},{"key":"1618_CR86","doi-asserted-by":"publisher","DOI":"10.1137\/19M1240794","author":"Y Xie","year":"2020","unstructured":"Xie, Y., Byrd, R.H., Nocedal, J.: Analysis of the bfgs method with errors. SIAM J. Opt. (2020). https:\/\/doi.org\/10.1137\/19M1240794","journal-title":"SIAM J. Opt."},{"key":"1618_CR87","doi-asserted-by":"publisher","unstructured":"Yang, SJ., Ho, CC., Chen, JY., et\u00a0al.: Practical Homography-based perspective correction method for License Plate Recognition. In: 2012 International conference on information security and intelligent control (2012). https:\/\/doi.org\/10.1109\/ISIC.2012.6449740","DOI":"10.1109\/ISIC.2012.6449740"},{"key":"1618_CR88","doi-asserted-by":"publisher","unstructured":"Zakharov, S., Shugurov, I., Ilic, S.: Dpod: 6d pose object detector and refiner. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), (2019). https:\/\/doi.org\/10.1109\/ICCV.2019.00203","DOI":"10.1109\/ICCV.2019.00203"},{"key":"1618_CR89","doi-asserted-by":"publisher","unstructured":"Zhang, B., Zhang, Q., Wang, Y., et\u00a0al.: The method of solving the non-coplanar perspective-four-point (p4p) problem. In: Proceedings of the 33rd Chinese Control conference (2014).https:\/\/doi.org\/10.1109\/ChiCC.2014.6896771","DOI":"10.1109\/ChiCC.2014.6896771"},{"key":"1618_CR90","doi-asserted-by":"publisher","unstructured":"Zhou, G., Wang, H., Chen, J., et\u00a0al.: PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation. In: 2021 IEEE\/CVF international conference on computer vision (ICCV), (2021).https:\/\/doi.org\/10.1109\/ICCV48922.2021.00279","DOI":"10.1109\/ICCV48922.2021.00279"},{"key":"1618_CR91","doi-asserted-by":"publisher","unstructured":"Zhu, Y., Li, M., Yao, W., et\u00a0al.: A review of 6d object pose estimation. In: 2022 IEEE 10th joint international information technology and artificial intelligence conference (ITAIC),(2022) https:\/\/doi.org\/10.1109\/ITAIC54216.2022.9836663","DOI":"10.1109\/ITAIC54216.2022.9836663"}],"container-title":["Machine Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-024-01618-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00138-024-01618-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-024-01618-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,8]],"date-time":"2024-11-08T03:07:11Z","timestamp":1731035231000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00138-024-01618-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,17]]},"references-count":91,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,11]]}},"alternative-id":["1618"],"URL":"https:\/\/doi.org\/10.1007\/s00138-024-01618-z","relation":{},"ISSN":["0932-8092","1432-1769"],"issn-type":[{"type":"print","value":"0932-8092"},{"type":"electronic","value":"1432-1769"}],"subject":[],"published":{"date-parts":[[2024,10,17]]},"assertion":[{"value":"14 March 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 September 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 September 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 October 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We declare that we have no Conflict of interest that influence the work reported in this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"137"}}