{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T11:23:58Z","timestamp":1768562638228,"version":"3.49.0"},"reference-count":34,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2018,8,12]],"date-time":"2018-08-12T00:00:00Z","timestamp":1534032000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004663","name":"Ministry of Science and Technology, Taiwan","doi-asserted-by":"publisher","award":["MOST 106-2221-E-032-006"],"award-info":[{"award-number":["MOST 106-2221-E-032-006"]}],"id":[{"id":"10.13039\/501100004663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Three-Dimensional (3D) object pose estimation plays a crucial role in computer vision because it is an essential function in many practical applications. In this paper, we propose a real-time model-based object pose estimation algorithm, which integrates template matching and Perspective-n-Point (PnP) pose estimation methods to deal with this issue efficiently. The proposed method firstly extracts and matches keypoints of the scene image and the object reference image. Based on the matched keypoints, a two-dimensional (2D) planar transformation between the reference image and the detected object can be formulated by a homography matrix, which can initialize a template tracking algorithm efficiently. Based on the template tracking result, the correspondence between image features and control points of the Computer-Aided Design (CAD) model of the object can be determined efficiently, thus leading to a fast 3D pose tracking result. Finally, the 3D pose of the object with respect to the camera is estimated by a PnP solver based on the tracked 2D-3D correspondences, which improves the accuracy of the pose estimation. Experimental results show that the proposed method not only achieves real-time performance in tracking multiple objects, but also provides accurate pose estimation results. These advantages make the proposed method suitable for many practical applications, such as augmented reality.<\/jats:p>","DOI":"10.3390\/a11080122","type":"journal-article","created":{"date-parts":[[2018,8,13]],"date-time":"2018-08-13T11:27:13Z","timestamp":1534159633000},"page":"122","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Efficient Model-Based Object Pose Estimation Based on Multi-Template Tracking and PnP Algorithms"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9872-4338","authenticated-orcid":false,"given":"Chi-Yi","family":"Tsai","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering, Tamkang University, 151 Ying-chuan Road, Danshui District, New Taipei City 251, Taiwan"}]},{"given":"Kuang-Jui","family":"Hsu","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Tamkang University, 151 Ying-chuan Road, Danshui District, New Taipei City 251, Taiwan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2026-5666","authenticated-orcid":false,"given":"Humaira","family":"Nisar","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Universiti Tunku Abdul Rahman, Jalan Universiti, Bandar Barat, 31900 Kampar, Perak, Malaysia"}]}],"member":"1968","published-online":{"date-parts":[[2018,8,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","article-title":"Speeded-up robust features (SURF)","volume":"110","author":"Bay","year":"2008","journal-title":"Compu. Vis. Image Underst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1281","DOI":"10.1109\/TPAMI.2011.222","article-title":"BRIEF: Computing a local binary descriptor very fast","volume":"34","author":"Calonder","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT and SURF. Proceedings of the IEEE International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_5","unstructured":"Lepetit, V., Lagger, P., and Fua, P. (2005, January 20\u201325). Randomized trees for real-time keypoint recognition. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1109\/TPAMI.2006.188","article-title":"Keypoint recognition using randomized trees","volume":"28","author":"Lepetit","year":"2006","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","unstructured":"Decker, P., and Paulus, D. (2010, January 8\u201312). Model based pose estimation using SURF. Proceedings of the 10th Asian Conference on Computer Vision, Queenstown, New Zealand."},{"key":"ref_8","unstructured":"Choi, C., and Christensen, H.I. (2010, January 3\u20137). Real-time 3D model-based tracking using edge and keypoint features for robotic manipulation. Proceedings of the IEEE International Conference on Robotics and Automation, Anchorage, AK, USA."},{"key":"ref_9","unstructured":"Harris, C. (1992). Tracking with Rigid Objects, MIT Press."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1049\/iet-cvi.2015.0099","article-title":"Probabilistic approach for maximum likelihood estimation of pose using lines","volume":"10","author":"Zhang","year":"2016","journal-title":"IET Comput. Vis."},{"key":"ref_11","first-page":"563","article-title":"3D object pose tracking using a novel model-based contour fitting algorithm","volume":"10","author":"Tsai","year":"2016","journal-title":"ICIC Express Lett."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"28859","DOI":"10.1109\/ACCESS.2018.2808225","article-title":"Simultaneous 3D object recognition and pose estimation based on RGB-D images","volume":"6","author":"Tsai","year":"2018","journal-title":"IEEE Access."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1330","DOI":"10.1109\/34.888718","article-title":"A flexible new technique for camera calibration","volume":"22","author":"Zhang","year":"2000","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_14","unstructured":"Sturm, P. (2000, January 15). Algorithms for plane-based pose estimation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head Island, SC, USA."},{"key":"ref_15","unstructured":"Collins, T., Durou, J.-D., Gurdjos, P., and Bartoli, A. (2010, January 17\u201320). Single-view perspective shape-from-texture with focal length estimation: A piecewise affine approach. Proceedings of the 5th International Symposium on 3D Data Processing, Visualization, and Transmission, Paris, France."},{"key":"ref_16","unstructured":"Simon, G., Fitzgibbon, A.W., and Zisserman, A. (2000, January 5\u20136). Markerless tracking using planar structures in the scene. Proceedings of the IEEE and ACM International Symposium on Augmented Reality, Munich, Germany."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1025","DOI":"10.1109\/34.722606","article-title":"Efficient region tracking with parametric models of geometry and illumination","volume":"20","author":"Hager","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","unstructured":"Buenaposada, J.M., and Baumela, L. (2002, January 11\u201315). Real-time tracking and estimation of plane pose. Proceedings of the IEEE International Conference on Pattern Recognition, Quebec, QC, Canada."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1109\/TPAMI.2002.1017625","article-title":"Hyperplane approximation for template matching","volume":"24","author":"Jurie","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zheng, Q., Wang, Y., and Heng, P.A. (2017, January 22\u201329). Online robust image alignment via subspace learning from gradient orientations. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.195"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Tjaden, H., Schwanecke, U., and Sch\u00f6mer, E. (2017, January 22\u201329). Real-time monocular pose estimation of 3D objects using temporally consistent local color histograms. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.23"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Chang, C.-H., Chou, C.-N., and Chang, E.Y. (2017, January 21\u201326). CLKN: Cascaded Lucas-Kanade networks for image alignment. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.402"},{"key":"ref_23","unstructured":"Br\u00e4unl, T., McCane, B., Rivera, M., and Yu, X. (2015, January 25\u201327). 6-DOF direct homography tracking with extended Kalman filter. Proceedings of the 7th Pacific-Rim Symposium, PSIVT 2015, Auckland, New Zealand. Lecture Notes in Computer Science."},{"key":"ref_24","unstructured":"(2018, August 11). A GPU Implementation of David Lowe\u2019s Scale Invariant Feature Transform. Available online: https:\/\/github.com\/pitzer\/SiftGPU."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1049\/iet-cvi.2015.0137","article-title":"Graphics processing unit-accelerated multi-resolution exhaustive search algorithm for real-time keypoint descriptor matching in high-dimensional spaces","volume":"10","author":"Tsai","year":"2016","journal-title":"IET Comput. Vis."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1145\/358669.358692","article-title":"Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography","volume":"24","author":"Fischler","year":"1981","journal-title":"Commun. ACM"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Hartley, R., and Zisserman, A. (2004). Multiple View Geometry in Computer Vision, Cambridge University Press. [2nd ed.].","DOI":"10.1017\/CBO9780511811685"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1007\/s11263-014-0729-1","article-title":"Efficient learning of linear predictors for template tracking","volume":"111","author":"Holzer","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_29","unstructured":"Crivellaro, A., Fau, P., and Lepetit, V. (2018, August 11). Dense Methods for Image Alignment with an Application to 3D Tracking. Available online: https:\/\/infoscience.epfl.ch\/record\/197866\/files\/denseMethods2014.pdf."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1007\/s11263-014-0725-5","article-title":"Infinitesimal plane-based pose estimation","volume":"109","author":"Collins","year":"2014","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","unstructured":"(2018, August 11). NVIDIA\u2019s Tesla C2050 Board Specification. Available online: http:\/\/www.nvidia.com\/docs\/io\/43395\/tesla_c2050_board_specification.pdf."},{"key":"ref_32","unstructured":"(2018, August 11). Experimental Result of 3d Pose Tracking of Multiple Objects. Available online: https:\/\/www.youtube.com\/watch?v=wBEHEQMFer4."},{"key":"ref_33","unstructured":"(2018, August 11). Experimental Result of 3d Pose Tracking of a Single Object. Available online: https:\/\/www.youtube.com\/watch?v=-uVUbBIPAH4."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/8\/122\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:18:16Z","timestamp":1760195896000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/8\/122"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8,12]]},"references-count":34,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2018,8]]}},"alternative-id":["a11080122"],"URL":"https:\/\/doi.org\/10.3390\/a11080122","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,8,12]]}}}