{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:07:42Z","timestamp":1760242062196,"version":"build-2065373602"},"reference-count":37,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2018,12,15]],"date-time":"2018-12-15T00:00:00Z","timestamp":1544832000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Union, Infrastructure and Environment Operational Programme","award":["POIS.11.01.00-00.068\/14"],"award-info":[{"award-number":["POIS.11.01.00-00.068\/14"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Camera pose tracking is a fundamental task in Augmented Reality (AR) applications. In this paper, we present CATCHA, a method to achieve camera pose tracking in cultural heritage interiors with rigorous conservatory policies. Our solution is real-time model-based camera tracking according to textured point cloud, regardless of its registration technique. We achieve this solution using orthographic model rendering that allows us to achieve real-time performance, regardless of point cloud density. Our developed algorithm is used to create a novel tool to help both cultural heritage restorers and individual visitors visually compare the actual state of a culture heritage location with its previously scanned state from the same point of view in real time. The provided application can directly achieve a frame rate of over 15 Hz on VGA frames on a mobile device and over 40 Hz using remote processing. The performance of our approach is evaluated using a model of the King\u2019s Chinese Cabinet (Museum of King Jan III\u2019s Palace at Wilan\u00f3w, Warsaw, Poland) that was scanned in 2009 using the structured light technique and renovated and scanned again in 2015. Additional tests are performed on a model of the Al Fresco Cabinet in the same museum, scanned using a time-of-flight laser scanner.<\/jats:p>","DOI":"10.3390\/ijgi7120479","type":"journal-article","created":{"date-parts":[[2018,12,18]],"date-time":"2018-12-18T02:15:59Z","timestamp":1545099359000},"page":"479","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["CATCHA: Real-Time Camera Tracking Method for Augmented Reality Applications in Cultural Heritage Interiors"],"prefix":"10.3390","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9101-4608","authenticated-orcid":false,"given":"Piotr","family":"Sieka\u0144ski","sequence":"first","affiliation":[{"name":"Faculty of Mechatronics, Warsaw University of Technology, ul. \u015bw. Andrzeja Boboli 8, 02-525 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7284-8463","authenticated-orcid":false,"given":"Jakub","family":"Micho\u0144ski","sequence":"additional","affiliation":[{"name":"Faculty of Mechatronics, Warsaw University of Technology, ul. \u015bw. Andrzeja Boboli 8, 02-525 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eryk","family":"Bunsch","sequence":"additional","affiliation":[{"name":"Museum of King Jan III\u2019s Palace, ul. Stanis\u0142awa Kostki Potockiego 10\/16, 02-958 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8156-5462","authenticated-orcid":false,"given":"Robert","family":"Sitnik","sequence":"additional","affiliation":[{"name":"Faculty of Mechatronics, Warsaw University of Technology, ul. \u015bw. Andrzeja Boboli 8, 02-525 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,12,15]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3145534","article-title":"A Survey of Augmented, Virtual, and Mixed Reality for Cultural Heritage","volume":"11","author":"Bekele","year":"2018","journal-title":"J. Comput. Cult. Herit."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Sitnik, R., Bunsch, E., M\u0105czkowski, G., and Za\u0142uski, W. (2016). Towards automated, high resolution 3D scanning of large surfaces for cultural heritage documentation. IS&T\/SPIE Symposium on Electronic Imaging: Science &Technology, IS&T, Society for Imaging Science and Technology.","DOI":"10.2352\/ISSN.2470-1173.2016.21.3DIPM-051"},{"key":"ref_3","first-page":"178","article-title":"Laser Scanner Survey of An Archaeological Site: Scala Di Furno (Lecce, Italy)","volume":"38","author":"Costantino","year":"2010","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.culher.2017.11.006","article-title":"Recent trends in cultural heritage 3D survey: The photogrammetric computer vision approach","volume":"32","author":"Aicardi","year":"2018","journal-title":"J. Cult. Herit."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2633","DOI":"10.1109\/TVCG.2015.2513408","article-title":"Pose estimation for augmented reality: A hands-on survey","volume":"22","author":"Marchand","year":"2016","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Pauwels, K., Rubio, L., Diaz, J., and Ros, E. (2013, January 23\u201328). Real-time model-based rigid object pose estimation and tracking combining dense and sparse visual cues. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.304"},{"key":"ref_7","first-page":"55","article-title":"Visual simultaneous localization and mapping: A survey","volume":"43","year":"2012","journal-title":"Artif. Intell. Rev."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zeisl, B., Sattler, T., and Pollefeys, M. (2015, January 7\u201313). Camera Pose Voting for Large-Scale Image-Based Localization. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.310"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Rubio, A., Villamizar, M., Ferraz, L., Ramisa, A., and Sanfeliu, A. (2015, January 26\u201330). Efficient Monocular Pose Estimation for Complex 3D Models. Proceedings of the 2015 IEEE International Conference on Robotics and Automation (ICRA), Seattle, WA, USA.","DOI":"10.1109\/ICRA.2015.7139372"},{"key":"ref_10","unstructured":"Yang, G., Becker, J., and Stewart, C.V. (2007, January 21\u201323). Estimating the location of a camera with respect to a 3D model. Proceedings of the 3DIM 2007 6th International Conference on 3-D Digital Imaging and Modeling, Montreal, QC, Canada."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Guan, W., You, S., and Pang, G. (2013, January 15\u201317). Estimation of camera pose with respect to terrestrial LiDAR data. Proceedings of the IEEE Workshop on Applications of Computer Vision, Tampa, FL, USA.","DOI":"10.1109\/WACV.2013.6475045"},{"key":"ref_12","unstructured":"Wu, C., Fraundorfer, F., Frahm, J.-M., and Pollefeys, M. (2008, January 23\u201328). 3D model search and pose estimation from single images using VIP features. Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Anchorage, AK, USA."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1016\/j.cag.2011.04.007","article-title":"Real-time and accurate extrinsic camera parameter estimation using feature landmark database for augmented reality","volume":"35","author":"Taketomi","year":"2011","journal-title":"Comput. Graph."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Jaramillo, C., Dryanovski, I., Valenti, R.G., and Xiao, J. (2013, January 12\u201314). 6-DoF pose localization in 3D point-cloud dense maps using a monocular camera. Proceedings of the 2013 IEEE International Conference on Robotics and Biomimetics, ROBIO 2013, Shenzhen, China.","DOI":"10.1109\/ROBIO.2013.6739720"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Crombez, N., Caron, G., and Mouaddib, E.M. (2015, January 28\u201330). Using dense point clouds as environment model for visual localization of mobile robot. Proceedings of the 2015 12th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), Goyang, Korea.","DOI":"10.1109\/URAI.2015.7358924"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Rambach, J., Pagani, A., Schneider, M., Artemenko, O., and Stricker, D. (2018). 6DoF Object Tracking based on 3D Scans for Augmented Reality Remote Live Support. Computers, 7.","DOI":"10.3390\/computers7010006"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Vanoni, D., Seracini, M., and Kuester, F. (2012, January 10\u201312). ARtifact: Tablet-Based Augmented Reality for Interactive Analysis of Cultural Artifacts. Proceedings of the 2012 IEEE International Symposium on Multimedia, Irvine, CA, USA.","DOI":"10.1109\/ISM.2012.17"},{"key":"ref_19","unstructured":"PTC Inc. (2018, October 03). Vuforia Library. Available online: https:\/\/www.vuforia.com\/."},{"key":"ref_20","first-page":"223","article-title":"Virtual restoration of deteriorated religious heritage objects using augmented reality technologies","volume":"9","author":"Butnariu","year":"2013","journal-title":"Eur. J. Sci. Theol."},{"key":"ref_21","unstructured":"(2018, October 03). The Fraunhofer Institute for Computer Graphics Research IGD Instant Player. Available online: http:\/\/www.instantreality.org\/."},{"key":"ref_22","first-page":"279","article-title":"3D visualization via augmented reality: The case of the middle stoa in the ancient agora of Athens","volume":"Volume 8740","author":"Verykokou","year":"2014","journal-title":"Lecture Notes in Computer Science, Proceedings of the Euro-Mediterranean Conference, Limassol, Cyprus, 3\u20138 November 2014"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"375","DOI":"10.5194\/isprsarchives-XL-5-W4-375-2015","article-title":"3D tracking based augmented reality for cultural heritage data management","volume":"40","author":"Battini","year":"2015","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_24","unstructured":"(2018, October 03). Inglobe Technologies ARmedia 3D SDK. Available online: http:\/\/dev.inglobetechnologies.com\/index.php."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Mansouri, A., Nouboud, F., Chalifour, A., Mammass, D., Meunier, J., and Elmoataz, A. (2016). Color Correction in 3D Digital Documentation: Case Study. Image and Signal Processing, Springer International Publishing.","DOI":"10.1007\/978-3-319-33618-3"},{"key":"ref_26","unstructured":"CharLS (2018, October 03). CharLS Library. Available online: https:\/\/github.com\/team-charls\/charls."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the IEEE International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Sibbing, D., Sattler, T., Leibe, B., and Kobbelt, L. (July, January 29). SIFT-realistic rendering. Proceedings of the 2013 International Conference on 3D Vision, 3DV 2013, Seattle, WA, USA.","DOI":"10.1109\/3DV.2013.16"},{"key":"ref_29","unstructured":"Nister, D., and Stewenius, H. (2006, January 17\u201322). Scalable recognition with a vocabulary tree. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1007\/s11263-008-0152-6","article-title":"EPnP: An Accurate O(n) Solution to the PnP Problem","volume":"81","author":"Lepetit","year":"2008","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"930","DOI":"10.1109\/TPAMI.2003.1217599","article-title":"Complete solution classification for the perspective-three-point problem","volume":"25","author":"Gao","year":"2003","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","first-page":"4521","article-title":"Seeing the past: An augmented reality application for visualization the previous state of cultural heritage locations","volume":"Volume 2018","author":"Bunsch","year":"2018","journal-title":"Electronic Imaging"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Bay, H., Tuytelaars, T., and Van Gool, L. (2006). SURF: Speeded Up Robust Features. Lecture Notes in Computer Science, European Conference on Computer Vision, Graz, Austria, 7\u201313 May 2006, Springer.","DOI":"10.1007\/11744023_32"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Irschara, A., Zach, C., Frahm, J.M., and Bischof, H. (2009, January 20\u201325). From structure-from-motion point clouds to fast location recognition. Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2009, Miami, FL, USA.","DOI":"10.1109\/CVPRW.2009.5206587"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1137\/080732730","article-title":"ASIFT: A New Framework for Fully Affine Invariant Image Comparison","volume":"2","author":"Morel","year":"2009","journal-title":"SIAM J. Imaging Sci."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"2280","DOI":"10.1016\/j.patcog.2014.01.005","article-title":"Automatic generation and detection of highly reliable fiducial markers under occlusion","volume":"47","year":"2014","journal-title":"Pattern Recognit."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Ferraz, L., Binefa, X., and Moreno-Noguer, F. (2014, January 23\u201328). Very fast solution to the PnP problem with algebraic outlier rejection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.71"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/7\/12\/479\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:34:12Z","timestamp":1760196852000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/7\/12\/479"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,15]]},"references-count":37,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2018,12]]}},"alternative-id":["ijgi7120479"],"URL":"https:\/\/doi.org\/10.3390\/ijgi7120479","relation":{},"ISSN":["2220-9964"],"issn-type":[{"type":"electronic","value":"2220-9964"}],"subject":[],"published":{"date-parts":[[2018,12,15]]}}}