{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,22]],"date-time":"2026-02-22T23:11:22Z","timestamp":1771801882743,"version":"3.50.1"},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100010418","name":"Institute for Information and Communications Technology Promotion","doi-asserted-by":"publisher","award":["IITP2020000103001"],"award-info":[{"award-number":["IITP2020000103001"]}],"id":[{"id":"10.13039\/501100010418","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002460","name":"Chung-Ang University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002460","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2022,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>As augmented reality technologies develop, real-time interactions between objects present in the real world and virtual space are required. Generally, recognition and location estimation in augmented reality are carried out using tracking techniques, typically markers. However, using markers creates spatial constraints in simultaneous tracking of space and objects. Therefore, we propose a system that enables camera tracking in the real world and visualizes virtual visual information through the recognition and positioning of objects. We scanned the space using an RGB-D camera. A three-dimensional (3D) dense point cloud map is created using point clouds generated through video images. Among the generated point cloud information, objects are detected and retrieved based on the pre-learned data. Finally, using the predicted pose of the detected objects, other information may be augmented. Our system estimates object recognition and 3D pose based on simple camera information, enabling the viewing of virtual visual information based on object location.<\/jats:p>","DOI":"10.1007\/s11227-021-04161-0","type":"journal-article","created":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T12:02:43Z","timestamp":1636977763000},"page":"7509-7528","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["A study on recognizing multi-real world object and estimating 3D position in augmented reality"],"prefix":"10.1007","volume":"78","author":[{"given":"Taemin","family":"Lee","sequence":"first","affiliation":[]},{"given":"Changhun","family":"Jung","sequence":"additional","affiliation":[]},{"given":"Kyungtaek","family":"Lee","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4824-3517","authenticated-orcid":false,"given":"Sanghyun","family":"Seo","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,11,15]]},"reference":[{"key":"4161_CR1","doi-asserted-by":"crossref","unstructured":"Azuma RT (1997) A survey of augmented reality. Presence Teleoper Virt Environ 6(4):355\u2013385","DOI":"10.1162\/pres.1997.6.4.355"},{"key":"4161_CR2","doi-asserted-by":"crossref","unstructured":"Coltekin A, Lochhead I, Madden M, Christophe S, Devaux A, Pettit C, Kubicek P (2020) Extended reality in spatial sciences: a review of research challenges and future directions. ISPRS Int J Geo-Inform 9(7):439","DOI":"10.3390\/ijgi9070439"},{"key":"4161_CR3","doi-asserted-by":"publisher","unstructured":"Keisuke T, Itaru K, Yuichi O (2007) Nested marker for augmented reality. In: 2007 IEEE Virtual Reality Conference. https:\/\/doi.org\/10.1109\/VR.2007.352495","DOI":"10.1109\/VR.2007.352495"},{"issue":"5","key":"4161_CR4","first-page":"441","volume":"2","author":"K Anuroop","year":"2015","unstructured":"Anuroop K, Karan K, Chetan G (2015) Marker based augmented reality. Adv Comput Sci Inform Technol 2(5):441\u2013445","journal-title":"Adv Comput Sci Inform Technol"},{"issue":"5","key":"4161_CR5","doi-asserted-by":"publisher","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","volume":"33","author":"R Mur-Artal","year":"2017","unstructured":"Mur-Artal R, Tardos JD (2017) Orb-slam2: an open-source slam system for monocular, stereo, and rgb-d cameras. IEEE Trans Robot 33(5):1255\u20131262","journal-title":"IEEE Trans Robot"},{"key":"4161_CR6","unstructured":"Ze Y, Liwie W (2019) Learning relationships for multi-view 3D object recognition. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp 7505\u20137514"},{"key":"4161_CR7","doi-asserted-by":"publisher","first-page":"104458","DOI":"10.1016\/j.conengprac.2020.104458","volume":"104458","author":"W Ning","year":"2020","unstructured":"Ning W, Yuanyuan W, Meng J (2020) Review on deep learning techniques for marine object recognition: architectures and algorithms. Control Eng Pract 104458:104458. https:\/\/doi.org\/10.1016\/j.conengprac.2020.104458","journal-title":"Control Eng Pract"},{"key":"4161_CR8","unstructured":"Jingru T, Cahngbao W, Buyu L, Quanguan L, Wanli O, Changqing Y, Junjie Y (2020) Equalization loss for long-tailed object recognition. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 11662\u201311671"},{"issue":"2","key":"4161_CR9","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"DG Lowe","year":"2004","unstructured":"Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int j Comput Vis 60(2):91\u2013110","journal-title":"Int j Comput Vis"},{"key":"4161_CR10","doi-asserted-by":"crossref","unstructured":"Bay H, Tuytelaars T, Van Gool L (2006) Surf: speeded up robust features. In: European Conference on Computer Vision. Springer, Berlin, Heidelberg, pp 404\u2013417","DOI":"10.1007\/11744023_32"},{"key":"4161_CR11","doi-asserted-by":"crossref","unstructured":"Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to SIFT or SURF. In: 2011 International Conference on Computer Vision, pp 2564\u20132571","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"4161_CR12","doi-asserted-by":"crossref","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580\u2013587","DOI":"10.1109\/CVPR.2014.81"},{"issue":"9","key":"4161_CR13","doi-asserted-by":"publisher","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","volume":"37","author":"K He","year":"2015","unstructured":"He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Patt Anal Mach Intell 37(9):1904\u20131916","journal-title":"IEEE Trans Patt Anal Mach Intell"},{"key":"4161_CR14","doi-asserted-by":"crossref","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: single shot multibox detector. In: European Conference on Computer Vision, pp 21\u201337","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"4161_CR15","doi-asserted-by":"crossref","unstructured":"Rothganger F, Lazebnik S, Schmid C, Ponce J (2003) 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints. In: 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2, pp 272\u2013277","DOI":"10.1109\/CVPR.2003.1211480"},{"key":"4161_CR16","doi-asserted-by":"crossref","unstructured":"Ozyesil O, Voroninski V, Basri R, Singer A (2017) A survey of structure from motion. arXiv preprint arXiv:1701.08493","DOI":"10.1017\/S096249291700006X"},{"key":"4161_CR17","unstructured":"Xiao J, Russell B, Torralba A (2012) Localizing 3D cuboids in single-view images. Adv Neural Inform Process Syst, pp 746\u2013754"},{"key":"4161_CR18","doi-asserted-by":"crossref","unstructured":"Pavlakos G, Zhou X, Chan A, Derpanis K.G, Daniilidis K (2017) 6-dof object pose from semantic keypoints. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), 2011\u20132018","DOI":"10.1109\/ICRA.2017.7989233"},{"key":"4161_CR19","doi-asserted-by":"crossref","unstructured":"Peng S, Liu Y, Huang Q, Zhou X, Bao H (2019) Pvnet: pixel-wise voting network for 6dof pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4561\u20134570","DOI":"10.1109\/CVPR.2019.00469"},{"key":"4161_CR20","unstructured":"Thomas PC, David WM (1992) Augmented reality: an application of heads-up display technology to manual manufacturing processes. In: Hawaii International Conference on System Sciences, pp 659\u2013669"},{"key":"4161_CR21","doi-asserted-by":"crossref","unstructured":"Kalkusch M, Lidy T, Knapp N, Reitmayr G, Kaufmann H, Schmalstieg D (2002) Structured visual markers for indoor pathfinding. In: The First IEEE international workshop agumented reality toolkit, pp 1\u20138","DOI":"10.1109\/ART.2002.1107018"},{"key":"4161_CR22","doi-asserted-by":"crossref","unstructured":"Wagner D, Reitmayr G, Mulloni A, Drummond T, Schmalstieg D (2008) Pose tracking from natural features on mobile phones. In: 2008 7th IEEE\/ACM international symposium on mixed and augmented reality, pp 125\u2013134","DOI":"10.1109\/ISMAR.2008.4637338"},{"key":"4161_CR23","doi-asserted-by":"crossref","unstructured":"Heok AD , Fong SW, Goh KH, Yang X, Liu W, Farzbiz F (2003) Human Pacman: a sensing-based mobile entertainment system with ubiquitous computing and tangible interaction. In: Proceedings of the 2nd workshop on network and system support for games, pp 106\u2013117","DOI":"10.1145\/963900.963911"},{"key":"4161_CR24","doi-asserted-by":"publisher","first-page":"2453","DOI":"10.1109\/ACCESS.2018.2886627","volume":"7","author":"F Munoz-Montoya","year":"2018","unstructured":"Munoz-Montoya F, Juan MC, Mendez-Lopez M, Fidalgo C (2018) Augmented reality based on SLAM to assess spatial short-term memory. IEEE Access 7:2453\u20132466","journal-title":"IEEE Access"},{"key":"4161_CR25","doi-asserted-by":"crossref","unstructured":"Runz M, Buffier M, Agapito L (2018) Maskfusion: real-time recognition, tracking and reconstruction of multiple moving objects. In: 2018 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pp 10\u201320","DOI":"10.1109\/ISMAR.2018.00024"},{"key":"4161_CR26","doi-asserted-by":"crossref","unstructured":"Seo S.H, Kang D.W, Park S.O (2018) Real-time adaptable and coherent rendering for outdoor augmented reality. EURASIP J Image Video Process 118","DOI":"10.1186\/s13640-018-0357-8"},{"key":"4161_CR27","doi-asserted-by":"publisher","first-page":"166528","DOI":"10.1109\/ACCESS.2019.2952161","volume":"7","author":"C Linyan","year":"2019","unstructured":"Linyan C, Chaowei M (2019) SOF-SLAM: a semantic visual SLAM for dynamic environments. IEEE Access 7:166528\u2013166539","journal-title":"IEEE Access"},{"key":"4161_CR28","doi-asserted-by":"publisher","unstructured":"Chao Y, Zuxin L, Xin-Jun L, Fugui X, Yi Y, Qi W, Qiao F (2018) DS-SLAM: a semantic visual SLAM towards dynamic environments. In: 2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems. https:\/\/doi.org\/10.1109\/IROS.2018.8593691","DOI":"10.1109\/IROS.2018.8593691"},{"key":"4161_CR29","unstructured":"Shinya S, Mikiya S, Ken S (2019) Openvslam: a versatile visual slam framework. In: Proceedings of the 27th ACM International Conference on Multimedia, pp 2292\u20132295"},{"key":"4161_CR30","unstructured":"Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767"},{"issue":"11","key":"4161_CR31","doi-asserted-by":"publisher","first-page":"2241","DOI":"10.1109\/TPAMI.2015.2513405","volume":"38","author":"J Yang","year":"2015","unstructured":"Yang J, Li H, Campbell D, Jia Y (2015) Go-ICP: a globally optimal solution to 3D ICP point-set registration. IEEE Trans Patt Anal Mach intelligence 38(11):2241\u20132254","journal-title":"IEEE Trans Patt Anal Mach intelligence"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-021-04161-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11227-021-04161-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-021-04161-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,18]],"date-time":"2022-03-18T16:38:51Z","timestamp":1647621531000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11227-021-04161-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,15]]},"references-count":31,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,4]]}},"alternative-id":["4161"],"URL":"https:\/\/doi.org\/10.1007\/s11227-021-04161-0","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,15]]},"assertion":[{"value":"20 October 2021","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}