{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T23:32:26Z","timestamp":1769556746500,"version":"3.49.0"},"reference-count":49,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2019,11,13]],"date-time":"2019-11-13T00:00:00Z","timestamp":1573603200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100010198","name":"Ministerio de Econom\u00eda, Industria y Competitividad, Gobierno de Espa\u00f1a","doi-asserted-by":"publisher","award":["DPI2017-84827-R"],"award-info":[{"award-number":["DPI2017-84827-R"]}],"id":[{"id":"10.13039\/501100010198","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","award":["DPI2017-84827-R"],"award-info":[{"award-number":["DPI2017-84827-R"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000780","name":"European Commission","doi-asserted-by":"publisher","award":["ICT-26-2016b-GA-732158"],"award-info":[{"award-number":["ICT-26-2016b-GA-732158"]}],"id":[{"id":"10.13039\/501100000780","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["FPU18\/01526"],"award-info":[{"award-number":["FPU18\/01526"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Human\u2013Robot interaction represents a cornerstone of mobile robotics, especially within the field of social robots. In this context, user localization becomes of crucial importance for the interaction. This work investigates the capabilities of wide field-of-view RGB cameras to estimate the 3D position and orientation (i.e., the pose) of a user in the environment. For that, we employ a social robot endowed with a fish-eye camera hosted in a tilting head and develop two complementary approaches: (1) a fast method relying on a single image that estimates the user pose from the detection of their feet and does not require either the robot or the user to remain static during the reconstruction; and (2) a method that takes some views of the scene while the camera is being tilted and does not need the feet to be visible. Due to the particular setup of the tilting camera, special equations for 3D reconstruction have been developed. In both approaches, a CNN-based skeleton detector (OpenPose) is employed to identify humans within the image. A set of experiments with real data validate our two proposed methods, yielding similar results than commercial RGB-D cameras while surpassing them in terms of coverage of the scene (wider FoV and longer range) and robustness to light conditions.<\/jats:p>","DOI":"10.3390\/s19224943","type":"journal-article","created":{"date-parts":[[2019,11,13]],"date-time":"2019-11-13T09:11:27Z","timestamp":1573636287000},"page":"4943","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":33,"title":["Human 3D Pose Estimation with a Tilting Camera for Social Mobile Robot Interaction"],"prefix":"10.3390","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3382-5872","authenticated-orcid":false,"given":"Mercedes","family":"Garcia-Salguero","sequence":"first","affiliation":[{"name":"Machine Perception and Intelligent Robotics Group (MAPIR), Dept. of System Engineering and Automation Biomedical Research Institute of Malaga (IBIMA), University of Malaga, 29071 M\u00e1laga, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3845-3497","authenticated-orcid":false,"given":"Javier","family":"Gonzalez-Jimenez","sequence":"additional","affiliation":[{"name":"Machine Perception and Intelligent Robotics Group (MAPIR), Dept. of System Engineering and Automation Biomedical Research Institute of Malaga (IBIMA), University of Malaga, 29071 M\u00e1laga, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2997-7571","authenticated-orcid":false,"given":"Francisco-Angel","family":"Moreno","sequence":"additional","affiliation":[{"name":"Machine Perception and Intelligent Robotics Group (MAPIR), Dept. of System Engineering and Automation Biomedical Research Institute of Malaga (IBIMA), University of Malaga, 29071 M\u00e1laga, Spain"}]}],"member":"1968","published-online":{"date-parts":[[2019,11,13]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1561\/1100000005","article-title":"Human\u2013robot interaction: A survey","volume":"1","author":"Goodrich","year":"2008","journal-title":"Found. Trends\u00ae Hum. Comput. Interact."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.cviu.2016.03.004","article-title":"A real-time human-robot interaction system based on gestures for assistive scenarios","volume":"149","author":"Canal","year":"2016","journal-title":"Comput. Vision Image Underst."},{"key":"ref_3","unstructured":"Saleh, S., Sahu, M., Zafar, Z., and Berns, K. (2015, January 4\u20136). A multimodal nonverbal human-robot communication system. Proceedings of the Sixth International Conference on Computational Bioengineering, ICCB, Belgrade, Serbia."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gockley, R., Forlizzi, J., and Simmons, R. (2007, January 10\u201312). Natural person-following behavior for social robots. Proceedings of the ACM\/IEEE International Conference on Human-robot Interaction, Arlington, VA, USA.","DOI":"10.1145\/1228716.1228720"},{"key":"ref_5","unstructured":"Cesta, A., Coradeschi, S., Cortellessa, G., Gonzalez, J., Tiberio, L., and Von Rump, S. (2010, January 5\u20137). Enabling social interaction through embodiment in ExCITE. Proceedings of the ForItAAL: Second Italian Forum on Ambient Assisted Living, Trento, Italy."},{"key":"ref_6","first-page":"1567","article-title":"Human Detection Using Color and Depth Information by Kinect Based on the Fusion Method of Decision Template","volume":"7","author":"Shi","year":"2016","journal-title":"ICIC Express Lett."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Zimmermann, C., Welschehold, T., Dornhege, C., Burgard, W., and Brox, T. (2018, January 21\u201325). 3D human pose estimation in RGBD images for robotic task learning. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia.","DOI":"10.1109\/ICRA.2018.8462833"},{"key":"ref_8","unstructured":"Moreno, F.A., Ruiz Sarmiento, J.R., Monroy, J., Fernandez, M., and Gonzalez-Jimenez, J. (2018, January 8\u201312). Analyzing interference between RGB-D cameras for human motion tracking. Proceedings of the International Conference on Applications of Intelligent Systems (APPIS), Las Palmas de Gran Canaria, Spain."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Butler, D.A., Izadi, S., Hilliges, O., Molyneaux, D., Hodges, S., and Kim, D. (2012, January 5\u201310). Shake\u2019n\u2019sense: reducing interference for overlapping structured light depth cameras. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Austin, TX, USA.","DOI":"10.1145\/2207676.2208335"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Gonz\u00e1lez-Jim\u00e9nez, J., Galindo, C., and Ruiz-Sarmiento, J. (2012, January 9\u201313). Technical improvements of the Giraff telepresence robot based on users\u2019 evaluation. Proceedings of the 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication, Paris, France.","DOI":"10.1109\/ROMAN.2012.6343854"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., and Sheikh, Y. (2018). OpenPose: Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields. arXiv.","DOI":"10.1109\/CVPR.2017.143"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Gong, W., Zhang, X., Gonz\u00e0lez, J., Sobral, A., Bouwmans, T., Tu, C., and Zahzah, E.H. (2016). Human pose estimation from monocular images: A comprehensive survey. Sensors, 16.","DOI":"10.3390\/s16121966"},{"key":"ref_13","unstructured":"Choo, K., and Fleet, D.J. (2001, January 7\u201314). People tracking using hybrid Monte Carlo filtering. Proceedings of the Eighth IEEE International Conference on Computer Vision. ICCV 2001, Vancouver, BC, Canada."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Andriluka, M., Roth, S., and Schiele, B. (2010, January 13\u201318). Monocular 3d pose estimation and tracking by detection. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540156"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1006\/cviu.2000.0878","article-title":"Reconstruction of articulated objects from point correspondences in a single uncalibrated image","volume":"80","author":"Taylor","year":"2000","journal-title":"Comput. Vision Image Underst."},{"key":"ref_16","unstructured":"Guan, P., Weiss, A., Balan, A.O., and Black, M.J. (October, January 29). Estimating human shape and pose from a single image. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Ramakrishna, V., Kanade, T., and Sheikh, Y. (2012, January 7\u201313). Reconstructing 3d human pose from 2d image landmarks. Proceedings of the European Conference on Computer Vision, Florence, Italy.","DOI":"10.1007\/978-3-642-33765-9_41"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Freifeld, O., and Black, M.J. (2012, January 7\u201313). Lie bodies: A manifold representation of 3D human shape. Proceedings of the European Conference on Computer Vision, Florence, Italy.","DOI":"10.1007\/978-3-642-33718-5_1"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1109\/TPAMI.2008.101","article-title":"Tracking people on a torus","volume":"31","author":"Elgammal","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.cviu.2006.08.006","article-title":"Temporal motion models for monocular and multiview 3D human body tracking","volume":"104","author":"Urtasun","year":"2006","journal-title":"Comput. Vision Image Underst."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Alldieck, T., Kassubeck, M., Wandt, B., Rosenhahn, B., and Magnor, M. (2017, January 9\u201312). Optical flow-based 3d human motion estimation from monocular video. Proceedings of the German Conference on Pattern Recognition, Stuttgart, Germany.","DOI":"10.1007\/978-3-319-66709-6_28"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Toshev, A., and Szegedy, C. (2014, January 24\u201327). Deeppose: Human pose estimation via deep neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.214"},{"key":"ref_23","unstructured":"Tompson, J.J., Jain, A., LeCun, Y., and Bregler, C. (2014, January 8\u201313). Joint training of a convolutional network and a graphical model for human pose estimation. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Li, S., Zhang, W., and Chan, A.B. (2015, January 7\u201313). Maximum-margin structured learning with deep networks for 3d human pose estimation. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.326"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"993","DOI":"10.1007\/s11263-018-1071-9","article-title":"Image-based synthesis for deep 3D human pose estimation","volume":"126","author":"Rogez","year":"2018","journal-title":"Int. J. Comput. Vision"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Martinez, J., Hossain, R., Romero, J., and Little, J.J. (2017, January 22\u201329). A simple yet effective baseline for 3d human pose estimation. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.288"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, X., Fan, Z., Liu, Y., Li, Y., and Dai, Q. (2019). 3D Pose Detection of Closely Interactive Humans Using Multi-View Cameras. Sensors, 19.","DOI":"10.3390\/s19122831"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1162\/PRES_a_00262","article-title":"ExCITE project: A review of forty-two months of robotic telepresence technology evolution","volume":"25","author":"Orlandini","year":"2016","journal-title":"Presence Teleoperators Virtual Environ."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Coradeschi, S., Cesta, A., Cortellessa, G., Coraci, L., Galindo, C., Gonzalez, J., Karlsson, L., Forsberg, A., Frennert, S., and Furfari, F. (2014). GiraffPlus: A system for monitoring activities and physiological parameters and promoting social interaction for elderly. Human-Computer Systems Interaction: Backgrounds and Applications 3, Springer.","DOI":"10.1007\/978-3-319-08491-6_22"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Luperto, M., Monroy, J., Ruiz-Sarmiento, J.R., Moreno, F.A., Basilico, N., Gonzalez-Jimenez, J., and Borghese, N.A. (2019, January 4\u20136). Towards Long-Term Deployment of a Mobile Robot for at-Home Ambient Assisted Living of the Elderly. Proceedings of the 2019 European Conference on Mobile Robots (ECMR), Prague, Czech Republic.","DOI":"10.1109\/ECMR.2019.8870924"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Cheng, C., Hao, X., and Li, J. (2017, January 13\u201316). Relative camera pose estimation method using optimization on the manifold. Proceedings of the International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Salzburg, Austria.","DOI":"10.5194\/isprs-archives-XLII-1-W1-41-2017"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Hartley, R., and Zisserman, A. (2003). Multiple View Geometry in Computer Vision, Cambridge University Press.","DOI":"10.1017\/CBO9780511811685"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"955","DOI":"10.1016\/j.robot.2009.03.002","article-title":"Stereo vision specific models for particle filter-based SLAM","volume":"57","author":"Moreno","year":"2009","journal-title":"Robot. Auton. Syst."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1036","DOI":"10.1177\/0278364915619238","article-title":"A constant-time SLAM back-end in the continuum between global mapping and submapping: application to visual stereo SLAM","volume":"35","author":"Moreno","year":"2016","journal-title":"Int. J. Robot. Res."},{"key":"ref_35","unstructured":"Wei, Y., Lhuillier, M., and Quan, L. (2004, January 27\u201330). Fast segmentation-based dense stereo from quasi-dense matching. Proceedings of the Asian Conference on Computer Vision, Jeju, Korea."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1080\/15599610802438680","article-title":"Review of stereo vision algorithms: from software to hardware","volume":"2","author":"Lazaros","year":"2008","journal-title":"Int. J. Optomechatronics"},{"key":"ref_37","unstructured":"Monasse, P., Morel, J.M., and Tang, Z. (September, January 30). Three-step image rectification. Proceedings of the British Machine Vision Conference (BMVA), Aberystwyth, UK."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Laveau, S., and Faugeras, O. (1996, January 14\u201318). Oriented projective geometry for computer vision. Proceedings of the European Conference on Computer Vision, Cambridge, UK.","DOI":"10.1007\/BFb0015531"},{"key":"ref_39","unstructured":"(2019, July 29). Body Tracking SDK. Available online: https:\/\/orbbec3d.com\/bodytracking-sdk\/."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Garcia-Salguero, M., Monroy, J., Solano, A., and Gonzalez-Jimenez, J. (2019, January 7\u20139). Socially Acceptable Approach to Humans by a Mobile Robot. Proceedings of the 2nd International Conference on Applications of Intelligent Systems (APPIS), Las Palmas de Gran Canaria, Spain.","DOI":"10.1145\/3309772.3309793"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Coroiu, A.D.C.A., and Coroiu, A. (2018, January 6\u20138). Interchangeability of Kinect and Orbbec Sensors for Gesture Recognition. Proceedings of the 2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP), Cluj-Napoca, Romania.","DOI":"10.1109\/ICCP.2018.8516586"},{"key":"ref_42","unstructured":"Microsoft (2019, October 24). Microsoft Kinect. Available online: https:\/\/developer.microsoft.com\/en-us\/windows\/kinect."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Lange, B., Koenig, S., McConnell, E., Chang, C.Y., Juang, R., Suma, E., Bolas, M., and Rizzo, A. (2012, January 4\u20138). Interactive game-based rehabilitation using the Microsoft Kinect. Proceedings of the 2012 IEEE Virtual Reality Workshops (VRW), Costa Mesa, CA, USA.","DOI":"10.1109\/VR.2012.6180935"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"El-laithy, R.A., Huang, J., and Yeh, M. (2012, January 23\u201326). Study on the use of Microsoft Kinect for robotics applications. Proceedings of the 2012 IEEE\/ION Position, Location and Navigation Symposium, Myrtle Beach, SC, USA.","DOI":"10.1109\/PLANS.2012.6236985"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1555008","DOI":"10.1142\/S0218001415550083","article-title":"A survey of applications and human motion recognition with microsoft kinect","volume":"29","author":"Lun","year":"2015","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"key":"ref_46","first-page":"63","article-title":"MIRA-Upper Limb Rehabilitation System Using Microsoft Kinect","volume":"56","author":"Cantea","year":"2011","journal-title":"Studia Univ. Babes-Bolyai Inform."},{"key":"ref_47","unstructured":"Gaber, A., Taher, M.F., and Waheb, M. (2015, January 13\u201314). A comparison of virtual rehabilitation techniques. Proceedings of the World Congress on Electrical Engineering and Computer Systems and Science(EECSS), Barcelona, Spain."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Hughes, C., Glavin, M., Jones, E., and Denny, P. (2008, January 18\u201319). Review of geometric distortion compensation in fish-eye cameras. Proceedings of the IET Irish Signals and Systems Conference(ISSC), Galway, Ireland.","DOI":"10.1049\/cp:20080656"},{"key":"ref_49","unstructured":"(2019, November 11). MAPIR-UMA Youtube Channel. Available online: https:\/\/www.youtube.com\/channel\/UC-thsUlVVKvB_vIANQXLLeA."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/22\/4943\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:34:04Z","timestamp":1760189644000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/19\/22\/4943"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,13]]},"references-count":49,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2019,11]]}},"alternative-id":["s19224943"],"URL":"https:\/\/doi.org\/10.3390\/s19224943","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,13]]}}}