{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,20]],"date-time":"2026-05-20T16:30:31Z","timestamp":1779294631132,"version":"3.51.4"},"reference-count":45,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2024,9,26]],"date-time":"2024-09-26T00:00:00Z","timestamp":1727308800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Hum.-Robot Interact."],"published-print":{"date-parts":[[2024,9,30]]},"abstract":"<jats:p>Teleoperation systems find many applications from earlier search-and-rescue to more recent daily tasks. It is widely acknowledged that using external sensors can decouple the view of the remote scene from the motion of the robot arm during manipulation, facilitating the control task. However, this design requires the coordination of multiple operators or may exhaust a single operator as s\/he needs to control both the manipulator arm and the external sensors. To address this challenge, our work introduces a viewpoint prediction model, the first data-driven approach that autonomously adjusts the viewpoint of a dynamic camera to assist in telemanipulation tasks. This model is parameterized by a deep neural network and trained on a set of human demonstrations. We propose a contrastive learning scheme that leverages viewpoints in a camera trajectory as contrastive data for network training. We demonstrated the effectiveness of the proposed viewpoint prediction model by integrating it into a real-world robotic system for telemanipulation. User studies reveal that our model outperforms several camera control methods in terms of control experience and reduces the perceived task load compared to manual camera control. As an assistive module of a telemanipulation system, our method significantly reduces task completion time for users who choose to adopt its recommendation.<\/jats:p>","DOI":"10.1145\/3660348","type":"journal-article","created":{"date-parts":[[2024,4,24]],"date-time":"2024-04-24T14:13:29Z","timestamp":1713968009000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Learning Autonomous Viewpoint Adjustment from Human Demonstrations for Telemanipulation"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6655-7192","authenticated-orcid":false,"given":"Ruixing","family":"Jia","sequence":"first","affiliation":[{"name":"The University of Hong Kong, Pokfulam, Hong Kong SAR, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3284-4019","authenticated-orcid":false,"given":"Lei","family":"Yang","sequence":"additional","affiliation":[{"name":"TransGP and The University of Hong Kong, Pokfulam, Hong Kong SAR, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9288-3167","authenticated-orcid":false,"given":"Ying","family":"Cao","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9819-8865","authenticated-orcid":false,"given":"Calvin","family":"Kalun Or","sequence":"additional","affiliation":[{"name":"The University of Hong Kong, Pokfulam, Hong Kong SAR, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2284-3952","authenticated-orcid":false,"given":"Wenping","family":"Wang","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, College Station, TX, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9003-2054","authenticated-orcid":false,"given":"Jia","family":"Pan","sequence":"additional","affiliation":[{"name":"The University of Hong Kong, Pokfulam, Hong Kong SAR, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,9,26]]},"reference":[{"key":"e_1_3_2_2_2","unstructured":"Universal Robots A\/S. 2023. Universal_Robots_ROS_Driver. Retrieved from https:\/\/github.com\/UniversalRobots\/Universal_Robots_ROS_Driver"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2001.932683"},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"Andre Cleaver and Jivko Sinapov. 2023. Helping humans become better teachers for robots with augmented reality. Retrieved from https:\/\/openreview.net\/forum?id=LXgHj3JZtB","DOI":"10.1145\/3568294.3580207"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3385956.3422092"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/THMS.2021.3090765"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/1514095.1514105"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3357251.3359444"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0166-4115(08)62386-9"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3171221.3171251"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/THMS.2019.2904558"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2020.2974707"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/2909824.3020249"},{"key":"e_1_3_2_14_2","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980. Retrieved from https:\/\/arxiv.org\/abs\/1412.6980"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIE.2005.855696"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1167\/18.6.18"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19824-3_18"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.1995.525785"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-39516-6_37"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1002\/rob.21578"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-05321-5_11"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1177\/02783649211050677"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2792143"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN47096.2020.9223445"},{"key":"e_1_3_2_25_2","first-page":"8026","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"32","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An imperative style, high-performance deep learning library. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 32 (2019), 8026\u20138037."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.5555\/3523760.3523818"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/2909824.3020254"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3171221.3171279"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2019.XV.068"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9561505"},{"key":"e_1_3_2_31_2","unstructured":"Nicklas Ritola Alberto Giaretta and Andrey Kiselev. 2023. Operator identification in a VR-based robot teleoperation scenario using head hands and eyes movement data. Retrieved from https:\/\/openreview.net\/forum?id=Xals4UE6ZS"},{"key":"e_1_3_2_32_2","first-page":"1","volume-title":"Proceedings of the 1st International Workshop on Virtual, Augmented, and Mixed Reality for HRI (VAM-HRI)","author":"Rosen Eric","year":"2018","unstructured":"Eric Rosen, David Whitney, Elizabeth Phillips, Daniel Ullman, and Stefanie Tellex. 2018. Testing robot teleoperation using a virtual reality interface with ROS reality. In Proceedings of the 1st International Workshop on Virtual, Augmented, and Mixed Reality for HRI (VAM-HRI), 1\u20134."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS47612.2022.9982063"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.3389\/frobt.2021.707149"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1145\/3434074.3447280"},{"key":"e_1_3_2_36_2","first-page":"7462","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"33","author":"Sitzmann Vincent","year":"2020","unstructured":"Vincent Sitzmann, Julien Martel, Alexander Bergman, David Lindell, and Gordon Wetzstein. 2020. Implicit neural representations with periodic activation functions. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 33 (2020), 7462\u20137473."},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/SSRR56537.2022.10018630"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9561361"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197578"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.21105\/joss.01026"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI.2019.8673306"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3109348"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2020.3005121"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-9460-1_2"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.14505"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1978952"}],"container-title":["ACM Transactions on Human-Robot Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3660348","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3660348","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T23:56:49Z","timestamp":1750291009000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3660348"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,26]]},"references-count":45,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,9,30]]}},"alternative-id":["10.1145\/3660348"],"URL":"https:\/\/doi.org\/10.1145\/3660348","relation":{},"ISSN":["2573-9522"],"issn-type":[{"value":"2573-9522","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,26]]},"assertion":[{"value":"2023-02-06","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-03-07","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-09-26","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}