{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T08:47:55Z","timestamp":1776674875558,"version":"3.51.2"},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T00:00:00Z","timestamp":1776643200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T00:00:00Z","timestamp":1776643200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton Robot"],"published-print":{"date-parts":[[2026,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>This paper presents Diver Interest via Pointing in Three Dimensions (DIP-3D), a method to indicate an object of interest from a diver to an autonomous underwater vehicle (AUV) by pointing that includes three-dimensional distance information to discriminate between multiple objects in the AUV\u2019s field of view. Traditional dense stereo vision for distance estimation underwater is challenging because of the relative lack of saliency of scene features and degraded lighting conditions. Yet in many applications, including distance information is necessary for robotic perception of diver pointing when multiple objects appear within the robot\u2019s image view. We subvert the challenges of underwater distance estimation by using sparse reconstruction of specific keypoints in both the left and right images from the robot\u2019s stereo camera to perform pose estimation. Triangulated pose keypoints, along with any object detection method, enable DIP-3D to infer the location of an object of interest when multiple objects are in the AUV\u2019s field of view. By allowing the scuba diver to point at an arbitrary object of interest and enabling the AUV to autonomously decide which object the diver is pointing to, this method permits more natural interaction between AUVs and humans in underwater-human robot collaborative tasks.<\/jats:p>","DOI":"10.1007\/s10514-026-10246-7","type":"journal-article","created":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T07:59:35Z","timestamp":1776671975000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Diver interest via pointing in three dimensions: 3D pointing reconstruction for diver-AUV communication"],"prefix":"10.1007","volume":"50","author":[{"given":"Chelsey","family":"Edge","sequence":"first","affiliation":[]},{"given":"Demetrious","family":"Kutzke","sequence":"additional","affiliation":[]},{"given":"Megdalia","family":"Bromhal","sequence":"additional","affiliation":[]},{"given":"Junaed","family":"Sattar","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2026,4,20]]},"reference":[{"key":"10246_CR1","doi-asserted-by":"crossref","unstructured":"Abidi, S., Williams, M., & Johnston, B. (2013). Human pointing as a robot directive. Proceedings of the ACM\/IEEE International Conference on Human-Robot Interaction (HRI) (pp. 67\u201368).","DOI":"10.1109\/HRI.2013.6483504"},{"key":"10246_CR2","unstructured":"Amazon (2025). Amazon mechanical turk.https:\/\/www.mturk.com\/. (Accessed: 2025-11-01)"},{"key":"10246_CR3","doi-asserted-by":"crossref","unstructured":"Andriluka, M., Pishchulin, L., Gehler, P., & Schiele, B. (2014). 2D human pose estimation: New benchmark and state of the art analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","DOI":"10.1109\/CVPR.2014.471"},{"key":"10246_CR4","doi-asserted-by":"crossref","unstructured":"Azari, B., Lim, A., & Vaughan, R. (2019). Commodifying pointing in HRI: Simple and fast pointing gesture detection from RGB-D images. Proceedings of the Conference on Computer and Robot Vision (CRV) (pp. 174\u2013180).","DOI":"10.1109\/CRV.2019.00031"},{"key":"10246_CR5","unstructured":"Bazarevsky, V., Grishchenko, I., Raveendran, K., Zhu, T., Zhang, F., & Grundmann, M. (2020). BlazePose: On-device real-time body pose tracking. arXiv."},{"key":"10246_CR6","doi-asserted-by":"publisher","first-page":"702","DOI":"10.1002\/rob.20350","volume":"27","author":"B Bingham","year":"2010","unstructured":"Bingham, B., Foley, B., Singh, H., Camilli, R., Delaporta, K., Eustice, R., & Sakellariou, D. (2010). Robotic tools for deep water archaeology: Surveying an ancient shipwreck with an autonomous underwater vehicle. Journal of Field Robotics, 27, 702\u2013717. https:\/\/doi.org\/10.1002\/rob.20350","journal-title":"Journal of Field Robotics"},{"issue":"4","key":"10246_CR7","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1007\/s43154-022-00092-7","volume":"3","author":"A Birk","year":"2022","unstructured":"Birk, A. (2022). A survey of underwater human-robot interaction (U-HRI). Current Robotics Reports, 3(4), 199\u2013211. https:\/\/doi.org\/10.1007\/s43154-022-00092-7","journal-title":"Current Robotics Reports"},{"key":"10246_CR8","unstructured":"Bradski, G., & Kaehler, A. (2008). Learning OpenCV: Computer vision with the OpenCV library. O\u2019Reilly Media, Inc."},{"key":"10246_CR9","doi-asserted-by":"publisher","DOI":"10.1016\/j.conengprac.2024.105864","volume":"145","author":"M Bresciani","year":"2024","unstructured":"Bresciani, M., Zacchini, L., Topini, A., Ridolfi, A., & Costanzi, R. (2024). Automatic target recognition and geolocalisation of natural gas seeps using an autonomous underwater vehicle. Control Engineering Practice, 145, Article 105864.","journal-title":"Control Engineering Practice"},{"issue":"1","key":"10246_CR10","doi-asserted-by":"publisher","first-page":"172","DOI":"10.1109\/TPAMI.2019.2929257","volume":"43","author":"Z Cao","year":"2021","unstructured":"Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E., & Sheikh, Y. (2021). OpenPose: Realtime multi-person 2D pose estimation using part affinity fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(1), 172\u2013186. https:\/\/doi.org\/10.1109\/TPAMI.2019.2929257","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"10246_CR11","doi-asserted-by":"crossref","unstructured":"Chavez, A.G., Mueller, C.A., Birk, A., Babic, A., & Miskovic, N. (2017). Stereo-vision based diver pose estimation using LSTM recurrent neural networks for AUV navigation guidance. Proceedings of the MTS\/IEEE OCEANS (pp. 1\u20137).","DOI":"10.1109\/OCEANSE.2017.8085020"},{"key":"10246_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2019.102897","volume":"192","author":"Y Chen","year":"2020","unstructured":"Chen, Y., Tian, Y., & He, M. (2020). Monocular human pose estimation: A survey of deep learning-based methods. Computer Vision and Image Understanding, 192, Article 102897. https:\/\/doi.org\/10.1016\/j.cviu.2019.102897","journal-title":"Computer Vision and Image Understanding"},{"key":"10246_CR13","doi-asserted-by":"publisher","unstructured":"Chiarella, D., Bibuli, M., Bruzzone, G., Caccia, M., Ranieri, A., Zereik, E., & Cutugno, P. (2018). A novel gesture-based language for underwater human-robot interaction. Journal of Marine Science and Engineering, 6(3), , https:\/\/doi.org\/10.3390\/jmse6030091","DOI":"10.3390\/jmse6030091"},{"key":"10246_CR14","doi-asserted-by":"crossref","unstructured":"Delmerico, J., Mintchev, S., Giusti, A., Gromov, B., Melo, K., Horvat, T., & Scaramuzza, D. (2019). The current state and future outlook of rescue robotics. Journal of Field Robotics,36(7), 1171\u20131191.","DOI":"10.1002\/rob.21887"},{"issue":"1","key":"10246_CR15","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1109\/MC.2007.6","volume":"40","author":"G Dudek","year":"2007","unstructured":"Dudek, G., Giguere, P., Prahacs, C., Saunderson, S., Sattar, J., Torres-Mendez, L., & Georgiades, C. (2007). AQUA: An amphibious autonomous robot. Computer, 40(1), 46\u201353. https:\/\/doi.org\/10.1109\/MC.2007.6","journal-title":"Computer"},{"key":"10246_CR16","doi-asserted-by":"crossref","unstructured":"Edge, C., Enan, S.S., Fulton, M., Hong, J., Mo, J., Barthelemy, K., & Sattar, J. (2020). Design and experiments with LoCO AUV: A low cost open-source autonomous underwater vehicle. Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 1761\u20131768). Las Vegas, Nevada, USA.","DOI":"10.1109\/IROS45743.2020.9341007"},{"key":"10246_CR17","doi-asserted-by":"crossref","unstructured":"Edge, C., & Sattar, J. (2023). Diver interest via pointing: Human-directed object inspection for AUVs. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (pp. 3146\u20133153).","DOI":"10.1109\/ICRA48891.2023.10160292"},{"key":"10246_CR18","doi-asserted-by":"crossref","unstructured":"Fridman, L., Reimer, B., Mehler, B., & Freeman, W.T. (2018). Cognitive load estimation in the wild. Proceedings of the Chi Conference on Human Factors in Computing Systems (pp. 1\u20139).","DOI":"10.1145\/3173574.3174226"},{"key":"10246_CR19","doi-asserted-by":"crossref","unstructured":"Fulton, M., Hong, J., & Sattar, J. (2022). Using monocular vision and human body priors for AUVs to autonomously approach divers. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (pp. 1076\u20131082).","DOI":"10.1109\/ICRA46639.2022.9811905"},{"key":"10246_CR20","doi-asserted-by":"crossref","unstructured":"Girdhar, Y., McGuire, N., Cai, L., Jamieson, S., McCammon, S., Claus, B., & Mooney, T.A. (2023). CUREE: A curious underwater robot for ecosystem exploration. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (pp. 11411\u201311417).","DOI":"10.1109\/ICRA48891.2023.10161282"},{"issue":"3","key":"10246_CR21","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1109\/MRA.2021.3075560","volume":"28","author":"A Gomez Chavez","year":"2021","unstructured":"Gomez Chavez, A., Ranieri, A., Chiarella, D., & Birk, A. (2021). Underwater vision-based gesture recognition: A robustness validation for safe human-robot interaction. IEEE Robotics & Automation Magazine, 28(3), 67\u201378. https:\/\/doi.org\/10.1109\/MRA.2021.3075560","journal-title":"IEEE Robotics & Automation Magazine"},{"issue":"3","key":"10246_CR22","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1109\/MRA.2021.3075560","volume":"28","author":"A Gomez Chavez","year":"2021","unstructured":"Gomez Chavez, A., Ranieri, A., Chiarella, D., & Birk, A. (2021). Underwater vision-based gesture recognition: A robustness validation for safe human-robot interaction. IEEE Robotics & Automation Magazine, 28(3), 67\u201378. https:\/\/doi.org\/10.1109\/MRA.2021.3075560","journal-title":"IEEE Robotics & Automation Magazine"},{"key":"10246_CR23","doi-asserted-by":"publisher","unstructured":"Gomez\u00a0Chavez, A., Ranieri, A., Chiarella, D., Zereik, E., Babic, A., & Birk, A. (2019). CADDY underwater stereo-vision dataset for human-robot interaction (HRI) in the context of diver activities. Journal of Marine Science and Engineering, 7(1), , https:\/\/doi.org\/10.3390\/jmse7010016","DOI":"10.3390\/jmse7010016"},{"key":"10246_CR24","doi-asserted-by":"crossref","unstructured":"Gro\u00dfmann, B., Pedersen, M.R., Klonovs, J., Herzog, D., Nalpantidis, L., & Kr\u00fcger, V. (2014). Communicating unknown objects to robots through pointing gestures. M.\u00a0Mistry, A.\u00a0Leonardis, M.\u00a0Witkowski, and C.\u00a0Melhuish (Eds.), Proceedings of the Conference on Advances in Autonomous Robotics Systems (pp. 209\u2013220). Cham: Springer International Publishing.","DOI":"10.1007\/978-3-319-10401-0_19"},{"issue":"2","key":"10246_CR25","doi-asserted-by":"publisher","first-page":"328","DOI":"10.1109\/TPAMI.2007.1166","volume":"30","author":"H Hirschmuller","year":"2007","unstructured":"Hirschmuller, H. (2007). Stereo processing by semiglobal matching and mutual information. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 328\u2013341. https:\/\/doi.org\/10.1109\/TPAMI.2007.1166","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"1","key":"10246_CR26","first-page":"1","volume":"10","author":"G Hoffman","year":"2020","unstructured":"Hoffman, G., & Zhao, X. (2020). A primer for conducting experiments in human-robot interaction. ACM Transactions on Human-Robot Interaction (THRI), 10(1), 1\u201331.","journal-title":"ACM Transactions on Human-Robot Interaction (THRI)"},{"key":"10246_CR27","doi-asserted-by":"crossref","unstructured":"Islam, M.J., Ho, M., & Sattar, J. (2018a). Dynamic reconfiguration of mission parameters in underwater human-robot collaboration. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (pp. 6212\u20136219).","DOI":"10.1109\/ICRA.2018.8461197"},{"key":"10246_CR28","doi-asserted-by":"crossref","unstructured":"Islam, M.J., Ho, M., & Sattar, J. (2018b). Dynamic reconfiguration of mission parameters in underwater human-robot collaboration. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (pp. 6212\u20136219).","DOI":"10.1109\/ICRA.2018.8461197"},{"key":"10246_CR29","doi-asserted-by":"publisher","unstructured":"Islam, M. J., Ho, M., & Sattar, J. (2018). Understanding human motion and gestures for underwater human-robot collaboration. Journal of Field Robotics,1\u201323. https:\/\/doi.org\/10.1002\/rob.21837","DOI":"10.1002\/rob.21837"},{"issue":"4","key":"10246_CR30","doi-asserted-by":"publisher","first-page":"579","DOI":"10.1007\/s10514-021-09985-6","volume":"45","author":"MJ Islam","year":"2021","unstructured":"Islam, M. J., Mo, J., & Sattar, J. (2021). Robot-to-robot relative pose estimation using humans as markers. Autonomous Robots, 45(4), 579\u2013593. https:\/\/doi.org\/10.1007\/s10514-021-09985-6","journal-title":"Autonomous Robots"},{"key":"10246_CR31","unstructured":"Jackson, J.D. (1999). Classical electrodynamics. American Association of Physics Teachers."},{"key":"10246_CR32","doi-asserted-by":"publisher","DOI":"10.3389\/fphys.2022.842612","volume":"13","author":"KR Kelly","year":"2022","unstructured":"Kelly, K. R., Arrington, L. J., Bernards, J. R., & Jensen, A. E. (2022). Prolonged extreme cold water diving and the acute stress response during military dive training. Frontiers in Physiology, 13, Article 842612.","journal-title":"Frontiers in Physiology"},{"key":"10246_CR33","first-page":"24","volume":"2","author":"K Kray","year":"2023","unstructured":"Kray, K. (2023). Breathing-Gas Giant: Jess Stark. Alert Diver Magazine, 2, 24\u201325.","journal-title":"Alert Diver Magazine"},{"key":"10246_CR34","doi-asserted-by":"crossref","unstructured":"Lavest, J.-M., Rives, G., & Laprest\u00e9, J.-T. (2000). Underwater camera calibration. Proceedings of the European Conference on Computer Vision (ECCV) (pp. 654\u2013668).","DOI":"10.1007\/3-540-45053-X_42"},{"key":"10246_CR35","doi-asserted-by":"crossref","unstructured":"Littmann, E., Drees, A., & Ritter, H. (1996). Robot guidance by human pointing gestures. Proceedings of international workshop on neural networks for identification, control, robotics and signal\/image processing (pp. 449\u2013457).","DOI":"10.1109\/NICRSP.1996.542789"},{"key":"10246_CR36","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., & Berg, A.C. (2016). SSD: Single shot multibox detector. B.\u00a0Leibe, J.\u00a0Matas, N.\u00a0Sebe, and M.\u00a0Welling (Eds.), Proceedings of the European Conference on Computer Vision (ECCV) (pp. 21\u201337). Cham: Springer International Publishing.","DOI":"10.1007\/978-3-319-46448-0_2"},{"issue":"2","key":"10246_CR37","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"DG Lowe","year":"2004","unstructured":"Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision (IJCV), 60(2), 91\u2013110. https:\/\/doi.org\/10.1023\/B:VISI.0000029664.99615.94","journal-title":"International Journal of Computer Vision (IJCV)"},{"key":"10246_CR38","unstructured":"Lugaresi, C., Tang, J., Nash, H., McClanahan, C., Uboweja, E., Hays, M., & Grundmann, M. (2019). MediaPipe: A framework for perceiving and processing reality. Proceedings from the Third Workshop on Computer Vision for AR\/VR at IEEE Computer Vision and Pattern Recognition (CVPR)."},{"key":"10246_CR39","doi-asserted-by":"publisher","unstructured":"Medeiros, A.C.S., Ratsamee, P., Orlosky, J., Uranishi, Y., Higashida, M., & Takemura, H. (2021). 3D pointing gestures as target selection tools: Guiding monocular UAVs during window selection in an outdoor environment. ROBOMECH Journal, 8(1), , https:\/\/doi.org\/10.1186\/s40648-021-00200-w","DOI":"10.1186\/s40648-021-00200-w"},{"key":"10246_CR40","doi-asserted-by":"crossref","unstructured":"Modasshir, M., Rahman, S., Youngquist, O., & Rekleitis, I. (2018). Coral identification and counting with an autonomous underwater vehicle. Proceedings of the IEEE International Conference on Robotics and Biomimetics (ROBIO) (pp. 524\u2013529). Kuala Lumpur, Malaysia, (Finalist of T. J. Tarn Best Paper in Robotics).","DOI":"10.1109\/ROBIO.2018.8664785"},{"key":"10246_CR41","doi-asserted-by":"crossref","unstructured":"Nad, D., Ferreira, F., Kvasic, I., Mandic, L., Walker, C., Antillon, D.O., & Anderson, I. (2021). Diver-robot communication using wearable sensing diver glove. Proceedings of the MTS\/IEEE OCEANS (pp. 1\u20136).","DOI":"10.23919\/OCEANS44145.2021.9706117"},{"key":"10246_CR42","doi-asserted-by":"publisher","first-page":"1240276","DOI":"10.3389\/frobt.2023.1240276","volume":"10","author":"F Nauert","year":"2023","unstructured":"Nauert, F., & Kampmann, P. (2023). Inspection and maintenance of industrial infrastructure with autonomous underwater robots. Frontiers in Robotics and AI, 10, 1240276.","journal-title":"Frontiers in Robotics and AI"},{"key":"10246_CR43","unstructured":"Nvidia (2020). https:\/\/github.com\/NVIDIA-AI-IOT\/trt_pose."},{"key":"10246_CR44","doi-asserted-by":"crossref","unstructured":"Oth, L., Furgale, P., Kneip, L., & Siegwart, R. (2013). Rolling shutter camera calibration. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 1360\u20131367).","DOI":"10.1109\/CVPR.2013.179"},{"key":"10246_CR45","doi-asserted-by":"crossref","unstructured":"Pavllo, D., Feichtenhofer, C., Grangier, D., & Auli, M. (2019). 3D human pose estimation in video with temporal convolutions and semi-supervised training. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 7753\u20137762).","DOI":"10.1109\/CVPR.2019.00794"},{"key":"10246_CR46","doi-asserted-by":"crossref","unstructured":"Petillot, Y.R., Reed, S.R., & Bell, J.M. (2002). Real time AUV pipeline detection and tracking using side scan sonar and multi-beam echo-sounder. Proceedings of the MTS\/IEEE OCEANS (Vol.\u00a01, pp. 217\u2013222).","DOI":"10.1109\/OCEANS.2002.1193275"},{"key":"10246_CR47","unstructured":"Quigley, M., Conley, K., Gerkey, B.P., Faust, J., Foote, T., Leibs, J., & Ng, A.Y. (2009). ROS: An open-source robot operating system. ICRA Workshop on Open Source Software."},{"key":"10246_CR48","unstructured":"Recreational Scuba Training Council (RSTC) (2005). Common hand signals for recreational scuba diving. https:\/\/wrstc.com\/downloads\/12%20-%20Common%20Hand%20Signals.pdf. (Effective 2005. \u201cWhen using hand signals, divers should employ deliberate, clear movements, pausing if necessary between signals to ensure comprehension by their dive buddy.\u201d Accessed: 2025-11-01)"},{"issue":"6","key":"10246_CR49","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren, S., He, K., Girshick, R., & Sun, J. (2017). Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137\u20131149. https:\/\/doi.org\/10.1109\/TPAMI.2016.2577031","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"10246_CR50","doi-asserted-by":"publisher","unstructured":"Richarz, J., Scheidig, A., Martin, C., M\u00fcller, S., & Gross, H.-M. (2007). A monocular pointing pose estimator for gestural instruction of a mobile robot. International Journal of Advanced Robotic Systems, 4(1) https:\/\/doi.org\/10.5772\/5700","DOI":"10.5772\/5700"},{"issue":"4","key":"10246_CR51","doi-asserted-by":"publisher","first-page":"1179","DOI":"10.1109\/JOE.2021.3058416","volume":"46","author":"D Roper","year":"2021","unstructured":"Roper, D., Harris, C. A., Salavasidis, G., Pebody, M., Templeton, R., Prampart, T., et al. (2021). Autosub long range 6000: A multiple-month endurance AUV for deep-ocean monitoring and survey. IEEE Journal of Oceanic Engineering, 46(4), 1179\u20131191.","journal-title":"IEEE Journal of Oceanic Engineering"},{"key":"10246_CR52","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.cviu.2016.09.002","volume":"152","author":"N Sarafianos","year":"2016","unstructured":"Sarafianos, N., Boteanu, B., Ionescu, B., & Kakadiaris, I. A. (2016). 3D human pose estimation: A review of the literature and analysis of covariates. Computer Vision and Image Understanding, 152, 1\u201320. https:\/\/doi.org\/10.1016\/j.cviu.2016.09.002","journal-title":"Computer Vision and Image Understanding"},{"key":"10246_CR53","doi-asserted-by":"crossref","unstructured":"Sattar, J., Bourque, E., Gigu\u00e8re, P., & Dudek, G. (2007). Fourier tags: Smoothly degradable fiducial markers for use in human-robot interaction. Proceedings of the Fourth Canadian Conference on Computer and Robot Vision (pp. 165\u2013174). Montr\u00e9al, QC, Canada.","DOI":"10.1109\/CRV.2007.34"},{"key":"10246_CR54","doi-asserted-by":"crossref","unstructured":"Shortis, M. (2019). Camera calibration techniques for accurate measurement underwater. In McCarthy, John K. and Benjamin, Jonathan and Winton, Trevor and van Duivenvoorde, Wendy (Ed.), 3D Recording and Interpretation for Maritime Archaeology (pp. 11\u201327). Cham: Springer International Publishing.","DOI":"10.1007\/978-3-030-03635-5_2"},{"key":"10246_CR55","doi-asserted-by":"crossref","unstructured":"Shukla, D., Erkent, O., & Piater, J. (2015). Probabilistic detection of pointing directions for human-robot interaction. Proceedings of the International Conference on Digital Image Computing: Techniques and Applications (DICTA) (pp. 1\u20138).","DOI":"10.1109\/DICTA.2015.7371296"},{"key":"10246_CR56","unstructured":"StereoLabs (2025). Zed-mini stereo camera.https:\/\/store.stereolabs.com\/products\/zed-mini. (Accessed: 2025-10-25)"},{"key":"10246_CR57","doi-asserted-by":"publisher","unstructured":"Stilinovic, N., Markovic, M., Miskovic, N., Vukic, Z., & Vasilijevic, A. (2016). Mechanical design of an autonomous marine robotic system for interaction with divers. Brodogradnja, 67, , https:\/\/doi.org\/10.21278\/brod67305","DOI":"10.21278\/brod67305"},{"key":"10246_CR58","doi-asserted-by":"crossref","unstructured":"Stokey, R.P., Freitag, L.E., & Grund, M.D. (2005). A compact control language for AUV acoustic communication. Proceedings of the MTS\/IEEE OCEANS (Vol.\u00a02, pp. 1133\u20131137).","DOI":"10.1109\/OCEANSE.2005.1513217"},{"key":"10246_CR59","doi-asserted-by":"crossref","unstructured":"Szeliski, R. (2022). Computer vision: Algorithms and applications. Springer Nature.","DOI":"10.1007\/978-3-030-34372-9"},{"key":"10246_CR60","volume-title":"Foundations of computer vision","author":"A Torralba","year":"2024","unstructured":"Torralba, A., Isola, P., & Freeman, W. T. (2024). Foundations of computer vision. MIT Press."},{"key":"10246_CR61","doi-asserted-by":"publisher","unstructured":"T\u00f6lgyessy, M., Dekan, M., Duchon, F., Rodina, J., Hubinsk\u00fd, P., & Chovanec, L. (2017). Foundations of visual linear human-robot interaction via pointing gesture navigation. International Journal of Social Robotics,9(4), 509\u2013523. https:\/\/doi.org\/10.1007\/s12369-017-0408-9","DOI":"10.1007\/s12369-017-0408-9"},{"key":"10246_CR62","doi-asserted-by":"crossref","unstructured":"Van\u00a0den Bergh, M., Carton, D., De\u00a0Nijs, R., Mitsou, N., Landsiedel, C., Kuehnlenz, K., & Buss, M. (2011). Real-time 3D hand gesture interaction with a robot for understanding directions from humans. Proceedings of the International Conference on Robot and Human Interactive Communication (RO-MAN) (pp. 357\u2013362).","DOI":"10.1109\/ROMAN.2011.6005195"},{"key":"10246_CR63","unstructured":"Walker, A.M. (2021). Towards natural underwater human-robot interaction: Pointing gesture recognition for autonomous underwater vehicles (Unpublished master\u2019s thesis). University of Minnesota."},{"key":"10246_CR64","doi-asserted-by":"crossref","unstructured":"Widhalm, D., Ohnsted, C., Knutson, C., Kutzke, D., Singh, S., Mukherjee, R., & Sattar, J. (2025). Design and development of the MeCO open-source autonomous underwater vehicle. Forthcoming in the Proceedings of the IEEE International Conference on Robotics & Systems (IROS).","DOI":"10.1109\/IROS60139.2025.11247179"},{"issue":"5","key":"10246_CR65","doi-asserted-by":"publisher","first-page":"5002","DOI":"10.1109\/LRA.2025.3557235","volume":"10","author":"Y-K Wu","year":"2025","unstructured":"Wu, Y.-K., & Sattar, J. (2025a). Stereo-based 3d human pose estimation for underwater robots without 3d supervision. IEEE Robotics and Automation Letters, 10(5), 5002\u20135009. https:\/\/doi.org\/10.1109\/LRA.2025.3557235","journal-title":"IEEE Robotics and Automation Letters"},{"key":"10246_CR66","doi-asserted-by":"crossref","unstructured":"Wu, Y.-K., & Sattar, J. (2025b). Stereo-based 3D human pose estimation for underwater robots without 3D supervision. IEEE Robotics and Automation Letters,","DOI":"10.1109\/LRA.2025.3557235"}],"container-title":["Autonomous Robots"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-026-10246-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10514-026-10246-7","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-026-10246-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T07:59:58Z","timestamp":1776671998000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10514-026-10246-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,20]]},"references-count":66,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,6]]}},"alternative-id":["10246"],"URL":"https:\/\/doi.org\/10.1007\/s10514-026-10246-7","relation":{},"ISSN":["0929-5593","1573-7527"],"issn-type":[{"value":"0929-5593","type":"print"},{"value":"1573-7527","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,20]]},"assertion":[{"value":"30 June 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 November 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 March 2026","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 April 2026","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"23"}}