{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T00:35:54Z","timestamp":1769733354837,"version":"3.49.0"},"reference-count":77,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2022,7,1]],"date-time":"2022-07-01T00:00:00Z","timestamp":1656633600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2018AAA0102200"],"award-info":[{"award-number":["2018AAA0102200"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Guangdong Laboratory of Artificial Intelligence and Digital Economy"},{"name":"GD Talent Plan","award":["2019JC05X328"],"award-info":[{"award-number":["2019JC05X328"]}]},{"DOI":"10.13039\/501100001809","name":"NSFC","doi-asserted-by":"crossref","award":["61872250, 62132021, U2001206, U21B2023, 62161146005"],"award-info":[{"award-number":["61872250, 62132021, U2001206, U21B2023, 62161146005"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"DEGP Key Project","award":["2018KZDXM058, 2020SFKC059"],"award-info":[{"award-number":["2018KZDXM058, 2020SFKC059"]}]},{"name":"GD Natural Science Foundation","award":["2021B1515020085"],"award-info":[{"award-number":["2021B1515020085"]}]},{"name":"Shenzhen Science and Technology Program","award":["RCYX20210609103121030, RCJC20200714114435012, JCYJ20210324120213036"],"award-info":[{"award-number":["RCYX20210609103121030, RCJC20200714114435012, JCYJ20210324120213036"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2022,7]]},"abstract":"<jats:p>We approach the problem of high-DOF reaching-and-grasping via learning joint planning of grasp and motion with deep reinforcement learning. To resolve the sample efficiency issue in learning the high-dimensional and complex control of dexterous grasping, we propose an effective representation of grasping state characterizing the spatial interaction between the gripper and the target object. To represent gripper-object interaction, we adopt Interaction Bisector Surface (IBS) which is the Voronoi diagram between two close by 3D geometric objects and has been successfully applied in characterizing spatial relations between 3D objects. We found that IBS is surprisingly effective as a state representation since it well informs the finegrained control of each finger with spatial relation against the target object. This novel grasp representation, together with several technical contributions including a fast IBS approximation, a novel vector-based reward and an effective training strategy, facilitate learning a strong control model of high-DOF grasping with good sample efficiency, dynamic adaptability, and cross-category generality. Experiments show that it generates high-quality dexterous grasp for complex shapes with smooth grasping motions. Code and data for this paper are at https:\/\/github.com\/qijinshe\/IBS-Grasping.<\/jats:p>","DOI":"10.1145\/3528223.3530091","type":"journal-article","created":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T21:06:27Z","timestamp":1658523987000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":81,"title":["Learning high-DOF reaching-and-grasping via dynamic representation of gripper-object interaction"],"prefix":"10.1145","volume":"41","author":[{"given":"Qijin","family":"She","sequence":"first","affiliation":[{"name":"National University of Defense Technology and Shenzhen University, China"}]},{"given":"Ruizhen","family":"Hu","sequence":"additional","affiliation":[{"name":"Shenzhen University, China"}]},{"given":"Juzhan","family":"Xu","sequence":"additional","affiliation":[{"name":"Shenzhen University, China"}]},{"given":"Min","family":"Liu","sequence":"additional","affiliation":[{"name":"Chinese Academy of Military Science, China"}]},{"given":"Kai","family":"Xu","sequence":"additional","affiliation":[{"name":"National University of Defense Technology, China"}]},{"given":"Hui","family":"Huang","sequence":"additional","affiliation":[{"name":"Shenzhen University, China"}]}],"member":"320","published-online":{"date-parts":[[2022,7,22]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2011.07.022"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364919887447"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2013.2289018"},{"key":"e_1_2_2_4_1","volume-title":"Proceedings, Part XIII 16","author":"Brahmbhatt Samarth","year":"2020","unstructured":"Samarth Brahmbhatt, Chengcheng Tang, Christopher D Twigg, Charles C Kemp, and James Hays. 2020. ContactPose: A dataset of grasps with object contact and hand pose. In Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part XIII 16. Springer, 361--378."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364917700714"},{"key":"e_1_2_2_6_1","volume-title":"Finding antipodal point grasps on irregularly shaped objects","author":"Chen Ming","year":"1993","unstructured":"I-Ming Chen and Joel W Burdick. 1993. Finding antipodal point grasps on irregularly shaped objects. IEEE transactions on Robotics and Automation 9, 4 (1993), 507--512."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2010.5509615"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2896485"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2018.8593950"},{"key":"e_1_2_2_10_1","volume-title":"Advances in robot kinematics: Analysis and design","author":"Gregorio Raffaele Di","unstructured":"Raffaele Di Gregorio. 2008. A novel point of view to define the distance between two rigid-body poses. In Advances in robot kinematics: Analysis and design. Springer, 361--369."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364914559753"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8461041"},{"key":"e_1_2_2_13_1","first-page":"2290","article-title":"Planning optimal grasps","volume":"3","author":"Ferrari Carlo","year":"1992","unstructured":"Carlo Ferrari and John F Canny. 1992. Planning optimal grasps.. In ICRA, Vol. 3. 2290--2295.","journal-title":"ICRA"},{"key":"e_1_2_2_14_1","volume-title":"Vision-based grasp learning of an anthropomorphic hand-arm system in a synergy-based control framework. Science robotics 4, 26","author":"Ficuciello F","year":"2019","unstructured":"F Ficuciello, A Migliozzi, G Laudante, P Falco, and B Siciliano. 2019. Vision-based grasp learning of an anthropomorphic hand-arm system in a synergy-based control framework. Science robotics 4, 26 (2019)."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2016.7759306"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/HUMANOIDS.2014.7041469"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2016.7759114"},{"key":"e_1_2_2_18_1","unstructured":"Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha Jie Tan Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel et al. 2018. Soft actor-critic algorithms and applications. arXiv preprint arXiv:1812.05905 (2018)."},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2017.2651381"},{"key":"e_1_2_2_20_1","first-page":"1","article-title":"Interaction context (ICON) towards a geometric functionality descriptor","volume":"34","author":"Hu Ruizhen","year":"2015","unstructured":"Ruizhen Hu, Chenyang Zhu, Oliver van Kaick, Ligang Liu, Ariel Shamir, and Hao Zhang. 2015. Interaction context (ICON) towards a geometric functionality descriptor. ACM Transactions on Graphics 34, 4 (2015), 1--12.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_2_2_21_1","volume-title":"Learning Deep Visuomotor Policies for Dexterous Hand Manipulation. In International Conference on Robotics and Automation (ICRA).","author":"Jain Divye","year":"2019","unstructured":"Divye Jain, Andrew Li, Shivam Singhal, Aravind Rajeswaran, Vikash Kumar, and Emanuel Todorov. 2019. Learning Deep Visuomotor Policies for Dexterous Hand Manipulation. In International Conference on Robotics and Automation (ICRA)."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01291"},{"key":"e_1_2_2_23_1","volume-title":"Qt-opt: Scalable deep reinforcement learning for vision-based robotic manipulation. arXiv preprint arXiv:1806.10293","author":"Kalashnikov Dmitry","year":"2018","unstructured":"Dmitry Kalashnikov, Alex Irpan, Peter Pastor, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Mrinal Kalakrishnan, Vincent Vanhoucke, et al. 2018. Qt-opt: Scalable deep reinforcement learning for vision-based robotic manipulation. arXiv preprint arXiv:1806.10293 (2018)."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2015.7139793"},{"key":"e_1_2_2_25_1","volume-title":"Grasping Field: Learning Implicit Representations for Human Grasps. arXiv preprint arXiv:2008.04451","author":"Karunratanakul Korrawe","year":"2020","unstructured":"Korrawe Karunratanakul, Jinlong Yang, Yan Zhang, Michael Black, Krikamol Muandet, and Siyu Tang. 2020. Grasping Field: Learning Implicit Representations for Human Grasps. arXiv preprint arXiv:2008.04451 (2020)."},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364912445831"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/70.508439"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-34995-0_8"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cad.2006.07.007"},{"key":"e_1_2_2_30_1","volume-title":"A survey on learning-based robotic grasping. Current Robotics Reports","author":"Kleeberger Kilian","year":"2020","unstructured":"Kilian Kleeberger, Richard Bormann, Werner Kraus, and Marco F Huber. 2020. A survey on learning-based robotic grasping. Current Robotics Reports (2020), 1--11."},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364917710318"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531365"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS40897.2019.8968115"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2020.XVI.066"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2020.3003280"},{"key":"e_1_2_2_36_1","volume-title":"Robotics Research","author":"Lu Qingkai","unstructured":"Qingkai Lu, Kautilya Chenna, Balakumar Sundaralingam, and Tucker Hermans. 2020a. Planning multi-fingered grasps as probabilistic inference in a learned deep network. In Robotics Research. Springer, 455--472."},{"key":"e_1_2_2_37_1","volume-title":"Balakumar Sundaralingam, and Tucker Hermans.","author":"Lu Qingkai","year":"2020","unstructured":"Qingkai Lu, Mark Van der Merwe, Balakumar Sundaralingam, and Tucker Hermans. 2020b. Multifingered Grasp Planning via Inference in Deep Neural Networks: Outperforming Sampling by Learning Differentiable Models. IEEE Robotics & Automation Magazine (2020)."},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2017.XIII.058"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460887"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2016.7487342"},{"key":"e_1_2_2_41_1","volume-title":"International Conference on Intelligent Robots and Systems. IEEE, 2586--2591","author":"Maldonado Alexis","year":"2010","unstructured":"Alexis Maldonado, Ulrich Klank, and Michael Beetz. 2010. Robotic grasping of un-modeled objects using time-of-flight range data and finger torque information. In International Conference on Intelligent Robots and Systems. IEEE, 2586--2591."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9561802"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1115\/IMECE2000-2439"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/MRA.2004.1371616"},{"key":"e_1_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski et al. 2015. Human-level control through deep reinforcement learning. nature 518 7540 (2015) 529--533.","DOI":"10.1038\/nature14236"},{"key":"e_1_2_2_46_1","volume-title":"Human Friendly Robotics","author":"Monforte Marco","unstructured":"Marco Monforte, Fanny Ficuciello, and Bruno Siciliano. 2019. Multifunctional principal component analysis for human-like grasping. In Human Friendly Robotics. Springer, 47--58."},{"key":"e_1_2_2_47_1","volume-title":"Closing the loop for robotic grasping: A real-time, generative grasp synthesis approach. arXiv preprint arXiv:1804.05172","author":"Morrison Douglas","year":"2018","unstructured":"Douglas Morrison, Peter Corke, and J\u00fcrgen Leitner. 2018. Closing the loop for robotic grasping: A real-time, generative grasp synthesis approach. arXiv preprint arXiv:1804.05172 (2018)."},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1177\/027836498800700301"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3130376"},{"key":"e_1_2_2_50_1","first-page":"18050","article-title":"Effective diversity in population based reinforcement learning","volume":"33","author":"Parker-Holder Jack","year":"2020","unstructured":"Jack Parker-Holder, Aldo Pacchiano, Krzysztof M Choromanski, and Stephen J Roberts. 2020. Effective diversity in population based reinforcement learning. Advances in Neural Information Processing Systems 33 (2020), 18050--18062.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460528"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3083725"},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073368.1073413"},{"key":"e_1_2_2_54_1","volume-title":"Proc. IEEE Conf. on Computer Vision & Pattern Recognition. 652--660","author":"Qi Charles R","year":"2017","unstructured":"Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proc. IEEE Conf. on Computer Vision & Pattern Recognition. 652--660."},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8461039"},{"key":"e_1_2_2_56_1","volume-title":"Learning complex dexterous manipulation with deep reinforcement learning and demonstrations. arXiv preprint arXiv:1709.10087","author":"Rajeswaran Aravind","year":"2017","unstructured":"Aravind Rajeswaran, Vikash Kumar, Abhishek Gupta, Giulia Vezzani, John Schulman, Emanuel Todorov, and Sergey Levine. 2017. Learning complex dexterous manipulation with deep reinforcement learning and demonstrations. arXiv preprint arXiv:1709.10087 (2017)."},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2009.2020351"},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2011.07.016"},{"key":"e_1_2_2_59_1","doi-asserted-by":"crossref","unstructured":"Ashutosh Saxena Justin Driemeyer Justin Kearns and Andrew Y Ng. 2007. Robotic grasping of novel objects. In Advances in neural information processing systems. 1209--1216.","DOI":"10.7551\/mitpress\/7503.003.0156"},{"key":"e_1_2_2_60_1","volume-title":"Robotics research","author":"Saxena Ashutosh","unstructured":"Ashutosh Saxena, Lawson Wong, Morgan Quigley, and Andrew Y Ng. 2010. A vision-based system for grasping novel objects in cluttered environments. In Robotics research. Springer, 337--348."},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2014.6906903"},{"key":"e_1_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2020.3004787"},{"key":"e_1_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3355089.3356505"},{"key":"e_1_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/MRA.2012.2205651"},{"key":"e_1_2_2_65_1","volume-title":"Reinforcement learning: An introduction","author":"Sutton Richard S","unstructured":"Richard S Sutton and Andrew G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00651"},{"key":"e_1_2_2_67_1","volume-title":"Learning Continuous 3D Reconstructions for Geometrically Aware Grasping. arXiv preprint arXiv:1910.00983","author":"der Merwe Mark Van","year":"2019","unstructured":"Mark Van der Merwe, Qingkai Lu, Balakumar Sundaralingam, Martin Matak, and Tucker Hermans. 2019. Learning Continuous 3D Reconstructions for Geometrically Aware Grasping. arXiv preprint arXiv:1910.00983 (2019)."},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8206060"},{"key":"e_1_2_2_69_1","volume-title":"Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards. arXiv preprint arXiv:1707.08817","author":"Vecerik Mel","year":"2017","unstructured":"Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Roth\u00f6rl, Thomas Lampe, and Martin Riedmiller. 2017. Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards. arXiv preprint arXiv:1707.08817 (2017)."},{"key":"e_1_2_2_70_1","volume-title":"Conference on Robot Learning. PMLR, 291--300","author":"Viereck Ulrich","year":"2017","unstructured":"Ulrich Viereck, Andreas Pas, Kate Saenko, and Robert Platt. 2017. Learning a visuo-motor controller for real world robotic grasping using simulated depth images. In Conference on Robot Learning. PMLR, 291--300."},{"key":"e_1_2_2_71_1","volume-title":"Conference on Robot Learning. PMLR, 169--179","author":"Walker Nick","year":"2022","unstructured":"Nick Walker, Christoforos Mavrogiannis, Siddhartha Srinivasa, and Maya Cakmak. 2022. Influencing Behavioral Attributions to Robot Motion During Task Execution. In Conference on Robot Learning. PMLR, 169--179."},{"key":"e_1_2_2_72_1","volume-title":"Manipulation trajectory optimization with online grasp synthesis and selection. arXiv preprint arXiv:1911.10280","author":"Wang Lirui","year":"2019","unstructured":"Lirui Wang, Yu Xiang, and Dieter Fox. 2019. Manipulation trajectory optimization with online grasp synthesis and selection. arXiv preprint arXiv:1911.10280 (2019)."},{"key":"e_1_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2012.6225116"},{"key":"e_1_2_2_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00985"},{"key":"e_1_2_2_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9560833"},{"key":"e_1_2_2_76_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13112"},{"key":"e_1_2_2_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/2574860"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3528223.3530091","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3528223.3530091","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:26Z","timestamp":1750186946000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3528223.3530091"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7]]},"references-count":77,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,7]]}},"alternative-id":["10.1145\/3528223.3530091"],"URL":"https:\/\/doi.org\/10.1145\/3528223.3530091","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7]]},"assertion":[{"value":"2022-07-22","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}