{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T12:32:55Z","timestamp":1766147575196,"version":"3.48.0"},"reference-count":65,"publisher":"Association for Computing Machinery (ACM)","issue":"2","funder":[{"name":"Council of Hong Kong","award":["17210222"],"award-info":[{"award-number":["17210222"]}]},{"name":"Innovation and Technology Commission of the HKSAR Government","award":["ITS\/335\/23FP"],"award-info":[{"award-number":["ITS\/335\/23FP"]}]},{"name":"InnoHK initiative"},{"name":"The Hong Kong Jockey Club Charities Trust"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2026,4,30]]},"abstract":"<jats:p>Animating human-scene interactions such as picking and placing a wide range of objects with different geometries is a challenging task, especially in a cluttered environment where interactions with complex articulated containers are involved. The main difficulty lies in the sparsity of the motion data compared to the wide variation of the objects and environments, as well as the poor availability of transition motions between different actions, increasing the complexity of the generalization to arbitrary conditions. To cope with this issue, we develop a system that tackles the interaction synthesis problem as a hierarchical goal-driven task. Firstly, we develop a bimanual scheduler that plans a set of keyframes for simultaneously controlling the two hands to efficiently achieve the pick-and-place task from an abstract goal signal such as the target object selected by the user. Next, we develop a neural implicit planner that generates hand trajectories to guide reaching and leaving motions across diverse object shapes\/types and obstacle layouts. Finally, we propose a linear dynamic model for our DeepPhase controller that incorporates a Kalman filter to enable smooth transitions in the frequency domain, resulting in a more realistic and effective multi-objective control of the character. Our system can synthesize a rich variety of natural pick-and-place movements that adapt to different object geometries, container articulations, and scene layouts.<\/jats:p>","DOI":"10.1145\/3770746","type":"journal-article","created":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T09:08:37Z","timestamp":1759396117000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["CHOICE: Coordinated Human-Object Interaction in Cluttered Environments for Pick-and-Place Actions"],"prefix":"10.1145","volume":"45","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4141-6710","authenticated-orcid":false,"given":"Jintao","family":"Lu","sequence":"first","affiliation":[{"name":"Computer Science, The University of Hong Kong","place":["Hong Kong, Hong Kong"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-8607-9441","authenticated-orcid":false,"given":"He","family":"Zhang","sequence":"additional","affiliation":[{"name":"RoboticsX, Tencent","place":["Shenzhen, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2643-7457","authenticated-orcid":false,"given":"Yuting","family":"Ye","sequence":"additional","affiliation":[{"name":"Meta Reality Labs","place":["Seattle, United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1012-415X","authenticated-orcid":false,"given":"Takaaki","family":"Shiratori","sequence":"additional","affiliation":[{"name":"Meta Reality Labs","place":["Pittsburgh, United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4519-4326","authenticated-orcid":false,"given":"Sebastian","family":"Starke","sequence":"additional","affiliation":[{"name":"Meta Reality Labs","place":["Seattle, United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2729-5860","authenticated-orcid":false,"given":"Taku","family":"Komura","sequence":"additional","affiliation":[{"name":"Computer Science, The University of Hong Kong","place":["Hong Kong, Hong Kong"]}]}],"member":"320","published-online":{"date-parts":[[2025,12,19]]},"reference":[{"key":"e_1_3_3_2_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Araujo Joao Pedro","year":"2023","unstructured":"Joao Pedro Araujo, Jiaman Li, Karthik Vetrivel, Rishi Agarwal, Deepak Gopinath, Jiajun Wu, Alexander Clegg, and C. Karen Liu. 2023. CIRCLE: Capture in rich contextual environments. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_3_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3588432.3591487"},{"key":"e_1_3_3_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01547"},{"key":"e_1_3_3_5_1","volume-title":"Proceedings of the International Conference on 3D Vision (3DV)","author":"Braun Jona","year":"2024","unstructured":"Jona Braun, Sammy Christen, Muhammed Kocabas, Emre Aksan, and Otmar Hilliges. 2024. Physically plausible full-body hand-object interaction synthesis. In Proceedings of the International Conference on 3D Vision (3DV)."},{"key":"e_1_3_3_6_1","article-title":"Learning deep SDF maps online for robot navigation and exploration","author":"Camps Gadiel Sznaier","year":"2022","unstructured":"Gadiel Sznaier Camps, Robert Dyro, Marco Pavone, and Mac Schwager. 2022. Learning deep SDF maps online for robot navigation and exploration. arXiv:2207.10782. Retrieved from https:\/\/arxiv.org\/abs\/2207.10782","journal-title":"a"},{"key":"e_1_3_3_7_1","first-page":"1855","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Cen Zhi","year":"2024","unstructured":"Zhi Cen, Huaijin Pi, Sida Peng, Zehong Shen, Minghui Yang, Shuai Zhu, Hujun Bao, and Xiaowei Zhou. 2024. Generating human motion in 3D scenes from text descriptions. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1855\u20131866."},{"key":"e_1_3_3_8_1","article-title":"Neural motion fields: Encoding grasp trajectories as implicit value functions","author":"Chen Yun-Chun","year":"2022","unstructured":"Yun-Chun Chen, Adithyavairavan Murali, Balakumar Sundaralingam, Wei Yang, Animesh Garg, and Dieter Fox. 2022. Neural motion fields: Encoding grasp trajectories as implicit value functions. Robotics Science and Systems (RSS) Workshop on Implicit Representations for Robotic Manipulation (2022).","journal-title":"Robotics Science and Systems (RSS) Workshop on Implicit Representations for Robotic Manipulation"},{"key":"e_1_3_3_9_1","doi-asserted-by":"publisher","DOI":"10.1177\/02783649241273668"},{"key":"e_1_3_3_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01992"},{"key":"e_1_3_3_11_1","unstructured":"Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9 86 (2008) 2579\u20132605. http:\/\/jmlr.org\/papers\/v9\/vandermaaten08a.html"},{"key":"e_1_3_3_12_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Fan Zicong","year":"2023","unstructured":"Zicong Fan, Omid Taheri, Dimitrios Tzionas, Muhammed Kocabas, Manuel Kaufmann, Michael J. Black, and Otmar Hilliges. 2023. ARCTIC: A dataset for dexterous bimanual hand-object manipulation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_3_13_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Ghorbani Nima","year":"2021","unstructured":"Nima Ghorbani and Michael J. Black. 2021. SOMA: Solving optical marker-based MoCap automatically. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_3_14_1","first-page":"1","volume-title":"Proceedings of the Computer Graphics Forum","volume":"42","author":"Ghosh Anindita","year":"2023","unstructured":"Anindita Ghosh, Rishabh Dabral, Vladislav Golyanik, Christian Theobalt, and Philipp Slusallek. 2023. IMoS: Intent-driven full-body motion synthesis for human-object interactions. In Proceedings of the Computer Graphics Forum, Vol. 42. Wiley Online Library, 1\u201312."},{"key":"e_1_3_3_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01118"},{"key":"e_1_3_3_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00237"},{"key":"e_1_3_3_17_1","doi-asserted-by":"publisher","unstructured":"Mohamed Hassan Yunrong Guo Tingwu Wang Michael Black Sanja Fidler and Xue Bin Peng. 2023. Synthesizing physical character-scene interactions. In ACM SIGGRAPH 2023 Conference Proceedings (SIGGRAPH\u201923). Association for Computing Machinery New York NY USA Article 63 1\u20139. 10.1145\/3588432.3591525","DOI":"10.1145\/3588432.3591525"},{"key":"e_1_3_3_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073663"},{"key":"e_1_3_3_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3641519.3657505"},{"key":"e_1_3_3_20_1","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1109\/3DV50981.2020.00043","volume-title":"Proceedings of the 2020 International Conference on 3D Vision (3DV)","author":"Karunratanakul Korrawe","year":"2020","unstructured":"Korrawe Karunratanakul, Jinlong Yang, Yan Zhang, Michael J. Black, Krikamol Muandet, and Siyu Tang. 2020. Grasping field: Learning implicit representations for human grasps. In Proceedings of the 2020 International Conference on 3D Vision (3DV). IEEE, 333\u2013344."},{"key":"e_1_3_3_21_1","doi-asserted-by":"crossref","unstructured":"Jiye Lee and Hanbyul Joo. 2023. Locomotion-action-manipulation: Synthesizing human-scene interactions in complex 3D environments. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 9663\u20139674.","DOI":"10.1109\/ICCV51070.2023.00886"},{"key":"e_1_3_3_22_1","first-page":"54","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Li Jiaman","year":"2024","unstructured":"Jiaman Li, Alexander Clegg, Roozbeh Mottaghi, Jiajun Wu, Xavier Puig, and C. Karen Liu. 2024. Controllable human-object interaction synthesis. In Proceedings of the European Conference on Computer Vision. Springer, 54\u201372."},{"key":"e_1_3_3_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3618333"},{"key":"e_1_3_3_24_1","unstructured":"Xueting Li Shalini De Mello Xiaolong Wang Ming-Hsuan Yang Jan Kautz and Sifei Liu. 2022. Learning continuous environment fields via implicit functions. In Proceedings of the International Conference on Learning Representations (ICLR\u201922). arXiv:2111.13997."},{"key":"e_1_3_3_25_1","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1145\/3596711.3596800","volume-title":"Proceedings of the Seminal Graphics Papers: Pushing the Boundaries, Volume 2","author":"Loper Matthew","year":"2023","unstructured":"Matthew Loper, Naureen Mahmood, Javier Romero, Gerard Pons-Moll, and Michael J. Black. 2023. SMPL: A skinned multi-person linear model. In Proceedings of the Seminal Graphics Papers: Pushing the Boundaries, Volume 2. 851\u2013866."},{"key":"e_1_3_3_26_1","first-page":"2161","article-title":"Omnigrasp: Grasping diverse objects with simulated humanoids","volume":"37","author":"Luo Zhengyi","year":"2024","unstructured":"Zhengyi Luo, Jinkun Cao, Sammy Christen, Alexander Winkler, Kris Kitani, and Weipeng Xu. 2024. Omnigrasp: Grasping diverse objects with simulated humanoids. Advances in Neural Information Processing Systems 37 (2024), 2161\u20132184.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_27_1","unstructured":"Zhengyi Luo Jinkun Cao Josh Merel Alexander Winkler Jing Huang Kris Kitani and Weipeng Xu. 2024. Universal humanoid motion representations for physics-based control. In Proceedings of the International Conference on Learning Representations (ICLR\u201924). https:\/\/openreview.net\/forum?id=OrOd8PxOO2"},{"key":"e_1_3_3_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00554"},{"key":"e_1_3_3_29_1","first-page":"132","volume-title":"Proceedings of the International Symposium of Robotics Research","author":"Manuelli Lucas","year":"2019","unstructured":"Lucas Manuelli, Wei Gao, Peter Florence, and Russ Tedrake. 2019. KPAM: Keypoint affordances for category-level robotic manipulation. In Proceedings of the International Symposium of Robotics Research. Springer, 132\u2013157."},{"key":"e_1_3_3_30_1","first-page":"1","volume-title":"Proceedings of the SIGGRAPH Asia 2023 Conference Papers","author":"Marschner Zo\u00eb","year":"2023","unstructured":"Zo\u00eb Marschner, Silvia Sell\u00e1n, Hsueh-Ti Derek Liu, and Alec Jacobson. 2023. Constructive solid geometry on neural signed distance fields. In Proceedings of the SIGGRAPH Asia 2023 Conference Papers. 1\u201312."},{"issue":"4","key":"e_1_3_3_31_1","article-title":"Catch & carry: Reusable neural controllers for vision-guided whole-body tasks","volume":"39","author":"Merel Josh","year":"2020","unstructured":"Josh Merel, Saran Tunyasuvunakool, Arun Ahuja, Yuval Tassa, Leonard Hasenclever, Vu Pham, Tom Erez, Greg Wayne, and Nicolas Heess. 2020. Catch & carry: Reusable neural controllers for vision-guided whole-body tasks. ACM Transactions on Graphics (TOG) 39, 4 (2020), 39\u20131.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_3_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58452-8_24"},{"key":"e_1_3_3_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366172"},{"key":"e_1_3_3_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186822.1073313"},{"key":"e_1_3_3_35_1","unstructured":"Ruiqi Ni and Ahmed H. Qureshi. 2023. Ntfields: Neural time fields for physics-informed robot motion planning. In Proceedings of the 11th International Conference on Learning Representations (ICLR\u201923). https:\/\/openreview.net\/forum?id=ApF0dmi1_9K"},{"key":"e_1_3_3_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00025"},{"issue":"4","key":"e_1_3_3_37_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3528223.3530110","article-title":"Ase: Large-scale reusable adversarial skill embeddings for physically simulated characters","volume":"41","author":"Peng Xue Bin","year":"2022","unstructured":"Xue Bin Peng, Yunrong Guo, Lina Halper, Sergey Levine, and Sanja Fidler. 2022. Ase: Large-scale reusable adversarial skill embeddings for physically simulated characters. ACM Transactions On Graphics (TOG) 41, 4 (2022), 1\u201317.","journal-title":"ACM Transactions On Graphics (TOG)"},{"key":"e_1_3_3_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459670"},{"key":"e_1_3_3_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.01383"},{"key":"e_1_3_3_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01129"},{"issue":"4","key":"e_1_3_3_41_1","doi-asserted-by":"crossref","first-page":"1591","DOI":"10.1073\/pnas.93.4.1591","article-title":"A fast marching level set method for monotonically advancing fronts.","volume":"93","author":"Sethian James A.","year":"1996","unstructured":"James A. Sethian. 1996. A fast marching level set method for monotonically advancing fronts. proceedings of the National Academy of Sciences 93, 4 (1996), 1591\u20131595.","journal-title":"proceedings of the National Academy of Sciences"},{"key":"e_1_3_3_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA46639.2022.9812146"},{"issue":"4","key":"e_1_3_3_43_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3528223.3530178","article-title":"Deepphase: Periodic autoencoders for learning motion phase manifolds","volume":"41","author":"Starke Sebastian","year":"2022","unstructured":"Sebastian Starke, Ian Mason, and Taku Komura. 2022. Deepphase: Periodic autoencoders for learning motion phase manifolds. ACM Transactions on Graphics (TOG) 41, 4 (2022), 1\u201313.","journal-title":"ACM Transactions on Graphics (TOG)"},{"issue":"6","key":"e_1_3_3_44_1","article-title":"Neural state machine for character-scene interactions.","volume":"38","author":"Starke Sebastian","year":"2019","unstructured":"Sebastian Starke, He Zhang, Taku Komura, and Jun Saito. 2019. Neural state machine for character-scene interactions. ACM Trans. Graph. 38, 6 (2019), 209\u20131.","journal-title":"ACM Trans. Graph."},{"issue":"4","key":"e_1_3_3_45_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3450626.3459881","article-title":"Neural animation layering for synthesizing martial arts movements","volume":"40","author":"Starke Sebastian","year":"2021","unstructured":"Sebastian Starke, Yiwei Zhao, Fabio Zinno, and Taku Komura. 2021. Neural animation layering for synthesizing martial arts movements. ACM Transactions on Graphics (TOG) 40, 4 (2021), 1\u201316.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_3_46_1","first-page":"8112","volume-title":"Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA)","author":"Sundaralingam Balakumar","year":"2023","unstructured":"Balakumar Sundaralingam, Siva Kumar Sastry Hari, Adam Fishman, Caelan Garrett, Karl Van Wyk, Valts Blukis, Alexander Millane, Helen Oleynikova, Ankur Handa, Fabio Ramos, et\u00a0al. 2023. Curobo: Parallelized collision-free robot motion generation. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 8112\u20138119."},{"key":"e_1_3_3_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01291"},{"key":"e_1_3_3_48_1","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1007\/978-3-030-58548-8_34","volume-title":"Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part IV 16","author":"Taheri Omid","year":"2020","unstructured":"Omid Taheri, Nima Ghorbani, Michael J. Black, and Dimitrios Tzionas. 2020. GRAB: A dataset of whole-body human grasping of objects. In Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part IV 16. Springer, 581\u2013600."},{"key":"e_1_3_3_49_1","volume-title":"Proceedings of the International Conference on 3D Vision (3DV)","author":"Taheri Omid","year":"2024","unstructured":"Omid Taheri, Yi Zhou, Dimitrios Tzionas, Yang Zhou, Duygu Ceylan, Soren Pirk, and Michael J. Black. 2024. GRIP: Generating interaction poses using latent consistency and spatial cues. In Proceedings of the International Conference on 3D Vision (3DV). Retrieved from https:\/\/grip.is.tue.mpg.de"},{"key":"e_1_3_3_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3687951"},{"key":"e_1_3_3_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/9.412624"},{"key":"e_1_3_3_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3658181"},{"key":"e_1_3_3_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2022.3160022"},{"key":"e_1_3_3_54_1","first-page":"257","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Wu Yan","year":"2022","unstructured":"Yan Wu, Jiahao Wang, Yan Zhang, Siwei Zhang, Otmar Hilliges, Fisher Yu, and Siyu Tang. 2022. Saga: Stochastic whole-body grasping with contact. In Proceedings of the European Conference on Computer Vision. Springer, 257\u2013274."},{"key":"e_1_3_3_55_1","volume-title":"Proceedings of the International Conference on Computer Vision (ICCV)","author":"Xu Sirui","year":"2023","unstructured":"Sirui Xu, Zhengyuan Li, Yu-Xiong Wang, and Liang-Yan Gui. 2023. InterDiff: Generating 3D human-object interactions with physics-informed diffusion. In Proceedings of the International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_3_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.02028"},{"key":"e_1_3_3_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185537"},{"key":"e_1_3_3_58_1","volume-title":"Proceedings of the International Conference on 3D Vision (3DV)","author":"Zhang Hui","year":"2024","unstructured":"Hui Zhang, Sammy Christen, Zicong Fan, Luocheng Zheng, Jemin Hwangbo, Jie Song, and Otmar Hilliges. 2024. ArtiGrasp: Physically plausible synthesis of Bi-manual dexterous grasping and articulation. In Proceedings of the International Conference on 3D Vision (3DV)."},{"key":"e_1_3_3_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275090"},{"issue":"4","key":"e_1_3_3_60_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3450626.3459830","article-title":"Manipnet: Neural manipulation synthesis with a hand-object spatial representation","volume":"40","author":"Zhang He","year":"2021","unstructured":"He Zhang, Yuting Ye, Takaaki Shiratori, and Taku Komura. 2021. Manipnet: Neural manipulation synthesis with a hand-object spatial representation. ACM Transactions on Graphics (ToG) 40, 4 (2021), 1\u201314.","journal-title":"ACM Transactions on Graphics (ToG)"},{"key":"e_1_3_3_61_1","doi-asserted-by":"publisher","unstructured":"Xiaohan Zhang Bharat Lal Bhatnagar Sebastian Starke Vladimir Guzov and Gerard Pons-Moll. 2022. COUCH: Towards controllable human-chair interactions. (October2022). In Computer Vision ECCV 2022: 17th European Conference Tel Aviv Israel October 2327 2022 Proceedings Part V. Springer-Verlag Berlin Heidelberg 518\u2013535. 10.1007\/978-3-031-20065-6_30","DOI":"10.1007\/978-3-031-20065-6_30"},{"key":"e_1_3_3_62_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"Zhao Kaifeng","year":"2022","unstructured":"Kaifeng Zhao, Shaofei Wang, Yan Zhang, Thabo Beeler, and Siyu Tang. 2022. Compositional human-scene interaction synthesis with semantic control. In Proceedings of the European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_3_63_1","volume-title":"Proceedings of the International Conference on Computer Vision (ICCV)","author":"Zhao Kaifeng","year":"2023","unstructured":"Kaifeng Zhao, Yan Zhang, Shaofei Wang, Thabo Beeler, and Siyu Tang. 2023a. DIMOS: Synthesizing diverse human motions in 3D indoor scenes. In Proceedings of the International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_3_64_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-96530-3"},{"key":"e_1_3_3_65_1","first-page":"585","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Zheng Juntian","year":"2023","unstructured":"Juntian Zheng, Qingyuan Zheng, Lixing Fang, Yun Liu, and Li Yi. 2023. CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 585\u2013594."},{"key":"e_1_3_3_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2927938"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3770746","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T12:27:57Z","timestamp":1766147277000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3770746"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,19]]},"references-count":65,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,4,30]]}},"alternative-id":["10.1145\/3770746"],"URL":"https:\/\/doi.org\/10.1145\/3770746","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2025,12,19]]},"assertion":[{"value":"2024-12-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-09-23","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-12-19","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}