{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T12:53:02Z","timestamp":1776084782219,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,11,3]],"date-time":"2022-11-03T00:00:00Z","timestamp":1667433600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,3]]},"DOI":"10.1145\/3561975.3562953","type":"proceedings-article","created":{"date-parts":[[2022,10,11]],"date-time":"2022-10-11T22:10:57Z","timestamp":1665526257000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["A Tool for Extracting 3D Avatar-Ready Gesture Animations from Monocular Videos"],"prefix":"10.1145","author":[{"given":"Andrew","family":"Feng","sequence":"first","affiliation":[{"name":"University of Southern California, Institute for Creative Technologies, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Samuel","family":"Shin","sequence":"additional","affiliation":[{"name":"University of Southern California, Institute for Creative Technologies, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Youngwoo","family":"Yoon","sequence":"additional","affiliation":[{"name":"ETRI, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,11,3]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach. In European Conference on Computer Vision.","author":"Ahuja Chaitanya","year":"2020","unstructured":"Chaitanya Ahuja , Dong\u00a0Won Lee , Yukiko\u00a0 I Nakano , and Louis-Philippe Morency . 2020 . Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach. In European Conference on Computer Vision. Chaitanya Ahuja, Dong\u00a0Won Lee, Yukiko\u00a0I Nakano, and Louis-Philippe Morency. 2020. Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach. In European Conference on Computer Vision."},{"key":"e_1_3_2_2_2_1","volume-title":"Computer Vision \u2013 ECCV 2016(Lecture Notes in Computer Science)","author":"Bogo Federica","unstructured":"Federica Bogo , Angjoo Kanazawa , Christoph Lassner , Peter Gehler , Javier Romero , and Michael\u00a0 J. Black . 2016. Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image . In Computer Vision \u2013 ECCV 2016(Lecture Notes in Computer Science) . Springer International Publishing . Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero, and Michael\u00a0J. Black. 2016. Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image. In Computer Vision \u2013 ECCV 2016(Lecture Notes in Computer Science). Springer International Publishing."},{"key":"e_1_3_2_2_3_1","volume-title":"OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields","author":"Cao Z.","year":"2019","unstructured":"Z. Cao , G. Hidalgo Martinez , T. Simon , S. Wei , and Y.\u00a0 A. Sheikh . 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2019 ). Z. Cao, G. Hidalgo Martinez, T. Simon, S. Wei, and Y.\u00a0A. Sheikh. 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00200"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58607-2_2"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1002\/cav.1560"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV53792.2021.00088"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3267851.3267898"},{"key":"e_1_3_2_2_9_1","unstructured":"Epic Games. 2021. Unreal Engine. https:\/\/www.unrealengine.com  Epic Games. 2021. Unreal Engine. https:\/\/www.unrealengine.com"},{"key":"e_1_3_2_2_10_1","volume-title":"Learning Individual Styles of Conversational Gesture. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-June (6 2019","author":"Ginosar Shiry","year":"2019","unstructured":"Shiry Ginosar , Amir Bar , Gefen Kohavi , Caroline Chan , Andrew Owens , and Jitendra Malik . 2019 . Learning Individual Styles of Conversational Gesture. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-June (6 2019 ), 3492\u20133501. https:\/\/doi.org\/10.1109\/CVPR.2019.00361 10.1109\/CVPR.2019.00361 Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, and Jitendra Malik. 2019. Learning Individual Styles of Conversational Gesture. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-June (6 2019), 3492\u20133501. https:\/\/doi.org\/10.1109\/CVPR.2019.00361"},{"key":"e_1_3_2_2_11_1","volume-title":"Hao\u00a0Liu and Yaser Sheikh","author":"Lin Gui Bart Nabbe Lei Tan","year":"2015","unstructured":"Lei Tan Lin Gui Bart Nabbe Iain Matthews Takeo Kanade Shohei\u00a0Nobuhara Hanbyul\u00a0Joo , Hao\u00a0Liu and Yaser Sheikh . 2015 . Panoptic Studio : A Massively Multiview System for Social Motion Capture . (2015). Lei Tan Lin Gui Bart Nabbe Iain Matthews Takeo Kanade Shohei\u00a0Nobuhara Hanbyul\u00a0Joo, Hao\u00a0Liu and Yaser Sheikh. 2015. Panoptic Studio: A Massively Multiview System for Social Motion Capture. (2015)."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"crossref","unstructured":"Angjoo Kanazawa Jason\u00a0Y. Zhang Panna Felsen and Jitendra Malik. 2019. Learning 3D Human Dynamics from Video. In Computer Vision and Pattern Recognition (CVPR).  Angjoo Kanazawa Jason\u00a0Y. Zhang Panna Felsen and Jitendra Malik. 2019. Learning 3D Human Dynamics from Video. In Computer Vision and Pattern Recognition (CVPR).","DOI":"10.1109\/CVPR.2019.00576"},{"key":"e_1_3_2_2_13_1","volume-title":"Auto-Encoding Variational Bayes. 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings.","author":"Kingma P","year":"2014","unstructured":"Diederik\u00a0 P Kingma and Max Welling . 2014 . Auto-Encoding Variational Bayes. 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings. Diederik\u00a0P Kingma and Max Welling. 2014. Auto-Encoding Variational Bayes. 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397481.3450692"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00085"},{"key":"e_1_3_2_2_16_1","volume-title":"BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis. arXiv preprint arXiv:2203.05297(2022).","author":"Liu Haiyang","year":"2022","unstructured":"Haiyang Liu , Zihao Zhu , Naoya Iwamoto , Yichen Peng , Zhengqing Li , You Zhou , Elif Bozkurt , and Bo Zheng . 2022 . BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis. arXiv preprint arXiv:2203.05297(2022). Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li, You Zhou, Elif Bozkurt, and Bo Zheng. 2022. BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis. arXiv preprint arXiv:2203.05297(2022)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3472307.3484167"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00554"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58565-5_33"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01169"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01123"},{"key":"e_1_3_2_2_22_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. https:\/\/github.com\/  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. https:\/\/github.com\/"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01249-6_37"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/THMS.2022.3149173"},{"key":"e_1_3_2_2_25_1","volume-title":"Speech gesture generation from the trimodal context of text, audio, and speaker identity. ACM Transactions on Graphics 39 (11","author":"Yoon Youngwoo","year":"2020","unstructured":"Youngwoo Yoon , Bok Cha , Joo\u00a0Haeng Lee , Minsu Jang , Jaeyeon Lee , Jaehong Kim , and Geehyuk Lee . 2020. Speech gesture generation from the trimodal context of text, audio, and speaker identity. ACM Transactions on Graphics 39 (11 2020 ). Issue 6. https:\/\/doi.org\/10.1145\/3414685.3417838 10.1145\/3414685.3417838 Youngwoo Yoon, Bok Cha, Joo\u00a0Haeng Lee, Minsu Jang, Jaeyeon Lee, Jaehong Kim, and Geehyuk Lee. 2020. Speech gesture generation from the trimodal context of text, audio, and speaker identity. ACM Transactions on Graphics 39 (11 2020). Issue 6. https:\/\/doi.org\/10.1145\/3414685.3417838"},{"key":"e_1_3_2_2_26_1","volume-title":"Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture Generation for Humanoid Robots. In International Conference on Robotics and Automation. IEEE, 4303\u20134309","author":"Yoon Youngwoo","year":"2019","unstructured":"Youngwoo Yoon , Woo-Ri Ko , Minsu Jang , Jaeyeon Lee , Jaehong Kim , and Geehyuk Lee . 2019 . Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture Generation for Humanoid Robots. In International Conference on Robotics and Automation. IEEE, 4303\u20134309 . Youngwoo Yoon, Woo-Ri Ko, Minsu Jang, Jaeyeon Lee, Jaehong Kim, and Geehyuk Lee. 2019. Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture Generation for Humanoid Robots. In International Conference on Robotics and Automation. IEEE, 4303\u20134309."},{"key":"e_1_3_2_2_27_1","unstructured":"Yifu Zhang Peize Sun Yi Jiang Dongdong Yu Zehuan Yuan Ping Luo Wenyu Liu and Xinggang Wang. 2021. ByteTrack: Multi-Object Tracking by Associating Every Detection Box. CoRR abs\/2110.06864(2021). https:\/\/arxiv.org\/abs\/2110.06864  Yifu Zhang Peize Sun Yi Jiang Dongdong Yu Zehuan Yuan Ping Luo Wenyu Liu and Xinggang Wang. 2021. ByteTrack: Multi-Object Tracking by Associating Every Detection Box. CoRR abs\/2110.06864(2021). https:\/\/arxiv.org\/abs\/2110.06864"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00589"}],"event":{"name":"MIG '22: ACM SIGGRAPH Conference on Motion, Interaction and Games","location":"Guanajuato Mexico","acronym":"MIG '22","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Proceedings of the 15th ACM SIGGRAPH Conference on Motion, Interaction and Games"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561975.3562953","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3561975.3562953","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:07Z","timestamp":1750182547000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561975.3562953"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,3]]},"references-count":28,"alternative-id":["10.1145\/3561975.3562953","10.1145\/3561975"],"URL":"https:\/\/doi.org\/10.1145\/3561975.3562953","relation":{},"subject":[],"published":{"date-parts":[[2022,11,3]]},"assertion":[{"value":"2022-11-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}