{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T09:25:16Z","timestamp":1780392316809,"version":"3.54.1"},"reference-count":77,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2021,7,19]],"date-time":"2021-07-19T00:00:00Z","timestamp":1626652800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2021,8,31]]},"abstract":"<jats:p>We present a learning-based method for building driving-signal aware full-body avatars. Our model is a conditional variational autoencoder that can be animated with incomplete driving signals, such as human pose and facial keypoints, and produces a high-quality representation of human geometry and view-dependent appearance. The core intuition behind our method is that better drivability and generalization can be achieved by disentangling the driving signals and remaining generative factors, which are not available during animation. To this end, we explicitly account for information deficiency in the driving signal by introducing a latent space that exclusively captures the remaining information, thus enabling the imputation of the missing factors required during full-body animation, while remaining faithful to the driving signal. We also propose a learnable localized compression for the driving signal which promotes better generalization, and helps minimize the influence of global chance-correlations often found in real datasets. For a given driving signal, the resulting variational model produces a compact space of uncertainty for missing factors that allows for an imputation strategy best suited to a particular application. We demonstrate the efficacy of our approach on the challenging problem of full-body animation for virtual telepresence with driving signals acquired from minimal sensors placed in the environment and mounted on a VR-headset.<\/jats:p>","DOI":"10.1145\/3450626.3459850","type":"journal-article","created":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T00:04:26Z","timestamp":1626739466000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":84,"title":["Driving-signal aware full-body avatars"],"prefix":"10.1145","volume":"40","author":[{"given":"Timur","family":"Bagautdinov","sequence":"first","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chenglei","family":"Wu","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tomas","family":"Simon","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Fabi\u00e1n","family":"Prada","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Takaaki","family":"Shiratori","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shih-En","family":"Wei","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Weipeng","family":"Xu","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yaser","family":"Sheikh","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jason","family":"Saragih","sequence":"additional","affiliation":[{"name":"Facebook Reality Labs"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2021,7,19]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Computer Graphics Forum","author":"Aberman Kfir","unstructured":"Kfir Aberman , Mingyi Shi , Jing Liao , Dani Lischinski , Baoquan Chen , and Daniel Cohen-Or . 2019. Deep video-based performance cloning . In Computer Graphics Forum , Vol. 38 . Wiley Online Library , 219--233. Kfir Aberman, Mingyi Shi, Jing Liao, Dani Lischinski, Baoquan Chen, and Daniel Cohen-Or. 2019. Deep video-based performance cloning. In Computer Graphics Forum, Vol. 38. Wiley Online Library, 219--233."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2010.65"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13946"},{"key":"e_1_2_2_4_1","volume-title":"Mathieu Salzmann, Lars Peters-son, and Stephen Gould.","author":"Aliakbarian Mohammad Sadegh","year":"2019","unstructured":"Mohammad Sadegh Aliakbarian , Fatemeh Sadat Saleh , Mathieu Salzmann, Lars Peters-son, and Stephen Gould. 2019 . Mitigating Posterior Collapse in Strongly Conditioned Variational Autoencoders . (2019). Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Peters-son, and Stephen Gould. 2019. Mitigating Posterior Collapse in Strongly Conditioned Variational Autoencoders. (2019)."},{"key":"e_1_2_2_5_1","volume-title":"Proceedings of International Conference on 3D Vision (3DV). 98--109","author":"Alldieck T.","unstructured":"T. Alldieck , M. Magnor , W. Xu , C. Theobalt , and G. Pons-Moll . 2018. Detailed Human Avatars from Monocular Video . In Proceedings of International Conference on 3D Vision (3DV). 98--109 . T. Alldieck, M. Magnor, W. Xu, C. Theobalt, and G. Pons-Moll. 2018. Detailed Human Avatars from Monocular Video. In Proceedings of International Conference on 3D Vision (3DV). 98--109."},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00875"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073207"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00408"},{"key":"e_1_2_2_9_1","volume-title":"Mine: mutual information neural estimation. arXiv preprint arXiv:1801.04062","author":"Belghazi Mohamed Ishmael","year":"2018","unstructured":"Mohamed Ishmael Belghazi , Aristide Baratin , Sai Rajeswar , Sherjil Ozair , Yoshua Bengio , Aaron Courville , and R Devon Hjelm . 2018. Mine: mutual information neural estimation. arXiv preprint arXiv:1801.04062 ( 2018 ). Mohamed Ishmael Belghazi, Aristide Baratin, Sai Rajeswar, Sherjil Ozair, Yoshua Bengio, Aaron Courville, and R Devon Hjelm. 2018. Mine: mutual information neural estimation. arXiv preprint arXiv:1801.04062 (2018)."},{"key":"e_1_2_2_10_1","volume-title":"Representation learning: A review and new perspectives","author":"Bengio Yoshua","year":"2013","unstructured":"Yoshua Bengio , Aaron Courville , and Pascal Vincent . 2013. Representation learning: A review and new perspectives . IEEE transactions on pattern analysis and machine intelligence 35, 8 ( 2013 ), 1798--1828. Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence 35, 8 (2013), 1798--1828."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/311535.311556"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46454-1_34"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2007.1054"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2017.2693418"},{"key":"e_1_2_2_15_1","volume-title":"Understanding disentangling in beta-VAE. arXiv preprint arXiv:1804.03599","author":"Burgess Christopher P","year":"2018","unstructured":"Christopher P Burgess , Irina Higgins , Arka Pal , Loic Matthey , Nick Watters , Guillaume Desjardins , and Alexander Lerchner . 2018. Understanding disentangling in beta-VAE. arXiv preprint arXiv:1804.03599 ( 2018 ). Christopher P Burgess, Irina Higgins, Arka Pal, Loic Matthey, Nick Watters, Guillaume Desjardins, and Alexander Lerchner. 2018. Understanding disentangling in beta-VAE. arXiv preprint arXiv:1804.03599 (2018)."},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00603"},{"key":"e_1_2_2_17_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0.","author":"Esser Patrick","year":"2018","unstructured":"Patrick Esser , Johannes Haux , Timo Milbich , 2018 . Towards learning a realistic rendering of human behavior . In Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0. Patrick Esser, Johannes Haux, Timo Milbich, et al. 2018. Towards learning a realistic rendering of human behavior. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206755"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.106"},{"key":"e_1_2_2_20_1","doi-asserted-by":"crossref","unstructured":"S. Ginosar A. Bar G. Kohavi C. Chan A. Owens and J. Malik. 2019. Learning Individual Styles of Conversational Gesture. In Computer Vision and Pattern Recognition (CVPR).  S. Ginosar A. Bar G. Kohavi C. Chan A. Owens and J. Malik. 2019. Learning Individual Styles of Conversational Gesture. In Computer Vision and Pattern Recognition (CVPR).","DOI":"10.1109\/CVPR.2019.00361"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00030"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185531"},{"key":"e_1_2_2_23_1","unstructured":"Irina Higgins Loic Matthey Arka Pal Christopher Burgess Xavier Glorot Matthew Botvinick Shakir Mohamed and Alexander Lerchner. 2016. beta-vae: Learning basic visual concepts with a constrained variational framework. (2016).  Irina Higgins Loic Matthey Arka Pal Christopher Burgess Xavier Glorot Matthew Botvinick Shakir Mohamed and Alexander Lerchner. 2016. beta-vae: Learning basic visual concepts with a constrained variational framework. (2016)."},{"key":"e_1_2_2_24_1","volume-title":"Physically Based Shading in Theory and Practice. In ACM SIGGRAPH 2020 Courses.","author":"Hill Stephen","year":"2020","unstructured":"Stephen Hill , Stephen McAuley , Laurent Belcour , Will Earl , Niklas Harrysson , S\u00e9bastien Hillaire , Naty Hoffman , Lee Kerley , Jasmin Patry , Rob Piek\u00e9 , Igor Skliar , Jonathan Stone , Pascal Barla , M\u00e9gane Bati , and Iliyan Georgiev . 2020 . Physically Based Shading in Theory and Practice. In ACM SIGGRAPH 2020 Courses. Stephen Hill, Stephen McAuley, Laurent Belcour, Will Earl, Niklas Harrysson, S\u00e9bastien Hillaire, Naty Hoffman, Lee Kerley, Jasmin Patry, Rob Piek\u00e9, Igor Skliar, Jonathan Stone, Pascal Barla, M\u00e9gane Bati, and Iliyan Georgiev. 2020. Physically Based Shading in Theory and Practice. In ACM SIGGRAPH 2020 Courses."},{"key":"e_1_2_2_25_1","volume-title":"Stretchable and Twistable Bones for Skeletal Shape Deformation. ACM Transactions on Graphics (proceedings of ACM SIGGRAPH ASIA) 30, 6","author":"Jacobson Alec","year":"2011","unstructured":"Alec Jacobson and Olga Sorkine . 2011. Stretchable and Twistable Bones for Skeletal Shape Deformation. ACM Transactions on Graphics (proceedings of ACM SIGGRAPH ASIA) 30, 6 ( 2011 ), 165:1--165:8. Alec Jacobson and Olga Sorkine. 2011. Stretchable and Twistable Bones for Skeletal Shape Deformation. ACM Transactions on Graphics (proceedings of ACM SIGGRAPH ASIA) 30, 6 (2011), 165:1--165:8."},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2020.2988476"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00868"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409625.1409627"},{"key":"e_1_2_2_29_1","volume-title":"Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114","author":"Kingma Diederik P","year":"2013","unstructured":"Diederik P Kingma and Max Welling . 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 ( 2013 ). Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00982"},{"key":"e_1_2_2_31_1","first-page":"5967","article-title":"Fader Networks:Manipulating Images by Sliding Attributes","volume":"30","author":"Lample Guillaume","year":"2017","unstructured":"Guillaume Lample , Neil Zeghidour , Nicolas Usunier , Antoine Bordes , Ludovic DENOYER, and Marc' Aurelio Ranzato . 2017 . Fader Networks:Manipulating Images by Sliding Attributes . In Advances in Neural Information Processing Systems , Vol. 30. 5967 -- 5976 . Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic DENOYER, and Marc' Aurelio Ranzato. 2017. Fader Networks:Manipulating Images by Sliding Attributes. In Advances in Neural Information Processing Systems, Vol. 30. 5967--5976.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1640443.1640446"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.2312\/egst.20141042"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.344862"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3333002"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00780"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00600"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201401"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818013"},{"key":"e_1_2_2_40_1","volume-title":"Black","author":"Ma Qianli","year":"2020","unstructured":"Qianli Ma , Jinlong Yang , Anurag Ranjan , Sergi Pujades , Gerard Pons-Moll , Siyu Tang , and Michael J . Black . 2020 . Learning to Dress 3D People in Generative Clothing. In Computer Vision and Pattern Recognition (CVPR). IEEE , 6468--6477. Qianli Ma, Jinlong Yang, Anurag Ranjan, Sergi Pujades, Gerard Pons-Moll, Siyu Tang, and Michael J. Black. 2020. Learning to Dress 3D People in Generative Clothing. In Computer Vision and Pattern Recognition (CVPR). IEEE, 6468--6477."},{"key":"e_1_2_2_41_1","volume-title":"Proceedings on Graphics Interface '88 (Edmonton","author":"Magnenat-Thalmann N.","unstructured":"N. Magnenat-Thalmann , R. Laperri\u00e8re , and D. Thalmann . 1989. Joint-Dependent Local Deformations for Hand Animation and Object Grasping . In Proceedings on Graphics Interface '88 (Edmonton , Alberta, Canada). Canadian Information Processing Society, CAN, 26--33. N. Magnenat-Thalmann, R. Laperri\u00e8re, and D. Thalmann. 1989. Joint-Dependent Local Deformations for Hand Animation and Object Grasping. In Proceedings on Graphics Interface '88 (Edmonton, Alberta, Canada). Canadian Information Processing Society, CAN, 26--33."},{"key":"e_1_2_2_42_1","volume-title":"International Conference on Machine Learning. PMLR, 4413--4423","author":"Mattei Pierre-Alexandre","year":"2019","unstructured":"Pierre-Alexandre Mattei and Jes Frellsen . 2019 . MIWAE: Deep generative modelling and imputation of incomplete data sets . In International Conference on Machine Learning. PMLR, 4413--4423 . Pierre-Alexandre Mattei and Jes Frellsen. 2019. MIWAE: Deep generative modelling and imputation of incomplete data sets. In International Conference on Machine Learning. PMLR, 4413--4423."},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/192161.192244"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58536-5_26"},{"key":"e_1_2_2_45_1","volume-title":"Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics. arXiv preprint arXiv:2007.12287","author":"Ng Evonne","year":"2020","unstructured":"Evonne Ng , Hanbyul Joo , Shiry Ginosar , and Trevor Darrell . 2020. Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics. arXiv preprint arXiv:2007.12287 ( 2020 ). Evonne Ng, Hanbyul Joo, Shiry Ginosar, and Trevor Darrell. 2020. Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics. arXiv preprint arXiv:2007.12287 (2020)."},{"key":"e_1_2_2_46_1","volume-title":"STAR: A Sparse Trained Articulated Human Body Regressor. In European Conference on Computer Vision (ECCV). https:\/\/star.is.tue.mpg.de","author":"Osman Ahmed A A","unstructured":"Ahmed A A Osman , Timo Bolkart , and Michael J. Black . 2020 . STAR: A Sparse Trained Articulated Human Body Regressor. In European Conference on Computer Vision (ECCV). https:\/\/star.is.tue.mpg.de Ahmed A A Osman, Timo Bolkart, and Michael J. Black. 2020. STAR: A Sparse Trained Articulated Human Body Regressor. In European Conference on Computer Vision (ECCV). https:\/\/star.is.tue.mpg.de"},{"key":"e_1_2_2_47_1","volume-title":"NPMs: Neural Parametric Models for 3D Deformable Shapes. arXiv preprint arXiv:2104.00702","author":"Palafox Pablo","year":"2021","unstructured":"Pablo Palafox , Alja\u017e Bo\u017ei\u010d , Justus Thies , Matthias Nie\u00dfner , and Angela Dai . 2021. NPMs: Neural Parametric Models for 3D Deformable Shapes. arXiv preprint arXiv:2104.00702 ( 2021 ). Pablo Palafox, Alja\u017e Bo\u017ei\u010d, Justus Thies, Matthias Nie\u00dfner, and Angela Dai. 2021. NPMs: Neural Parametric Models for 3D Deformable Shapes. arXiv preprint arXiv:2104.00702 (2021)."},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00025"},{"key":"e_1_2_2_49_1","volume-title":"Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR). 10975--10985","author":"Pavlakos Georgios","unstructured":"Georgios Pavlakos , Vasileios Choutas , Nima Ghorbani , Timo Bolkart , Ahmed A. A. Osman , Dimitrios Tzionas , and Michael J. Black . 2019. Expressive Body Capture: 3D Hands, Face, and Body from a Single Image . In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR). 10975--10985 . http:\/\/smpl-x.is.tue.mpg.de Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, and Michael J. Black. 2019. Expressive Body Capture: 3D Hands, Face, and Body from a Single Image. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR). 10975--10985. http:\/\/smpl-x.is.tue.mpg.de"},{"key":"e_1_2_2_50_1","volume-title":"Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans. In CVPR.","author":"Peng Sida","year":"2021","unstructured":"Sida Peng , Yuanqing Zhang , Yinghao Xu , Qianqian Wang , Qing Shuai , Hujun Bao , and Xiaowei Zhou . 2021 . Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans. In CVPR. Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, and Xiaowei Zhou. 2021. Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans. In CVPR."},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV48630.2021.00185"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00899"},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58621-8_4"},{"key":"e_1_2_2_54_1","volume-title":"European Conference on Computer Vision (ECCV)","volume":"11207","author":"Ranjan Anurag","unstructured":"Anurag Ranjan , Timo Bolkart , Soubhik Sanyal , and Michael J. Black . 2018. Generating 3D Faces using Convolutional Mesh Autoencoders . In European Conference on Computer Vision (ECCV) , Vol. Lecture Notes in Computer Science, vol 11207 . Springer, Cham, 725--741. Anurag Ranjan, Timo Bolkart, Soubhik Sanyal, and Michael J. Black. 2018. Generating 3D Faces using Convolutional Mesh Autoencoders. In European Conference on Computer Vision (ECCV), Vol. Lecture Notes in Computer Science, vol 11207. Springer, Cham, 725--741."},{"key":"e_1_2_2_55_1","volume-title":"MeshSDF: Differentiable Iso-Surface Extraction. Neural Information Processing Systems (NeurIPS)","author":"Remelli Edoardo","year":"2020","unstructured":"Edoardo Remelli , Artem Lukoianov , Stephan R Richter , Beno\u00eet Guillard , Timur Bagautdinov , Pierre Baque , and Pascal Fua . 2020. MeshSDF: Differentiable Iso-Surface Extraction. Neural Information Processing Systems (NeurIPS) ( 2020 ). Edoardo Remelli, Artem Lukoianov, Stephan R Richter, Beno\u00eet Guillard, Timur Bagautdinov, Pierre Baque, and Pascal Fua. 2020. MeshSDF: Differentiable Iso-Surface Extraction. Neural Information Processing Systems (NeurIPS) (2020)."},{"key":"e_1_2_2_56_1","volume-title":"Black","author":"Romero Javier","year":"2017","unstructured":"Javier Romero , Dimitrios Tzionas , and Michael J . Black . 2017 . Embodied Hands : Modeling and Capturing Hands and Bodies Together. ACM Transactions on Graphics, (Proc. SIGGRAPH Asia) 36, 6 (Nov. 2017). Javier Romero, Dimitrios Tzionas, and Michael J. Black. 2017. Embodied Hands: Modeling and Capturing Hands and Bodies Together. ACM Transactions on Graphics, (Proc. SIGGRAPH Asia) 36, 6 (Nov. 2017)."},{"key":"e_1_2_2_57_1","doi-asserted-by":"crossref","unstructured":"O. Ronneberger P.Fischer and T. Brox. 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI). Springer 234--241.  O. Ronneberger P.Fischer and T. Brox. 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI). Springer 234--241.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_2_2_58_1","volume-title":"Proceedings IEEE\/CVF Conf. on Computer Vision and Pattern Recognition (CVPR).","author":"Saito Shunsuke","unstructured":"Shunsuke Saito , Jinlong Yang , Qianli Ma , and Michael J. Black . 2021. SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks . In Proceedings IEEE\/CVF Conf. on Computer Vision and Pattern Recognition (CVPR). Shunsuke Saito, Jinlong Yang, Qianli Ma, and Michael J. Black. 2021. SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks. In Proceedings IEEE\/CVF Conf. on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58621-8_35"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386569.3392493"},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00249"},{"key":"e_1_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00020"},{"key":"e_1_2_2_63_1","volume-title":"Garnett (Eds.)","volume":"28","author":"Sohn Kihyuk","year":"2015","unstructured":"Kihyuk Sohn , Honglak Lee , and Xinchen Yan . 2015 . Learning Structured Output Representation using Deep Conditional Generative Models. In Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R . Garnett (Eds.) , Vol. 28 . Curran Associates, Inc., 3483--3491. https:\/\/proceedings.neurips.cc\/paper\/ 2015\/file\/8d55a249e6baa5c06772297520da2051-Paper.pdf Kihyuk Sohn, Honglak Lee, and Xinchen Yan. 2015. Learning Structured Output Representation using Deep Conditional Generative Models. In Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.), Vol. 28. Curran Associates, Inc., 3483--3491. https:\/\/proceedings.neurips.cc\/paper\/2015\/file\/8d55a249e6baa5c06772297520da2051-Paper.pdf"},{"key":"e_1_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/1057432.1057456"},{"key":"e_1_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/1866158.1866161"},{"key":"e_1_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"e_1_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964971"},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3323035"},{"key":"e_1_2_2_69_1","volume-title":"NVAE: A Deep Hierarchical Variational Autoencoder. In Neural Information Processing Systems (NeurIPS).","author":"Vahdat Arash","year":"2020","unstructured":"Arash Vahdat and Jan Kautz . 2020 . NVAE: A Deep Hierarchical Variational Autoencoder. In Neural Information Processing Systems (NeurIPS). Arash Vahdat and Jan Kautz. 2020. NVAE: A Deep Hierarchical Variational Autoencoder. In Neural Information Processing Systems (NeurIPS)."},{"key":"e_1_2_2_70_1","article-title":"Visualizing data using t-SNE","volume":"9","author":"der Maaten Laurens Van","year":"2008","unstructured":"Laurens Van der Maaten and Geoffrey Hinton . 2008 . Visualizing data using t-SNE . Journal of machine learning research 9 , 11 (2008). Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research 9, 11 (2008).","journal-title":"Journal of machine learning research"},{"key":"e_1_2_2_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073209"},{"key":"e_1_2_2_72_1","volume-title":"Proceedings of the 32nd International Conference on Neural Information Processing Systems. 1152--1164","author":"Wang Ting-Chun","year":"2018","unstructured":"Ting-Chun Wang , Ming-Yu Liu , Jun-Yan Zhu , Guilin Liu , Andrew Tao , Jan Kautz , and Bryan Catanzaro . 2018 . Video-to-video synthesis . In Proceedings of the 32nd International Conference on Neural Information Processing Systems. 1152--1164 . Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. 2018. Video-to-video synthesis. In Proceedings of the 32nd International Conference on Neural Information Processing Systems. 1152--1164."},{"key":"e_1_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3323030"},{"key":"e_1_2_2_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925882"},{"key":"e_1_2_2_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3414685.3417838"},{"key":"e_1_2_2_76_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58542-6_21"},{"key":"e_1_2_2_77_1","unstructured":"Yi Zhou Chenglei Wu Zimo Li Chen Cao Yuting Ye Jason Saragih Hao Li and Yaser Sheikh. 2020b. Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels. In Advances in Neural Information Processing Systems.  Yi Zhou Chenglei Wu Zimo Li Chen Cao Yuting Ye Jason Saragih Hao Li and Yaser Sheikh. 2020b. Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels. In Advances in Neural Information Processing Systems."}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3450626.3459850","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3450626.3459850","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:21Z","timestamp":1750191441000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3450626.3459850"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,19]]},"references-count":77,"aliases":["10.1145\/3476576.3476721"],"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,8,31]]}},"alternative-id":["10.1145\/3450626.3459850"],"URL":"https:\/\/doi.org\/10.1145\/3450626.3459850","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,19]]},"assertion":[{"value":"2021-07-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}