{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,20]],"date-time":"2026-05-20T16:45:01Z","timestamp":1779295501627,"version":"3.51.4"},"reference-count":106,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T00:00:00Z","timestamp":1731974400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2022YFF0902301"],"award-info":[{"award-number":["2022YFF0902301"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"name":"NSFC programs","award":["61976138"],"award-info":[{"award-number":["61976138"]}]},{"DOI":"10.13039\/501100003399","name":"STCSM","doi-asserted-by":"crossref","award":["2015F0203-000-06"],"award-info":[{"award-number":["2015F0203-000-06"]}],"id":[{"id":"10.13039\/501100003399","id-type":"DOI","asserted-by":"crossref"}]},{"name":"SHMEC","award":["2019-01-07-00-01-E00003"],"award-info":[{"award-number":["2019-01-07-00-01-E00003"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2024,12,19]]},"abstract":"<jats:p>\n            Volumetric video represents a transformative advancement in visual media, enabling users to freely navigate immersive virtual experiences and narrowing the gap between digital and real worlds. However, the need for extensive manual intervention to stabilize mesh sequences and the generation of excessively large assets in existing workflows impedes broader adoption. In this paper, we present a novel Gaussian-based approach, dubbed\n            <jats:italic>DualGS<\/jats:italic>\n            , for real-time and high-fidelity playback of complex human performance with excellent compression ratios. Our key idea in DualGS is to separately represent motion and appearance using the corresponding skin and joint Gaussians. Such an explicit disentanglement can significantly reduce motion redundancy and enhance temporal coherence. We begin by initializing the DualGS and anchoring skin Gaussians to joint Gaussians at the first frame. Subsequently, we employ a coarse-to-fine training strategy for frame-by-frame human performance modeling. It includes a coarse alignment phase for overall motion prediction as well as a fine-grained optimization for robust tracking and high-fidelity rendering. To integrate volumetric video seamlessly into VR environments, we efficiently compress motion using entropy encoding and appearance using codec compression coupled with a persistent codebook. Our approach achieves a compression ratio of up to 120 times, only requiring approximately 350KB of storage per frame. We demonstrate the efficacy of our representation through photo-realistic, free-view experiences on VR headsets, enabling users to immersively watch musicians in performance and feel the rhythm of the notes at the performers' fingertips. Project page: https:\/\/nowheretrix.github.io\/DualGS\/.\n          <\/jats:p>","DOI":"10.1145\/3687926","type":"journal-article","created":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T15:46:04Z","timestamp":1732031164000},"page":"1-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos"],"prefix":"10.1145","volume":"43","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8121-0015","authenticated-orcid":false,"given":"Yuheng","family":"Jiang","sequence":"first","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"},{"name":"NeuDim Digital Technology, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-8933-0385","authenticated-orcid":false,"given":"Zhehao","family":"Shen","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-1831-7652","authenticated-orcid":false,"given":"Yu","family":"Hong","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2122-7001","authenticated-orcid":false,"given":"Chengcheng","family":"Guo","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-0131-114X","authenticated-orcid":false,"given":"Yize","family":"Wu","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0594-7549","authenticated-orcid":false,"given":"Yingliang","family":"Zhang","sequence":"additional","affiliation":[{"name":"DGene Inc., Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9198-6853","authenticated-orcid":false,"given":"Jingyi","family":"Yu","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8807-7787","authenticated-orcid":false,"given":"Lan","family":"Xu","sequence":"additional","affiliation":[{"name":"ShanghaiTech University, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,11,19]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1111\/1467-8659.00433"},{"key":"e_1_2_1_2_1","volume-title":"b0nes164","author":"JasonDeacutis","year":"2024","unstructured":"hybridherbst pastasfuture JasonDeacutis aras p, b0nes164. 2024. UnityGaussianSplatting. https:\/\/github.com\/aras-p\/UnityGaussianSplatting."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19824-3_20"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01139"},{"key":"e_1_2_1_5_1","volume-title":"ACM SIGGRAPH 2024 Conference Papers. 1--9.","author":"Chen Yufan","year":"2024","unstructured":"Yufan Chen, Lizhen Wang, Qijing Li, Hongjiang Xiao, Shengping Zhang, Hongxun Yao, and Yebin Liu. 2024a. Monogaussianavatar: Monocular gaussian point-based head avatar. In ACM SIGGRAPH 2024 Conference Papers. 1--9."},{"key":"e_1_2_1_6_1","volume-title":"HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression. arXiv preprint arXiv:2403.14530","author":"Chen Yihang","year":"2024","unstructured":"Yihang Chen, Qianyi Wu, Jianfei Cai, Mehrtash Harandi, and Weiyao Lin. 2024b. HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression. arXiv preprint arXiv:2403.14530 (2024)."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766945"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130800.3130801"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925969"},{"key":"e_1_2_1_10_1","volume-title":"Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps. arXiv preprint arXiv:2311.17245","author":"Fan Zhiwen","year":"2023","unstructured":"Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, and Zhangyang Wang. 2023. Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps. arXiv preprint arXiv:2311.17245 (2023)."},{"key":"e_1_2_1_11_1","volume-title":"Eagles: Efficient accelerated 3d gaussians with lightweight encodings. arXiv preprint arXiv:2312.04564","author":"Girish Sharath","year":"2023","unstructured":"Sharath Girish, Kamal Gupta, and Abhinav Shrivastava. 2023. Eagles: Efficient accelerated 3d gaussians with lightweight encodings. arXiv preprint arXiv:2312.04564 (2023)."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1006\/cviu.2002.0987"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3606927"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459749"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459749"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3311970"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20418--20431","author":"Hu Shoukang","year":"2024","unstructured":"Shoukang Hu, Tao Hu, and Ziwei Liu. 2024. Gauhuman: Articulated gaussian splatting from monocular human videos. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20418--20431."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.01811"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592415"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_22"},{"key":"e_1_2_1_22_1","volume-title":"Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, and James Gee.","author":"Jena Rohit","year":"2023","unstructured":"Rohit Jena, Ganesh Subramanian Iyer, Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, and James Gee. 2023. SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos. arXiv preprint arXiv:2311.10812 (2023)."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01623"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00606"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19734--19745","author":"Jiang Yuheng","year":"2024","unstructured":"Yuheng Jiang, Zhehao Shen, Penghao Wang, Zhuo Su, Yu Hong, Yingliang Zhang, Jingyi Yu, and Lan Xu. 2024. Hifi4g: High-fidelity human performance rendering via compact gaussian splatting. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19734--19745."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 595--605","author":"Jiang Yuheng","year":"2023","unstructured":"Yuheng Jiang, Kaixin Yao, Zhuo Su, Zhehao Shen, Haimin Luo, and Lan Xu. 2023b. Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 595--605."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592433"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 505--515","author":"Kocabas Muhammed","year":"2024","unstructured":"Muhammed Kocabas, Jen-Hao Rick Chang, James Gabriel, Oncel Tuzel, and Anurag Ranjan. 2024. Hugs: Human gaussian splats. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 505--515."},{"key":"e_1_2_1_29_1","volume-title":"Deliffas: Deformable light fields for fast avatar synthesis. Advances in Neural Information Processing Systems 36","author":"Kwon Youngjoong","year":"2024","unstructured":"Youngjoong Kwon, Lingjie Liu, Henry Fuchs, Marc Habermann, and Christian Theobalt. 2024. Deliffas: Deformable light fields for fast avatar synthesis. Advances in Neural Information Processing Systems 36 (2024)."},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 21719--21728","author":"Lee Joo Chan","year":"2024","unstructured":"Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, and Eunbyung Park. 2024. Compact 3D Gaussian Representation for Radiance Field. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 21719--21728."},{"key":"e_1_2_1_31_1","volume-title":"Gaussianbody: Clothed human reconstruction via 3d gaussian splatting. arXiv preprint arXiv:2401.09720","author":"Li Mengtian","year":"2024","unstructured":"Mengtian Li, Shengxiang Yao, Zhifeng Xie, Keyu Chen, and Yu-Gang Jiang. 2024b. Gaussianbody: Clothed human reconstruction via 3d gaussian splatting. arXiv preprint arXiv:2401.09720 (2024)."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19824-3_25"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130800.3130813"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV53792.2021.00047"},{"key":"e_1_2_1_35_1","first-page":"2","article-title":"Toward a practical perceptual video quality metric","volume":"6","author":"Li Zhi","year":"2016","unstructured":"Zhi Li, Anne Aaron, Ioannis Katsavounidis, Anush Moorthy, Megha Manohara, et al. 2016. Toward a practical perceptual video quality metric. The Netflix Tech Blog 6, 2 (2016), 2.","journal-title":"The Netflix Tech Blog"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8508--8520","author":"Li Zhan","year":"2024","unstructured":"Zhan Li, Zhang Chen, Zhong Li, and Yi Xu. 2024a. Spacetime gaussian feature splatting for real-time dynamic view synthesis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8508--8520."},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19711--19722","author":"Li Zhe","year":"2024","unstructured":"Zhe Li, Zerong Zheng, Lizhen Wang, and Yebin Liu. 2024c. Animatable gaussians: Learning pose-dependent gaussian maps for high-fidelity human avatar modeling. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19711--19722."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3610548.3618142"},{"key":"e_1_2_1_39_1","volume-title":"Efficient Neural Radiance Fields for Interactive Free-viewpoint Video. In SIGGRAPH Asia Conference Proceedings.","author":"Lin Haotong","year":"2022","unstructured":"Haotong Lin, Sida Peng, Zhen Xu, Yunzhi Yan, Qing Shuai, Hujun Bao, and Xiaowei Zhou. 2022. Efficient Neural Radiance Fields for Interactive Free-viewpoint Video. In SIGGRAPH Asia Conference Proceedings."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00865"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3478513.3480528"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2020.2996594"},{"key":"e_1_2_1_43_1","volume-title":"CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting. arXiv preprint arXiv:2404.09458","author":"Liu Xiangrui","year":"2024","unstructured":"Xiangrui Liu, Xinju Wu, Pingping Zhang, Shiqi Wang, Zhu Li, and Sam Kwong. 2024. CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting. arXiv preprint arXiv:2404.09458 (2024)."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818013"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20654--20664","author":"Lu Tao","year":"2024","unstructured":"Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, and Bo Dai. 2024. Scaffold-gs: Structured 3d gaussians for view-adaptive rendering. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20654--20664."},{"key":"e_1_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Jonathon Luiten Georgios Kopanas Bastian Leibe and Deva Ramanan. 2024. Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis. In 3DV.","DOI":"10.1109\/3DV62453.2024.00044"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1002\/cav.1522"},{"key":"e_1_2_1_48_1","volume-title":"Relightable Neural Actor with Intrinsic Decomposition and Pose Control. arXiv preprint arXiv:2312.11587","author":"Luvizon Diogo","year":"2023","unstructured":"Diogo Luvizon, Vladislav Golyanik, Adam Kortylewski, Marc Habermann, and Christian Theobalt. 2023. Relightable Neural Actor with Intrinsic Decomposition and Pose Control. arXiv preprint arXiv:2312.11587 (2023)."},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1002\/cav.319"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58452-8_24"},{"key":"e_1_2_1_51_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 788--798","author":"Moreau Arthur","year":"2024","unstructured":"Arthur Moreau, Jifei Song, Helisa Dhamo, Richard Shaw, Yiren Zhou, and Eduardo P\u00e9rez-Pellitero. 2024. Human gaussian splatting: Real-time rendering of animatable avatars. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 788--798."},{"key":"e_1_2_1_52_1","volume-title":"Compact 3D Scene Representation via Self-Organizing Gaussian Grids. arXiv preprint arXiv:2312.13299","author":"Morgenstern Wieland","year":"2023","unstructured":"Wieland Morgenstern, Florian Barthel, Anna Hilsmann, and Peter Eisert. 2023. Compact 3D Scene Representation via Self-Organizing Gaussian Grids. arXiv preprint arXiv:2312.13299 (2023)."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528223.3530127"},{"key":"e_1_2_1_54_1","volume-title":"Soroush Abbasi Koohpayegani, and Hamed Pirsiavash.","author":"Navaneet KL","year":"2023","unstructured":"KL Navaneet, Kossar Pourahmadi Meibodi, Soroush Abbasi Koohpayegani, and Hamed Pirsiavash. 2023. Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization. arXiv preprint arXiv:2311.18159 (2023)."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298631"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00985"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10349--10358","author":"Niedermayr Simon","year":"2024","unstructured":"Simon Niedermayr, Josef Stumpfegger, and R\u00fcdiger Westermann. 2024b. Compressed 3d gaussian splatting for accelerated novel view synthesis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10349--10358."},{"key":"e_1_2_1_58_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1165--1175","author":"Pang Haokai","year":"2024","unstructured":"Haokai Pang, Heming Zhu, Adam Kortylewski, Christian Theobalt, and Marc Habermann. 2024. Ash: Animatable gaussian splats for efficient and photoreal human rendering. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1165--1175."},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3651282"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01018"},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20299--20309","author":"Qian Shenhan","year":"2024","unstructured":"Shenhan Qian, Tobias Kirschstein, Liam Schoneveld, Davide Davoli, Simon Giebenhain, and Matthias Nie\u00dfner. 2024a. Gaussianavatars: Photorealistic head avatars with rigged 3d gaussians. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20299--20309."},{"key":"e_1_2_1_62_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5020--5030","author":"Qian Zhiyin","year":"2024","unstructured":"Zhiyin Qian, Shaofei Wang, Marko Mihajlovic, Andreas Geiger, and Siyu Tang. 2024b. 3dgs-avatar: Animatable avatars via deformable 3d gaussian splatting. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5020--5030."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592426"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550469.3555409"},{"key":"e_1_2_1_65_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 16911--16921","author":"Shen Kaiyue","year":"2023","unstructured":"Kaiyue Shen, Chen Guo, Manuel Kaufmann, Juan Jose Zarate, Julien Valentin, Jie Song, and Otmar Hilliges. 2023. X-avatar: Expressive human avatars. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 16911--16921."},{"key":"e_1_2_1_66_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1206--1215","author":"Shetty Ashwath","year":"2024","unstructured":"Ashwath Shetty, Marc Habermann, Guoxing Sun, Diogo Luvizon, Vladislav Golyanik, and Christian Theobalt. 2024. Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1206--1215."},{"key":"e_1_2_1_67_1","unstructured":"cedric-chedaleux shg8. 2024. 3DGS.cpp. https:\/\/github.com\/shg8\/3DGS.cpp."},{"key":"e_1_2_1_68_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1386--1395","author":"Slavcheva Miroslava","year":"2017","unstructured":"Miroslava Slavcheva, Maximilian Baust, Daniel Cremers, and Slobodan Ilic. 2017. Killing-fusion: Non-rigid 3d reconstruction without correspondences. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1386--1395."},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00280"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2023.3247082"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58548-8_15"},{"key":"e_1_2_1_72_1","volume-title":"Robustfusion: Robust volumetric performance reconstruction under human-object interactions from monocular rgbd stream","author":"Su Zhuo","year":"2022","unstructured":"Zhuo Su, Lan Xu, Dawei Zhong, Zhong Li, Fan Deng, Shuxue Quan, and Lu Fang. 2022. Robustfusion: Robust volumetric performance reconstruction under human-object interactions from monocular rgbd stream. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022)."},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276478"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475442"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00616"},{"key":"e_1_2_1_76_1","first-page":"14798","article-title":"Compressible-composable nerf via rank-residual decomposition","volume":"35","author":"Tang Jiaxiang","year":"2022","unstructured":"Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, and Gang Zeng. 2022. Compressible-composable nerf via rank-residual decomposition. Advances in Neural Information Processing Systems 35 (2022), 14798--14809.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01272"},{"key":"e_1_2_1_78_1","volume-title":"Coddyac: Connectivity driven dynamic mesh compression. In 2007 3DTV Conference","author":"Vasa Libor","year":"2007","unstructured":"Libor Vasa and V\u00e1clav Skala. 2007. Coddyac: Connectivity driven dynamic mesh compression. In 2007 3DTV Conference. IEEE, 1--4."},{"key":"e_1_2_1_79_1","doi-asserted-by":"crossref","unstructured":"Daniel Vlasic Pieter Peers Ilya Baran Paul Debevec Jovan Popovi\u0107 Szymon Rusinkiewicz and Wojciech Matusik. 2009. Dynamic shape capture using multi-view photometric stereo. In ACM SIGGRAPH Asia 2009 papers. 1--11.","DOI":"10.1145\/1661412.1618520"},{"key":"e_1_2_1_80_1","volume-title":"End-to-End Rate-Distortion Optimized 3D Gaussian Representation. arXiv preprint arXiv:2406.01597","author":"Wang Henan","year":"2024","unstructured":"Henan Wang, Hanxin Zhu, Tianyu He, Runsen Feng, Jiajun Deng, Jiang Bian, and Zhibo Chen. 2024b. End-to-End Rate-Distortion Optimized 3D Gaussian Representation. arXiv preprint arXiv:2406.01597 (2024)."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.00016"},{"key":"e_1_2_1_82_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 470--481","author":"Wang Liao","year":"2024","unstructured":"Liao Wang, Kaixin Yao, Chengcheng Guo, Zhirui Zhang, Qiang Hu, Jingyi Yu, Lan Xu, and Minye Wu. 2024a. VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 470--481."},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01316"},{"key":"e_1_2_1_84_1","volume-title":"Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Wang Shaofei","year":"2021","unstructured":"Shaofei Wang, Andreas Geiger, and Siyu Tang. 2021. Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration. In Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_2_1_85_1","volume-title":"ARAH: Animatable","author":"Wang Shaofei","year":"2022","unstructured":"Shaofei Wang, Katja Schwarz, Andreas Geiger, and Siyu Tang. 2022a. ARAH: Animatable Volume Rendering of Articulated Human SDFs. In European Conference on Computer Vision."},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00305"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01573"},{"key":"e_1_2_1_88_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20310--20320","author":"Wu Guanjun","year":"2024","unstructured":"Guanjun Wu, Taoran Yi, Jiemin Fang, Lingxi Xie, Xiaopeng Zhang, Wei Wei, Wenyu Liu, Qi Tian, and Xinggang Wang. 2024b. 4d gaussian splatting for real-time dynamic scene rendering. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20310--20320."},{"key":"e_1_2_1_89_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 6487--6496","author":"Wu Minye","year":"2024","unstructured":"Minye Wu, Zehao Wang, Georgios Kouros, and Tinne Tuytelaars. 2024a. TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 6487--6496."},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00930"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550454.3555456"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV50981.2020.00042"},{"key":"e_1_2_1_93_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4389--4398","author":"Xie Tianyi","year":"2024","unstructured":"Tianyi Xie, Zeshun Zong, Yuxing Qiu, Xuan Li, Yutao Feng, Yin Yang, and Chenfanfu Jiang. 2024. Physgaussian: Physics-integrated 3d gaussians for generative dynamics. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4389--4398."},{"key":"e_1_2_1_94_1","volume-title":"Flyfusion: Realtime dynamic scene reconstruction using a flying depth camera","author":"Xu Lan","year":"2019","unstructured":"Lan Xu, Wei Cheng, Kaiwen Guo, Lei Han, Yebin Liu, and Lu Fang. 2019a. Flyfusion: Realtime dynamic scene reconstruction using a flying depth camera. IEEE transactions on visualization and computer graphics 27, 1 (2019), 68--82."},{"key":"e_1_2_1_95_1","volume-title":"UnstructuredFusion: realtime 4D geometry and texture reconstruction using commercial RGBD cameras","author":"Xu Lan","year":"2019","unstructured":"Lan Xu, Zhuo Su, Lei Han, Tao Yu, Yebin Liu, and Lu Fang. 2019b. UnstructuredFusion: realtime 4D geometry and texture reconstruction using commercial RGBD cameras. IEEE transactions on pattern analysis and machine intelligence 42, 10 (2019), 2508--2522."},{"key":"e_1_2_1_96_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20029--20040","author":"Xu Zhen","year":"2024","unstructured":"Zhen Xu, Sida Peng, Haotong Lin, Guangzhao He, Jiaming Sun, Yujun Shen, Hujun Bao, and Xiaowei Zhou. 2024. 4k4d: Real-time 4d view synthesis at 4k resolution. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20029--20040."},{"key":"e_1_2_1_97_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20331--20341","author":"Yang Ziyi","year":"2024","unstructured":"Ziyi Yang, Xinyu Gao, Wen Zhou, Shaohui Jiao, Yuqing Zhang, and Xiaogang Jin. 2024. Deformable 3d gaussians for high-fidelity monocular dynamic scene reconstruction. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 20331--20341."},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00570"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00569"},{"key":"e_1_2_1_100_1","volume-title":"DoubleFusion: Real-time Capture of Human Performances with Inner Body Shapes from a Single Depth Sensor. Transactions on Pattern Analysis and Machine Intelligence (TPAMI)","author":"Yu Tao","year":"2019","unstructured":"Tao Yu, Zerong Zheng, Kaiwen Guo, Jianhui Zhao, Qionghai Dai, Hao Li, Gerard Pons-Moll, and Yebin Liu. 2019. DoubleFusion: Real-time Capture of Human Performances with Inner Body Shapes from a Single Depth Sensor. Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2019)."},{"key":"e_1_2_1_101_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.","author":"Zhang Hongwen","year":"2023","unstructured":"Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, and Yebin Liu. 2023. CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_1_102_1","volume-title":"NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing. arXiv preprint arXiv:2202.06088","author":"Zhang Jiakai","year":"2022","unstructured":"Jiakai Zhang, Liao Wang, Xinhang Liu, Fuqiang Zhao, Minzhang Li, Haizhao Dai, Boyuan Zhang, Wei Yang, Lan Xu, and Jingyi Yu. 2022. NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing. arXiv preprint arXiv:2202.06088 (2022)."},{"key":"e_1_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550454.3555451"},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00759"},{"key":"e_1_2_1_105_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19680--19690","author":"Zheng Shunyuan","year":"2024","unstructured":"Shunyuan Zheng, Boyao Zhou, Ruizhi Shao, Boning Liu, Shengping Zhang, Liqiang Nie, and Yebin Liu. 2024. Gps-gaussian: Generalizable pixel-wise 3d gaussian splatting for real-time human novel view synthesis. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 19680--19690."},{"key":"e_1_2_1_106_1","volume-title":"TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis. arXiv preprint arXiv:2312.05161","author":"Zhu Heming","year":"2023","unstructured":"Heming Zhu, Fangneng Zhan, Christian Theobalt, and Marc Habermann. 2023. TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis. arXiv preprint arXiv:2312.05161 (2023)."},{"key":"e_1_2_1_107_1","volume-title":"Drivable 3d gaussian avatars. arXiv preprint arXiv:2311.08581","author":"Zielonka Wojciech","year":"2023","unstructured":"Wojciech Zielonka, Timur Bagautdinov, Shunsuke Saito, Michael Zollh\u00f6fer, Justus Thies, and Javier Romero. 2023. Drivable 3d gaussian avatars. arXiv preprint arXiv:2311.08581 (2023)."}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687926","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3687926","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:09:57Z","timestamp":1750295397000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687926"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,19]]},"references-count":106,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,12,19]]}},"alternative-id":["10.1145\/3687926"],"URL":"https:\/\/doi.org\/10.1145\/3687926","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,19]]},"assertion":[{"value":"2024-11-19","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}