{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T20:26:25Z","timestamp":1776111985886,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":99,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,12,15]]},"DOI":"10.1145\/3757377.3763845","type":"proceedings-article","created":{"date-parts":[[2025,12,8]],"date-time":"2025-12-08T16:30:41Z","timestamp":1765211441000},"page":"1-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-4544-9378","authenticated-orcid":false,"given":"Chuhao","family":"Chen","sequence":"first","affiliation":[{"name":"University of California San Diego, La Jolla, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2882-2383","authenticated-orcid":false,"given":"Isabella","family":"Liu","sequence":"additional","affiliation":[{"name":"University of California San Diego, La Jolla, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9466-6836","authenticated-orcid":false,"given":"Xinyue","family":"Wei","sequence":"additional","affiliation":[{"name":"University of California San Diego, La Jolla, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1796-2682","authenticated-orcid":false,"given":"Hao","family":"Su","sequence":"additional","affiliation":[{"name":"University of California San Diego, La Jolla, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6413-023X","authenticated-orcid":false,"given":"Minghua","family":"Liu","sequence":"additional","affiliation":[{"name":"Hillbot, Sunnyvale, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,12,14]]},"reference":[{"key":"e_1_3_3_2_2_1","volume-title":"Proceedings of the 3rd Conference on Robot Learning","author":"Abbatematteo Ben","year":"2019","unstructured":"Ben Abbatematteo, Stefanie Tellex, and George Konidaris. 2019. Learning to generalize kinematic models to novel objects. In Proceedings of the 3rd Conference on Robot Learning."},{"key":"e_1_3_3_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00764"},{"key":"e_1_3_3_2_4_1","first-page":"72","volume-title":"European Conference on Computer Vision","author":"Che Yuchen","year":"2024","unstructured":"Yuchen Che, Ryo Furukawa, and Asako Kanezaki. 2024. OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation. In European Conference on Computer Vision. Springer, 72\u201388."},{"key":"e_1_3_3_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.02033"},{"key":"e_1_3_3_2_6_1","unstructured":"Yiwen Chen Tong He Di Huang Weicai Ye Sijin Chen Jiaxiang Tang Xin Chen Zhongang Cai Lei Yang Gang Yu et\u00a0al. 2024a. Meshanything: Artist-created mesh generation with autoregressive transformers. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.10163 (2024)."},{"key":"e_1_3_3_2_7_1","doi-asserted-by":"crossref","unstructured":"Zoey Chen Aaron Walsman Marius Memmel Kaichun Mo Alex Fang Karthikeya Vemuri Alan Wu Dieter Fox and Abhishek Gupta. 2024b. Urdformer: A pipeline for constructing articulated simulation environments from real-world images. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2405.11656 (2024).","DOI":"10.15607\/RSS.2024.XX.124"},{"key":"e_1_3_3_2_8_1","unstructured":"Tianyuan Dai Josiah Wong Yunfan Jiang Chen Wang Cem Gokmen Ruohan Zhang Jiajun Wu and Li Fei-Fei. 2024. Automated creation of digital cousins for robust policy learning. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2410.07408 (2024)."},{"key":"e_1_3_3_2_9_1","doi-asserted-by":"crossref","unstructured":"Matt Deitke Ruoshi Liu Matthew Wallingford Huong Ngo Oscar Michel Aditya Kusupati Alan Fan Christian Laforte Vikram Voleti Samir\u00a0Yitzhak Gadre et\u00a0al. 2023a. Objaverse-xl: A universe of 10m+ 3d objects. Advances in Neural Information Processing Systems 36 (2023) 35799\u201335813.","DOI":"10.1109\/CVPR52729.2023.01263"},{"key":"e_1_3_3_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01263"},{"key":"e_1_3_3_2_11_1","unstructured":"Jianning Deng Kartic Subr and Hakan Bilen. 2024. Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.16623 (2024)."},{"key":"e_1_3_3_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.264"},{"key":"e_1_3_3_2_13_1","unstructured":"Lian Fu Ryoichi Ishikawa Yoshihiro Sato and Takeshi Oishi. 2024. CAPT: Category-level Articulation Estimation from a Single Point Cloud Using Transformer. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2402.17360 (2024)."},{"key":"e_1_3_3_2_14_1","unstructured":"Akshay Gadi\u00a0Patil Yiming Qian Shan Yang Brian Jackson Eric Bennett and Hao Zhang. 2023. RoSI: Recovering 3D Shape Interiors from Few Articulation Images. arXiv e-prints (2023) arXiv\u20132304."},{"key":"e_1_3_3_2_15_1","unstructured":"Daoyi Gao Yawar Siddiqui Lei Li and Angela Dai. 2024. MeshArt: Generating Articulated Meshes with Structure-guided Transformers. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2412.11596 (2024)."},{"key":"e_1_3_3_2_16_1","unstructured":"Mingju Gao Yike Pan Huan-ang Gao Zongzheng Zhang Wenyi Li Hao Dong Hao Tang Li Yi and Hao Zhao. 2025. PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2503.19913 (2025)."},{"key":"e_1_3_3_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.02031"},{"key":"e_1_3_3_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS47612.2022.9982029"},{"key":"e_1_3_3_2_19_1","unstructured":"Yicong Hong Kai Zhang Jiuxiang Gu Sai Bi Yang Zhou Difan Liu Feng Liu Kalyan Sunkavalli Trung Bui and Hao Tan. 2023. Lrm: Large reconstruction model for single image to 3d. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.04400 (2023)."},{"key":"e_1_3_3_2_20_1","doi-asserted-by":"crossref","unstructured":"Ruizhen Hu Wenchao Li Oliver Van\u00a0Kaick Ariel Shamir Hao Zhang and Hui Huang. 2017. Learning to predict part mobility from a single static snapshot. ACM Transactions On Graphics (TOG) 36 6 (2017) 1\u201313.","DOI":"10.1145\/3130800.3130811"},{"key":"e_1_3_3_2_21_1","unstructured":"Siyuan Huang Haonan Chang Yuhan Liu Yimeng Zhu Hao Dong Peng Gao Abdeslam Boularias and Hongsheng Li. 2024. A3vlm: Actionable articulation-aware vision language model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.07549 (2024)."},{"key":"e_1_3_3_2_22_1","first-page":"1611","volume-title":"Conference on Robot Learning","author":"Jain Ajinkya","year":"2022","unstructured":"Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, and Scott Niekum. 2022a. Distributional depth-based estimation of object articulation models. In Conference on Robot Learning. PMLR, 1611\u20131621."},{"key":"e_1_3_3_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9561132"},{"key":"e_1_3_3_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00094"},{"key":"e_1_3_3_2_25_1","doi-asserted-by":"crossref","unstructured":"Hanxiao Jiang Yongsen Mao Manolis Savva and Angel\u00a0X Chang. 2022b. OPD: Single-view 3D Openable Part Detection. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2203.16421 (2022).","DOI":"10.1007\/978-3-031-19842-7_24"},{"key":"e_1_3_3_2_26_1","unstructured":"Yanqin Jiang Li Zhang Jin Gao Weimin Hu and Yao Yao. 2023. Consistent4d: Consistent 360 {\\ deg} dynamic object generation from monocular video. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.02848 (2023)."},{"key":"e_1_3_3_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00553"},{"key":"e_1_3_3_2_28_1","unstructured":"Yuki Kawana and Tatsuya Harada. 2023. Detection based part-level articulated object reconstruction from single RGBD image. Advances in Neural Information Processing Systems 36 (2023) 18444\u201318473."},{"key":"e_1_3_3_2_29_1","unstructured":"Diederik\u00a0P Kingma. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1412.6980 (2014)."},{"key":"e_1_3_3_2_30_1","unstructured":"Long Le Jason Xie William Liang Hung-Ju Wang Yue Yang Yecheng\u00a0Jason Ma Kyle Vedder Arjun Krishna Dinesh Jayaraman and Eric Eaton. 2024. Articulate-anything: Automatic modeling of articulated objects via a vision-language foundation model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2410.13882 (2024)."},{"key":"e_1_3_3_2_31_1","unstructured":"Jiahui Lei Congyue Deng Bokui Shen Leonidas Guibas and Kostas Daniilidis. 2023. Nap: Neural 3d articulation prior. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.16315 (2023)."},{"key":"e_1_3_3_2_32_1","unstructured":"Jiahao Li Hao Tan Kai Zhang Zexiang Xu Fujun Luan Yinghao Xu Yicong Hong Kalyan Sunkavalli Greg Shakhnarovich and Sai Bi. 2023. Instant3d: Fast text-to-3d with sparse-view generation and large reconstruction model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.06214 (2023)."},{"key":"e_1_3_3_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00376"},{"key":"e_1_3_3_2_34_1","doi-asserted-by":"crossref","unstructured":"Zhiqi Li Yiming Chen and Peidong Liu. 2024. Dreammesh4d: Video-to-4d generation with sparse-controlled gaussian-mesh hybrid representation. Advances in Neural Information Processing Systems 37 (2024) 21377\u201321400.","DOI":"10.52202\/079017-0674"},{"key":"e_1_3_3_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.00037"},{"key":"e_1_3_3_2_36_1","unstructured":"Jiayi Liu Denys Iliash Angel\u00a0X Chang Manolis Savva and Ali Mahdavi-Amiri. 2024a. SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2410.16499 (2024)."},{"key":"e_1_3_3_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00039"},{"key":"e_1_3_3_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01693"},{"key":"e_1_3_3_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3611852"},{"key":"e_1_3_3_2_40_1","doi-asserted-by":"crossref","unstructured":"Liu Liu Han Xue Wenqiang Xu Haoyuan Fu and Cewu Lu. 2022. Toward real-world category-level articulation pose estimation. IEEE Transactions on Image Processing 31 (2022) 1072\u20131083.","DOI":"10.1109\/TIP.2021.3138644"},{"key":"e_1_3_3_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00960"},{"key":"e_1_3_3_2_42_1","unstructured":"Minghua Liu Chao Xu Haian Jin Linghao Chen Mukund Varma\u00a0T Zexiang Xu and Hao Su. 2023f. One-2-3-45: Any single image to 3d mesh in 45 seconds without per-shape optimization. Advances in Neural Information Processing Systems 36 (2023) 22226\u201322246."},{"key":"e_1_3_3_2_43_1","unstructured":"Minghua Liu Chong Zeng Xinyue Wei Ruoxi Shi Linghao Chen Chao Xu Mengqi Zhang Zhaoning Wang Xiaoshuai Zhang Isabella Liu et\u00a0al. 2024d. Meshformer: High-quality mesh generation with 3d-guided reconstruction model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2408.10198 (2024)."},{"key":"e_1_3_3_2_44_1","unstructured":"Qihao Liu Weichao Qiu Weiyao Wang Gregory\u00a0D Hager and Alan\u00a0L Yuille. 2020. Nothing but geometric constraints: A model-free method for articulated object pose estimation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2012.00088 (2020)."},{"key":"e_1_3_3_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00853"},{"key":"e_1_3_3_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.02025"},{"key":"e_1_3_3_2_47_1","unstructured":"Yuan Liu Cheng Lin Zijiao Zeng Xiaoxiao Long Lingjie Liu Taku Komura and Wenping Wang. 2023c. Syncdreamer: Generating multiview-consistent images from a single-view image. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2309.03453 (2023)."},{"key":"e_1_3_3_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00951"},{"key":"e_1_3_3_2_49_1","unstructured":"Rundong Luo Haoran Geng Congyue Deng Puhao Li Zan Wang Baoxiong Jia Leonidas Guibas and Siyuan Huang. 2024. Physpart: Physically plausible part completion for interactable objects. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2408.13724 (2024)."},{"key":"e_1_3_3_2_50_1","unstructured":"Zhao Mandi Yijia Weng Dominik Bauer and Shuran Song. 2024. Real2code: Reconstruct articulated objects via code generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.08474 (2024)."},{"key":"e_1_3_3_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01276"},{"key":"e_1_3_3_2_52_1","volume-title":"tiny-cuda-nn","author":"M\u00fcller Thomas","year":"2021","unstructured":"Thomas M\u00fcller. 2021. tiny-cuda-nn. https:\/\/github.com\/NVlabs\/tiny-cuda-nn"},{"key":"e_1_3_3_2_53_1","unstructured":"Ben Poole Ajay Jain Jonathan\u00a0T Barron and Ben Mildenhall. 2022. Dreamfusion: Text-to-3d using 2d diffusion. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2209.14988 (2022)."},{"key":"e_1_3_3_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00165"},{"key":"e_1_3_3_2_55_1","unstructured":"Xiaowen Qiu Jincheng Yang Yian Wang Zhehuan Chen Yufei Wang Tsun-Hsuan Wang Zhou Xian and Chuang Gan. 2025. Articulate anymesh: Open-vocabulary 3d articulated objects modeling. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2502.02590 (2025)."},{"key":"e_1_3_3_2_56_1","first-page":"8748","volume-title":"International conference on machine learning","author":"Radford Alec","year":"2021","unstructured":"Alec Radford, Jong\u00a0Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et\u00a0al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PmLR, 8748\u20138763."},{"key":"e_1_3_3_2_57_1","unstructured":"Jiawei Ren Liang Pan Jiaxiang Tang Chi Zhang Ang Cao Gang Zeng and Ziwei Liu. 2023. Dreamgaussian4d: Generative 4d gaussian splatting. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2312.17142 (2023)."},{"key":"e_1_3_3_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"e_1_3_3_2_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01805"},{"key":"e_1_3_3_2_60_1","unstructured":"Ruoxi Shi Hansheng Chen Zhuoyang Zhang Minghua Liu Chao Xu Xinyue Wei Linghao Chen Chong Zeng and Hao Su. 2023a. Zero123++: a single image to consistent multi-view diffusion base model. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2310.15110 (2023)."},{"key":"e_1_3_3_2_61_1","unstructured":"Yichun Shi Peng Wang Jianglong Ye Mai Long Kejie Li and Xiao Yang. 2023b. Mvdream: Multi-view diffusion for 3d generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2308.16512 (2023)."},{"key":"e_1_3_3_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01855"},{"key":"e_1_3_3_2_63_1","unstructured":"Uriel Singer Shelly Sheynin Adam Polyak Oron Ashual Iurii Makarov Filippos Kokkinos Naman Goyal Andrea Vedaldi Devi Parikh Justin Johnson et\u00a0al. 2023. Text-to-4d dynamic scene generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2301.11280 (2023)."},{"key":"e_1_3_3_2_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00515"},{"key":"e_1_3_3_2_65_1","unstructured":"Jiayi Su Youhe Feng Zheng Li Jinhua Song Yangfan He Botao Ren and Botian Xu. 2024. Artformer: Controllable generation of diverse 3d articulated objects. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2412.07237 (2024)."},{"key":"e_1_3_3_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00881"},{"key":"e_1_3_3_2_67_1","unstructured":"Qi Sun Zhiyang Guo Ziyu Wan Jing\u00a0Nathan Yan Shengming Yin Wengang Zhou Jing Liao and Houqiang Li. 2024. Eg4d: Explicit generation of 4d object without score distillation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2405.18132 (2024)."},{"key":"e_1_3_3_2_68_1","unstructured":"Xiaohao Sun Hanxiao Jiang Manolis Savva and Angel\u00a0Xuan Chang. 2023. OPDMulti: Openable Part Detection for Multiple Objects. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2303.14087 (2023)."},{"key":"e_1_3_3_2_69_1","first-page":"210","volume-title":"European Conference on Computer Vision","author":"Swaminathan Archana","year":"2024","unstructured":"Archana Swaminathan, Anubhav Gupta, Kamal Gupta, Shishira\u00a0R Maiya, Vatsal Agarwal, and Abhinav Shrivastava. 2024. Leia: Latent view-invariant embeddings for implicit 3d articulation. In European Conference on Computer Vision. Springer, 210\u2013227."},{"key":"e_1_3_3_2_70_1","first-page":"1","volume-title":"European Conference on Computer Vision","author":"Tang Jiaxiang","year":"2024","unstructured":"Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, and Ziwei Liu. 2024. Lgm: Large multi-view gaussian model for high-resolution 3d content creation. In European Conference on Computer Vision. Springer, 1\u201318."},{"key":"e_1_3_3_2_71_1","unstructured":"Dmitry Tochilkin David Pankratz Zexiang Liu Zixuan Huang Adam Letts Yangguang Li Ding Liang Christian Laforte Varun Jampani and Yan-Pei Cao. 2024. Triposr: Fast 3d object reconstruction from a single image. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2403.02151 (2024)."},{"key":"e_1_3_3_2_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA46639.2022.9812272"},{"key":"e_1_3_3_2_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01214"},{"key":"e_1_3_3_2_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA57147.2024.10610171"},{"key":"e_1_3_3_2_75_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01252-6_4"},{"key":"e_1_3_3_2_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00908"},{"key":"e_1_3_3_2_77_1","unstructured":"Zhengyi Wang Cheng Lu Yikai Wang Fan Bao Chongxuan Li Hang Su and Jun Zhu. 2023b. Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation. Advances in Neural Information Processing Systems 36 (2023) 8406\u20138441."},{"key":"e_1_3_3_2_78_1","first-page":"57","volume-title":"European Conference on Computer Vision","author":"Wang Zhengyi","year":"2024","unstructured":"Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, and Jun Zhu. 2024a. Crm: Single image to 3d textured mesh with convolutional reconstruction model. In European Conference on Computer Vision. Springer, 57\u201374."},{"key":"e_1_3_3_2_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01536"},{"key":"e_1_3_3_2_80_1","unstructured":"Xinyue Wei Kai Zhang Sai Bi Hao Tan Fujun Luan Valentin Deschaintre Kalyan Sunkavalli Hao Su and Zexiang Xu. 2024. MeshLRM: Large Reconstruction Model for High-Quality Meshes. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2404.12385 (2024)."},{"key":"e_1_3_3_2_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01296"},{"key":"e_1_3_3_2_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00303"},{"key":"e_1_3_3_2_83_1","unstructured":"Di Wu Liu Liu Zhou Linli Anran Huang Liangtu Song Qiaojun Yu Qi Wu and Cewu Lu. 2025. REArtGS: Reconstructing and Generating Articulated Objects via 3D Gaussian Splatting with Geometric and Motion Constraints. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2503.06677 (2025)."},{"key":"e_1_3_3_2_84_1","volume-title":"The Thirty-eighth Annual Conference on Neural Information Processing Systems","author":"Wu Kailu","year":"2024","unstructured":"Kailu Wu, Fangfu Liu, Zhihan Cai, Runjie Yan, Hanyang Wang, Yating Hu, Yueqi Duan, and Kaisheng Ma. 2024b. Unique3d: High-quality and efficient 3d mesh generation from a single image. In The Thirty-eighth Annual Conference on Neural Information Processing Systems."},{"key":"e_1_3_3_2_85_1","unstructured":"Shuang Wu Youtian Lin Feihu Zhang Yifei Zeng Jingxi Xu Philip Torr Xun Cao and Yao Yao. 2024a. Direct3d: Scalable image-to-3d generation via 3d latent diffusion transformer. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2405.14832 (2024)."},{"key":"e_1_3_3_2_86_1","unstructured":"Tianhao Wu Fangcheng Zhong Andrea Tagliasacchi Forrester Cole and Cengiz Oztireli. 2022. D^ 2nerf: Self-supervised decoupling of dynamic and static objects from a monocular video. Advances in neural information processing systems 35 (2022) 32653\u201332666."},{"key":"e_1_3_3_2_87_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01111"},{"key":"e_1_3_3_2_88_1","doi-asserted-by":"crossref","unstructured":"Jianfeng Xiang Zelong Lv Sicheng Xu Yu Deng Ruicheng Wang Bowen Zhang Dong Chen Xin Tong and Jiaolong Yang. 2024. Structured 3d latents for scalable and versatile 3d generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2412.01506 (2024).","DOI":"10.1109\/CVPR52734.2025.02000"},{"key":"e_1_3_3_2_89_1","unstructured":"Jiale Xu Weihao Cheng Yiming Gao Xintao Wang Shenghua Gao and Ying Shan. 2024. Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2404.07191 (2024)."},{"key":"e_1_3_3_2_90_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528233.3530742"},{"key":"e_1_3_3_2_91_1","unstructured":"Zihao Yan Ruizhen Hu Xingguang Yan Luanmin Chen Oliver Van\u00a0Kaick Hao Zhang and Hui Huang. 2020. RPM-Net: recurrent prediction of motion and parts from point cloud. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2006.14865 (2020)."},{"key":"e_1_3_3_2_92_1","doi-asserted-by":"crossref","unstructured":"Zihao Yan Fubao Su Mingyang Wang Ruizhen Hu Hao Zhang and Hui Huang. 2023. Interaction-Driven Active 3D Reconstruction with Object Interiors. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2310.14700 (2023).","DOI":"10.1145\/3618327"},{"key":"e_1_3_3_2_93_1","unstructured":"Hongliang Zeng Ping Zhang Chengjiong Wu Jiahua Wang Tingyu Ye and Fang Li. 2024. MARS: multimodal active robotic sensing for articulated characterization. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2407.01191 (2024)."},{"key":"e_1_3_3_2_94_1","unstructured":"Ge Zhang Or Litany Srinath Sridhar and Leonidas Guibas. 2021. Strobenet: Category-level multiview reconstruction of articulated objects. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2105.08016 (2021)."},{"key":"e_1_3_3_2_95_1","doi-asserted-by":"crossref","unstructured":"Haiyu Zhang Xinyuan Chen Yaohui Wang Xihui Liu Yunhong Wang and Yu Qiao. 2024a. 4diffusion: Multi-view video diffusion model for 4d generation. Advances in Neural Information Processing Systems 37 (2024) 15272\u201315295.","DOI":"10.52202\/079017-0488"},{"key":"e_1_3_3_2_96_1","doi-asserted-by":"crossref","unstructured":"Longwen Zhang Ziyu Wang Qixuan Zhang Qiwei Qiu Anqi Pang Haoran Jiang Wei Yang Lan Xu and Jingyi Yu. 2024b. CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets. ACM Transactions on Graphics (TOG) 43 4 (2024) 1\u201320.","DOI":"10.1145\/3658146"},{"key":"e_1_3_3_2_97_1","unstructured":"Yuyang Zhao Zhiwen Yan Enze Xie Lanqing Hong Zhenguo Li and Gim\u00a0Hee Lee. 2023. Animate124: Animating one image to 4d dynamic scene. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.14603 (2023)."},{"key":"e_1_3_3_2_98_1","unstructured":"Zibo Zhao Zeqiang Lai Qingxiang Lin Yunfei Zhao Haolin Liu Shuhui Yang Yifei Feng Mingxin Yang Sheng Zhang Xianghui Yang et\u00a0al. 2025. Hunyuan3d 2.0: Scaling diffusion models for high resolution textured 3d assets generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2501.12202 (2025)."},{"key":"e_1_3_3_2_99_1","unstructured":"Junzhe Zhu and Peiye Zhuang. 2023. HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance. arxiv:https:\/\/arXiv.org\/abs\/2305.18766\u00a0[cs.CV]"},{"key":"e_1_3_3_2_100_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00983"}],"event":{"name":"SA Conference Papers '25: SIGGRAPH Asia 2025 Conference Papers","location":"Hong Kong Hong Kong","acronym":"SA Conference Papers '25","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Proceedings of the SIGGRAPH Asia 2025 Conference Papers"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3757377.3763845","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T03:28:24Z","timestamp":1765250904000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3757377.3763845"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,14]]},"references-count":99,"alternative-id":["10.1145\/3757377.3763845","10.1145\/3757377"],"URL":"https:\/\/doi.org\/10.1145\/3757377.3763845","relation":{},"subject":[],"published":{"date-parts":[[2025,12,14]]},"assertion":[{"value":"2025-12-14","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}