{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T18:57:15Z","timestamp":1774637835877,"version":"3.50.1"},"reference-count":111,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"name":"The Research Grants Council of the Hong Kong Special Administrative Region","award":["Project T45-401\/22-N"],"award-info":[{"award-number":["Project T45-401\/22-N"]}]},{"name":"The Research Grants Council of the Hong Kong Special Administrative Region","award":["No. CUHK 14201321"],"award-info":[{"award-number":["No. CUHK 14201321"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2025,8,1]]},"abstract":"<jats:p>Hand shadow art is a captivating art form, creatively using hand shadows to reproduce expressive shapes on the wall. In this work, we study an inverse problem: given a target shape, find the poses of left and right hands that together best produce a shadow resembling the input. This problem is nontrivial, since the design space of 3D hand poses is huge while being restrictive due to anatomical constraints. Also, we need to attend to the input's shape and crucial features, though the input is colorless and textureless. To meet these challenges, we design Hand-Shadow Poser, a three-stage pipeline, to decouple the anatomical constraints (by hand) and semantic constraints (by shadow shape): (i) a generative hand assignment module to explore diverse but reasonable left\/right-hand shape hypotheses; (ii) a generalized hand-shadow alignment module to infer coarse hand poses with a similarity-driven strategy for selecting hypotheses; and (iii) a shadow-feature-aware refinement module to optimize the hand poses for physical plausibility and shadow feature preservation. Further, we design our pipeline to be trainable on generic public hand data, thus avoiding the need for any specialized training dataset. For method validation, we build a benchmark of 210 diverse shadow shapes of varying complexity and a comprehensive set of metrics, including a novel DINOv2-based evaluation metric. Through extensive comparisons with multiple baselines and user studies, our approach is demonstrated to effectively generate bimanual hand poses for a large variety of hand shapes for over 85% of the benchmark cases.<\/jats:p>","DOI":"10.1145\/3730836","type":"journal-article","created":{"date-parts":[[2025,7,27]],"date-time":"2025-07-27T04:02:41Z","timestamp":1753588961000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Hand-Shadow Poser"],"prefix":"10.1145","volume":"44","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3676-5737","authenticated-orcid":false,"given":"Hao","family":"Xu","sequence":"first","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6099-206X","authenticated-orcid":false,"given":"Yinqiao","family":"Wang","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2597-0914","authenticated-orcid":false,"given":"Niloy J.","family":"Mitra","sequence":"additional","affiliation":[{"name":"University College London (UCL), London, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8815-5335","authenticated-orcid":false,"given":"Shuaicheng","family":"Liu","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3055-5034","authenticated-orcid":false,"given":"Pheng-Ann","family":"Heng","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5238-593X","authenticated-orcid":false,"given":"Chi-Wing","family":"Fu","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,7,27]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2004.1315258"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778797"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2012.02.008"},{"key":"e_1_2_2_4_1","volume-title":"Albert","author":"Y.","year":"1970","unstructured":"Y. ill Almoznino, Albert; Pinas. 1970. The art of hand shadows. Stravon Educational Press, New York."},{"key":"e_1_2_2_5_1","volume-title":"Luc Van Gool, and Marc Pollefeys","author":"Ballan Luca","year":"2012","unstructured":"Luca Ballan, Aparna Taneja, J\u00fcrgen Gall, Luc Van Gool, and Marc Pollefeys. 2012. Motion capture of hands in action using discriminative salient points. In ECCV. Springer, 640\u2013653."},{"key":"e_1_2_2_6_1","volume-title":"Computer Graphics Forum","author":"Baran Ilya","unstructured":"Ilya Baran, Philipp Keller, Derek Bradley, Stelian Coros, Wojciech Jarosz, Derek Nowrouzezahrai, and Markus Gross. 2012. Manufacturing layered attenuators for multiple prescribed shadow images. In Computer Graphics Forum, Vol. 31. Wiley Online Library, 603\u2013610."},{"key":"e_1_2_2_7_1","volume-title":"SHADWOPIX: Multiple images from self shadowing. In Computer Graphics Forum","author":"Bermano Amit","year":"2012","unstructured":"Amit Bermano, Ilya Baran, Marc Alexa, and Wojciech Matusk. 2012. SHADWOPIX: Multiple images from self shadowing. In Computer Graphics Forum, Vol. 31. Wiley Online Library, 593\u2013602."},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3680528.3687570"},{"key":"e_1_2_2_9_1","unstructured":"Blender 2019. Blender. http:\/\/www.blender.org."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3641519.3657500"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528233.3530715"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/APSIPAASC58517.2023.10317159"},{"key":"e_1_2_2_13_1","volume-title":"Umar Iqbal, Stan Birchfield, et al.","author":"Chao Yu-Wei","year":"2021","unstructured":"Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, et al. 2021. DexYCB: A benchmark for capturing hand grasping of objects. In CVPR. 9044\u20139053."},{"key":"e_1_2_2_14_1","volume-title":"Proceedings of the 43rd Graphics Interface Conference (GI '17)","author":"Chen Xiaozhong","unstructured":"Xiaozhong Chen, Sheldon Andrews, Derek Nowrouzezahrai, and Paul G. Kry. 2017. Ballistic shadow art. In Proceedings of the 43rd Graphics Interface Conference (GI '17). Canadian Human-Computer Communications Society, Waterloo, CAN, 190\u2013198."},{"key":"e_1_2_2_15_1","doi-asserted-by":"crossref","unstructured":"Xingyu Chen Yufeng Liu Yajiao Dong Xiong Zhang Chongyang Ma Yanmin Xiong Yuan Zhang and Xiaoyan Guo. 2022. MobRecon: Mobile-friendly hand mesh reconstruction from monocular image. In CVPR. 20544\u201320554.","DOI":"10.1109\/CVPR52688.2022.01989"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360661"},{"key":"e_1_2_2_17_1","volume-title":"Computer Graphics Forum","author":"Chiu Chun-Chia","unstructured":"Chun-Chia Chiu, Yi-Hsiang Lo, Ruen-Rone Lee, and Hung-Kuo Chu. 2015. Tone- and feature-aware circular scribble art. In Computer Graphics Forum, Vol. 34. Wiley Online Library, 225\u2013234."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778788"},{"key":"e_1_2_2_19_1","volume-title":"Seeing is deceiving: The psychology of visual illusions","author":"Coren J.","unstructured":"J. Coren, S.; Girgus. 1978. Seeing is deceiving: The psychology of visual illusions. Routledge, London."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.584"},{"key":"e_1_2_2_21_1","volume-title":"An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929","author":"Dosovitskiy Alexey","year":"2020","unstructured":"Alexey Dosovitskiy. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2022.3222784"},{"key":"e_1_2_2_23_1","volume-title":"International conference on machine learning. PMLR, 1126\u20131135","author":"Finn Chelsea","year":"2017","unstructured":"Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning. PMLR, 1126\u20131135."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1274871.1274873"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.15138"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.2312\/pg.20231279"},{"key":"e_1_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Daniel Geng Inbum Park and Andrew Owens. 2024. Visual anagrams: Generating multi-view optical illusions with diffusion models. In CVPR. 24154\u201324163.","DOI":"10.1109\/CVPR52733.2024.02280"},{"key":"e_1_2_2_28_1","volume-title":"Factorized diffusion: Perceptual illusions by noise decomposition","author":"Geng Daniel","unstructured":"Daniel Geng, Inbum Park, and Andrew Owens. 2025. Factorized diffusion: Perceptual illusions by noise decomposition. In ECCV. Springer, 366\u2013384."},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2018.12.001"},{"key":"e_1_2_2_30_1","volume-title":"Honnotate: A method for 3D annotation of hand and object poses. In CVPR. 3196\u20133206.","author":"Hampali Shreyas","year":"2020","unstructured":"Shreyas Hampali, Mahdi Rad, Markus Oberweger, and Vincent Lepetit. 2020. Honnotate: A method for 3D annotation of hand and object poses. In CVPR. 3196\u20133206."},{"key":"e_1_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Ayaan Haque Matthew Tancik Alexei A. Efros Aleksander Holynski and Angjoo Kanazawa. 2023. Instruct-NeRF2NeRF: Editing 3D scenes with instructions. In ICCV. 19740\u201319750.","DOI":"10.1109\/ICCV51070.2023.01808"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1364\/OE.27.027637"},{"key":"e_1_2_2_33_1","first-page":"6840","article-title":"Denoising diffusion probabilistic models","volume":"33","author":"Ho Jonathan","year":"2020","unstructured":"Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. In NeurIPS, Vol. 33. 6840\u20136851.","journal-title":"NeurIPS"},{"key":"e_1_2_2_34_1","volume-title":"Classifier-free diffusion guidance. arXiv preprint arXiv:2207.12598","author":"Ho Jonathan","year":"2022","unstructured":"Jonathan Ho and Tim Salimans. 2022. Classifier-free diffusion guidance. arXiv preprint arXiv:2207.12598 (2022)."},{"key":"e_1_2_2_35_1","volume-title":"Meta-learning in neural networks: A survey","author":"Hospedales Timothy","year":"2021","unstructured":"Timothy Hospedales, Antreas Antoniou, Paul Micaelli, and Amos Storkey. 2021. Meta-learning in neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence 44, 9 (2021), 5149\u20135169."},{"key":"e_1_2_2_36_1","volume-title":"Lora Oehlberg, and Joshua Taron.","author":"Hosseini Seyed Vahab","year":"2020","unstructured":"Seyed Vahab Hosseini, Usman Alim, Ali Mahdavi Amiri, Lora Oehlberg, and Joshua Taron. 2020. Portal: Design and fabrication of incidence-driven screens. International society of the arts, mathematics, and architecture, summer (2020), 31\u201346."},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275070"},{"key":"e_1_2_2_38_1","doi-asserted-by":"crossref","unstructured":"Lin Huang Chung-Ching Lin Kevin Lin Lin Liang Lijuan Wang Junsong Yuan and Zicheng Liu. 2023. Neural voting field for camera-space 3D hand pose estimation. In CVPR. 8969\u20138978.","DOI":"10.1109\/CVPR52729.2023.00866"},{"key":"e_1_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Umar Iqbal Pavlo Molchanov Thomas Breuel Juergen Gall and Jan Kautz. 2018. Hand pose estimation via latent 2.5D heatmap regression. In ECCV. 118\u2013134.","DOI":"10.1007\/978-3-030-01252-6_8"},{"key":"e_1_2_2_40_1","volume-title":"Fun with hand shadows","author":"Jacobs Frank","unstructured":"Frank Jacobs. 1996. Fun with hand shadows. Dover Publications, Mineola, N.Y."},{"key":"e_1_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Hanwen Jiang Shaowei Liu Jiashun Wang and Xiaolong Wang. 2021. Hand-object contact consistency reasoning for human grasps generation. In ICCV. 11107\u201311116.","DOI":"10.1109\/ICCV48922.2021.01092"},{"key":"e_1_2_2_42_1","volume-title":"Whole-body human pose estimation in the wild","author":"Jin Sheng","unstructured":"Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, and Ping Luo. 2020. Whole-body human pose estimation in the wild. In ECCV. Springer, 196\u2013214."},{"key":"e_1_2_2_43_1","first-page":"26565","article-title":"Elucidating the design space of diffusion-based generative models","volume":"35","author":"Karras Tero","year":"2022","unstructured":"Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. 2022. Elucidating the design space of diffusion-based generative models. NeurIPS 35 (2022), 26565\u201326577.","journal-title":"NeurIPS"},{"key":"e_1_2_2_44_1","volume-title":"Abstracts 25th European Workshop on Computational Geometry (EuroCG'09","author":"Keiren JJA","year":"2009","unstructured":"JJA Keiren, Freek van Walderveen, and Alexander Wolff. 2009. Constructability of triplets. In Abstracts 25th European Workshop on Computational Geometry (EuroCG'09, Brussels, Belgium, March 16\u201318, 2009). 251\u2013254."},{"key":"e_1_2_2_45_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In ICLR (Poster). http:\/\/dblp.uni-trier.de\/db\/conf\/iclr\/iclr2015.html#KingmaB14 Johannes Kopf and Dani Lischinski. 2011. Depixelizing pixel art. In ACM SIGGRAPH 2011 papers. 1\u20138."},{"key":"e_1_2_2_46_1","volume-title":"ImageNet classification with deep convolutional neural networks. NeurIPS 25","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet classification with deep convolutional neural networks. NeurIPS 25 (2012)."},{"key":"e_1_2_2_47_1","unstructured":"Hae Beom Lee Hayeon Lee Donghyun Na Saehoon Kim Minseop Park Eunho Yang and Sung Ju Hwang. 2020. Learning to balance: Bayesian meta-learning for imbalanced and out-of-distribution tasks. In ICLR."},{"key":"e_1_2_2_48_1","volume-title":"Silhouette-Net: 3D hand pose estimation from silhouettes. arXiv preprint arXiv:1912.12436","author":"Lee Kuo-Wei","year":"2019","unstructured":"Kuo-Wei Lee, Shih-Hung Liu, Hwann-Tzong Chen, and Koichi Ito. 2019. Silhouette-Net: 3D hand pose estimation from silhouettes. arXiv preprint arXiv:1912.12436 (2019)."},{"key":"e_1_2_2_49_1","doi-asserted-by":"crossref","unstructured":"Lijun Li Linrui Tian Xindi Zhang Qi Wang Bang Zhang Liefeng Bo Mengyuan Liu and Chen Chen. 2023. RenderIH: A large-scale synthetic dataset for 3D interacting hand pose estimation. In ICCV. 20395\u201320405.","DOI":"10.1109\/ICCV51070.2023.01865"},{"key":"e_1_2_2_50_1","volume-title":"A sketching interface for sitting pose design in the virtual environment","author":"Lin Juncong","year":"2012","unstructured":"Juncong Lin, Takeo Igarashi, Jun Mitani, Minghong Liao, and Ying He. 2012. A sketching interface for sitting pose design in the virtual environment. IEEE transactions on visualization and computer graphics 18, 11 (2012), 1979\u20131991."},{"key":"e_1_2_2_51_1","doi-asserted-by":"crossref","unstructured":"Kunhao Liu Fangneng Zhan Muyu Xu Christian Theobalt Ling Shao and Shijian Lu. 2024. StyleGaussian: Instant 3D style transfer with gaussian splatting. In SIGGRAPH Asia 2024 Technical Communications. 1\u20134.","DOI":"10.1145\/3681758.3698002"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073682"},{"key":"e_1_2_2_53_1","doi-asserted-by":"crossref","unstructured":"Xinyu Liu Houwen Peng Ningxin Zheng Yuqing Yang Han Hu and Yixuan Yuan. 2023. EfficientViT: Memory efficient vision transformer with cascaded group attention. In CVPR. 14420\u201314430.","DOI":"10.1109\/CVPR52729.2023.01386"},{"key":"e_1_2_2_54_1","volume-title":"Computer Graphics Forum","author":"Mattausch Oliver","unstructured":"Oliver Mattausch, Takeo Igarashi, and Michael Wimmer. 2013. Freeform shadow boundary editing. In Computer Graphics Forum, Vol. 32. Wiley Online Library, 175\u2013184."},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/1618452.1618474"},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3092912.3092915"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/1618452.1618502"},{"key":"e_1_2_2_58_1","doi-asserted-by":"crossref","unstructured":"Gyeongsik Moon and Kyoung Mu Lee. 2020. I2L-MeshNet: Image-to-lixel prediction network for accurate 3D human pose and mesh estimation from a single RGB image. In ECCV. 752\u2013768.","DOI":"10.1007\/978-3-030-58571-6_44"},{"key":"e_1_2_2_59_1","volume-title":"A dataset and baseline for 3D interacting hand pose estimation from a single RGB image","author":"Moon Gyeongsik","unstructured":"Gyeongsik Moon, Shoou-I Yu, He Wen, Takaaki Shiratori, and Kyoung Mu Lee. 2020. InterHand2.6M: A dataset and baseline for 3D interacting hand pose estimation from a single RGB image. In ECCV. Springer, 548\u2013564."},{"key":"e_1_2_2_60_1","unstructured":"Louis Nikola. 1913. Hand shadows: The complete art of shadowgraphy. C. Arthur Pearson LTD. Henrietta Street W.C."},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141919"},{"key":"e_1_2_2_62_1","unstructured":"Maxime Oquab Timoth\u00e9e Darcet Th\u00e9o Moutakanni Huy Vo Marc Szafraniec Vasil Khalidov Pierre Fernandez Daniel Haziza Francisco Massa Alaaeldin El-Nouby et al. 2023. DINOv2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 (2023)."},{"key":"e_1_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366205"},{"key":"e_1_2_2_64_1","unstructured":"Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury Gregory Chanan Trevor Killeen Zeming Lin Natalia Gimelshein Luca Antiga et al. 2019. PyTorch: An imperative style high-performance deep learning library. NeurIPS 32 (2019)."},{"key":"e_1_2_2_65_1","volume-title":"Black","author":"Pavlakos Georgios","year":"2019","unstructured":"Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, and Michael J. Black. 2019. Expressive body capture: 3D hands, face, and body from a single image. In CVPR."},{"key":"e_1_2_2_66_1","doi-asserted-by":"crossref","unstructured":"Georgios Pavlakos Dandan Shan Ilija Radosavovic Angjoo Kanazawa David Fouhey and Jitendra Malik. 2024. Reconstructing hands in 3D with transformers. In CVPR. 9826\u20139836.","DOI":"10.1109\/CVPR52733.2024.00938"},{"key":"e_1_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/566654.566617"},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3588432.3591526"},{"key":"e_1_2_2_69_1","doi-asserted-by":"crossref","unstructured":"Zhiyu Qu Lan Yang Honggang Zhang Tao Xiang Kaiyue Pang and Yi-Zhe Song. 2024. Wired perspectives: Multi-view wire art embraces generative AI. In CVPR. 6149\u20136158.","DOI":"10.1109\/CVPR52733.2024.00588"},{"key":"e_1_2_2_70_1","volume-title":"International conference on machine learning. PMLR, 8748\u20138763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748\u20138763."},{"key":"e_1_2_2_71_1","volume-title":"Meta-learning with implicit gradients. NeurIPS 32","author":"Rajeswaran Aravind","year":"2019","unstructured":"Aravind Rajeswaran, Chelsea Finn, Sham M Kakade, and Sergey Levine. 2019. Meta-learning with implicit gradients. NeurIPS 32 (2019)."},{"key":"e_1_2_2_72_1","volume-title":"Reconstructing 3D human pose from 2D image landmarks","author":"Ramakrishna Varun","unstructured":"Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2012. Reconstructing 3D human pose from 2D image landmarks. In ECCV. Springer, 573\u2013586."},{"key":"e_1_2_2_73_1","volume-title":"Embodied hands: Modeling and capturing hands and bodies together. arXiv preprint arXiv:2201.02610","author":"Romero Javier","year":"2022","unstructured":"Javier Romero, Dimitrios Tzionas, and Michael J Black. 2022. Embodied hands: Modeling and capturing hands and bodies together. arXiv preprint arXiv:2201.02610 (2022)."},{"key":"e_1_2_2_74_1","doi-asserted-by":"crossref","unstructured":"Kaustubh Sadekar Ashish Tiwari and Shanmuganathan Raman. 2022. Shadow art revisited: A differentiable rendering based approach. In WACV. 29\u201337.","DOI":"10.1109\/WACV51458.2022.00070"},{"key":"e_1_2_2_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201400"},{"key":"e_1_2_2_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661267"},{"key":"e_1_2_2_77_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00371-006-0095-2"},{"key":"e_1_2_2_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592142"},{"key":"e_1_2_2_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.927422"},{"key":"e_1_2_2_80_1","doi-asserted-by":"crossref","unstructured":"Tomas Simon Hanbyul Joo Iain Matthews and Yaser Sheikh. 2017. Hand keypoint detection in single images using multiview bootstrapping. In CVPR. 1145\u20131153.","DOI":"10.1109\/CVPR.2017.494"},{"key":"e_1_2_2_81_1","volume-title":"Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502","author":"Song Jiaming","year":"2020","unstructured":"Jiaming Song, Chenlin Meng, and Stefano Ermon. 2020. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502 (2020)."},{"key":"e_1_2_2_82_1","doi-asserted-by":"publisher","DOI":"10.5555\/2592304.2592480"},{"key":"e_1_2_2_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/3641519.3657453"},{"key":"e_1_2_2_84_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-016-0895-4"},{"key":"e_1_2_2_85_1","volume-title":"NeurIPS","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141 ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS, Vol. 30. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2017\/file\/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf"},{"key":"e_1_2_2_86_1","volume-title":"Diffusion-based visual art creation: A survey and new perspectives. arXiv preprint arXiv:2408.12128","author":"Wang Bingyuan","year":"2024","unstructured":"Bingyuan Wang, Qifeng Chen, and Zeyu Wang. 2024. Diffusion-based visual art creation: A survey and new perspectives. arXiv preprint arXiv:2408.12128 (2024)."},{"key":"e_1_2_2_87_1","unstructured":"Caoliwen Wang and Bailin Deng. 2024. Neural shadow art. (2024). arXiv:2411.19161 [cs.CV] https:\/\/arxiv.org\/abs\/2411.19161"},{"key":"e_1_2_2_88_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531338"},{"key":"e_1_2_2_89_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925869"},{"key":"e_1_2_2_90_1","doi-asserted-by":"publisher","DOI":"10.1145\/3517120"},{"key":"e_1_2_2_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/1731047.1731051"},{"key":"e_1_2_2_92_1","doi-asserted-by":"crossref","unstructured":"Donglai Xiang Hanbyul Joo and Yaser Sheikh. 2019. Monocular total capture: Posing face body and hands in the wild. In CVPR. 10965\u201310974.","DOI":"10.1109\/CVPR.2019.01122"},{"key":"e_1_2_2_93_1","doi-asserted-by":"crossref","unstructured":"Hao Xu Tianyu Wang Xiao Tang and Chi-Wing Fu. 2023. H2ONet: Hand-occlusion-and-orientation-aware network for real-time 3D hand mesh reconstruction. In CVPR. 17048\u201317058.","DOI":"10.1109\/CVPR52729.2023.01635"},{"key":"e_1_2_2_94_1","volume-title":"CPF: Learning a contact potential field to model the hand-object interaction. In ICCV.","author":"Yang Lixin","year":"2021","unstructured":"Lixin Yang, Xinyu Zhan, Kailin Li, Wenqiang Xu, Jiefeng Li, and Cewu Lu. 2021b. CPF: Learning a contact potential field to model the hand-object interaction. In ICCV."},{"key":"e_1_2_2_95_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3450626.3459796","article-title":"WireRoom: Model-guided explorative design of abstract wire art","volume":"40","author":"Yang Zhijin","year":"2021","unstructured":"Zhijin Yang, Pengfei Xu, Hongbo Fu, and Hui Huang. 2021a. WireRoom: Model-guided explorative design of abstract wire art. ACM Transactions on Graphics (TOG) 40, 4 (2021), 1\u201313.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_2_2_96_1","volume-title":"Computer Graphics Forum","author":"Yue Yonghao","unstructured":"Yonghao Yue, Kei Iwasaki, Bing-Yu Chen, Yoshinori Dobashi, and Tomoyuki Nishita. 2012. Pixel art with refracted light by rearrangeable sticks. In Computer Graphics Forum, Vol. 31. Wiley Online Library, 575\u2013582."},{"key":"e_1_2_2_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/3472749.3474815"},{"key":"e_1_2_2_98_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2017.8296428"},{"key":"e_1_2_2_99_1","volume-title":"ARF: Artistic radiance fields","author":"Zhang Kai","year":"2022","unstructured":"Kai Zhang, Nick Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, and Noah Snavely. 2022. ARF: Artistic radiance fields. In ECCV. Springer, 717\u2013733."},{"key":"e_1_2_2_100_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6981"},{"key":"e_1_2_2_101_1","volume-title":"CVPR. Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, and Ji Liu.","author":"Zhang Richard","year":"2021","unstructured":"Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR. Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, and Ji Liu. 2021. Hand image understanding via deep multi-task learning. In ICCV. 11281\u201311292."},{"key":"e_1_2_2_102_1","doi-asserted-by":"crossref","unstructured":"Xiong Zhang Qiang Li Hong Mo Wenbo Zhang and Wen Zheng. 2019. End-to-end hand mesh recovery from a monocular RGB image. In ICCV. 2354\u20132364.","DOI":"10.1109\/ICCV.2019.00244"},{"key":"e_1_2_2_103_1","doi-asserted-by":"publisher","DOI":"10.1145\/3059454.3078858"},{"key":"e_1_2_2_104_1","doi-asserted-by":"publisher","DOI":"10.1145\/2907049"},{"key":"e_1_2_2_105_1","doi-asserted-by":"crossref","unstructured":"Pancheng Zhao Peng Xu Pengda Qin Deng-Ping Fan Zhicheng Zhang Guoli Jia Bowen Zhou and Jufeng Yang. 2024. LAKE-RED: Camouflaged images generation by latent background knowledge retrieval-augmented diffusion. In CVPR. 4092\u20134101.","DOI":"10.1109\/CVPR52733.2024.00392"},{"key":"e_1_2_2_106_1","doi-asserted-by":"crossref","unstructured":"Yuxiao Zhou Marc Habermann Weipeng Xu Ikhsanul Habibie Christian Theobalt and Feng Xu. 2020. Monocular real-time hand shape and motion capture using multi-modal data. In CVPR. 5346\u20135355.","DOI":"10.1109\/CVPR42600.2020.00539"},{"key":"e_1_2_2_107_1","doi-asserted-by":"crossref","unstructured":"Zhishan Zhou Shihao Zhou Zhi Lv Minqiang Zou Yao Tang and Jiajun Liang. 2024. A simple baseline for efficient hand mesh reconstruction. In CVPR. 1367\u20131376.","DOI":"10.1109\/CVPR52733.2024.00136"},{"key":"e_1_2_2_108_1","doi-asserted-by":"publisher","DOI":"10.1145\/3658231"},{"key":"e_1_2_2_109_1","doi-asserted-by":"crossref","unstructured":"Christian Zimmermann and Thomas Brox. 2017. Learning to estimate 3D hand pose from single RGB images. In ICCV. 4903\u20134911.","DOI":"10.1109\/ICCV.2017.525"},{"key":"e_1_2_2_110_1","doi-asserted-by":"crossref","unstructured":"Christian Zimmermann Duygu Ceylan Jimei Yang Bryan Russell Max Argus and Thomas Brox. 2019. FreiHAND: A dataset for markerless capture of hand pose and shape from single RGB images. In ICCV. 813\u2013822.","DOI":"10.1109\/ICCV.2019.00090"},{"key":"e_1_2_2_111_1","unstructured":"Binghui Zuo Zimeng Zhao Wenqian Sun Wei Xie Zhou Xue and Yangang Wang. 2023. Reconstructing interacting hands with interaction prior from monocular images. In ICCV. 9054\u20139064."}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3730836","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T17:59:39Z","timestamp":1774634379000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3730836"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,27]]},"references-count":111,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,8,1]]}},"alternative-id":["10.1145\/3730836"],"URL":"https:\/\/doi.org\/10.1145\/3730836","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,27]]},"assertion":[{"value":"2025-01-23","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-03-29","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}