{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T20:08:36Z","timestamp":1769458116099,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":61,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548351","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:12Z","timestamp":1665416592000},"page":"2483-2494","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["CrossHuman"],"prefix":"10.1145","author":[{"given":"Liliang","family":"Chen","sequence":"first","affiliation":[{"name":"OPPO Research Institute, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiaqi","family":"Li","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Han","family":"Huang","sequence":"additional","affiliation":[{"name":"OPPO Research Institute, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yandong","family":"Guo","sequence":"additional","affiliation":[{"name":"OPPO Research Institute, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Proceedings, Part XXII 16","author":"Aliev Kara-Ali","year":"2020","unstructured":"Kara-Ali Aliev , Artem Sevastopolsky , Maria Kolos , Dmitry Ulyanov , and Victor Lempitsky . 2020 . Neural point-based graphics. In Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020 , Proceedings, Part XXII 16 . Springer, 696--712. Kara-Ali Aliev, Artem Sevastopolsky, Maria Kolos, Dmitry Ulyanov, and Victor Lempitsky. 2020. Neural point-based graphics. In Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part XXII 16. Springer, 696--712."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00127"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2018.00022"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00875"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00238"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58536-5_19"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46454-1_34"},{"key":"e_1_3_2_2_8_1","unstructured":"Z. Cao G. Hidalgo Martinez T. Simon S. Wei and Y. A. Sheikh. 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019).  Z. Cao G. Hidalgo Martinez T. Simon S. Wei and Y. A. Sheikh. 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00609"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00700"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.264"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206755"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00491"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_29"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00961"},{"key":"e_1_3_2_2_16_1","first-page":"1","article-title":"The relightables: Volumetric performance capture of humans with realistic relighting","volume":"38","author":"Guo Kaiwen","year":"2019","unstructured":"Kaiwen Guo , Peter Lincoln , Philip Davidson , Jay Busch , Xueming Yu , Matt Whalen , Geoff Harvey , Sergio Orts-Escolano , Rohit Pandey , Jason Dourgarian , 2019 . The relightables: Volumetric performance capture of humans with realistic relighting . ACM Transactions on Graphics (TOG) 38 , 6 (2019), 1 -- 19 . Kaiwen Guo, Peter Lincoln, Philip Davidson, Jay Busch, Xueming Yu, Matt Whalen, Geoff Harvey, Sergio Orts-Escolano, Rohit Pandey, Jason Dourgarian, et al. 2019. The relightables: Volumetric performance capture of humans with realistic relighting. ACM Transactions on Graphics (TOG) 38, 6 (2019), 1--19.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3311970"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00510"},{"key":"e_1_3_2_2_19_1","volume-title":"Geo-pifu: Geom- etry and pixel aligned implicit functions for single-view human reconstruction. arXiv preprint arXiv:2006.08072","author":"He Tong","year":"2020","unstructured":"Tong He , John Collomosse , Hailin Jin , and Stefano Soatto . 2020 . Geo-pifu: Geom- etry and pixel aligned implicit functions for single-view human reconstruction. arXiv preprint arXiv:2006.08072 (2020). Tong He, John Collomosse, Hailin Jin, and Stefano Soatto. 2020. Geo-pifu: Geom- etry and pixel aligned implicit functions for single-view human reconstruction. arXiv preprint arXiv:2006.08072 (2020)."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01086"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00316"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00604"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00744"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00530"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00234"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2019.00076"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.336"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01394"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12278"},{"key":"e_1_3_2_2_30_1","volume-title":"Real-Time High-Resolution Background Matting. arXiv","author":"Lin Shanchuan","year":"2020","unstructured":"Shanchuan Lin , Andrey Ryabtsev , Soumyadip Sengupta , Brian Curless , Steve Seitz , and Ira Kemelmacher-Shlizerman . 2020. Real-Time High-Resolution Background Matting. arXiv ( 2020 ), arXiv--2012. Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian Curless, Steve Seitz, and Ira Kemelmacher-Shlizerman. 2020. Real-Time High-Resolution Background Matting. arXiv (2020), arXiv--2012."},{"key":"e_1_3_2_2_31_1","volume-title":"Markerless motion capture of multiple characters using multiview image segmentation","author":"Liu Yebin","year":"2013","unstructured":"Yebin Liu , Juergen Gall , Carsten Stoll , Qionghai Dai , Hans-Peter Seidel , and Christian Theobalt . 2013. Markerless motion capture of multiple characters using multiview image segmentation . IEEE transactions on pattern analysis and machine intelligence 35, 11 ( 2013 ), 2720--2735. Yebin Liu, Juergen Gall, Carsten Stoll, Qionghai Dai, Hans-Peter Seidel, and Christian Theobalt. 2013. Markerless motion capture of multiple characters using multiview image segmentation. IEEE transactions on pattern analysis and machine intelligence 35, 11 (2013), 2720--2735."},{"key":"e_1_3_2_2_32_1","volume-title":"Neural volumes: Learning dynamic renderable volumes from images. arXiv preprint arXiv:1906.07751","author":"Lombardi Stephen","year":"2019","unstructured":"Stephen Lombardi , Tomas Simon , Jason Saragih , Gabriel Schwartz , Andreas Lehrmann , and Yaser Sheikh . 2019. Neural volumes: Learning dynamic renderable volumes from images. arXiv preprint arXiv:1906.07751 ( 2019 ). Stephen Lombardi, Tomas Simon, Jason Saragih, Gabriel Schwartz, Andreas Lehrmann, and Yaser Sheikh. 2019. Neural volumes: Learning dynamic renderable volumes from images. arXiv preprint arXiv:1906.07751 (2019)."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818013"},{"key":"e_1_3_2_2_34_1","volume-title":"Marching cubes: A high resolution 3D surface construction algorithm. ACM siggraph computer graphics 21, 4","author":"Lorensen William E","year":"1987","unstructured":"William E Lorensen and Harvey E Cline . 1987. Marching cubes: A high resolution 3D surface construction algorithm. ACM siggraph computer graphics 21, 4 ( 1987 ), 163--169. William E Lorensen and Harvey E Cline. 1987. Marching cubes: A high resolution 3D surface construction algorithm. ACM siggraph computer graphics 21, 4 (1987), 163--169."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00650"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.109"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298631"},{"key":"e_1_3_2_2_38_1","volume-title":"Star: Sparse trained articulated human body regressor. In Computer Vision--ECCV 2020: 16th European Conference","author":"Osman Ahmed AA","year":"2020","unstructured":"Ahmed AA Osman , Timo Bolkart , and Michael J Black . 2020 . Star: Sparse trained articulated human body regressor. In Computer Vision--ECCV 2020: 16th European Conference , Glasgow, UK , August 23-28, 2020, Proceedings, Part VI 16. Springer , 598--613. Ahmed AA Osman, Timo Bolkart, and Michael J Black. 2020. Star: Sparse trained articulated human body regressor. In Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part VI 16. Springer, 598--613."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00025"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58580-8_31"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00894"},{"key":"e_1_3_2_2_42_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 652--660","author":"Qi Charles R","year":"2017","unstructured":"Charles R Qi , Hao Su , Kaichun Mo , and Leonidas J Guibas . 2017 . Pointnet: Deep learning on point sets for 3d classification and segmentation . In Proceedings of the IEEE conference on computer vision and pattern recognition. 652--660 . Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 652--660."},{"key":"e_1_3_2_2_43_1","unstructured":"renderpeople. 2000. https:\/\/renderpeople.com\/.  renderpeople. 2000. https:\/\/renderpeople.com\/."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00239"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00016"},{"key":"e_1_3_2_2_46_1","volume-title":"DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Rendering. arXiv preprint arXiv:2106.03798","author":"Shao Ruizhi","year":"2021","unstructured":"Ruizhi Shao , Hongwen Zhang , He Zhang , Yanpei Cao , Tao Yu , and Yebin Liu . 2021. DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Rendering. arXiv preprint arXiv:2106.03798 ( 2021 ). Ruizhi Shao, Hongwen Zhang, He Zhang, Yanpei Cao, Tao Yu, and Yebin Liu. 2021. DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Rendering. arXiv preprint arXiv:2106.03798 (2021)."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01054"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00254"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00584"},{"key":"e_1_3_2_2_50_1","volume-title":"Flyfusion: Realtime dynamic scene reconstruction using a flying depth camera","author":"Xu Lan","year":"2019","unstructured":"Lan Xu , Wei Cheng , Kaiwen Guo , Lei Han , Yebin Liu , and Lu Fang . 2019 . Flyfusion: Realtime dynamic scene reconstruction using a flying depth camera . IEEE transactions on visualization and computer graphics 27, 1 (2019), 68--82. Lan Xu, Wei Cheng, Kaiwen Guo, Lei Han, Yebin Liu, and Lu Fang. 2019. Flyfusion: Realtime dynamic scene reconstruction using a flying depth camera. IEEE transactions on visualization and computer graphics 27, 1 (2019), 68--82."},{"key":"e_1_3_2_2_51_1","volume-title":"Per- spective transformer nets: Learning single-view 3d object reconstruction without 3d supervision. arXiv preprint arXiv:1612.00814","author":"Yan Xinchen","year":"2016","unstructured":"Xinchen Yan , Jimei Yang , Ersin Yumer , Yijie Guo , and Honglak Lee . 2016. Per- spective transformer nets: Learning single-view 3d object reconstruction without 3d supervision. arXiv preprint arXiv:1612.00814 ( 2016 ). Xinchen Yan, Jimei Yang, Ersin Yumer, Yijie Guo, and Honglak Lee. 2016. Per- spective transformer nets: Learning single-view 3d object reconstruction without 3d supervision. arXiv preprint arXiv:1612.00814 (2016)."},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00455"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.104"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00569"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00761"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00708"},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.582"},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01125"},{"key":"e_1_3_2_2_59_1","volume-title":"DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras. arXiv preprint arXiv:2105.00261","author":"Zheng Yang","year":"2021","unstructured":"Yang Zheng , Ruizhi Shao , Yuxiang Zhang , Tao Yu , Zerong Zheng , Qionghai Dai , and Yebin Liu . 2021. DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras. arXiv preprint arXiv:2105.00261 ( 2021 ). Yang Zheng, Ruizhi Shao, Yuxiang Zhang, Tao Yu, Zerong Zheng, Qionghai Dai, and Yebin Liu. 2021. DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras. arXiv preprint arXiv:2105.00261 (2021)."},{"key":"e_1_3_2_2_60_1","volume-title":"Pamir: Parametric model-conditioned implicit representation for image-based human reconstruc- tion","author":"Zheng Zerong","year":"2021","unstructured":"Zerong Zheng , Tao Yu , Yebin Liu , and Qionghai Dai . 2021 . Pamir: Parametric model-conditioned implicit representation for image-based human reconstruc- tion . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2021). Zerong Zheng, Tao Yu, Yebin Liu, and Qionghai Dai. 2021. Pamir: Parametric model-conditioned implicit representation for image-based human reconstruc- tion. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)."},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58607-2_29"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548351","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548351","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:43Z","timestamp":1750186843000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548351"}},"subtitle":["Learning Cross-guidance from Multi-frame Images for Human Reconstruction"],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":61,"alternative-id":["10.1145\/3503161.3548351","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548351","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}