{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T22:15:15Z","timestamp":1769638515013,"version":"3.49.0"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2011,7,1]],"date-time":"2011-07-01T00:00:00Z","timestamp":1309478400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["6.10E+15"],"award-info":[{"award-number":["6.10E+15"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002855","name":"Ministry of Science and Technology of the People's Republic of China","doi-asserted-by":"publisher","award":["2009AA01Z327"],"award-info":[{"award-number":["2009AA01Z327"]}],"id":[{"id":"10.13039\/501100002855","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002855","name":"Ministry of Science and Technology of the People's Republic of China","doi-asserted-by":"publisher","award":["2010CB731800"],"award-info":[{"award-number":["2010CB731800"]}],"id":[{"id":"10.13039\/501100002855","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2011,7]]},"abstract":"<jats:p>We present a method to synthesize plausible video sequences of humans according to user-defined body motions and viewpoints. We first capture a small database of multi-view video sequences of an actor performing various basic motions. This database needs to be captured only once and serves as the input to our synthesis algorithm. We then apply a marker-less model-based performance capture approach to the entire database to obtain pose and geometry of the actor in each database frame. To create novel video sequences of the actor from the database, a user animates a 3D human skeleton with novel motion and viewpoints. Our technique then synthesizes a realistic video sequence of the actor performing the specified motion based only on the initial database. The first key component of our approach is a new efficient retrieval strategy to find appropriate spatio-temporally coherent database frames from which to synthesize target video frames. The second key component is a warping-based texture synthesis approach that uses the retrieved most-similar database frames to synthesize spatio-temporally coherent target video frames. For instance, this enables us to easily create video sequences of actors performing dangerous stunts without them being placed in harm's way. We show through a variety of result videos and a user study that we can synthesize realistic videos of people, even if the target motions and camera views are different from the database content.<\/jats:p>","DOI":"10.1145\/2010324.1964927","type":"journal-article","created":{"date-parts":[[2011,7,26]],"date-time":"2011-07-26T14:17:46Z","timestamp":1311689866000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":54,"title":["Video-based characters"],"prefix":"10.1145","volume":"30","author":[{"given":"Feng","family":"Xu","sequence":"first","affiliation":[{"name":"TNList, Tsinghua University, China"}]},{"given":"Yebin","family":"Liu","sequence":"additional","affiliation":[{"name":"MPI Informatik, Germany"}]},{"given":"Carsten","family":"Stoll","sequence":"additional","affiliation":[{"name":"MPI Informatik, Germany"}]},{"given":"James","family":"Tompkin","sequence":"additional","affiliation":[{"name":"University College London, UK"}]},{"given":"Gaurav","family":"Bharaj","sequence":"additional","affiliation":[{"name":"MPI Informatik, Germany"}]},{"given":"Qionghai","family":"Dai","sequence":"additional","affiliation":[{"name":"TNList, Tsinghua University, China"}]},{"given":"Hans-Peter","family":"Seidel","sequence":"additional","affiliation":[{"name":"MPI Informatik, Germany"}]},{"given":"Jan","family":"Kautz","sequence":"additional","affiliation":[{"name":"University College London, UK"}]},{"given":"Christian","family":"Theobalt","sequence":"additional","affiliation":[{"name":"MPI Informatik, Germany"}]}],"member":"320","published-online":{"date-parts":[[2011,7,25]]},"reference":[{"key":"e_1_2_2_1_1","unstructured":"Ballan L. and Cortelazzo G. M. 2008. Marker-less motion capture of skinned models in a four camera set-up using optical flow and silhouettes. In 3DPVT."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","unstructured":"Ballan L. Brostow G. J. Puwein J. and Pollefeys M. 2010. Unstructured video-based rendering: Interactive exploration of casually captured videos. ACM TOG (Proc. SIGGRAPH) 1--11. 10.1145\/1833349.1778824","DOI":"10.1145\/1833349.1778824"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276467"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360698"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","unstructured":"Buehler C. Bosse M. McMillan L. Gortler S. and Cohen M. 2001. Unstructured lumigraph rendering. In SIGGRAPH 425--432. 10.1145\/383259.383309","DOI":"10.1145\/383259.383309"},{"key":"e_1_2_2_6_1","volume-title":"Proc. IEEE CVPR, 1--8.","author":"Cagniart C.","unstructured":"Cagniart, C., Boyer, E., and Ilic, S. 2010. Free-form mesh tracking: a patch-based approach. In Proc. IEEE CVPR, 1--8."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1201775.882309"},{"key":"e_1_2_2_8_1","volume-title":"Proc. of CASA, 331--338","author":"Celly B.","unstructured":"Celly, B., and Zordan, V. 2004. Animated people textures. In Proc. of CASA, 331--338."},{"key":"e_1_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Cobzas D. Yerex K. and Jagersand M. 2002. Dynamic textures for image-based rendering of fine-scale 3d structure and animation of non-rigid motion. In In Eurographics 1067--7055.","DOI":"10.1111\/1467-8659.00609"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360697"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","unstructured":"Debevec P. E. Taylor C. J. and Malik J. 1996. Modeling and rendering architecture from photographs: A hybrid geometry- and image-based approach. In SIGGRAPH 11--20. 10.1145\/237170.237191","DOI":"10.1145\/237170.237191"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.2312\/EGWR\/EGSR06\/183-194"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1507149.1507182"},{"key":"e_1_2_2_15_1","volume-title":"-P","author":"Gall J.","year":"2009","unstructured":"Gall, J., Stoll, C., Aguiar, E., Theobalt, C., Rosenhahn, B., and Seidel, H.-P. 2009. Motion capture using joint skeleton tracking and surface estimation. In Proc. IEEE CVPR, 1746--1753."},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/280814.280820"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2009.01416.x"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","unstructured":"Hornung A. Dekkers E. and Kobbelt L. 2007. Character animation from 2d pictures and 3d motion data. ACM TOG 26 1 1:1--1:9. 10.1145\/1189762.1189763","DOI":"10.1145\/1189762.1189763"},{"key":"e_1_2_2_19_1","volume-title":"Proc. CVPR, 1478--1485","author":"Huang P.","unstructured":"Huang, P., Hilton, A., and Starck, J. 2009. Human motion synthesis from 3d video. In Proc. CVPR, 1478--1485."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","unstructured":"Jain A. Thorm\u00e4hlen T. Seidel H.-P. and Theobalt C. 2010. Moviereshape: tracking and reshaping of humans in videos. ACM TOG (Proc. SIGGRAPH Asia) 29 148:1--148:10. 10.1145\/1882261.1866174","DOI":"10.1145\/1882261.1866174"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","unstructured":"Jimenez J. Scully T. Barbosa N. Donner C. Alvarez X. Vieira T. Matts P. Orvalho V. Gutierrez D. and Weyrich T. 2010. A practical appearance model for dynamic facial color. ACM TOG (Proc. SIGGRAPH Asia) 29 141:1--141:10. 10.1145\/1882261.1866167","DOI":"10.1145\/1882261.1866167"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/1886063.1886090"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","unstructured":"Leyvand T. Cohen-Or D. Dror G. and Lischinski D. 2008. Data-driven enhancement of facial attractiveness. ACM TOG (Proc. SIGGRAPH) 27 3 38:1--38:9. 10.1145\/1360612.1360637","DOI":"10.1145\/1360612.1360637"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.344951"},{"key":"e_1_2_2_25_1","unstructured":"Mori G. Berg A. Efros A. Eden A. and Malik J. 2004. Video based motion synthesis by splicing and morphing. UC Berkeley Technical Reports No. UCB\/CSD-4-1337."},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/938978.939185"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141920"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/545261.545281"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","unstructured":"Sch\u00f6dl A. Szeliski R. Salesin D. H. and Essa I. 2000. Video textures. In SIGGRAPH 489--498. 10.1145\/344779.345012","DOI":"10.1145\/344779.345012"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073368.1073375"},{"key":"e_1_2_2_31_1","first-page":"7","article-title":"View and Time Interpolation in Image Space","volume":"27","author":"Stich T.","year":"2008","unstructured":"Stich, T., Linz, C., Albuquerque, G., and Magnor, M. 2008. View and Time Interpolation in Image Space. Computer Graphics Forum (Proc. Pacific Graphics) 27, 7.","journal-title":"Computer Graphics Forum (Proc. Pacific Graphics)"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","unstructured":"Stoll C. Gall J. de Aguiar E. Thrun S. and Theobalt C. 2010. Video-based reconstruction of animatable human characters. ACM TOG (Proc. SIGGRAPH Asia) 29 139:1--139:10. 10.1145\/1882261.1866161","DOI":"10.1145\/1882261.1866161"},{"key":"e_1_2_2_33_1","unstructured":"Theobalt C. Wuermlin S. de Aguiar E. and Nieder-berger C. 2007. New trends in 3d video. In Eurographics Courses."},{"key":"e_1_2_2_34_1","volume-title":"Proc. IEEE ICCV, 1709--1716","author":"Tung T.","unstructured":"Tung, T., Nobuhara, S., and Matsuyama, T. 2009. Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo. In Proc. IEEE ICCV, 1709--1716."},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1399504.1360696"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1661412.1618520"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00371-006-0053-z"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2005.14"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073259"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","unstructured":"Zhou S. Fu H. Liu L. Cohen-Or D. and Han X. 2010. Parametric reshaping of human bodies in images. ACM TOG (Proc. SIGGRAPH) 29 4 126:1--126:10. 10.1145\/1778765.1778863","DOI":"10.1145\/1778765.1778863"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015766"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2010324.1964927","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2010324.1964927","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:22:23Z","timestamp":1750245743000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2010324.1964927"}},"subtitle":["creating new human performances from a multi-view video database"],"short-title":[],"issued":{"date-parts":[[2011,7]]},"references-count":40,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,7]]}},"alternative-id":["10.1145\/2010324.1964927"],"URL":"https:\/\/doi.org\/10.1145\/2010324.1964927","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,7]]},"assertion":[{"value":"2011-07-25","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}