{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T18:09:13Z","timestamp":1758823753513,"version":"3.41.0"},"reference-count":63,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2010,7,26]],"date-time":"2010-07-26T00:00:00Z","timestamp":1280102400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100004963","name":"Seventh Framework Programme","doi-asserted-by":"publisher","award":["FP7\/2007-2013210806"],"award-info":[{"award-number":["FP7\/2007-2013210806"]}],"id":[{"id":"10.13039\/501100004963","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2010,7,26]]},"abstract":"<jats:p>We present an algorithm designed for navigating around a performance that was filmed as a \"casual\" multi-view video collection: real-world footage captured on hand held cameras by a few audience members. The objective is to easily navigate in 3D, generating a video-based rendering (VBR) of a performance filmed with widely separated cameras. Casually filmed events are especially challenging because they yield footage with complicated backgrounds and camera motion. Such challenging conditions preclude the use of most algorithms that depend on correlation-based stereo or 3D shape-from-silhouettes.<\/jats:p>\n          <jats:p>Our algorithm builds on the concepts developed for the exploration of photo-collections of empty scenes. Interactive performer-specific view-interpolation is now possible through innovations in interactive rendering and offline-matting relating to i) modeling the foreground subject as video-sprites on billboards, ii) modeling the background geometry with adaptive view-dependent textures, and iii) view interpolation that follows a performer. The billboards are embedded in a simple but realistic reconstruction of the environment. The reconstructed environment provides very effective visual cues for spatial navigation as the user transitions between viewpoints. The prototype is tested on footage from several challenging events, and demonstrates the editorial utility of the whole system and the particular value of our new inter-billboard optimization.<\/jats:p>","DOI":"10.1145\/1778765.1778824","type":"journal-article","created":{"date-parts":[[2010,7,15]],"date-time":"2010-07-15T12:48:46Z","timestamp":1279198126000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":61,"title":["Unstructured video-based rendering"],"prefix":"10.1145","volume":"29","author":[{"given":"Luca","family":"Ballan","sequence":"first","affiliation":[{"name":"ETH Z\u00fcrich"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gabriel J.","family":"Brostow","sequence":"additional","affiliation":[{"name":"University College London"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jens","family":"Puwein","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marc","family":"Pollefeys","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2010,7,26]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/78.978374"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531376"},{"key":"e_1_2_2_3_1","unstructured":"Ballan L. and Cortelazzo G. M. 2008. Marker-less motion capture of skinned models in a four camera set-up using optical flow and silhouettes. In 3DPVT.  Ballan L. and Cortelazzo G. M. 2008. Marker-less motion capture of skinned models in a four camera set-up using optical flow and silhouettes. In 3DPVT."},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2004.60"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383309"},{"key":"e_1_2_2_6_1","volume-title":"18th British Machine Vision Conference","volume":"1","author":"Campbell N. D.","unstructured":"Campbell , N. D. , Vogiatzis , G. , Hern\u00e1ndez , C. , and Cipolla , R . 2007. Automatic 3d object segmentation in multiple views using volumetric graph-cuts . In 18th British Machine Vision Conference , vol. 1 , 530--539. Campbell, N. D., Vogiatzis, G., Hern\u00e1ndez, C., and Cipolla, R. 2007. Automatic 3d object segmentation in multiple views using volumetric graph-cuts. In 18th British Machine Vision Conference, vol. 1, 530--539."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/882262.882309"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/166117.166153"},{"key":"e_1_2_2_9_1","volume-title":"Proceedings of IEEE CVPR","volume":"2","author":"Chuang Y.-Y.","year":"2001","unstructured":"Chuang , Y.-Y. , Curless , B. , Salesin , D. H. , and Szeliski , R . 2001. A bayesian approach to digital matting . In Proceedings of IEEE CVPR 2001 , vol. 2 , 264--271. Chuang, Y.-Y., Curless, B., Salesin, D. H., and Szeliski, R. 2001. A bayesian approach to digital matting. In Proceedings of IEEE CVPR 2001, vol. 2, 264--271."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/566654.566572"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360697"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237191"},{"volume-title":"9th Eurographics Workshop on Rendering.","author":"Debevec P.","key":"e_1_2_2_13_1","unstructured":"Debevec , P. , Borshukov , G. , and Yu , Y . 1998. Efficient view-dependent image-based rendering with projective texture-mapping . In 9th Eurographics Workshop on Rendering. Debevec, P., Borshukov, G., and Yu, Y. 1998. Efficient view-dependent image-based rendering with projective texture-mapping. In 9th Eurographics Workshop on Rendering."},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357096"},{"volume-title":"Floating Textures. Computer Graphics Forum (Proc. Eurographics EG'08)","author":"Eisemann M.","key":"e_1_2_2_15_1","unstructured":"Eisemann , M. , Decker , B. D. , Magnor , M. , Bekaert , P. , de Aguiar , E. , Ahmed , N. , Theobalt , C. , and Sellent , A . 2008 . Floating Textures. Computer Graphics Forum (Proc. Eurographics EG'08) 27, 2 (4), 409--418. Eisemann, M., Decker, B. D., Magnor, M., Bekaert, P., de Aguiar, E., Ahmed, N., Theobalt, C., and Sellent, A. 2008. Floating Textures. Computer Graphics Forum (Proc. Eurographics EG'08) 27, 2 (4), 409--418."},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2005.105"},{"key":"e_1_2_2_17_1","doi-asserted-by":"crossref","unstructured":"Goesele M. Snavely N. Curless B. Hoppe H. and Seitz S. M. 2007. Multi-view stereo for community photo collections. In ICCV 1--8.  Goesele M. Snavely N. Curless B. Hoppe H. and Seitz S. M. 2007. Multi-view stereo for community photo collections. In ICCV 1--8.","DOI":"10.1109\/ICCV.2007.4408933"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1449715.1449719"},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237200"},{"volume-title":"Proceedings of EUROGRAPHICS, Computer Graphics Forum, 577--586","author":"Grundland M.","key":"e_1_2_2_21_1","unstructured":"Grundland , M. , Vohra , R. , Williams , G. P. , and Dodgson , N. A . 2006. Cross dissolve without cross fade: Preserving contrast, color and salience in image compositing . In Proceedings of EUROGRAPHICS, Computer Graphics Forum, 577--586 . Grundland, M., Vohra, R., Williams, G. P., and Dodgson, N. A. 2006. Cross dissolve without cross fade: Preserving contrast, color and salience in image compositing. In Proceedings of EUROGRAPHICS, Computer Graphics Forum, 577--586."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DIM.2007.3"},{"key":"e_1_2_2_23_1","volume-title":"Proc. International Conference on Computer Vision (ICCV","author":"Guillemaut J.-Y.","year":"2009","unstructured":"Guillemaut , J.-Y. , Kilner , J. , and Hilton , A . 2009. Robust graph-cut scene segmentation and reconstruction for free-viewpoint video of complex dynamic scenes . In Proc. International Conference on Computer Vision (ICCV 2009 ). Guillemaut, J.-Y., Kilner, J., and Hilton, A. 2009. Robust graph-cut scene segmentation and reconstruction for free-viewpoint video of complex dynamic scenes. In Proc. International Conference on Computer Vision (ICCV 2009)."},{"key":"e_1_2_2_24_1","unstructured":"Hartley R. I. and Zisserman A. 2000. Multiple View Geometry in Computer Vision. Cambridge University Press ISBN: 0521623049.   Hartley R. I. and Zisserman A. 2000. Multiple View Geometry in Computer Vision. Cambridge University Press ISBN: 0521623049."},{"key":"e_1_2_2_25_1","volume-title":"-P","author":"Hasler N.","year":"2009","unstructured":"Hasler , N. , Rosenhahn , B. , Thorm\u00e4hlen , T. , Wand , M. , Gall , J. , and Seidel , H . -P . 2009 . Markerless motion capture with unsynchronized moving cameras. In CVPR , 224--231. Hasler, N., Rosenhahn, B., Thorm\u00e4hlen, T., Wand, M., Gall, J., and Seidel, H.-P. 2009. Markerless motion capture with unsynchronized moving cameras. In CVPR, 224--231."},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGIV.2006.83"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276382"},{"key":"e_1_2_2_28_1","first-page":"21","article-title":"Plenoptic modeling and rendering from image sequences taken by hand-held camera","volume":"1999","author":"Heigl B.","year":"1999","unstructured":"Heigl , B. , Koch , R. , Pollefeys , M. , Denzler , J. , and Van Gool , L. 1999 . Plenoptic modeling and rendering from image sequences taken by hand-held camera . In Patter Recognition 1999 , 21 . DAGM-Symposium, 94--101. Heigl, B., Koch, R., Pollefeys, M., Denzler, J., and Van Gool, L. 1999. Plenoptic modeling and rendering from image sequences taken by hand-held camera. In Patter Recognition 1999, 21. DAGM-Symposium, 94--101.","journal-title":"Patter Recognition"},{"key":"e_1_2_2_29_1","unstructured":"Kanade T. 2001. Carnegie mellon goes to the superbowl. http:\/\/www.ri.cmu.edu\/events\/sb35\/tksuperbowl.html.  Kanade T. 2001. Carnegie mellon goes to the superbowl. http:\/\/www.ri.cmu.edu\/events\/sb35\/tksuperbowl.html."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357097"},{"volume-title":"European Conference on Visual Media Production (CVMP).","author":"Kilner J.","key":"e_1_2_2_31_1","unstructured":"Kilner , J. , Starck , J. , and Hilton , A . 2006. A comparative study of free-viewpoint video techniques for sports events . European Conference on Visual Media Production (CVMP). Kilner, J., Starck, J., and Hilton, A. 2006. A comparative study of free-viewpoint video techniques for sports events. European Conference on Visual Media Production (CVMP)."},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DIM.2007.22"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409060.1409069"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237199"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2005.44"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1576246.1531350"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.344951"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000025798.50602.3a"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360616"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1111411.1111431"},{"key":"e_1_2_2_42_1","doi-asserted-by":"crossref","unstructured":"Schindler G. and Dellaert F. 2010. Probabilistic temporal inference on reconstructed 3D scenes. In CVPR 1--8.  Schindler G. and Dellaert F. 2010. Probabilistic temporal inference on reconstructed 3D scenes. In CVPR 1--8.","DOI":"10.1109\/CVPR.2010.5539803"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.345012"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02289451"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237196"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.19"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.5555\/1018427.1020452"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409060.1409112"},{"key":"e_1_2_2_49_1","volume-title":"Proceedings of the International Conference on Computer Vision","volume":"2","author":"Sivic J.","unstructured":"Sivic , J. , and Zisserman , A . 2003. Video Google: A text retrieval approach to object matching in videos . In Proceedings of the International Conference on Computer Vision , vol. 2 , 1470--1477. Sivic, J., and Zisserman, A. 2003. Video Google: A text retrieval approach to object matching in videos. In Proceedings of the International Conference on Computer Vision, vol. 2, 1470--1477."},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1179352.1141964"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360614"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2007.68"},{"key":"e_1_2_2_53_1","first-page":"7","article-title":"View and time interpolation in image space","volume":"27","author":"Stich T.","year":"2008","unstructured":"Stich , T. , Linz , C. , Albuquerque , G. , and Magnor , M. 2008 . View and time interpolation in image space . Computer Graphics Forum (Proc. Pacific Graphics) 27 , 7 . Stich, T., Linz, C., Albuquerque, G., and Magnor, M. 2008. View and time interpolation in image space. Computer Graphics Forum (Proc. Pacific Graphics) 27, 7.","journal-title":"Computer Graphics Forum (Proc. Pacific Graphics)"},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744047_48"},{"key":"e_1_2_2_55_1","volume-title":"IEEE Computer Society Conference on 1, 762--768","author":"Tuytelaars T.","year":"2004","unstructured":"Tuytelaars , T. , and Van Gool , L. 2004 . Synchronizing video sequences. Computer Vision and Pattern Recognition , IEEE Computer Society Conference on 1, 762--768 . Tuytelaars, T., and Van Gool, L. 2004. Synchronizing video sequences. Computer Vision and Pattern Recognition, IEEE Computer Society Conference on 1, 762--768."},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276485"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/1061347.1061351"},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360696"},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/1330511.1330512"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073233"},{"volume-title":"Computer Graphics Forum (Proc. Eurographics EG'07)","author":"Waschb\u00fcsch M.","key":"e_1_2_2_61_1","unstructured":"Waschb\u00fcsch , M. , W\u00fcrmlin , S. , and Gross , M. H . 2007. 3d video billboard clouds . Computer Graphics Forum (Proc. Eurographics EG'07) 26, 3, 561--569. Waschb\u00fcsch, M., W\u00fcrmlin, S., and Gross, M. H. 2007. 3d video billboard clouds. Computer Graphics Forum (Proc. Eurographics EG'07) 26, 3, 561--569."},{"key":"e_1_2_2_62_1","unstructured":"W\u00fcrmlin S. and Niederberger C. 2010. Realistic virtual replays for sports broadcasts. http:\/\/www.liberovision.com\/.  W\u00fcrmlin S. and Niederberger C. 2010. Realistic virtual replays for sports broadcasts. http:\/\/www.liberovision.com\/."},{"volume-title":"IEEE International Conference on Computer Vision (ICCV).","author":"Zach C.","key":"e_1_2_2_63_1","unstructured":"Zach , C. , Pock , T. , and Bischof , H . 2007. A globally optimal algorithm for robust tv-11 range image integration . In IEEE International Conference on Computer Vision (ICCV). Zach, C., Pock, T., and Bischof, H. 2007. A globally optimal algorithm for robust tv-11 range image integration. In IEEE International Conference on Computer Vision (ICCV)."},{"key":"e_1_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015766"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1778765.1778824","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1778765.1778824","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:25:53Z","timestamp":1750278353000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1778765.1778824"}},"subtitle":["interactive exploration of casually captured videos"],"short-title":[],"issued":{"date-parts":[[2010,7,26]]},"references-count":63,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2010,7,26]]}},"alternative-id":["10.1145\/1778765.1778824"],"URL":"https:\/\/doi.org\/10.1145\/1778765.1778824","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2010,7,26]]},"assertion":[{"value":"2010-07-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}