{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T02:20:20Z","timestamp":1774059620861,"version":"3.50.1"},"reference-count":36,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2012,7,1]],"date-time":"2012-07-01T00:00:00Z","timestamp":1341100800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000143","name":"Division of Computing and Communication Foundations","doi-asserted-by":"publisher","award":["CCF-0643552"],"award-info":[{"award-number":["CCF-0643552"]}],"id":[{"id":"10.13039\/100000143","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2012,8,5]]},"abstract":"<jats:p>We present a set of tools designed to help editors place cuts and create transitions in interview video. To help place cuts, our interface links a text transcript of the video to the corresponding locations in the raw footage. It also visualizes the suitability of cut locations by analyzing the audio\/visual features of the raw footage to find frames where the speaker is relatively quiet and still. With these tools editors can directly highlight segments of text, check if the endpoints are suitable cut locations and if so, simply delete the text to make the edit. For each cut our system generates visible (e.g. jump-cut, fade, etc.) and seamless, hidden transitions. We present a hierarchical, graph-based algorithm for efficiently generating hidden transitions that considers visual features specific to interview footage. We also describe a new data-driven technique for setting the timing of the hidden transition. Finally, our tools offer a one click method for seamlessly removing 'ums' and repeated words as well as inserting natural-looking pauses to emphasize semantic content. We apply our tools to edit a variety of interviews and also show how they can be used to quickly compose multiple takes of an actor narrating a story.<\/jats:p>","DOI":"10.1145\/2185520.2185563","type":"journal-article","created":{"date-parts":[[2012,8,6]],"date-time":"2012-08-06T18:11:37Z","timestamp":1344276697000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":73,"title":["Tools for placing cuts and transitions in interview video"],"prefix":"10.1145","volume":"31","author":[{"given":"Floraine","family":"Berthouzoz","sequence":"first","affiliation":[{"name":"University of California, Berkeley"}]},{"given":"Wilmot","family":"Li","sequence":"additional","affiliation":[{"name":"Adobe Systems"}]},{"given":"Maneesh","family":"Agrawala","sequence":"additional","affiliation":[{"name":"University of California, Berkeley"}]}],"member":"320","published-online":{"date-parts":[[2012,7]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Radio: An illustrated guide","author":"Abel J.","year":"1999","unstructured":"Abel , J. , and Glass , I . 1999 . Radio: An illustrated guide . WBEZ Alliance Inc . Abel, J., and Glass, I. 1999. Radio: An illustrated guide. WBEZ Alliance Inc."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186822.1073268"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/293347.293348"},{"key":"e_1_2_2_4_1","first-page":"268","article-title":"BoostMap: A method for efficient approximate similarity rankings","author":"Athitsos V.","year":"2004","unstructured":"Athitsos , V. , Alon , J. , Sclaroff , S. , and Kollios , G. 2004 . BoostMap: A method for efficient approximate similarity rankings . Proc. CVPR , II : 268 -- 2II :275. Athitsos, V., Alon, J., Sclaroff, S., and Kollios, G. 2004. BoostMap: A method for efficient approximate similarity rankings. Proc. CVPR, II:268--II:275.","journal-title":"Proc. CVPR"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/311535.311556"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1117\/12.238675"},{"key":"e_1_2_2_7_1","volume-title":"Proc. ICCV, 494--499","author":"Bregler C.","unstructured":"Bregler , C. , and Omohundro , S . 1995. Nonlinear manifold learning for visual speech recognition . Proc. ICCV, 494--499 . Bregler, C., and Omohundro, S. 1995. Nonlinear manifold learning for visual speech recognition. Proc. ICCV, 494--499."},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/258734.258880"},{"key":"e_1_2_2_9_1","volume-title":"Proc. ECCV, 25--36","author":"Brox T.","unstructured":"Brox , T. , Bruhn , A. , Papenberg , N. , and Weickert , J . 2004. High accuracy optical flow estimation based on a theory for warping . Proc. ECCV, 25--36 . Brox, T., Bruhn, A., Papenberg, N., and Weickert, J. 2004. High accuracy optical flow estimation based on a theory for warping. Proc. ECCV, 25--36."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/778712.778737"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_2_12_1","volume-title":"Proc. SIGGRAPH ASIA 30","author":"Dale K.","unstructured":"Dale , K. , Sunkavalli , K. , Johnson , M. , Vlasic , D. , Matusik , W. , and Pfister , H . 2011. Video face replacement . Proc. SIGGRAPH ASIA 30 , 6, 130:1--130:10. Dale, K., Sunkavalli, K., Johnson, M., Vlasic, D., Matusik, W., and Pfister, H. 2011. Video face replacement. Proc. SIGGRAPH ASIA 30, 6, 130:1--130:10."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357096"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2004.1262185"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/354401.354415"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1449715.1449719"},{"key":"e_1_2_2_17_1","volume-title":"Warping and morphing of graphical objects","author":"Gomes J.","unstructured":"Gomes , J. 1999. Warping and morphing of graphical objects , vol. 1 . Morgan Kaufmann . Gomes, J. 1999. Warping and morphing of graphical objects, vol. 1. Morgan Kaufmann."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357097"},{"key":"e_1_2_2_19_1","volume-title":"Proc. ECCV, 341--353","author":"Kemelmacher-Shlizerman I.","unstructured":"Kemelmacher-Shlizerman , I. , Sankar , A. , Shechtman , E. , and Seitz , S . 2010. Being John Malkovich . Proc. ECCV, 341--353 . Kemelmacher-Shlizerman, I., Sankar, A., Shechtman, E., and Seitz, S. 2010. Being John Malkovich. Proc. ECCV, 341--353."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964956"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1201775.882264"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1576246.1531348"},{"key":"e_1_2_2_23_1","volume-title":"The Invisible Cut: How Editors","author":"O'Steen B.","unstructured":"O'Steen , B. 2009. The Invisible Cut: How Editors Make Movie Magic. Michael Wiese Productions . O'Steen, B. 2009. The Invisible Cut: How Editors Make Movie Magic. Michael Wiese Productions."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/280814.280825"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2003.817150"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357095"},{"key":"e_1_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Saragih J. Lucey S. and Cohn J. 2009. Face alignment through subspace constrained mean-shifts. ICCV 1034--1041.  Saragih J. Lucey S. and Cohn J. 2009. Face alignment through subspace constrained mean-shifts. ICCV 1034--1041.","DOI":"10.1109\/ICCV.2009.5459377"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/545261.545281"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.345012"},{"key":"e_1_2_2_30_1","volume-title":"Proc. CVPR, 615--622","author":"Shechtman E.","unstructured":"Shechtman , E. , Rav-Acha , A. , Irani , M. , and Seitz , S . 2010. Regenerative morphing . Proc. CVPR, 615--622 . Shechtman, E., Rav-Acha, A., Irani, M., and Seitz, S. 2010. Regenerative morphing. Proc. CVPR, 615--622."},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1198302.1198305"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/108844.108939"},{"key":"e_1_2_2_33_1","unstructured":"Virage. Audio analysis. http:\/\/www.virage.com\/.  Virage. Audio analysis. http:\/\/www.virage.com\/."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.60"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/217279.215068"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186562.1015759"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2185520.2185563","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2185520.2185563","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T10:06:47Z","timestamp":1750241207000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2185520.2185563"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,7]]},"references-count":36,"aliases":["10.1145\/2185520.2335418"],"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,8,5]]}},"alternative-id":["10.1145\/2185520.2185563"],"URL":"https:\/\/doi.org\/10.1145\/2185520.2185563","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,7]]},"assertion":[{"value":"2012-07-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}