{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T10:39:17Z","timestamp":1761561557390,"version":"3.41.0"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2011,7,1]],"date-time":"2011-07-01T00:00:00Z","timestamp":1309478400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Landmark Program of the NCKU Top University Project","award":["B0008"],"award-info":[{"award-number":["B0008"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2011,7]]},"abstract":"<jats:p>The key to high-quality video resizing is preserving the shape and motion of visually salient objects while remaining temporally-coherent. These spatial and temporal requirements are difficult to reconcile, typically leading existing video retargeting methods to sacrifice one of them and causing distortion or waving artifacts. Recent work enforces temporal coherence of content-aware video warping by solving a global optimization problem over the entire video cube. This significantly improves the results but does not scale well with the resolution and length of the input video and quickly becomes intractable. We propose a new method that solves the scalability problem without compromising the resizing quality. Our method factors the problem into spatial and time\/motion components: we first resize each frame independently to preserve the shape of salient regions, and then we optimize their motion using a reduced model for each pathline of the optical flow. This factorization decomposes the optimization of the video cube into sets of sub-problems whose size is proportional to a single frame's resolution and which can be solved in parallel. We also show how to incorporate cropping into our optimization, which is useful for scenes with numerous salient objects where warping alone would degenerate to linear scaling. Our results match the quality of state-of-the-art retargeting methods while dramatically reducing the computation time and memory consumption, making content-aware video resizing scalable and practical.<\/jats:p>","DOI":"10.1145\/2010324.1964983","type":"journal-article","created":{"date-parts":[[2011,7,26]],"date-time":"2011-07-26T14:17:46Z","timestamp":1311689866000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":36,"title":["Scalable and coherent video resizing with per-frame optimization"],"prefix":"10.1145","volume":"30","author":[{"given":"Yu-Shuen","family":"Wang","sequence":"first","affiliation":[{"name":"National Chiao Tung University and National Cheng Kung University"}]},{"given":"Jen-Hung","family":"Hsiao","sequence":"additional","affiliation":[{"name":"National Cheng Kung University"}]},{"given":"Olga","family":"Sorkine","sequence":"additional","affiliation":[{"name":"New York University and ETH Zurich"}]},{"given":"Tong-Yee","family":"Lee","sequence":"additional","affiliation":[{"name":"National Cheng Kung University"}]}],"member":"320","published-online":{"date-parts":[[2011,7,25]]},"reference":[{"doi-asserted-by":"publisher","key":"e_1_2_2_1_1","DOI":"10.1145\/1276377.1276390"},{"doi-asserted-by":"publisher","key":"e_1_2_2_2_1","DOI":"10.1145\/1531326.1531330"},{"doi-asserted-by":"publisher","key":"e_1_2_2_3_1","DOI":"10.1007\/s00530-003-0105-4"},{"volume-title":"Proc. CVPR '08","author":"Cho T. S.","unstructured":"Cho, T. S., Butman, M., Avidan, S., and Freeman, W. T. 2008. The patch transform and its applications to image editing. In Proc. CVPR '08.","key":"e_1_2_2_4_1"},{"volume-title":"CVPR '08","author":"Deselaers T.","unstructured":"Deselaers, T., Dreuw, P., and Ney, H. 2008. Pan, zoom, scan -- time-coherent, trained automatic video cropping. In CVPR '08.","key":"e_1_2_2_5_1"},{"doi-asserted-by":"publisher","key":"e_1_2_2_6_1","DOI":"10.1145\/1618452.1618471"},{"doi-asserted-by":"publisher","key":"e_1_2_2_7_1","DOI":"10.5555\/2383894.2383931"},{"doi-asserted-by":"publisher","key":"e_1_2_2_8_1","DOI":"10.1145\/1404880.1404882"},{"doi-asserted-by":"publisher","key":"e_1_2_2_9_1","DOI":"10.5555\/1735603.1735605"},{"doi-asserted-by":"publisher","key":"e_1_2_2_10_1","DOI":"10.1145\/1618452.1618472"},{"doi-asserted-by":"publisher","key":"e_1_2_2_11_1","DOI":"10.1145\/1180639.1180702"},{"doi-asserted-by":"publisher","key":"e_1_2_2_12_1","DOI":"10.1145\/957013.957045"},{"volume-title":"Proc. CVPR, 537--544","author":"Niu Y.","unstructured":"Niu, Y., Liu, F., Li, X., and Gleicher, M. 2010. Warp propagation for video resizing. In Proc. CVPR, 537--544.","key":"e_1_2_2_13_1"},{"volume-title":"Proc. ICCV'09","author":"Pritch Y.","unstructured":"Pritch, Y., Kav-Venaki, E., and Peleg, S. 2009. Shift-map image editing. In Proc. ICCV'09.","key":"e_1_2_2_14_1"},{"volume-title":"Proc. CVPR, II-343-8.","author":"Rasheed Z.","unstructured":"Rasheed, Z., and Shah, M. 2003. Scene detection in Hollywood movies and TV shows. In Proc. CVPR, II-343-8.","key":"e_1_2_2_15_1"},{"doi-asserted-by":"publisher","key":"e_1_2_2_16_1","DOI":"10.1145\/1360612.1360615"},{"doi-asserted-by":"publisher","key":"e_1_2_2_17_1","DOI":"10.1145\/1531326.1531329"},{"doi-asserted-by":"publisher","key":"e_1_2_2_18_1","DOI":"10.1145\/1882261.1866186"},{"doi-asserted-by":"publisher","key":"e_1_2_2_19_1","DOI":"10.1145\/1124772.1124886"},{"doi-asserted-by":"publisher","unstructured":"Shamir A. and Sorkine O. 2009. Visual media retargeting. In ACM SIGGRAPH Asia Courses. 10.1145\/1665817.1665828","key":"e_1_2_2_20_1","DOI":"10.1145\/1665817.1665828"},{"volume-title":"Proc. CVPR '08","author":"Simakov D.","unstructured":"Simakov, D., Caspi, Y., Shechtman, E., and Irani, M. 2008. Summarizing visual data using bidirectional similarity. In Proc. CVPR '08.","key":"e_1_2_2_21_1"},{"doi-asserted-by":"publisher","key":"e_1_2_2_22_1","DOI":"10.1145\/964696.964707"},{"doi-asserted-by":"publisher","key":"e_1_2_2_23_1","DOI":"10.1145\/1409060.1409071"},{"doi-asserted-by":"publisher","key":"e_1_2_2_24_1","DOI":"10.1145\/1618452.1618473"},{"doi-asserted-by":"publisher","key":"e_1_2_2_25_1","DOI":"10.1145\/1778765.1778827"},{"volume-title":"Proc. British Machine Vision Conference (BMVC).","author":"Werlberger M.","unstructured":"Werlberger, M., Trobin, W., Pock, T., Wedel, A., Cremers, D., and Bischof, H. 2009. Anisotropic Huber-L1 optical flow. In Proc. British Machine Vision Conference (BMVC).","key":"e_1_2_2_26_1"},{"volume-title":"ICCV '07","author":"Wolf L.","unstructured":"Wolf, L., Guttmann, M., and Cohen-Or, D. 2007. Non-homogeneous content-driven video-retargeting. In ICCV '07.","key":"e_1_2_2_27_1"},{"doi-asserted-by":"publisher","key":"e_1_2_2_28_1","DOI":"10.1145\/1882261.1866185"},{"volume-title":"Proc. PG '08","author":"Zhang Y.-F.","unstructured":"Zhang, Y.-F., Hu, S.-M., and Martin, R. R. 2008. Shrinkability maps for content-aware video resizing. In Proc. PG '08.","key":"e_1_2_2_29_1"},{"doi-asserted-by":"publisher","key":"e_1_2_2_30_1","DOI":"10.1111\/j.1467-8659.2009.01568.x"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2010324.1964983","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2010324.1964983","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:06:23Z","timestamp":1750244783000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2010324.1964983"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7]]},"references-count":30,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,7]]}},"alternative-id":["10.1145\/2010324.1964983"],"URL":"https:\/\/doi.org\/10.1145\/2010324.1964983","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2011,7]]},"assertion":[{"value":"2011-07-25","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}