{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:24:57Z","timestamp":1750307097188,"version":"3.41.0"},"reference-count":23,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2012,11,1]],"date-time":"2012-11-01T00:00:00Z","timestamp":1351728000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100002855","name":"Ministry of Science and Technology of the People's Republic of China","doi-asserted-by":"publisher","award":["2009CB320802"],"award-info":[{"award-number":["2009CB320802"]}],"id":[{"id":"10.13039\/501100002855","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["2012M511509"],"award-info":[{"award-number":["2012M511509"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100017445","name":"Natural Science Fund for Distinguished Young Scholars of Shandong Province","doi-asserted-by":"crossref","award":["JQ200920"],"award-info":[{"award-number":["JQ200920"]}],"id":[{"id":"10.13039\/100017445","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U10350046120214961173070"],"award-info":[{"award-number":["U10350046120214961173070"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2012,11]]},"abstract":"<jats:p>Existing video object cutout systems can only deal with limited cases. They usually require detailed user interactions to segment real-life videos, which often suffer from both inseparable statistics (similar appearance between foreground and background) and temporal discontinuities (e.g. large movements, newly-exposed regions following disocclusion or topology change).<\/jats:p><jats:p>In this paper, we present an efficient video cutout system to meet this challenge. A novel directional classifier is proposed to handle temporal discontinuities robustly, and then multiple classifiers are incorporated to cover a variety of cases. The outputs of these classifiers are integrated via another classifier, which is learnt from real examples. The foreground matte is solved by a coherent matting procedure, and remaining errors can be removed easily by additive spatio-temporal local editing. Experiments demonstrate that our system performs more robustly and more intelligently than existing systems in dealing with various input types, thus saving a lot of user labor and time.<\/jats:p>","DOI":"10.1145\/2366145.2366194","type":"journal-article","created":{"date-parts":[[2012,11,14]],"date-time":"2012-11-14T20:36:17Z","timestamp":1352925377000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":28,"title":["Discontinuity-aware video object cutout"],"prefix":"10.1145","volume":"31","author":[{"given":"Fan","family":"Zhong","sequence":"first","affiliation":[{"name":"Shandong University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xueying","family":"Qin","sequence":"additional","affiliation":[{"name":"Shandong University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qunsheng","family":"Peng","sequence":"additional","affiliation":[{"name":"Zhejiang University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiangxu","family":"Meng","sequence":"additional","affiliation":[{"name":"Shandong Provincial Key Laboratory of Software Engineering"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2012,11]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015764"},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Apostoloff N. and Fitzgibbon A. 2004. Bayesian Video Matting Using Learnt Image Priors. In CVPR 407--414. Apostoloff N. and Fitzgibbon A. 2004. Bayesian Video Matting Using Learnt Image Priors. In CVPR 407--414.","DOI":"10.1109\/CVPR.2004.1315061"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Bai X. and Sapiro G. 2007. A geodesic framework for fast interactive image and video segmentation and matting. In ICCV 1--8. Bai X. and Sapiro G. 2007. A geodesic framework for fast interactive image and video segmentation and matting. In ICCV 1--8.","DOI":"10.1109\/ICCV.2007.4408931"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531376"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Bai X. Wang J. and Sapiro G. 2010. Dynamic color flow: a motion-adaptive color model for object segmentation in video. In ECCV 617--630. Bai X. Wang J. and Sapiro G. 2010. Dynamic color flow: a motion-adaptive color model for object segmentation in video. In ECCV 617--630.","DOI":"10.1007\/978-3-642-15555-0_45"},{"key":"e_1_2_1_6_1","unstructured":"Bilmes J. 1998. A gentle tutorial of the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Tech. rep. International Computer Science Institute Berkeley. Bilmes J. 1998. A gentle tutorial of the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Tech. rep. International Computer Science Institute Berkeley."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/566654.566572"},{"key":"e_1_2_1_8_1","unstructured":"Gong M. Wang L. Yang R. and Yang Y.-H. 2010. Real-Time Video matting using Multichannel Poisson Equations. In Graphics Interface 89--96. Gong M. Wang L. Yang R. and Yang Y.-H. 2010. Real-Time Video matting using Multichannel Poisson Equations. In Graphics Interface 89--96."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276497"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1177"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073234"},{"key":"e_1_2_1_12_1","unstructured":"Mortensen E. N. and Barrett W. A. 1999. Toboggan-Based Intelligent Scissors with a Four Parameter Edge Model. In CVPR. Mortensen E. N. and Barrett W. A. 1999. Toboggan-Based Intelligent Scissors with a Four Parameter Edge Model. In CVPR ."},{"volume-title":"Livecut: Learning-based interactive video segmentation by evaluation of multiple propagated cues. In ICCV, 779--786.","year":"2009","author":"Price B.","key":"e_1_2_1_13_1"},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Rhemann C. Rother C. Rav-Acha A. and Sharp T. 2008. High resolution matting via interactive trimap segmentation. In CVPR 1--8. Rhemann C. Rother C. Rav-Acha A. and Sharp T. 2008. High resolution matting via interactive trimap segmentation. In CVPR 1--8.","DOI":"10.1109\/CVPR.2008.4587441"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0075-7"},{"key":"e_1_2_1_16_1","first-page":"1","article-title":"Non-Parametric patch based video matting. In BMVC","volume":"98","author":"Sarim M.","year":"2009","journal-title":"British Machine Vision Association"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00371-011-0598-3"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2011.02038.x"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1019956318069"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073233"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966394.1966401"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Yin P. Criminisi A. Winn J. and Essa I. 2007. Tree-based classifiers for bilayer video segmentation. In CVPR 1--8. Yin P. Criminisi A. Winn J. and Essa I. 2007. Tree-based classifiers for bilayer video segmentation. In CVPR 1--8.","DOI":"10.1109\/CVPR.2007.383008"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Zhong F. Qin X. and Peng Q. 2010. Transductive segmentation of live video with non-stationary background. In CVPR 2189--2196. Zhong F. Qin X. and Peng Q. 2010. Transductive segmentation of live video with non-stationary background. In CVPR 2189--2196.","DOI":"10.1109\/CVPR.2010.5539899"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2366145.2366194","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2366145.2366194","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:34:44Z","timestamp":1750239284000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2366145.2366194"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11]]},"references-count":23,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2012,11]]}},"alternative-id":["10.1145\/2366145.2366194"],"URL":"https:\/\/doi.org\/10.1145\/2366145.2366194","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2012,11]]},"assertion":[{"value":"2012-11-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}