{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:30:26Z","timestamp":1777656626746,"version":"3.51.4"},"reference-count":64,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2023,7,26]],"date-time":"2023-07-26T00:00:00Z","timestamp":1690329600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:p>\n            We present an interactive system to ease the creation of so-called\n            <jats:italic>video doodles<\/jats:italic>\n            - videos on which artists insert hand-drawn animations for entertainment or educational purposes. Video doodles are challenging to create because to be convincing, the inserted drawings must appear as if they were part of the captured scene. In particular, the drawings should undergo tracking, perspective deformations and occlusions as they move with respect to the camera and to other objects in the scene - visual effects that are difficult to reproduce with existing 2D video editing software. Our system supports these effects by relying on planar canvases that users position in a 3D scene reconstructed from the video. Furthermore, we present a custom tracking algorithm that allows users to anchor canvases to static or dynamic objects in the scene, such that the canvases move and rotate to follow the position and direction of these objects. When testing our system, novices could create a variety of short animated clips in a dozen of minutes, while professionals praised its speed and ease of use compared to existing tools.\n          <\/jats:p>","DOI":"10.1145\/3592413","type":"journal-article","created":{"date-parts":[[2023,7,26]],"date-time":"2023-07-26T15:47:45Z","timestamp":1690386465000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["VideoDoodles: Hand-Drawn Animations on Videos with Scene-Aware Canvases"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0170-200X","authenticated-orcid":false,"given":"Emilie","family":"Yu","sequence":"first","affiliation":[{"name":"Inria Centre d'Universit\u00e9 C\u00f4te d'Azur, Sophia-Antopolis, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-7891-4603","authenticated-orcid":false,"given":"Kevin","family":"Blackburn-Matzen","sequence":"additional","affiliation":[{"name":"Adobe Research, Seattle, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9234-9960","authenticated-orcid":false,"given":"Cuong","family":"Nguyen","sequence":"additional","affiliation":[{"name":"Adobe Research, Seattle, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2839-7153","authenticated-orcid":false,"given":"Oliver","family":"Wang","sequence":"additional","affiliation":[{"name":"Adobe Research, Seattle, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-9781-0032","authenticated-orcid":false,"given":"Rubaiat","family":"Habib Kazi","sequence":"additional","affiliation":[{"name":"Adobe Research, Seattle, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8003-9575","authenticated-orcid":false,"given":"Adrien","family":"Bousseau","sequence":"additional","affiliation":[{"name":"Inria Centre d'Universit\u00e9 C\u00f4te d'Azur, Sophia-Antopolis, France"},{"name":"TU Delft, Sophia-Antopolis, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,7,26]]},"reference":[{"key":"e_1_2_2_1_1","unstructured":"Adobe. 2022. After Effects. https:\/\/www.adobe.com\/products\/aftereffects.html."},{"key":"e_1_2_2_2_1","volume-title":"Keyframe-based tracking for rotoscoping and animation. ACM Transactions on Graphics (Proc. SIGGRAPH) 23, 3","author":"Agarwala Aseem","year":"2004","unstructured":"Aseem Agarwala, Aaron Hertzmann, David H Salesin, and Steven M Seitz. 2004. Keyframe-based tracking for rotoscoping and animation. ACM Transactions on Graphics (Proc. SIGGRAPH) 23, 3 (2004)."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995341"},{"key":"e_1_2_2_4_1","unstructured":"Apple. 2022. ARKit. https:\/\/developer.apple.com\/augmented-reality\/arkit\/."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3173759"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1449715.1449740"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00640"},{"key":"e_1_2_2_8_1","unstructured":"Blender. 2022a. Blender Grease Pencil. https:\/\/www.blender.org\/features\/story-artist\/."},{"key":"e_1_2_2_9_1","unstructured":"Blender. 2022b. Blender Motion Tracking. https:\/\/docs.blender.org\/manual\/en\/latest\/movie_clip\/tracking\/index.html."},{"key":"e_1_2_2_10_1","unstructured":"BorisFX. 2022. Mocha Pro. https:\/\/borisfx.com\/products\/mocha-pro\/."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40020-9_37"},{"key":"e_1_2_2_12_1","volume-title":"Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Buchanan Aeron","year":"2006","unstructured":"Aeron Buchanan and Andrew Fitzgibbon. 2006. Interactive feature tracking using kd trees and dynamic programming. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_2_2_13_1","unstructured":"Mental Canvas. 2022. Mental Canvas Application. https:\/\/mentalcanvas.com\/."},{"key":"e_1_2_2_14_1","unstructured":"Z. Cao G. Hidalgo Martinez T. Simon S. Wei and Y. A. Sheikh. 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)."},{"key":"e_1_2_2_15_1","volume-title":"Kucas Smaira, Yusuf Aytar, Joao Carreira, Andrew Zisserman, and Yi Yang.","author":"Doersch Carl","year":"2022","unstructured":"Carl Doersch, Ankush Gupta, Larisa Markeeva, Adria Recasens Continente, Kucas Smaira, Yusuf Aytar, Joao Carreira, Andrew Zisserman, and Yi Yang. 2022. TAP-Vid: A Benchmark for Tracking Any Point in a Video. In NeurIPS Datasets Track."},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/PG.2007.64"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357096"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3379337.3415881"},{"key":"e_1_2_2_19_1","unstructured":"Foundry. 2022. Nuke. https:\/\/www.foundry.com\/products\/nuke-family\/nuke."},{"key":"e_1_2_2_20_1","volume-title":"Schematic storyboarding for video visualization and editing. ACM Transactions on Graphics (Proc. SIGGRAPH) 25, 3","author":"Goldman Dan B","year":"2006","unstructured":"Dan B Goldman, Brian Curless, David Salesin, and Steven M Seitz. 2006. Schematic storyboarding for video visualization and editing. ACM Transactions on Graphics (Proc. SIGGRAPH) 25, 3 (2006)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1449715.1449719"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00762"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345390"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/302979.303030"},{"key":"e_1_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Riwano Ikeda and Issei Fujishiro. 2021. SpiCa: Stereoscopic Effect Design with 3D Pottery Wheel-Type Transparent Canvas. In ACM SIGGRAPH Asia 2021 Technical Communications.","DOI":"10.1145\/3478512.3488606"},{"key":"e_1_2_2_26_1","volume-title":"Space-Time Correspondence as a Contrastive Random Walk. Advances in Neural Information Processing Systems","author":"Jabri Allan","year":"2020","unstructured":"Allan Jabri, Andrew Owens, and Alexei A Efros. 2020. Space-Time Correspondence as a Contrastive Random Walk. Advances in Neural Information Processing Systems (2020)."},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3478513.3480546"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2556288.2556987"},{"key":"e_1_2_2_29_1","unstructured":"KenTools. 2022. GeoTracker. https:\/\/keentools.io\/products\/geotracker-for-after-effects."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766920"},{"key":"e_1_2_2_31_1","volume-title":"First-person Hyper-lapse videos. ACM Transactions on Graphics (Proc. SIGGRAPH) 33, 4","author":"Kopf Johannes","year":"2014","unstructured":"Johannes Kopf, Michael F. Cohen, and Richard Szeliski. 2014. First-person Hyper-lapse videos. ACM Transactions on Graphics (Proc. SIGGRAPH) 33, 4 (2014)."},{"key":"e_1_2_2_32_1","volume-title":"Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, et al.","author":"Kopf Johannes","year":"2020","unstructured":"Johannes Kopf, Kevin Matzen, Suhib Alsisan, Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, et al. 2020. One shot 3D photography. ACM Transactions on Graphics (Proc. SIGGRAPH) 39, 4 (2020)."},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00166"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376160"},{"key":"e_1_2_2_35_1","volume-title":"ACM Transactions on Graphics (Proc. SIGGRAPH) 35, 4","author":"Li Wenbin","year":"2016","unstructured":"Wenbin Li, Fabio Viola, Jonathan Starck, Gabriel J Brostow, and Neill DF Campbell. 2016. Roto++ accelerating professional rotoscoping using shape manifolds. ACM Transactions on Graphics (Proc. SIGGRAPH) 35, 4 (2016)."},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3126594.3126611"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8461037"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3526113.3545702"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3379337.3415822"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00328"},{"key":"e_1_2_2_41_1","volume-title":"Computer Graphics Forum","author":"Liu Sean J","unstructured":"Sean J Liu, Maneesh Agrawala, Stephen DiVerdi, and Aaron Hertzmann. 2022a. ZoomShop: Depth-Aware Editing of Photographic Composition. In Computer Graphics Forum, Vol. 41. Wiley Online Library, 57--70."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386569.3392377"},{"key":"e_1_2_2_43_1","volume-title":"MotionViz: Artistic Visualization of Human Motion on Mobile Devices. In ACM SIGGRAPH 2021 Appy Hour.","author":"Mayer Maximilian","year":"2021","unstructured":"Maximilian Mayer, Philipp Trenz, Sebastian Pasewaldt, Mandy Klingbeil, J\u00fcrgen D\u00f6llner, Matthias Trapp, and Amir Semmo. 2021. MotionViz: Artistic Visualization of Human Motion on Mobile Devices. In ACM SIGGRAPH 2021 Appy Hour."},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2015.2463671"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2470654.2466150"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00770"},{"key":"e_1_2_2_47_1","doi-asserted-by":"crossref","unstructured":"F. Perazzi J. Pont-Tuset B. McWilliams L. Van Gool M. Gross and A. Sorkine-Hornung. 2016. A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. In Computer Vision and Pattern Recognition.","DOI":"10.1109\/CVPR.2016.85"},{"key":"e_1_2_2_48_1","volume-title":"The 2017 DAVIS Challenge on Video Object Segmentation. arXiv:1704.00675","author":"Pont-Tuset Jordi","year":"2017","unstructured":"Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbel\u00e1ez, Alexander Sorkine-Hornung, and Luc Van Gool. 2017. The 2017 DAVIS Challenge on Video Object Segmentation. arXiv:1704.00675 (2017)."},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/1399504.1360616"},{"key":"e_1_2_2_50_1","unstructured":"Runway. 2022. RunwayML. https:\/\/app.runwayml.com\/."},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300852"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/1572741.1572765"},{"key":"e_1_2_2_53_1","volume-title":"Structure-from-Motion Revisited. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Sch\u00f6nberger Johannes Lutz","year":"2016","unstructured":"Johannes Lutz Sch\u00f6nberger and Jan-Michael Frahm. 2016. Structure-from-Motion Revisited. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/1124728.1124739"},{"key":"e_1_2_2_55_1","volume-title":"Shape from sensors: Curve networks on surfaces from 3D orientations. Computers & Graphics (Proc. SMI) 66","author":"Stanko Tibor","year":"2017","unstructured":"Tibor Stanko, Stefanie Hahmann, Georges-Pierre Bonneau, and Nathalie Saguin-Sprynski. 2017. Shape from sensors: Curve networks on surfaces from 3D orientations. Computers & Graphics (Proc. SMI) 66 (2017)."},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3174236"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3379337.3415892"},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58536-5_24"},{"key":"e_1_2_2_59_1","first-page":"1","article-title":"Pymanopt: A Python Toolbox for Optimization on Manifolds using Automatic Differentiation","volume":"17","author":"Townsend James","year":"2016","unstructured":"James Townsend, Niklas Koep, and Sebastian Weichwald. 2016. Pymanopt: A Python Toolbox for Optimization on Manifolds using Automatic Differentiation. Journal of Machine Learning Research 17, 137 (2016), 1--5.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275041"},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3126594.3126641"},{"key":"e_1_2_2_62_1","volume-title":"PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes","author":"Xiang Yu","year":"2018","unstructured":"Yu Xiang, Tanner Schmidt, Venkatraman Narayanan, and Dieter Fox. 2018. PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes. Robotics: Science and Systems (RSS) (2018)."},{"key":"e_1_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3242587.3242592"},{"key":"e_1_2_2_64_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3450626.3459871","article-title":"Consistent depth of moving objects in video. ACM Transactions on Graphics (Proc","volume":"40","author":"Zhang Zhoutong","year":"2021","unstructured":"Zhoutong Zhang, Forrester Cole, Richard Tucker, William T Freeman, and Tali Dekel. 2021. Consistent depth of moving objects in video. ACM Transactions on Graphics (Proc. SIGGRAPH) 40, 4 (2021), 1--12.","journal-title":"SIGGRAPH)"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3592413","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3592413","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:59Z","timestamp":1750182539000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3592413"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,26]]},"references-count":64,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,8]]}},"alternative-id":["10.1145\/3592413"],"URL":"https:\/\/doi.org\/10.1145\/3592413","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,26]]},"assertion":[{"value":"2023-07-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}