{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T16:24:26Z","timestamp":1776270266112,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":61,"publisher":"ACM","funder":[{"name":"Israel Science Foundation (ISF)","award":["3441\/21"],"award-info":[{"award-number":["3441\/21"]}]},{"name":"National Natural Science Foundation of China (NSFC)","award":["3077\/23"],"award-info":[{"award-number":["3077\/23"]}]},{"DOI":"10.13039\/501100004375","name":"Tel Aviv University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004375","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,10]]},"DOI":"10.1145\/3721238.3730612","type":"proceedings-article","created":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T08:40:47Z","timestamp":1753260047000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-8471-3709","authenticated-orcid":false,"given":"Ellie","family":"Arar","sequence":"first","affiliation":[{"name":"Tel Aviv University, Tel Aviv, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-4248-9421","authenticated-orcid":false,"given":"Yarden","family":"Frenkel","sequence":"additional","affiliation":[{"name":"Tel Aviv University, Tel Aviv, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6777-7445","authenticated-orcid":false,"given":"Daniel","family":"Cohen-Or","sequence":"additional","affiliation":[{"name":"Tel Aviv University, Tel Aviv, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7082-7845","authenticated-orcid":false,"given":"Ariel","family":"Shamir","sequence":"additional","affiliation":[{"name":"Reichman University, Herzliya, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4402-7267","authenticated-orcid":false,"given":"Yael","family":"Vinker","sequence":"additional","affiliation":[{"name":"Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology (MIT), Cambridge, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,7,27]]},"reference":[{"key":"e_1_3_3_2_2_1","doi-asserted-by":"publisher","unstructured":"Pablo Arbelaez Michael Maire Charless Fowlkes and Jitendra Malik. 2011. Contour Detection and Hierarchical Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33 5 (May 2011) 898\u2013916. 10.1109\/TPAMI.2010.161","DOI":"10.1109\/TPAMI.2010.161"},{"key":"e_1_3_3_2_3_1","volume-title":"The Twelfth International Conference on Learning Representations","author":"Ashcroft Alexander","year":"2024","unstructured":"Alexander Ashcroft, Ayan Das, Yulia Gryaditskaya, Zhiyu Qu, and Yi-Zhe Song. 2024. Modelling complex vector drawings with stroke-clouds. In The Twelfth International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=O2jyuo89CK"},{"key":"e_1_3_3_2_4_1","doi-asserted-by":"crossref","unstructured":"Itamar Berger Ariel Shamir Moshe Mahler Elizabeth\u00a0Jeanne Carter and Jessica\u00a0K. Hodgins. 2013. Style and abstraction in portrait sketching. ACM Transactions on Graphics (TOG) 32 (2013) 1 \u2013 12. https:\/\/api.semanticscholar.org\/CorpusID:17238299","DOI":"10.1145\/2461912.2461964"},{"key":"e_1_3_3_2_5_1","doi-asserted-by":"crossref","unstructured":"Kumar Bhunia Umar\u00a0Ayan Das Riaz Muhammad Yongxin Yang Timothy\u00a0M. Hospedales Tao Xiang Yulia Gryaditskaya and Yi-Zhe Song. 2020. Edinburgh Research Explorer Pixelor: A Competitive Sketching AI Agent. so You Think You Can Sketch?https:\/\/api.semanticscholar.org\/CorpusID:266903640","DOI":"10.1145\/3414685.3417840"},{"key":"e_1_3_3_2_6_1","unstructured":"Reiner Birkl Diana Wofk and Matthias M\u00fcller. 2023. MiDaS v3.1 \u2013 A Model Zoo for Robust Monocular Relative Depth Estimation. arxiv:https:\/\/arXiv.org\/abs\/2307.14460\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2307.14460"},{"key":"e_1_3_3_2_7_1","doi-asserted-by":"crossref","unstructured":"Caroline Chan Fr\u00e9do Durand and Phillip Isola. 2022. Learning to generate line drawings that convey geometry and semantics. 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022) 7905\u20137915. https:\/\/api.semanticscholar.org\/CorpusID:247628105","DOI":"10.1109\/CVPR52688.2022.00776"},{"key":"e_1_3_3_2_8_1","doi-asserted-by":"crossref","unstructured":"Shoufa Chen Pei Sun Yibing Song and Ping Luo. 2022. DiffusionDet: Diffusion Model for Object Detection. 2023 IEEE\/CVF International Conference on Computer Vision (ICCV) (2022) 19773\u201319786. https:\/\/api.semanticscholar.org\/CorpusID:253581633","DOI":"10.1109\/ICCV51070.2023.01816"},{"key":"e_1_3_3_2_9_1","unstructured":"Yajing Chen Shikui Tu Yuqi Yi and Lei Xu. 2017. Sketch-pix2seq: a Model to Generate Sketches of Multiple Categories. ArXiv abs\/1709.04121 (2017). https:\/\/api.semanticscholar.org\/CorpusID:4809276"},{"key":"e_1_3_3_2_10_1","unstructured":"Chenxwh. 2025. BRIA Background Removal v1.4 Model. https:\/\/github.com\/chenxwh\/cog-RMBG"},{"key":"e_1_3_3_2_11_1","doi-asserted-by":"crossref","unstructured":"Mathias Eitz James Hays and Marc Alexa. 2012. How do humans sketch objects? ACM Transactions on Graphics (TOG) 31 (2012) 1 \u2013 10. https:\/\/api.semanticscholar.org\/CorpusID:207194178","DOI":"10.1145\/2185520.2335395"},{"key":"e_1_3_3_2_12_1","unstructured":"Kevin Frans Lisa\u00a0B. Soros and Olaf Witkowski. 2021. CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders. ArXiv abs\/2106.14843 (2021). https:\/\/api.semanticscholar.org\/CorpusID:235658147"},{"key":"e_1_3_3_2_13_1","doi-asserted-by":"crossref","unstructured":"Yarden Frenkel Yael Vinker Ariel Shamir and Daniel Cohen-Or. 2024. Implicit Style-Content Separation using B-LoRA. arxiv:https:\/\/arXiv.org\/abs\/2403.14572\u00a0[cs.CV]","DOI":"10.1007\/978-3-031-72684-2_11"},{"key":"e_1_3_3_2_14_1","first-page":"50742","volume-title":"Advances in Neural Information Processing Systems","author":"Fu Stephanie","year":"2023","unstructured":"Stephanie Fu, Netanel Tamir, Shobhita Sundaram, Lucy Chai, Richard Zhang, Tali Dekel, and Phillip Isola. 2023. DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data. In Advances in Neural Information Processing Systems , Vol.\u00a036. 50742\u201350768."},{"key":"e_1_3_3_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00522"},{"key":"e_1_3_3_2_16_1","doi-asserted-by":"crossref","unstructured":"Yulia Gryaditskaya Mark Sypesteyn Jan\u00a0Willem Hoftijzer Sylvia\u00a0C. Pont Fr\u00e9do Durand and Adrien Bousseau. 2019. OpenSketch. ACM Transactions on Graphics (TOG) 38 (2019) 1 \u2013 16. https:\/\/api.semanticscholar.org\/CorpusID:203182013","DOI":"10.1145\/3355089.3356533"},{"key":"e_1_3_3_2_17_1","unstructured":"David Ha and Douglas Eck. 2017. A Neural Representation of Sketch Drawings. CoRR abs\/1704.03477 (2017). arXiv:https:\/\/arXiv.org\/abs\/1704.03477http:\/\/arxiv.org\/abs\/1704.03477"},{"key":"e_1_3_3_2_18_1","unstructured":"Yue Han Jiangning Zhang Junwei Zhu Xiangtai Li Yanhao Ge Wei Li Chengjie Wang Yong Liu Xiaoming Liu and Ying Tai. 2023. A Generalist FaceX via Learning Unified Facial Representation. ArXiv abs\/2401.00551 (2023). https:\/\/api.semanticscholar.org\/CorpusID:266693482"},{"key":"e_1_3_3_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00457"},{"key":"e_1_3_3_2_20_1","unstructured":"Jonathan Ho. 2022. Classifier-Free Diffusion Guidance. ArXiv abs\/2207.12598 (2022). https:\/\/api.semanticscholar.org\/CorpusID:249145348"},{"key":"e_1_3_3_2_21_1","series-title":"(NIPS \u201920)","volume-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems","author":"Ho Jonathan","year":"2020","unstructured":"Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. In Proceedings of the 34th International Conference on Neural Information Processing Systems (Vancouver, BC, Canada) (NIPS \u201920). Curran Associates Inc., Red Hook, NY, USA, Article 574, 12\u00a0pages."},{"key":"e_1_3_3_2_22_1","doi-asserted-by":"crossref","unstructured":"Zixuan Huang Mark Boss Aaryaman Vasishta James\u00a0M. Rehg and Varun Jampani. 2025. SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images. https:\/\/api.semanticscholar.org\/CorpusID:275357723","DOI":"10.1109\/CVPR52734.2025.01571"},{"key":"e_1_3_3_2_23_1","doi-asserted-by":"crossref","unstructured":"Ajay Jain Amber Xie and Pieter Abbeel. 2022. VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. arXiv (2022).","DOI":"10.1109\/CVPR52729.2023.00190"},{"key":"e_1_3_3_2_24_1","doi-asserted-by":"crossref","unstructured":"Moritz Kampelm\u00fchler and Axel Pinz. 2020. Synthesizing human-like sketches from natural images using a conditional convolutional decoder. 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) (2020) 3192\u20133200. https:\/\/api.semanticscholar.org\/CorpusID:211830916","DOI":"10.1109\/WACV45572.2020.9093440"},{"key":"e_1_3_3_2_25_1","unstructured":"Junnan Li Dongxu Li Silvio Savarese and Steven Hoi. 2023. BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models. arxiv:https:\/\/arXiv.org\/abs\/2301.12597\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2301.12597"},{"key":"e_1_3_3_2_26_1","doi-asserted-by":"crossref","unstructured":"Mengtian Li Zhe\u00a0L. Lin Radom\u00edr M\u011bch Ersin Yumer and Deva Ramanan. 2019. Photo-Sketching: Inferring Contour Drawings From Images. 2019 IEEE Winter Conference on Applications of Computer Vision (WACV) (2019) 1403\u20131412. https:\/\/api.semanticscholar.org\/CorpusID:57375706","DOI":"10.1109\/WACV.2019.00154"},{"key":"e_1_3_3_2_27_1","doi-asserted-by":"crossref","unstructured":"Tzu-Mao Li Michal Luk\u00e1c Micha\u00ebl Gharbi and Jonathan Ragan-Kelley. 2020. Differentiable vector graphics rasterization for editing and learning. ACM Transactions on Graphics (TOG) 39 (2020) 1 \u2013 15. https:\/\/api.semanticscholar.org\/CorpusID:221686970","DOI":"10.1145\/3414685.3417871"},{"key":"e_1_3_3_2_28_1","doi-asserted-by":"crossref","unstructured":"Hangyu Lin Yanwei Fu Yu-Gang Jiang and X. Xue. 2020. Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020) 6757\u20136766. https:\/\/api.semanticscholar.org\/CorpusID:218684600","DOI":"10.1109\/CVPR42600.2020.00679"},{"key":"e_1_3_3_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01394"},{"key":"e_1_3_3_2_30_1","doi-asserted-by":"crossref","unstructured":"Shitong Luo and Wei Hu. 2021. Diffusion Probabilistic Models for 3D Point Cloud Generation. 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021) 2836\u20132844. https:\/\/api.semanticscholar.org\/CorpusID:232092778","DOI":"10.1109\/CVPR46437.2021.00286"},{"key":"e_1_3_3_2_31_1","doi-asserted-by":"crossref","unstructured":"Umar\u00a0Riaz Muhammad Yongxin Yang Yi-Zhe Song Tao Xiang and Timothy\u00a0M. Hospedales. 2018. Learning Deep Sketch Abstraction. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2018) 8014\u20138023. https:\/\/api.semanticscholar.org\/CorpusID:4865391","DOI":"10.1109\/CVPR.2018.00836"},{"key":"e_1_3_3_2_32_1","unstructured":"Kushin Mukherjee Holly Huey Xuanchen Lu Yael Vinker Rio Aguina-Kang Ariel Shamir and Judith\u00a0E. Fan. 2023. SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction. ArXiv abs\/2312.03035 (2023). https:\/\/api.semanticscholar.org\/CorpusID:265720453"},{"key":"e_1_3_3_2_33_1","unstructured":"Alex Nichol and Prafulla Dhariwal. 2021. Improved Denoising Diffusion Probabilistic Models. ArXiv abs\/2102.09672 (2021). https:\/\/api.semanticscholar.org\/CorpusID:231979499"},{"key":"e_1_3_3_2_34_1","unstructured":"Dustin Podell Zion English Kyle Lacey Andreas Blattmann Tim Dockhorn Jonas M\u00fcller Joe Penna and Robin Rombach. 2023. SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. arxiv:https:\/\/arXiv.org\/abs\/2307.01952\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2307.01952"},{"key":"e_1_3_3_2_35_1","unstructured":"Sagi Polaczek Yuval Alaluf Elad Richardson Yael Vinker and Daniel Cohen-Or. 2025. NeuralSVG: An Implicit Representation for Text-to-Vector Generation. arxiv:https:\/\/arXiv.org\/abs\/2501.03992\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2501.03992"},{"key":"e_1_3_3_2_36_1","unstructured":"Ben Poole Ajay Jain Jonathan\u00a0T. Barron and Ben Mildenhall. 2022. DreamFusion: Text-to-3D using 2D Diffusion. ArXiv abs\/2209.14988 (2022). https:\/\/api.semanticscholar.org\/CorpusID:252596091"},{"key":"e_1_3_3_2_37_1","doi-asserted-by":"crossref","unstructured":"Yonggang Qi Guoyao Su Pinaki\u00a0Nath Chowdhury Mingkang Li and Yi-Zhe Song. 2021. SketchLattice: Latticed Representation for Sketch Manipulation. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV) (2021) 933\u2013941. https:\/\/api.semanticscholar.org\/CorpusID:237304123","DOI":"10.1109\/ICCV48922.2021.00099"},{"key":"e_1_3_3_2_38_1","unstructured":"Alec Radford Jong\u00a0Wook Kim Chris Hallacy Aditya Ramesh Gabriel Goh Sandhini Agarwal Girish Sastry Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. CoRR abs\/2103.00020 (2021). arXiv:https:\/\/arXiv.org\/abs\/2103.00020https:\/\/arxiv.org\/abs\/2103.00020"},{"key":"e_1_3_3_2_39_1","doi-asserted-by":"crossref","unstructured":"Leo Sampaio\u00a0Ferraz Ribeiro Tu Bui John\u00a0P. Collomosse and Moacir\u00a0Antonelli Ponti. 2020. Sketchformer: Transformer-Based Representation for Sketched Structure. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020) 14141\u201314150. https:\/\/api.semanticscholar.org\/CorpusID:211258599","DOI":"10.1109\/CVPR42600.2020.01416"},{"key":"e_1_3_3_2_40_1","doi-asserted-by":"crossref","unstructured":"Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser and Bj\u00f6rn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. arxiv:https:\/\/arXiv.org\/abs\/2112.10752\u00a0[cs.CV]","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"e_1_3_3_2_41_1","doi-asserted-by":"crossref","unstructured":"Chitwan Saharia William Chan Saurabh Saxena Lala Li Jay Whang Emily\u00a0L Denton Kamyar Ghasemipour Raphael Gontijo\u00a0Lopes Burcu Karagol\u00a0Ayan Tim Salimans et\u00a0al. 2022. Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems 35 (2022) 36479\u201336494.","DOI":"10.52202\/068431-2643"},{"key":"e_1_3_3_2_42_1","doi-asserted-by":"publisher","unstructured":"Patsorn Sangkloy Nathan Burnell Cusuh Ham and James Hays. 2016. The sketchy database: learning to retrieve badly drawn bunnies. ACM Trans. Graph. 35 4 Article 119 (July 2016) 12\u00a0pages. 10.1145\/2897824.2925954","DOI":"10.1145\/2897824.2925954"},{"key":"e_1_3_3_2_43_1","volume-title":"International Conference on Learning Representations","author":"Song Jiaming","year":"2021","unstructured":"Jiaming Song, Chenlin Meng, and Stefano Ermon. 2021. Denoising Diffusion Implicit Models. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=St1giarCHLP"},{"key":"e_1_3_3_2_44_1","doi-asserted-by":"crossref","unstructured":"Jifei Song Kaiyue Pang Yi-Zhe Song Tao Xiang and Timothy\u00a0M. Hospedales. 2018. Learning to Sketch with Shortcut Cycle Consistency. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2018) 801\u2013810. https:\/\/api.semanticscholar.org\/CorpusID:25434894","DOI":"10.1109\/CVPR.2018.00090"},{"key":"e_1_3_3_2_45_1","volume-title":"The Eleventh International Conference on Learning Representations","author":"Tevet Guy","year":"2023","unstructured":"Guy Tevet, Sigal Raab, Brian Gordon, Yoni Shafir, Daniel Cohen-or, and Amit\u00a0Haim Bermano. 2023. Human Motion Diffusion Model. In The Eleventh International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=SJ1kSyO2jwu"},{"key":"e_1_3_3_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00759"},{"key":"e_1_3_3_2_47_1","unstructured":"Varshaneya V Balasubramanian S and Vineeth\u00a0N. Balasubramanian. 2019. Teaching GANs to sketch in vector format. Proceedings of the Twelfth Indian Conference on Computer Vision Graphics and Image Processing (2019). https:\/\/api.semanticscholar.org\/CorpusID:102352561"},{"key":"e_1_3_3_2_48_1","doi-asserted-by":"crossref","unstructured":"Yael Vinker Yuval Alaluf Daniel Cohen-Or and Ariel Shamir. 2022a. CLIPascene: Scene Sketching with Different Types and Levels of Abstraction. 2023 IEEE\/CVF International Conference on Computer Vision (ICCV) (2022) 4123\u20134133. https:\/\/api.semanticscholar.org\/CorpusID:254096295","DOI":"10.1109\/ICCV51070.2023.00383"},{"key":"e_1_3_3_2_49_1","doi-asserted-by":"publisher","unstructured":"Yael Vinker Ehsan Pajouheshgar Jessica\u00a0Y. Bo Roman\u00a0Christian Bachmann Amit\u00a0Haim Bermano Daniel Cohen-Or Amir Zamir and Ariel Shamir. 2022b. CLIPasso: Semantically-Aware Object Sketching. ACM Trans. Graph. 41 4 Article 86 (jul 2022) 11\u00a0pages. 10.1145\/3528223.3530068","DOI":"10.1145\/3528223.3530068"},{"key":"e_1_3_3_2_50_1","unstructured":"Haofan Wang Matteo Spinelli Qixun Wang Xu Bai Zekui Qin and Anthony Chen. 2024. InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation. ArXiv abs\/2404.02733 (2024). https:\/\/api.semanticscholar.org\/CorpusID:268876474"},{"key":"e_1_3_3_2_51_1","volume-title":"The Eleventh International Conference on Learning Representations","author":"Wang Qiang","year":"2023","unstructured":"Qiang Wang, Haoge Deng, Yonggang Qi, Da Li, and Yi-Zhe Song. 2023. SketchKnitter: Vectorized Sketch Generation with Diffusion Models. In The Eleventh International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=4eJ43EN2g6l"},{"key":"e_1_3_3_2_52_1","doi-asserted-by":"publisher","unstructured":"Zeyu Wang Sherry Qiu Nicole Feng Holly Rushmeier Leonard McMillan and Julie Dorsey. 2021. Tracing Versus Freehand for Evaluating Computer-Generated Drawings. ACM Trans. Graph. 40 4 (Aug. 2021) 12\u00a0pages. 10.1145\/3450626.3459819","DOI":"10.1145\/3450626.3459819"},{"key":"e_1_3_3_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACSSC.2003.1292216"},{"key":"e_1_3_3_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2024676.2024700"},{"key":"e_1_3_3_2_55_1","doi-asserted-by":"crossref","unstructured":"Chufeng Xiao Wanchao Su Jing Liao Zhouhui Lian Yi-Zhe Song and Hongbo Fu. 2022. DifferSketching: How Differently Do People Sketch 3D Objects? ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH Asia 2022) 41 4 (2022) 1\u201316.","DOI":"10.1145\/3550454.3555493"},{"key":"e_1_3_3_2_56_1","unstructured":"Ximing Xing Chuan Wang Haitao Zhou Jing Zhang Qian Yu and Dong Xu. 2023. DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models. ArXiv abs\/2306.14685 (2023). https:\/\/api.semanticscholar.org\/CorpusID:259252217"},{"key":"e_1_3_3_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00435"},{"key":"e_1_3_3_2_58_1","unstructured":"Peng Xu Timothy\u00a0M. Hospedales Qiyue Yin Yi-Zhe Song Tao Xiang and Liang Wang. 2020. Deep Learning for Free-Hand Sketch: A Survey and A Toolbox. arxiv:https:\/\/arXiv.org\/abs\/2001.02600\u00a0[cs.CV]"},{"key":"e_1_3_3_2_59_1","unstructured":"Jingyi Zhang Jiaxing Huang Sheng Jin and Shijian Lu. 2024. Vision-Language Models for Vision Tasks: A Survey. arxiv:https:\/\/arXiv.org\/abs\/2304.00685\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2304.00685"},{"key":"e_1_3_3_2_60_1","doi-asserted-by":"crossref","unstructured":"Lvmin Zhang Anyi Rao and Maneesh Agrawala. 2023. Adding Conditional Control to Text-to-Image Diffusion Models. 2023 IEEE\/CVF International Conference on Computer Vision (ICCV) (2023) 3813\u20133824. https:\/\/api.semanticscholar.org\/CorpusID:256827727","DOI":"10.1109\/ICCV51070.2023.00355"},{"key":"e_1_3_3_2_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_3_3_2_62_1","volume-title":"British Machine Vision Conference","author":"Zhou Tao","year":"2018","unstructured":"Tao Zhou, Chen Fang, Zhaowen Wang, Jimei Yang, Byungmoon Kim, Zhili Chen, Jonathan Brandt, and Demetri Terzopoulos. 2018. Learning to Doodle with Stroke Demonstrations and Deep Q-Networks. In British Machine Vision Conference. https:\/\/api.semanticscholar.org\/CorpusID:53113988"}],"event":{"name":"SIGGRAPH Conference Papers '25: Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers","location":"Vancouver BC Canada","acronym":"SIGGRAPH Conference Papers '25","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3721238.3730612","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T14:58:09Z","timestamp":1774018689000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3721238.3730612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,27]]},"references-count":61,"alternative-id":["10.1145\/3721238.3730612","10.1145\/3721238"],"URL":"https:\/\/doi.org\/10.1145\/3721238.3730612","relation":{},"subject":[],"published":{"date-parts":[[2025,7,27]]},"assertion":[{"value":"2025-07-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}