{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:09:59Z","timestamp":1777655399154,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":64,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,7,23]],"date-time":"2023-07-23T00:00:00Z","timestamp":1690070400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,7,23]]},"DOI":"10.1145\/3588432.3591568","type":"proceedings-article","created":{"date-parts":[[2023,7,19]],"date-time":"2023-07-19T13:34:52Z","timestamp":1689773692000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["FashionTex: Controllable Virtual Try-on with Text and Texture"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-2550-598X","authenticated-orcid":false,"given":"Anran","family":"Lin","sequence":"first","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4007-2776","authenticated-orcid":false,"given":"Nanxuan","family":"Zhao","sequence":"additional","affiliation":[{"name":"Adobe Inc, India, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5383-7221","authenticated-orcid":false,"given":"Shuliang","family":"Ning","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1257-4271","authenticated-orcid":false,"given":"Yuda","family":"Qiu","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8268-7517","authenticated-orcid":false,"given":"Baoyuan","family":"Wang","sequence":"additional","affiliation":[{"name":"Xiaobing.AI, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0162-3296","authenticated-orcid":false,"given":"Xiaoguang","family":"Han","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,7,23]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Amazon Statistics","year":"2022","unstructured":"2022. Amazon Statistics ( 2022 ). 2022. Amazon Statistics (2022)."},{"key":"e_1_3_2_2_2_1","unstructured":"Kenan\u00a0Emir Ak Joo\u00a0Hwee Lim Jo\u00a0Yew Tham and Ashraf Kassim. 2019a. Semantically consistent hierarchical text to fashion image synthesis with an enhanced-attentional generative adversarial network. In ICCVW.  Kenan\u00a0Emir Ak Joo\u00a0Hwee Lim Jo\u00a0Yew Tham and Ashraf Kassim. 2019a. Semantically consistent hierarchical text to fashion image synthesis with an enhanced-attentional generative adversarial network. In ICCVW."},{"key":"e_1_3_2_2_3_1","unstructured":"Kenan\u00a0E Ak Joo\u00a0Hwee Lim Jo\u00a0Yew Tham and Ashraf\u00a0A Kassim. 2019b. Attribute manipulation generative adversarial networks for fashion images. In CVPR.  Kenan\u00a0E Ak Joo\u00a0Hwee Lim Jo\u00a0Yew Tham and Ashraf\u00a0A Kassim. 2019b. Attribute manipulation generative adversarial networks for fashion images. In CVPR."},{"key":"e_1_3_2_2_4_1","volume-title":"Proceedings of the IEEE\/CVF international conference on computer vision. 9016\u20139025","author":"AlBahar Badour","year":"2019","unstructured":"Badour AlBahar and Jia-Bin Huang . 2019 . Guided image-to-image translation with bi-directional feature transformation . In Proceedings of the IEEE\/CVF international conference on computer vision. 9016\u20139025 . Badour AlBahar and Jia-Bin Huang. 2019. Guided image-to-image translation with bi-directional feature transformation. In Proceedings of the IEEE\/CVF international conference on computer vision. 9016\u20139025."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3478513.3480559"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"crossref","unstructured":"Andrew Brown Cheng-Yang Fu Omkar Parkhi Tamara\u00a0L. Berg and Andrea Vedaldi. 2022. End-to-End Visual Editing with a Generatively Pre-Trained Artist. In ECCV.  Andrew Brown Cheng-Yang Fu Omkar Parkhi Tamara\u00a0L. Berg and Andrea Vedaldi. 2022. End-to-End Visual Editing with a Generatively Pre-Trained Artist. In ECCV.","DOI":"10.1007\/978-3-031-19784-0_2"},{"key":"e_1_3_2_2_7_1","volume-title":"Tailorgan: Making user-defined fashion designs. In WACV.","author":"Chen Lele","year":"2020","unstructured":"Lele Chen , Justin Tian , Guo Li , Cheng-Haw Wu , Erh-Kan King , Kuan-Ting Chen , Shao-Hang Hsieh , and Chenliang Xu . 2020 . Tailorgan: Making user-defined fashion designs. In WACV. Lele Chen, Justin Tian, Guo Li, Cheng-Haw Wu, Erh-Kan King, Kuan-Ting Chen, Shao-Hang Hsieh, and Chenliang Xu. 2020. Tailorgan: Making user-defined fashion designs. In WACV."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Guillaume Couairon Asya Grechka Jakob Verbeek Holger Schwenk and Matthieu Cord. 2022. FlexIT: Towards Flexible Semantic Image Translation. In CVPR.  Guillaume Couairon Asya Grechka Jakob Verbeek Holger Schwenk and Matthieu Cord. 2022. FlexIT: Towards Flexible Semantic Image Translation. In CVPR.","DOI":"10.1109\/CVPR52688.2022.01773"},{"key":"e_1_3_2_2_9_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647","author":"Cui Aiyu","year":"2021","unstructured":"Aiyu Cui , Daniel McKee , and Svetlana Lazebnik . 2021 . Dressing in order: Recurrent person image generation for pose transfer, virtual try-on and outfit editing . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647 . Aiyu Cui, Daniel McKee, and Svetlana Lazebnik. 2021. Dressing in order: Recurrent person image generation for pose transfer, virtual try-on and outfit editing. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647."},{"key":"e_1_3_2_2_10_1","volume-title":"Arcface: Additive angular margin loss for deep face recognition. In CVPR.","author":"Deng Jiankang","year":"2019","unstructured":"Jiankang Deng , Jia Guo , Niannan Xue , and Stefanos Zafeiriou . 2019 . Arcface: Additive angular margin loss for deep face recognition. In CVPR. Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019. Arcface: Additive angular margin loss for deep face recognition. In CVPR."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"crossref","unstructured":"Patrick Esser Robin Rombach and Bjorn Ommer. 2021. Taming transformers for high-resolution image synthesis. In CVPR.  Patrick Esser Robin Rombach and Bjorn Ommer. 2021. Taming transformers for high-resolution image synthesis. In CVPR.","DOI":"10.1109\/CVPR46437.2021.01268"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"crossref","unstructured":"Anna Fr\u00fchst\u00fcck Krishna\u00a0Kumar Singh Eli Shechtman Niloy\u00a0J Mitra Peter Wonka and Jingwan Lu. 2022. InsetGAN for Full-Body Image Generation. In CVPR.  Anna Fr\u00fchst\u00fcck Krishna\u00a0Kumar Singh Eli Shechtman Niloy\u00a0J Mitra Peter Wonka and Jingwan Lu. 2022. InsetGAN for Full-Body Image Generation. In CVPR.","DOI":"10.1109\/CVPR52688.2022.00757"},{"key":"e_1_3_2_2_13_1","unstructured":"Jianglin Fu Shikai Li Yuming Jiang Kwan-Yee Lin Chen Qian Chen\u00a0Change Loy Wayne Wu and Ziwei Liu. 2022. StyleGAN-Human: A Data-Centric Odyssey of Human Generation. In ECCV.  Jianglin Fu Shikai Li Yuming Jiang Kwan-Yee Lin Chen Qian Chen\u00a0Change Loy Wayne Wu and Ziwei Liu. 2022. StyleGAN-Human: A Data-Centric Odyssey of Human Generation. In ECCV."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3422622"},{"key":"e_1_3_2_2_15_1","volume-title":"Language guided fashion image manipulation with feature-wise transformations. arXiv preprint arXiv:1808.04000","author":"G\u00fcnel Mehmet","year":"2018","unstructured":"Mehmet G\u00fcnel , Erkut Erdem , and Aykut Erdem . 2018. Language guided fashion image manipulation with feature-wise transformations. arXiv preprint arXiv:1808.04000 ( 2018 ). Mehmet G\u00fcnel, Erkut Erdem, and Aykut Erdem. 2018. Language guided fashion image manipulation with feature-wise transformations. arXiv preprint arXiv:1808.04000 (2018)."},{"key":"e_1_3_2_2_16_1","volume-title":"Viton: An image-based virtual try-on network. In CVPR.","author":"Han Xintong","year":"2018","unstructured":"Xintong Han , Zuxuan Wu , Zhe Wu , Ruichi Yu , and Larry\u00a0 S Davis . 2018 . Viton: An image-based virtual try-on network. In CVPR. Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, and Larry\u00a0S Davis. 2018. Viton: An image-based virtual try-on network. In CVPR."},{"key":"e_1_3_2_2_17_1","volume-title":"Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30","author":"Heusel Martin","year":"2017","unstructured":"Martin Heusel , Hubert Ramsauer , Thomas Unterthiner , Bernhard Nessler , and Sepp Hochreiter . 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30 ( 2017 ). Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"Xun Huang and Serge Belongie. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV.  Xun Huang and Serge Belongie. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV.","DOI":"10.1109\/ICCV.2017.167"},{"key":"e_1_3_2_2_19_1","volume-title":"Image-to-Image Translation with Conditional Adversarial Networks. CVPR","author":"Isola Phillip","year":"2017","unstructured":"Phillip Isola , Jun-Yan Zhu , Tinghui Zhou , and Alexei\u00a0 A Efros . 2017. Image-to-Image Translation with Conditional Adversarial Networks. CVPR ( 2017 ). Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei\u00a0A Efros. 2017. Image-to-Image Translation with Conditional Adversarial Networks. CVPR (2017)."},{"key":"e_1_3_2_2_20_1","volume-title":"a generative model for image editing. arXiv preprint arXiv:2111.15264","author":"Issenhuth Thibaut","year":"2021","unstructured":"Thibaut Issenhuth , Ugo Tanielian , J\u00e9r\u00e9mie Mary , and David Picard . 2021. EdiBERT , a generative model for image editing. arXiv preprint arXiv:2111.15264 ( 2021 ). Thibaut Issenhuth, Ugo Tanielian, J\u00e9r\u00e9mie Mary, and David Picard. 2021. EdiBERT, a generative model for image editing. arXiv preprint arXiv:2111.15264 (2021)."},{"key":"e_1_3_2_2_21_1","unstructured":"Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu Pham Quoc Le Yun-Hsuan Sung Zhen Li and Tom Duerig. 2021. Scaling up visual and vision-language representation learning with noisy text supervision. In ICML.  Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu Pham Quoc Le Yun-Hsuan Sung Zhen Li and Tom Duerig. 2021. Scaling up visual and vision-language representation learning with noisy text supervision. In ICML."},{"key":"e_1_3_2_2_22_1","volume-title":"Text2human: Text-driven controllable human image generation. ACM TOG","author":"Jiang Yuming","year":"2022","unstructured":"Yuming Jiang , Shuai Yang , Haonan Qju , Wayne Wu , Chen\u00a0Change Loy , and Ziwei Liu . 2022. Text2human: Text-driven controllable human image generation. ACM TOG ( 2022 ). Yuming Jiang, Shuai Yang, Haonan Qju, Wayne Wu, Chen\u00a0Change Loy, and Ziwei Liu. 2022. Text2human: Text-driven controllable human image generation. ACM TOG (2022)."},{"key":"e_1_3_2_2_23_1","volume-title":"Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196","author":"Karras Tero","year":"2017","unstructured":"Tero Karras , Timo Aila , Samuli Laine , and Jaakko Lehtinen . 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 ( 2017 ). Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017)."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"crossref","unstructured":"Tero Karras Samuli Laine and Timo Aila. 2019. A style-based generator architecture for generative adversarial networks. In CVPR.  Tero Karras Samuli Laine and Timo Aila. 2019. A style-based generator architecture for generative adversarial networks. In CVPR.","DOI":"10.1109\/CVPR.2019.00453"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Tero Karras Samuli Laine Miika Aittala Janne Hellsten Jaakko Lehtinen and Timo Aila. 2020. Analyzing and improving the image quality of stylegan. In CVPR.  Tero Karras Samuli Laine Miika Aittala Janne Hellsten Jaakko Lehtinen and Timo Aila. 2020. Analyzing and improving the image quality of stylegan. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00813"},{"key":"e_1_3_2_2_26_1","unstructured":"Gwanghyun Kim Taesung Kwon and Jong\u00a0Chul Ye. 2022. DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation. In CVPR.  Gwanghyun Kim Taesung Kwon and Jong\u00a0Chul Ye. 2022. DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation. In CVPR."},{"key":"e_1_3_2_2_27_1","volume-title":"Clipstyler: Image style transfer with a single text condition. In CVPR.","author":"Kwon Gihyun","year":"2022","unstructured":"Gihyun Kwon and Jong\u00a0Chul Ye . 2022 . Clipstyler: Image style transfer with a single text condition. In CVPR. Gihyun Kwon and Jong\u00a0Chul Ye. 2022. Clipstyler: Image style transfer with a single text condition. In CVPR."},{"key":"e_1_3_2_2_28_1","unstructured":"Sangyun Lee Gyojung Gu Sunghyun Park Seunghwan Choi and Jaegul Choo. 2022. High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions. In ECCV.  Sangyun Lee Gyojung Gu Sunghyun Park Seunghwan Choi and Jaegul Choo. 2022. High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions. In ECCV."},{"key":"e_1_3_2_2_29_1","volume-title":"Tryongan: Body-aware try-on via layered interpolation. ACM TOG","author":"Lewis M","year":"2021","unstructured":"Kathleen\u00a0 M Lewis , Srivatsan Varadharajan , and Ira Kemelmacher-Shlizerman . 2021 . Tryongan: Body-aware try-on via layered interpolation. ACM TOG (2021). Kathleen\u00a0M Lewis, Srivatsan Varadharajan, and Ira Kemelmacher-Shlizerman. 2021. Tryongan: Body-aware try-on via layered interpolation. ACM TOG (2021)."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Ziwei Liu Ping Luo Shi Qiu Xiaogang Wang and Xiaoou Tang. 2016. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. In CVPR.  Ziwei Liu Ping Luo Shi Qiu Xiaogang Wang and Xiaoou Tang. 2016. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. In CVPR.","DOI":"10.1109\/CVPR.2016.124"},{"key":"e_1_3_2_2_31_1","volume-title":"Computer Vision and Pattern Recognition (CVPR), 2020 IEEE Conference on.","author":"Men Yifang","year":"2020","unstructured":"Yifang Men , Yiming Mao , Yuning Jiang , Wei-Ying Ma , and Zhouhui Lian . 2020 . Controllable Person Image Synthesis with Attribute-Decomposed GAN . In Computer Vision and Pattern Recognition (CVPR), 2020 IEEE Conference on. Yifang Men, Yiming Mao, Yuning Jiang, Wei-Ying Ma, and Zhouhui Lian. 2020. Controllable Person Image Synthesis with Attribute-Decomposed GAN. In Computer Vision and Pattern Recognition (CVPR), 2020 IEEE Conference on."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"crossref","unstructured":"Assaf Neuberger Eran Borenstein Bar Hilleli Eduard Oks and Sharon Alpert. 2020. Image based virtual try-on network from unpaired data. In CVPR.  Assaf Neuberger Eran Borenstein Bar Hilleli Eduard Oks and Sharon Alpert. 2020. Image based virtual try-on network from unpaired data. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00523"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"crossref","unstructured":"Taesung Park Ming-Yu Liu Ting-Chun Wang and Jun-Yan Zhu. 2019. Semantic image synthesis with spatially-adaptive normalization. In CVPR.  Taesung Park Ming-Yu Liu Ting-Chun Wang and Jun-Yan Zhu. 2019. Semantic image synthesis with spatially-adaptive normalization. In CVPR.","DOI":"10.1109\/CVPR.2019.00244"},{"key":"e_1_3_2_2_34_1","volume-title":"Styleclip: Text-driven manipulation of stylegan imagery. In CVPR.","author":"Patashnik Or","year":"2021","unstructured":"Or Patashnik , Zongze Wu , Eli Shechtman , Daniel Cohen-Or , and Dani Lischinski . 2021 . Styleclip: Text-driven manipulation of stylegan imagery. In CVPR. Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, and Dani Lischinski. 2021. Styleclip: Text-driven manipulation of stylegan imagery. In CVPR."},{"key":"e_1_3_2_2_35_1","volume-title":"International journal of computer vision","author":"Portilla Javier","year":"2000","unstructured":"Javier Portilla and Eero\u00a0 P Simoncelli . 2000. A parametric texture model based on joint statistics of complex wavelet coefficients . International journal of computer vision ( 2000 ). Javier Portilla and Eero\u00a0P Simoncelli. 2000. A parametric texture model based on joint statistics of complex wavelet coefficients. International journal of computer vision (2000)."},{"key":"e_1_3_2_2_36_1","unstructured":"Alec Radford Jong\u00a0Wook Kim Chris Hallacy Aditya Ramesh Gabriel Goh Sandhini Agarwal Girish Sastry Amanda Askell Pamela Mishkin Jack Clark 2021. Learning transferable visual models from natural language supervision. In ICML.  Alec Radford Jong\u00a0Wook Kim Chris Hallacy Aditya Ramesh Gabriel Goh Sandhini Agarwal Girish Sastry Amanda Askell Pamela Mishkin Jack Clark 2021. Learning transferable visual models from natural language supervision. In ICML."},{"key":"e_1_3_2_2_37_1","volume-title":"Swapnet: Image based garment transfer. In ECCV.","author":"Raj Amit","year":"2018","unstructured":"Amit Raj , Patsorn Sangkloy , Huiwen Chang , James Hays , Duygu Ceylan , and Jingwan Lu . 2018 . Swapnet: Image based garment transfer. In ECCV. Amit Raj, Patsorn Sangkloy, Huiwen Chang, James Hays, Duygu Ceylan, and Jingwan Lu. 2018. Swapnet: Image based garment transfer. In ECCV."},{"key":"e_1_3_2_2_38_1","volume-title":"Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125","author":"Ramesh Aditya","year":"2022","unstructured":"Aditya Ramesh , Prafulla Dhariwal , Alex Nichol , Casey Chu , and Mark Chen . 2022. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125 ( 2022 ). Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. 2022. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125 (2022)."},{"key":"e_1_3_2_2_39_1","volume-title":"Denseclip: Language-guided dense prediction with context-aware prompting. In CVPR.","author":"Rao Yongming","year":"2022","unstructured":"Yongming Rao , Wenliang Zhao , Guangyi Chen , Yansong Tang , Zheng Zhu , Guan Huang , Jie Zhou , and Jiwen Lu . 2022 . Denseclip: Language-guided dense prediction with context-aware prompting. In CVPR. Yongming Rao, Wenliang Zhao, Guangyi Chen, Yansong Tang, Zheng Zhu, Guan Huang, Jie Zhou, and Jiwen Lu. 2022. Denseclip: Language-guided dense prediction with context-aware prompting. In CVPR."},{"key":"e_1_3_2_2_40_1","volume-title":"Pivotal tuning for latent-based editing of real images. ACM TOG","author":"Roich Daniel","year":"2022","unstructured":"Daniel Roich , Ron Mokady , Amit\u00a0 H Bermano , and Daniel Cohen-Or . 2022. Pivotal tuning for latent-based editing of real images. ACM TOG ( 2022 ). Daniel Roich, Ron Mokady, Amit\u00a0H Bermano, and Daniel Cohen-Or. 2022. Pivotal tuning for latent-based editing of real images. ACM TOG (2022)."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser and Bj\u00f6rn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. In CVPR.  Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser and Bj\u00f6rn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. In CVPR.","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"e_1_3_2_2_42_1","volume-title":"Style and pose control for image synthesis of humans from a single monocular view. arXiv preprint arXiv:2102.11263","author":"Sarkar Kripasindhu","year":"2021","unstructured":"Kripasindhu Sarkar , Vladislav Golyanik , Lingjie Liu , and Christian Theobalt . 2021. Style and pose control for image synthesis of humans from a single monocular view. arXiv preprint arXiv:2102.11263 ( 2021 ). Kripasindhu Sarkar, Vladislav Golyanik, Lingjie Liu, and Christian Theobalt. 2021. Style and pose control for image synthesis of humans from a single monocular view. arXiv preprint arXiv:2102.11263 (2021)."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"crossref","unstructured":"Kripasindhu Sarkar Dushyant Mehta Weipeng Xu Vladislav Golyanik and Christian Theobalt. 2020. Neural re-rendering of humans from a single image. In ECCV.  Kripasindhu Sarkar Dushyant Mehta Weipeng Xu Vladislav Golyanik and Christian Theobalt. 2020. Neural re-rendering of humans from a single image. In ECCV.","DOI":"10.1007\/978-3-030-58621-8_35"},{"key":"e_1_3_2_2_44_1","volume-title":"Interfacegan: Interpreting the disentangled face representation learned by gans. PAMI","author":"Shen Yujun","year":"2020","unstructured":"Yujun Shen , Ceyuan Yang , Xiaoou Tang , and Bolei Zhou . 2020 . Interfacegan: Interpreting the disentangled face representation learned by gans. PAMI (2020). Yujun Shen, Ceyuan Yang, Xiaoou Tang, and Bolei Zhou. 2020. Interfacegan: Interpreting the disentangled face representation learned by gans. PAMI (2020)."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Yujun Shen and Bolei Zhou. 2021. Closed-form factorization of latent semantics in gans. In CVPR.  Yujun Shen and Bolei Zhou. 2021. Closed-form factorization of latent semantics in gans. In CVPR.","DOI":"10.1109\/CVPR46437.2021.00158"},{"key":"e_1_3_2_2_46_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_2_47_1","volume-title":"Efficient semantic image synthesis via class-adaptive normalization","author":"Tan Zhentao","year":"2021","unstructured":"Zhentao Tan , Dongdong Chen , Qi Chu , Menglei Chai , Jing Liao , Mingming He , Lu Yuan , Gang Hua , and Nenghai Yu. 2021. Efficient semantic image synthesis via class-adaptive normalization . IEEE TPAMI ( 2021 ). Zhentao Tan, Dongdong Chen, Qi Chu, Menglei Chai, Jing Liao, Mingming He, Lu Yuan, Gang Hua, and Nenghai Yu. 2021. Efficient semantic image synthesis via class-adaptive normalization. IEEE TPAMI (2021)."},{"key":"e_1_3_2_2_48_1","volume-title":"Designing an encoder for stylegan image manipulation. ACM TOG","author":"Tov Omer","year":"2021","unstructured":"Omer Tov , Yuval Alaluf , Yotam Nitzan , Or Patashnik , and Daniel Cohen-Or . 2021. Designing an encoder for stylegan image manipulation. ACM TOG ( 2021 ). Omer Tov, Yuval Alaluf, Yotam Nitzan, Or Patashnik, and Daniel Cohen-Or. 2021. Designing an encoder for stylegan image manipulation. ACM TOG (2021)."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550469.3555382"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"crossref","unstructured":"Bochao Wang Huabin Zheng Xiaodan Liang Yimin Chen Liang Lin and Meng Yang. 2018. Toward characteristic-preserving image-based virtual try-on network. In ECCV.  Bochao Wang Huabin Zheng Xiaodan Liang Yimin Chen Liang Lin and Meng Yang. 2018. Toward characteristic-preserving image-based virtual try-on network. In ECCV.","DOI":"10.1007\/978-3-030-01261-8_36"},{"key":"e_1_3_2_2_51_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence.","author":"Wang Zhizhong","year":"2022","unstructured":"Zhizhong Wang , Lei Zhao , Haibo Chen , Ailin Li , Zhiwen Zuo , Wei Xing , and Dongming Lu . 2022 . Texture Reformer: Towards Fast and Universal Interactive Texture Transfer . In Proceedings of the AAAI Conference on Artificial Intelligence. Zhizhong Wang, Lei Zhao, Haibo Chen, Ailin Li, Zhiwen Zuo, Wei Xing, and Dongming Lu. 2022. Texture Reformer: Towards Fast and Universal Interactive Texture Transfer. In Proceedings of the AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_2_52_1","volume-title":"Hairclip: Design your hair by text and reference image. In CVPR.","author":"Wei Tianyi","year":"2022","unstructured":"Tianyi Wei , Dongdong Chen , Wenbo Zhou , Jing Liao , Zhentao Tan , Lu Yuan , Weiming Zhang , and Nenghai Yu . 2022 . Hairclip: Design your hair by text and reference image. In CVPR. Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, and Nenghai Yu. 2022. Hairclip: Design your hair by text and reference image. In CVPR."},{"key":"e_1_3_2_2_53_1","unstructured":"Zongze Wu Dani Lischinski and Eli Shechtman. 2021. Stylespace analysis: Disentangled controls for stylegan image generation. In CVPR.  Zongze Wu Dani Lischinski and Eli Shechtman. 2021. Stylespace analysis: Disentangled controls for stylegan image generation. In CVPR."},{"key":"e_1_3_2_2_54_1","volume-title":"TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Xia Weihao","year":"2021","unstructured":"Weihao Xia , Yujiu Yang , Jing-Hao Xue , and Baoyuan Wu . 2021 . TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Weihao Xia, Yujiu Yang, Jing-Hao Xue, and Baoyuan Wu. 2021. TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_2_55_1","volume-title":"Texturegan: Controlling deep image synthesis with texture patches. In CVPR.","author":"Xian Wenqi","year":"2018","unstructured":"Wenqi Xian , Patsorn Sangkloy , Varun Agrawal , Amit Raj , Jingwan Lu , Chen Fang , Fisher Yu , and James Hays . 2018 . Texturegan: Controlling deep image synthesis with texture patches. In CVPR. Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, and James Hays. 2018. Texturegan: Controlling deep image synthesis with texture patches. In CVPR."},{"key":"e_1_3_2_2_56_1","volume-title":"Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN. NeurIPS","author":"Xie Zhenyu","year":"2021","unstructured":"Zhenyu Xie , Zaiyu Huang , Fuwei Zhao , Haoye Dong , Michael Kampffmeyer , and Xiaodan Liang . 2021a. Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN. NeurIPS ( 2021 ). Zhenyu Xie, Zaiyu Huang, Fuwei Zhao, Haoye Dong, Michael Kampffmeyer, and Xiaodan Liang. 2021a. Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN. NeurIPS (2021)."},{"key":"e_1_3_2_2_57_1","volume-title":"Was-vton: Warping architecture search for virtual try-on network. In ACM MM.","author":"Xie Zhenyu","year":"2021","unstructured":"Zhenyu Xie , Xujie Zhang , Fuwei Zhao , Haoye Dong , Michael\u00a0 C Kampffmeyer , Haonan Yan , and Xiaodan Liang . 2021 b. Was-vton: Warping architecture search for virtual try-on network. In ACM MM. Zhenyu Xie, Xujie Zhang, Fuwei Zhao, Haoye Dong, Michael\u00a0C Kampffmeyer, Haonan Yan, and Xiaodan Liang. 2021b. Was-vton: Warping architecture search for virtual try-on network. In ACM MM."},{"key":"e_1_3_2_2_58_1","unstructured":"Zipeng Xu Tianwei Lin Hao Tang Fu Li Dongliang He Nicu Sebe Radu Timofte Luc Van\u00a0Gool and Errui Ding. 2022. Predict Prevent and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model. In CVPR.  Zipeng Xu Tianwei Lin Hao Tang Fu Li Dongliang He Nicu Sebe Radu Timofte Luc Van\u00a0Gool and Errui Ding. 2022. Predict Prevent and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model. In CVPR."},{"key":"e_1_3_2_2_59_1","volume-title":"Vector-quantized image modeling with improved vqgan. arXiv preprint arXiv:2110.04627","author":"Yu Jiahui","year":"2021","unstructured":"Jiahui Yu , Xin Li , Jing\u00a0Yu Koh , Han Zhang , Ruoming Pang , James Qin , Alexander Ku , Yuanzhong Xu , Jason Baldridge , and Yonghui Wu. 2021. Vector-quantized image modeling with improved vqgan. arXiv preprint arXiv:2110.04627 ( 2021 ). Jiahui Yu, Xin Li, Jing\u00a0Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, and Yonghui Wu. 2021. Vector-quantized image modeling with improved vqgan. arXiv preprint arXiv:2110.04627 (2021)."},{"key":"e_1_3_2_2_60_1","volume-title":"Vtnfp: An image-based virtual try-on network with body and clothing feature preservation. In ICCV.","author":"Yu Ruiyun","year":"2019","unstructured":"Ruiyun Yu , Xiaoqi Wang , and Xiaohui Xie . 2019 . Vtnfp: An image-based virtual try-on network with body and clothing feature preservation. In ICCV. Ruiyun Yu, Xiaoqi Wang, and Xiaohui Xie. 2019. Vtnfp: An image-based virtual try-on network with body and clothing feature preservation. In ICCV."},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"crossref","unstructured":"Richard Zhang Phillip Isola Alexei\u00a0A Efros Eli Shechtman and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR.  Richard Zhang Phillip Isola Alexei\u00a0A Efros Eli Shechtman and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR.","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_3_2_2_62_1","unstructured":"Jun-Yan Zhu Philipp Kr\u00e4henb\u00fchl Eli Shechtman and Alexei\u00a0A Efros. 2016. Generative visual manipulation on the natural image manifold. In ECCV.  Jun-Yan Zhu Philipp Kr\u00e4henb\u00fchl Eli Shechtman and Alexei\u00a0A Efros. 2016. Generative visual manipulation on the natural image manifold. In ECCV."},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_3_2_2_64_1","unstructured":"Shizhan Zhu Raquel Urtasun Sanja Fidler Dahua Lin and Chen Change\u00a0Loy. 2017b. Be your own prada: Fashion synthesis with structural coherence. In ICCV.  Shizhan Zhu Raquel Urtasun Sanja Fidler Dahua Lin and Chen Change\u00a0Loy. 2017b. Be your own prada: Fashion synthesis with structural coherence. In ICCV."}],"event":{"name":"SIGGRAPH '23: Special Interest Group on Computer Graphics and Interactive Techniques Conference","location":"Los Angeles CA USA","acronym":"SIGGRAPH '23","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Proceedings"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3588432.3591568","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:47:12Z","timestamp":1750178832000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3588432.3591568"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,23]]},"references-count":64,"alternative-id":["10.1145\/3588432.3591568","10.1145\/3588432"],"URL":"https:\/\/doi.org\/10.1145\/3588432.3591568","relation":{},"subject":[],"published":{"date-parts":[[2023,7,23]]},"assertion":[{"value":"2023-07-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}