{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:15:52Z","timestamp":1750220152490,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":29,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,9,14]],"date-time":"2022-09-14T00:00:00Z","timestamp":1663113600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"JSPS KAKENHI","award":["21H05812, 22H00540, 22H00548 and 22K19808"],"award-info":[{"award-number":["21H05812, 22H00540, 22H00548 and 22K19808"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,14]]},"DOI":"10.1145\/3549555.3549556","type":"proceedings-article","created":{"date-parts":[[2022,10,7]],"date-time":"2022-10-07T16:14:01Z","timestamp":1665159241000},"page":"162-166","source":"Crossref","is-referenced-by-count":0,"title":["StyleGAN-based CLIP-guided Image Shape Manipulation"],"prefix":"10.1145","author":[{"given":"Yuchen","family":"Qian","sequence":"first","affiliation":[{"name":"Department of Informatics, The University of Electro-Communications, Japan"}]},{"given":"Kohei","family":"Yamamoto","sequence":"additional","affiliation":[{"name":"Department of Informatics, The University of Electro-Communications, Japan"}]},{"given":"Keiji","family":"Yanai","sequence":"additional","affiliation":[{"name":"Department of Informatics, The University of Electro-Communications, Japan"}]}],"member":"320","published-online":{"date-parts":[[2022,10,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proc. of SIGGRAPH.","author":"Abdal Rameen","year":"2022","unstructured":"Rameen Abdal , Peihao Zhu , John Femiani , Niloy\u00a0 J. Mitra , and Peter Wonka . 2022 . CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions . In Proc. of SIGGRAPH. Rameen Abdal, Peihao Zhu, John Femiani, Niloy\u00a0J. Mitra, and Peter Wonka. 2022. CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions. In Proc. of SIGGRAPH."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3447648"},{"key":"e_1_3_2_1_3_1","unstructured":"David Bau Alex Andonian Audrey Cui YeonHwan Park Ali Jahanian Aude Oliva and Antonio Torralba. 2021. Paint by word. arXiv preprint arXiv:2103.10951(2021).  David Bau Alex Andonian Audrey Cui YeonHwan Park Ali Jahanian Aude Oliva and Antonio Torralba. 2021. Paint by word. arXiv preprint arXiv:2103.10951(2021)."},{"key":"e_1_3_2_1_4_1","volume-title":"Proceedings of the European Conference on Computer Vision.","author":"Bau David","year":"2020","unstructured":"David Bau , Steven Liu , Tongzhou Wang , Jun-Yan Zhu , and Antonio Torralba . 2020 . Rewriting a Deep Generative Model . In Proceedings of the European Conference on Computer Vision. David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, and Antonio Torralba. 2020. Rewriting a Deep Generative Model. In Proceedings of the European Conference on Computer Vision."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00367"},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8188\u20138197","author":"Choi Yunjey","year":"2020","unstructured":"Yunjey Choi , Youngjung Uh , Jaejun Yoo , and Jung-Woo Ha . 2020 . Stargan v2: Diverse image synthesis for multiple domains . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8188\u20138197 . Yunjey Choi, Youngjung Uh, Jaejun Yoo, and Jung-Woo Ha. 2020. Stargan v2: Diverse image synthesis for multiple domains. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8188\u20138197."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.608"},{"key":"e_1_3_2_1_8_1","volume-title":"StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators. arXiv:2108.00946","author":"Gal Rinon","year":"2021","unstructured":"Rinon Gal , Or Patashnik , Gal Maron , Haggaiand\u00a0Chechik, and Daniel Cohen-Or . 2021. StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators. arXiv:2108.00946 ( 2021 ). Rinon Gal, Or Patashnik, Gal Maron, Haggaiand\u00a0Chechik, and Daniel Cohen-Or. 2021. StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators. arXiv:2108.00946 (2021)."},{"key":"e_1_3_2_1_9_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems.  Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_10_1","volume-title":"GANSpace: Discovering Interpretable GAN Controls. Advances in Neural Information Processing Systems 33","author":"H\u00e4rk\u00f6nen Erik","year":"2020","unstructured":"Erik H\u00e4rk\u00f6nen , Aaron Hertzmann , Jaakko Lehtinen , and Sylvain Paris . 2020. GANSpace: Discovering Interpretable GAN Controls. Advances in Neural Information Processing Systems 33 ( 2020 ). Erik H\u00e4rk\u00f6nen, Aaron Hertzmann, Jaakko Lehtinen, and Sylvain Paris. 2020. GANSpace: Discovering Interpretable GAN Controls. Advances in Neural Information Processing Systems 33 (2020)."},{"key":"e_1_3_2_1_11_1","volume-title":"Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems 30","author":"Heusel Martin","year":"2017","unstructured":"Martin Heusel , Hubert Ramsauer , Thomas Unterthiner , Bernhard Nessler , and Sepp Hochreiter . 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems 30 ( 2017 ). Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems 30 (2017)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00453"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00813"},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Kim Gwanghyun","year":"2022","unstructured":"Gwanghyun Kim , Taesung Kwon , and Jong\u00a0Chul Ye . 2022 . DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Gwanghyun Kim, Taesung Kwon, and Jong\u00a0Chul Ye. 2022. DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7880\u20137889","author":"Li Bowen","year":"2020","unstructured":"Bowen Li , Xiaojuan Qi , Thomas Lukasiewicz , and Philip\u00a0 HS Torr . 2020 . Manigan: Text-guided image manipulation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7880\u20137889 . Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, and Philip\u00a0HS Torr. 2020. Manigan: Text-guided image manipulation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7880\u20137889."},{"key":"e_1_3_2_1_16_1","unstructured":"Seonghyeon Nam Yunji Kim and Seon\u00a0Joo Kim. 2018. Text-adaptive generative adversarial networks: Manipulating images with natural language. arXiv preprint arXiv:1810.11919(2018).  Seonghyeon Nam Yunji Kim and Seon\u00a0Joo Kim. 2018. Text-adaptive generative adversarial networks: Manipulating images with natural language. arXiv preprint arXiv:1810.11919(2018)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00209"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58568-6_2"},{"key":"e_1_3_2_1_19_1","unstructured":"Alec Radford Jong\u00a0Wook Kim Chris Hallacy Aditya Ramesh Gabriel Goh Sandhini Agarwal Girish Sastry Amanda Askell Pamela Mishkin Jack Clark 2021. Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020(2021).  Alec Radford Jong\u00a0Wook Kim Chris Hallacy Aditya Ramesh Gabriel Goh Sandhini Agarwal Girish Sastry Amanda Askell Pamela Mishkin Jack Clark 2021. Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020(2021)."},{"key":"e_1_3_2_1_20_1","volume-title":"International Conference on Learning Representations.","author":"Radford Alec","year":"2016","unstructured":"Alec Radford , Luke Metz , and Soumith Chintala . 2016 . Unsupervised representation learning with deep convolutional generative adversarial networks . In International Conference on Learning Representations. Alec Radford, Luke Metz, and Soumith Chintala. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00926"},{"key":"e_1_3_2_1_22_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1532\u20131540","author":"Shen Yujun","year":"2021","unstructured":"Yujun Shen and Bolei Zhou . 2021 . Closed-form factorization of latent semantics in gans . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1532\u20131540 . Yujun Shen and Bolei Zhou. 2021. Closed-form factorization of latent semantics in gans. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1532\u20131540."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459838"},{"key":"e_1_3_2_1_24_1","volume-title":"International Conference on Machine Learning. 9786\u20139796","author":"Voynov Andrey","year":"2020","unstructured":"Andrey Voynov and Artem Babenko . 2020 . Unsupervised discovery of interpretable directions in the gan latent space . In International Conference on Machine Learning. 9786\u20139796 . Andrey Voynov and Artem Babenko. 2020. Unsupervised discovery of interpretable directions in the gan latent space. In International Conference on Machine Learning. 9786\u20139796."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01267"},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2256\u20132265","author":"Xia Weihao","year":"2021","unstructured":"Weihao Xia , Yujiu Yang , Jing-Hao Xue , and Baoyuan Wu . 2021 . TediGAN: Text-Guided Diverse Face Image Generation and Manipulation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2256\u20132265 . Weihao Xia, Yujiu Yang, Jing-Hao Xue, and Baoyuan Wu. 2021. TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2256\u20132265."},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4432\u20134442","author":"Xu Yinghao","year":"2021","unstructured":"Yinghao Xu , Yujun Shen , Jiapeng Zhu , Ceyuan Yang , and Bolei Zhou . 2021 . Generative hierarchical features from synthesizing images . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4432\u20134442 . Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, and Bolei Zhou. 2021. Generative hierarchical features from synthesizing images. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4432\u20134442."},{"key":"e_1_3_2_1_28_1","volume-title":"Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365(2015).","author":"Yu Fisher","year":"2015","unstructured":"Fisher Yu , Ari Seff , Yinda Zhang , Shuran Song , Thomas Funkhouser , and Jianxiong Xiao . 2015 . Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365(2015). Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser, and Jianxiong Xiao. 2015. Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365(2015)."},{"key":"e_1_3_2_1_29_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5104\u20135113","author":"Zhu Peihao","year":"2020","unstructured":"Peihao Zhu , Rameen Abdal , Yipeng Qin , and Peter Wonka . 2020 . Sean: Image synthesis with semantic region-adaptive normalization . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5104\u20135113 . Peihao Zhu, Rameen Abdal, Yipeng Qin, and Peter Wonka. 2020. Sean: Image synthesis with semantic region-adaptive normalization. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5104\u20135113."}],"event":{"name":"CBMI 2022: International Conference on Content-based Multimedia Indexing","acronym":"CBMI 2022","location":"Graz Austria"},"container-title":["International Conference on Content-based Multimedia Indexing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3549555.3549556","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3549555.3549556","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:11Z","timestamp":1750186811000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3549555.3549556"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,14]]},"references-count":29,"alternative-id":["10.1145\/3549555.3549556","10.1145\/3549555"],"URL":"https:\/\/doi.org\/10.1145\/3549555.3549556","relation":{},"subject":[],"published":{"date-parts":[[2022,9,14]]}}}