{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T16:32:35Z","timestamp":1773246755186,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,7,23]],"date-time":"2023-07-23T00:00:00Z","timestamp":1690070400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"UKRI Future Leaders Fellowship","award":["G104084"],"award-info":[{"award-number":["G104084"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,7,23]]},"DOI":"10.1145\/3588432.3591532","type":"proceedings-article","created":{"date-parts":[[2023,7,19]],"date-time":"2023-07-19T13:34:52Z","timestamp":1689773692000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable and Controllable Text-Guided Face Manipulation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-1096-1927","authenticated-orcid":false,"given":"Chenliang","family":"Zhou","sequence":"first","affiliation":[{"name":"University of Cambridge, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5964-5282","authenticated-orcid":false,"given":"Fangcheng","family":"Zhong","sequence":"additional","affiliation":[{"name":"University of Cambridge, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4700-2236","authenticated-orcid":false,"given":"Cengiz","family":"\u00d6ztireli","sequence":"additional","affiliation":[{"name":"University of Cambridge, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2023,7,23]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"2021 IEEE 17th International Conference on Intelligent Computer Communication and Processing (ICCP). IEEE, 163\u2013169","author":"Antal L\u00e1szl\u00f3","year":"2021","unstructured":"L\u00e1szl\u00f3 Antal and Zal\u00e1n Bod\u00f3 . 2021 . Feature axes orthogonalization in semantic face editing . In 2021 IEEE 17th International Conference on Intelligent Computer Communication and Processing (ICCP). IEEE, 163\u2013169 . L\u00e1szl\u00f3 Antal and Zal\u00e1n Bod\u00f3. 2021. Feature axes orthogonalization in semantic face editing. In 2021 IEEE 17th International Conference on Intelligent Computer Communication and Processing (ICCP). IEEE, 163\u2013169."},{"key":"e_1_3_2_1_2_1","volume-title":"Text and Image Guided 3D Avatar Generation and Manipulation. arXiv preprint arXiv:2202.06079","author":"Canfes Zehranaz","year":"2022","unstructured":"Zehranaz Canfes , M\u00a0Furkan Atasoy , Alara Dirik , and Pinar Yanardag . 2022. Text and Image Guided 3D Avatar Generation and Manipulation. arXiv preprint arXiv:2202.06079 ( 2022 ). Zehranaz Canfes, M\u00a0Furkan Atasoy, Alara Dirik, and Pinar Yanardag. 2022. Text and Image Guided 3D Avatar Generation and Manipulation. arXiv preprint arXiv:2202.06079 (2022)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00367"},{"key":"e_1_3_2_1_4_1","volume-title":"Inverting the generator of a generative adversarial network","author":"Creswell Antonia","year":"2018","unstructured":"Antonia Creswell and Anil\u00a0Anthony Bharath . 2018. Inverting the generator of a generative adversarial network . IEEE transactions on neural networks and learning systems 30, 7 ( 2018 ), 1967\u20131974. Antonia Creswell and Anil\u00a0Anthony Bharath. 2018. Inverting the generator of a generative adversarial network. IEEE transactions on neural networks and learning systems 30, 7 (2018), 1967\u20131974."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00482"},{"key":"e_1_3_2_1_6_1","volume-title":"Learning to Compose Diversified Prompts for Image Emotion Classification. arXiv preprint arXiv:2201.10963","author":"Deng Sinuo","year":"2022","unstructured":"Sinuo Deng , Lifang Wu , Ge Shi , Lehao Xing , and Meng Jian . 2022. Learning to Compose Diversified Prompts for Image Emotion Classification. arXiv preprint arXiv:2201.10963 ( 2022 ). Sinuo Deng, Lifang Wu, Ge Shi, Lehao Xing, and Meng Jian. 2022. Learning to Compose Diversified Prompts for Image Emotion Classification. arXiv preprint arXiv:2201.10963 (2022)."},{"key":"e_1_3_2_1_7_1","volume-title":"Diffusion models beat gans on image synthesis. Advances in Neural Information Processing Systems 34","author":"Dhariwal Prafulla","year":"2021","unstructured":"Prafulla Dhariwal and Alexander Nichol . 2021. Diffusion models beat gans on image synthesis. Advances in Neural Information Processing Systems 34 ( 2021 ). Prafulla Dhariwal and Alexander Nichol. 2021. Diffusion models beat gans on image synthesis. Advances in Neural Information Processing Systems 34 (2021)."},{"key":"e_1_3_2_1_8_1","volume-title":"Explainable and interpretable models in computer vision and machine learning","author":"Doshi-Velez Finale","unstructured":"Finale Doshi-Velez and Been Kim . 2018. Considerations for evaluation and generalization in interpretable machine learning . In Explainable and interpretable models in computer vision and machine learning . Springer , 3\u201317. Finale Doshi-Velez and Been Kim. 2018. Considerations for evaluation and generalization in interpretable machine learning. In Explainable and interpretable models in computer vision and machine learning. Springer, 3\u201317."},{"key":"e_1_3_2_1_9_1","volume-title":"An argument for basic emotions. Cognition & emotion 6, 3-4","author":"Ekman Paul","year":"1992","unstructured":"Paul Ekman . 1992. An argument for basic emotions. Cognition & emotion 6, 3-4 ( 1992 ), 169\u2013200. Paul Ekman. 1992. An argument for basic emotions. Cognition & emotion 6, 3-4 (1992), 169\u2013200."},{"key":"e_1_3_2_1_10_1","first-page":"9216","article-title":"An image is worth more than a thousand words: Towards disentanglement in the wild","volume":"34","author":"Gabbay Aviv","year":"2021","unstructured":"Aviv Gabbay , Niv Cohen , and Yedid Hoshen . 2021 . An image is worth more than a thousand words: Towards disentanglement in the wild . Advances in Neural Information Processing Systems 34 (2021), 9216 \u2013 9228 . Aviv Gabbay, Niv Cohen, and Yedid Hoshen. 2021. An image is worth more than a thousand words: Towards disentanglement in the wild. Advances in Neural Information Processing Systems 34 (2021), 9216\u20139228.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528223.3530164"},{"key":"e_1_3_2_1_12_1","volume-title":"Understanding and Detecting Hateful Content using Contrastive Learning. arXiv preprint arXiv:2201.08387","author":"Gonz\u00e1lez-Pizarro Felipe","year":"2022","unstructured":"Felipe Gonz\u00e1lez-Pizarro and Savvas Zannettou . 2022. Understanding and Detecting Hateful Content using Contrastive Learning. arXiv preprint arXiv:2201.08387 ( 2022 ). Felipe Gonz\u00e1lez-Pizarro and Savvas Zannettou. 2022. Understanding and Detecting Hateful Content using Contrastive Learning. arXiv preprint arXiv:2201.08387 (2022)."},{"key":"e_1_3_2_1_13_1","volume-title":"Generative adversarial nets. Advances in neural information processing systems 27","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014. Generative adversarial nets. Advances in neural information processing systems 27 ( 2014 ). Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Advances in neural information processing systems 27 (2014)."},{"key":"e_1_3_2_1_14_1","volume-title":"Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30","author":"Heusel Martin","year":"2017","unstructured":"Martin Heusel , Hubert Ramsauer , Thomas Unterthiner , Bernhard Nessler , and Sepp Hochreiter . 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30 ( 2017 ). Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_1_15_1","volume-title":"Advances in Neural Information Processing Systems, H.\u00a0Larochelle, M.\u00a0Ranzato, R.\u00a0Hadsell, M.F. Balcan, and H.\u00a0Lin (Eds.). Vol.\u00a033. Curran Associates","author":"Ho Jonathan","year":"2020","unstructured":"Jonathan Ho , Ajay Jain , and Pieter Abbeel . 2020. Denoising Diffusion Probabilistic Models . In Advances in Neural Information Processing Systems, H.\u00a0Larochelle, M.\u00a0Ranzato, R.\u00a0Hadsell, M.F. Balcan, and H.\u00a0Lin (Eds.). Vol.\u00a033. Curran Associates , Inc ., 6840\u20136851. https:\/\/proceedings.neurips.cc\/paper\/ 2020 \/file\/4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising Diffusion Probabilistic Models. In Advances in Neural Information Processing Systems, H.\u00a0Larochelle, M.\u00a0Ranzato, R.\u00a0Hadsell, M.F. Balcan, and H.\u00a0Lin (Eds.). Vol.\u00a033. Curran Associates, Inc., 6840\u20136851. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf"},{"key":"e_1_3_2_1_16_1","volume-title":"FEAT: Face Editing with Attention. arXiv preprint arXiv:2202.02713","author":"Hou Xianxu","year":"2022","unstructured":"Xianxu Hou , Linlin Shen , Or Patashnik , Daniel Cohen-Or , and Hui Huang . 2022 a. FEAT: Face Editing with Attention. arXiv preprint arXiv:2202.02713 (2022). Xianxu Hou, Linlin Shen, Or Patashnik, Daniel Cohen-Or, and Hui Huang. 2022a. FEAT: Face Editing with Attention. arXiv preprint arXiv:2202.02713 (2022)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2021.10.017"},{"key":"e_1_3_2_1_18_1","volume-title":"Principal component analysis for special types of data","author":"Jolliffe T","unstructured":"Ian\u00a0 T Jolliffe . 2002. Principal component analysis for special types of data . Springer . Ian\u00a0T Jolliffe. 2002. Principal component analysis for special types of data. Springer."},{"key":"e_1_3_2_1_19_1","volume-title":"Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196","author":"Karras Tero","year":"2017","unstructured":"Tero Karras , Timo Aila , Samuli Laine , and Jaakko Lehtinen . 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 ( 2017 ). Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00453"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00813"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV51458.2022.00373"},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 895\u2013904","author":"Kocasari Umut","year":"2022","unstructured":"Umut Kocasari , Alara Dirik , Mert Tiftikci , and Pinar Yanardag . 2022 . StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation . In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 895\u2013904 . Umut Kocasari, Alara Dirik, Mert Tiftikci, and Pinar Yanardag. 2022. StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 895\u2013904."},{"key":"e_1_3_2_1_24_1","volume-title":"Controllable text-to-image generation. Advances in Neural Information Processing Systems 32","author":"Li Bowen","year":"2019","unstructured":"Bowen Li , Xiaojuan Qi , Thomas Lukasiewicz , and Philip Torr . 2019. Controllable text-to-image generation. Advances in Neural Information Processing Systems 32 ( 2019 ). Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, and Philip Torr. 2019. Controllable text-to-image generation. Advances in Neural Information Processing Systems 32 (2019)."},{"key":"e_1_3_2_1_25_1","volume-title":"Language-driven Semantic Segmentation. arXiv preprint arXiv:2201.03546","author":"Li Boyi","year":"2022","unstructured":"Boyi Li , Kilian\u00a0 Q Weinberger , Serge Belongie , Vladlen Koltun , and Ren\u00e9 Ranftl . 2022. Language-driven Semantic Segmentation. arXiv preprint arXiv:2201.03546 ( 2022 ). Boyi Li, Kilian\u00a0Q Weinberger, Serge Belongie, Vladlen Koltun, and Ren\u00e9 Ranftl. 2022. Language-driven Semantic Segmentation. arXiv preprint arXiv:2201.03546 (2022)."},{"key":"e_1_3_2_1_26_1","volume-title":"EditGAN: High-Precision Semantic Image Editing. Advances in Neural Information Processing Systems 34","author":"Ling Huan","year":"2021","unstructured":"Huan Ling , Karsten Kreis , Daiqing Li , Seung\u00a0Wook Kim , Antonio Torralba , and Sanja Fidler . 2021. EditGAN: High-Precision Semantic Image Editing. Advances in Neural Information Processing Systems 34 ( 2021 ). Huan Ling, Karsten Kreis, Daiqing Li, Seung\u00a0Wook Kim, Antonio Torralba, and Sanja Fidler. 2021. EditGAN: High-Precision Semantic Image Editing. Advances in Neural Information Processing Systems 34 (2021)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/795661.796202"},{"key":"e_1_3_2_1_28_1","volume-title":"Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence 267","author":"Miller Tim","year":"2019","unstructured":"Tim Miller . 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence 267 ( 2019 ), 1\u201338. Tim Miller. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence 267 (2019), 1\u201338."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550469.3555392"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1900654116"},{"key":"e_1_3_2_1_31_1","volume-title":"MyStyle: A Personalized Generative Prior. arXiv preprint arXiv:2203.17272","author":"Nitzan Yotam","year":"2022","unstructured":"Yotam Nitzan , Kfir Aberman , Qiurui He , Orly Liba , Michal Yarom , Yossi Gandelsman , Inbar Mosseri , Yael Pritch , and Daniel Cohen-Or . 2022. MyStyle: A Personalized Generative Prior. arXiv preprint arXiv:2203.17272 ( 2022 ). Yotam Nitzan, Kfir Aberman, Qiurui He, Orly Liba, Michal Yarom, Yossi Gandelsman, Inbar Mosseri, Yael Pritch, and Daniel Cohen-Or. 2022. MyStyle: A Personalized Generative Prior. arXiv preprint arXiv:2203.17272 (2022)."},{"key":"e_1_3_2_1_32_1","first-page":"7198","article-title":"Swapping autoencoder for deep image manipulation","volume":"33","author":"Park Taesung","year":"2020","unstructured":"Taesung Park , Jun-Yan Zhu , Oliver Wang , Jingwan Lu , Eli Shechtman , Alexei Efros , and Richard Zhang . 2020 . Swapping autoencoder for deep image manipulation . Advances in Neural Information Processing Systems 33 (2020), 7198 \u2013 7211 . Taesung Park, Jun-Yan Zhu, Oliver Wang, Jingwan Lu, Eli Shechtman, Alexei Efros, and Richard Zhang. 2020. Swapping autoencoder for deep image manipulation. Advances in Neural Information Processing Systems 33 (2020), 7198\u20137211.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00209"},{"key":"e_1_3_2_1_34_1","volume-title":"Bogdan Raducanu, and Jose\u00a0M \u00c1lvarez.","author":"Perarnau Guim","year":"2016","unstructured":"Guim Perarnau , Joost Van De\u00a0Weijer , Bogdan Raducanu, and Jose\u00a0M \u00c1lvarez. 2016 . Invertible conditional gans for image editing. arXiv preprint arXiv:1611.06355 (2016). Guim Perarnau, Joost Van De\u00a0Weijer, Bogdan Raducanu, and Jose\u00a0M \u00c1lvarez. 2016. Invertible conditional gans for image editing. arXiv preprint arXiv:1611.06355 (2016)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01411"},{"key":"e_1_3_2_1_36_1","volume-title":"International Conference on Machine Learning. PMLR, 8748\u20138763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong\u00a0Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , 2021 . Learning transferable visual models from natural language supervision . In International Conference on Machine Learning. PMLR, 8748\u20138763 . Alec Radford, Jong\u00a0Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, 2021. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning. PMLR, 8748\u20138763."},{"key":"e_1_3_2_1_37_1","volume-title":"Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125","author":"Ramesh Aditya","year":"2022","unstructured":"Aditya Ramesh , Prafulla Dhariwal , Alex Nichol , Casey Chu , and Mark Chen . 2022. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125 ( 2022 ). Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. 2022. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125 (2022)."},{"key":"e_1_3_2_1_38_1","volume-title":"One-shot domain adaptation for semantic face editing of real world images using StyleALAE. arXiv preprint arXiv:2108.13876","author":"Reddy Ravi\u00a0Kiran","year":"2021","unstructured":"Ravi\u00a0Kiran Reddy , Kumar Shubham , Gopalakrishnan Venkatesh , Sriram Gandikota , Sarthak Khoche , Dinesh\u00a0Babu Jayagopi , and Gopalakrishnan Srinivasaraghavan . 2021. One-shot domain adaptation for semantic face editing of real world images using StyleALAE. arXiv preprint arXiv:2108.13876 ( 2021 ). Ravi\u00a0Kiran Reddy, Kumar Shubham, Gopalakrishnan Venkatesh, Sriram Gandikota, Sarthak Khoche, Dinesh\u00a0Babu Jayagopi, and Gopalakrishnan Srinivasaraghavan. 2021. One-shot domain adaptation for semantic face editing of real world images using StyleALAE. arXiv preprint arXiv:2108.13876 (2021)."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/BFb0020217"},{"key":"e_1_3_2_1_40_1","volume-title":"HyperExtended LightFace: A Facial Attribute Analysis Framework. In 2021 International Conference on Engineering and Emerging Technologies (ICEET). IEEE, 1\u20134. https:\/\/doi.org\/10","author":"Serengil Sefik\u00a0Ilkin","year":"2021","unstructured":"Sefik\u00a0Ilkin Serengil and Alper Ozpinar . 2021 . HyperExtended LightFace: A Facial Attribute Analysis Framework. In 2021 International Conference on Engineering and Emerging Technologies (ICEET). IEEE, 1\u20134. https:\/\/doi.org\/10 .1109\/ICEET53442.2021.9659697 10.1109\/ICEET53442.2021.9659697 Sefik\u00a0Ilkin Serengil and Alper Ozpinar. 2021. HyperExtended LightFace: A Facial Attribute Analysis Framework. In 2021 International Conference on Engineering and Emerging Technologies (ICEET). IEEE, 1\u20134. https:\/\/doi.org\/10.1109\/ICEET53442.2021.9659697"},{"key":"e_1_3_2_1_41_1","volume-title":"Interfacegan: Interpreting the disentangled face representation learned by gans","author":"Shen Yujun","year":"2020","unstructured":"Yujun Shen , Ceyuan Yang , Xiaoou Tang , and Bolei Zhou . 2020 . Interfacegan: Interpreting the disentangled face representation learned by gans . IEEE transactions on pattern analysis and machine intelligence (2020). Yujun Shen, Ceyuan Yang, Xiaoou Tang, and Bolei Zhou. 2020. Interfacegan: Interpreting the disentangled face representation learned by gans. IEEE transactions on pattern analysis and machine intelligence (2020)."},{"key":"e_1_3_2_1_42_1","volume-title":"Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching. arXiv preprint arXiv:2201.06686","author":"Shi Hengcan","year":"2022","unstructured":"Hengcan Shi , Munawar Hayat , and Jianfei Cai . 2022a. Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching. arXiv preprint arXiv:2201.06686 ( 2022 ). Hengcan Shi, Munawar Hayat, and Jianfei Cai. 2022a. Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching. arXiv preprint arXiv:2201.06686 (2022)."},{"key":"e_1_3_2_1_43_1","volume-title":"ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues. arXiv preprint arXiv:2201.06696","author":"Shi Hengcan","year":"2022","unstructured":"Hengcan Shi , Munawar Hayat , Yicheng Wu , and Jianfei Cai . 2022b. ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues. arXiv preprint arXiv:2201.06696 ( 2022 ). Hengcan Shi, Munawar Hayat, Yicheng Wu, and Jianfei Cai. 2022b. ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues. arXiv preprint arXiv:2201.06696 (2022)."},{"key":"e_1_3_2_1_44_1","volume-title":"Visualizing data using t-SNE.Journal of machine learning research 9, 11","author":"Maaten Laurens Van\u00a0der","year":"2008","unstructured":"Laurens Van\u00a0der Maaten and Geoffrey Hinton . 2008. Visualizing data using t-SNE.Journal of machine learning research 9, 11 ( 2008 ). Laurens Van\u00a0der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.Journal of machine learning research 9, 11 (2008)."},{"key":"e_1_3_2_1_45_1","volume-title":"Clipasso: Semantically-aware object sketching. arXiv preprint arXiv:2202.05822","author":"Vinker Yael","year":"2022","unstructured":"Yael Vinker , Ehsan Pajouheshgar , Jessica\u00a0 Y Bo , Roman\u00a0Christian Bachmann , Amit\u00a0Haim Bermano , Daniel Cohen-Or , Amir Zamir , and Ariel Shamir . 2022 . Clipasso: Semantically-aware object sketching. arXiv preprint arXiv:2202.05822 (2022). Yael Vinker, Ehsan Pajouheshgar, Jessica\u00a0Y Bo, Roman\u00a0Christian Bachmann, Amit\u00a0Haim Bermano, Daniel Cohen-Or, Amir Zamir, and Ariel Shamir. 2022. Clipasso: Semantically-aware object sketching. arXiv preprint arXiv:2202.05822 (2022)."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01267"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00229"},{"key":"e_1_3_2_1_48_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18229\u201318238","author":"Xu Zipeng","year":"2022","unstructured":"Zipeng Xu , Tianwei Lin , Hao Tang , Fu Li , Dongliang He , Nicu Sebe , Radu Timofte , Luc Van\u00a0Gool , and Errui Ding . 2022 . Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18229\u201318238 . Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc Van\u00a0Gool, and Errui Ding. 2022. Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 18229\u201318238."},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475206"},{"key":"e_1_3_2_1_50_1","volume-title":"GI-AEE: GAN Inversion Based Attentive Expression Embedding Network For Facial Expression Editing. In 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2453\u20132457","author":"Zhang Yun","year":"2021","unstructured":"Yun Zhang , Ruixin Liu , Yifan Pan , Dehao Wu , Yuesheng Zhu , and Zhiqiang Bai . 2021 . GI-AEE: GAN Inversion Based Attentive Expression Embedding Network For Facial Expression Editing. In 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2453\u20132457 . Yun Zhang, Ruixin Liu, Yifan Pan, Dehao Wu, Yuesheng Zhu, and Zhiqiang Bai. 2021. GI-AEE: GAN Inversion Based Attentive Expression Embedding Network For Facial Expression Editing. In 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2453\u20132457."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46454-1_36"},{"key":"e_1_3_2_1_52_1","volume-title":"Perceptually Validated Precise Local Editing for Facial Action Units with StyleGAN. arXiv preprint arXiv:2107.12143","author":"Zindanc\u0131o\u011flu Alara","year":"2021","unstructured":"Alara Zindanc\u0131o\u011flu and T\u00a0Metin Sezgin . 2021. Perceptually Validated Precise Local Editing for Facial Action Units with StyleGAN. arXiv preprint arXiv:2107.12143 ( 2021 ). Alara Zindanc\u0131o\u011flu and T\u00a0Metin Sezgin. 2021. Perceptually Validated Precise Local Editing for Facial Action Units with StyleGAN. arXiv preprint arXiv:2107.12143 (2021)."}],"event":{"name":"SIGGRAPH '23: Special Interest Group on Computer Graphics and Interactive Techniques Conference","location":"Los Angeles CA USA","acronym":"SIGGRAPH '23","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Proceedings"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3588432.3591532","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:47:11Z","timestamp":1750178831000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3588432.3591532"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,23]]},"references-count":52,"alternative-id":["10.1145\/3588432.3591532","10.1145\/3588432"],"URL":"https:\/\/doi.org\/10.1145\/3588432.3591532","relation":{},"subject":[],"published":{"date-parts":[[2023,7,23]]},"assertion":[{"value":"2023-07-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}