{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T05:23:37Z","timestamp":1780550617036,"version":"3.54.1"},"reference-count":81,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2024,3,13]],"date-time":"2024-03-13T00:00:00Z","timestamp":1710288000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,3,13]],"date-time":"2024-03-13T00:00:00Z","timestamp":1710288000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010049","name":"Kingston University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010049","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["951911"],"award-info":[{"award-number":["951911"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In this paper, we present our framework for neural face\/head reenactment whose goal is to transfer the 3D head orientation and expression of a target face to a source face. Previous methods focus on learning embedding networks for identity and head pose\/expression disentanglement which proves to be a rather hard task, degrading the quality of the generated images. We take a different approach, bypassing the training of such networks, by using (fine-tuned) pre-trained GANs which have been shown capable of producing high-quality facial images. Because GANs are characterized by weak controllability, the core of our approach is a method to discover which directions in latent GAN space are responsible for controlling head pose and expression variations. We present a simple pipeline to learn such directions with the aid of a 3D shape model which, by construction, inherently captures disentangled directions for head pose, identity, and expression. Moreover, we show that by embedding real images in the GAN latent space, our method can be successfully used for the reenactment of real-world faces. Our method features several favorable properties including using a single source image (one-shot) and enabling cross-person reenactment. Extensive qualitative and quantitative results show that our approach typically produces reenacted faces of notably higher quality than those produced by state-of-the-art methods for the standard benchmarks of VoxCeleb1 &amp; 2.<\/jats:p>","DOI":"10.1007\/s11263-024-02018-6","type":"journal-article","created":{"date-parts":[[2024,3,13]],"date-time":"2024-03-13T09:06:18Z","timestamp":1710320778000},"page":"3324-3354","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["One-Shot Neural Face Reenactment via Finding Directions in GAN\u2019s Latent Space"],"prefix":"10.1007","volume":"132","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-3704-2162","authenticated-orcid":false,"given":"Stella","family":"Bounareli","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2036-9089","authenticated-orcid":false,"given":"Christos","family":"Tzelepis","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vasileios","family":"Argyriou","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ioannis","family":"Patras","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Georgios","family":"Tzimiropoulos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,3,13]]},"reference":[{"key":"2018_CR1","doi-asserted-by":"crossref","unstructured":"Abdal, R., Qin, Y., & Wonka, P. (2019). Image2stylegan: How to embed images into the stylegan latent space? In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 4432\u20134441).","DOI":"10.1109\/ICCV.2019.00453"},{"issue":"3","key":"2018_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3447648","volume":"40","author":"R Abdal","year":"2021","unstructured":"Abdal, R., Zhu, P., Mitra, N. J., & Wonka, P. (2021). Styleflow: Attribute-conditioned exploration of stylegan-generated images using conditional continuous normalizing flows. ACM Transactions on Graphics (ToG), 40(3), 1\u201321.","journal-title":"ACM Transactions on Graphics (ToG)"},{"key":"2018_CR3","doi-asserted-by":"crossref","unstructured":"Alaluf, Y., Patashnik, O., & Cohen-Or, D. (2021). Restyle: A residual-based stylegan encoder via iterative refinement. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 6711\u20136720).","DOI":"10.1109\/ICCV48922.2021.00664"},{"key":"2018_CR4","doi-asserted-by":"crossref","unstructured":"Alaluf, Y., Tov, O., Mokady, R., Gal, R., & Bermano, A. (2022). Hyperstyle: Stylegan inversion with hypernetworks for real image editing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 18511\u201318521).","DOI":"10.1109\/CVPR52688.2022.01796"},{"key":"2018_CR5","doi-asserted-by":"crossref","unstructured":"Bai, Q., Xu, Y., Zhu, J., Xia, W., Yang, Y., & Shen, Y. (2022). High-fidelity GAN inversion with padding space. In X. V. Part (Ed.), Computer Vision-ECCV 2022: 17th European conference (pp. 36\u201353). Springer.","DOI":"10.1007\/978-3-031-19784-0_3"},{"key":"2018_CR6","doi-asserted-by":"crossref","unstructured":"Bao, J., Chen, D., Wen, F., Li, H., & Hua, G. (2018). Towards open-set identity preserving face synthesis. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6713\u20136722).","DOI":"10.1109\/CVPR.2018.00702"},{"key":"2018_CR7","doi-asserted-by":"crossref","unstructured":"Barattin, S., Tzelepis, C., Patras, I., & Sebe, N. (2023). Attribute-preserving face dataset anonymization via latent code optimization. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 8001\u20138010).","DOI":"10.1109\/CVPR52729.2023.00773"},{"key":"2018_CR8","doi-asserted-by":"crossref","unstructured":"Blanz, V., & Vetter, T. (1999). A morphable model for the synthesis of 3d faces. In Proceedings of the 26th annual conference on computer graphics and interactive techniques (pp. 187\u2013194).","DOI":"10.1145\/311535.311556"},{"key":"2018_CR9","unstructured":"Bounareli, S., Argyriou, V., & Tzimiropoulos, G. (2022). Finding directions in GAN\u2019s latent space for neural face reenactment. In British Machine vision conference (BMVC)"},{"key":"2018_CR10","doi-asserted-by":"crossref","unstructured":"Bounareli, S., Tzelepis, C., Argyriou, V., Patras, I., & Tzimiropoulos, G. (2023). StyleMask: Disentangling the style space of StyleGAN2 for neural face reenactment. In 2023 IEEE 17th international conference on automatic face and gesture recognition (FG) (pp. 1\u20138). IEEE.","DOI":"10.1109\/FG57933.2023.10042744"},{"key":"2018_CR11","doi-asserted-by":"crossref","unstructured":"Bulat, A., & Tzimiropoulos, G. (2017). How far are we from solving the 2d & 3d face alignment problem? (and a dataset of 230,000 3d facial landmarks). In Proceedings of the IEEE international conference on computer vision (pp. 1021\u20131030).","DOI":"10.1109\/ICCV.2017.116"},{"key":"2018_CR12","doi-asserted-by":"crossref","unstructured":"Burkov, E., Pasechnik, I., Grigorev, A., & Lempitsky, V. (2020). Neural head reenactment with latent pose descriptors. In: CVPR.","DOI":"10.1109\/CVPR42600.2020.01380"},{"key":"2018_CR13","unstructured":"Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., & Abbeel, P. (2016). Infogan: Interpretable representation learning by information maximizing generative adversarial nets. Advances in neural Information Processing Systems, 29."},{"key":"2018_CR14","doi-asserted-by":"crossref","unstructured":"Chung, J.S., Nagrani, A., & Zisserman, A. (2018). Voxceleb2: Deep speaker recognition. In INTERSPEECH.","DOI":"10.21437\/Interspeech.2018-1929"},{"key":"2018_CR15","doi-asserted-by":"crossref","unstructured":"Deng, J., Guo, J., Xue, N., & Zafeiriou, S. (2019). Arcface: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 4690\u20134699).","DOI":"10.1109\/CVPR.2019.00482"},{"key":"2018_CR16","doi-asserted-by":"crossref","unstructured":"Deng, Y., Yang, J., Chen, D., Wen, F., & Tong, X. (2020). Disentangled and controllable face image generation via 3d imitative-contrastive learning. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 5154\u20135163).","DOI":"10.1109\/CVPR42600.2020.00520"},{"key":"2018_CR17","doi-asserted-by":"crossref","unstructured":"Dinh, T. M., Tran, A. T., Nguyen, R., & Hua, B. S. (2022). Hyperinverter: Improving stylegan inversion via hypernetwork. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 11389\u201311398).","DOI":"10.1109\/CVPR52688.2022.01110"},{"key":"2018_CR18","doi-asserted-by":"crossref","unstructured":"Doukas, M.C., Zafeiriou, S., & Sharmanska, V. (2021). Headgan: One-shot neural head synthesis and editing. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 14398\u201314407).","DOI":"10.1109\/ICCV48922.2021.01413"},{"key":"2018_CR19","unstructured":"Durall, R., Jam, J., Strassel, D., Yap, M. H., & Keuper, J. (2021). Facialgan: Style transfer and attribute manipulation on synthetic faces. In 32nd British machine vision conference (pp. 1\u201314)."},{"issue":"4","key":"2018_CR20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3450626.3459936","volume":"40","author":"Y Feng","year":"2021","unstructured":"Feng, Y., Feng, H., Black, M. J., & Bolkart, T. (2021). Learning an animatable detailed 3d face model from in-the-wild images. ACM Transactions on Graphics, 40(4), 1\u201313.","journal-title":"ACM Transactions on Graphics"},{"key":"2018_CR21","doi-asserted-by":"crossref","unstructured":"Ghosh, P., Gupta, P. S., Uziel, R., Ranjan, A., Black, M. J., & Bolkart, T. (2020) GIF: Generative interpretable faces. In 8th international conference on 3D vision, 3DV 2020, Virtual Event (pp. 868\u2013878). IEEE.","DOI":"10.1109\/3DV50981.2020.00097"},{"issue":"5","key":"2018_CR22","doi-asserted-by":"publisher","first-page":"807","DOI":"10.1016\/j.imavis.2009.08.002","volume":"28","author":"R Gross","year":"2010","unstructured":"Gross, R., Matthews, I., Cohn, J., Kanade, T., & Baker, S. (2010). Multi-pie. Image and Vision Computing, 28(5), 807\u2013813.","journal-title":"Image and Vision Computing"},{"key":"2018_CR23","doi-asserted-by":"crossref","unstructured":"Ha, S., Kersner, M., Kim, B., Seo, S., & Kim, D. (2020). Marionette: Few-shot face reenactment preserving identity of unseen targets. In Proceedings of the AAAI conference on artificial intelligence (pp. 10893\u201310900).","DOI":"10.1609\/aaai.v34i07.6721"},{"key":"2018_CR24","unstructured":"H\u00e4rk\u00f6nen, E., Hertzmann, A., Lehtinen, J., & Paris, S. (2020). Ganspace: Discovering interpretable gan controls. In Proc. NeurIPS."},{"key":"2018_CR25","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770\u2013778).","DOI":"10.1109\/CVPR.2016.90"},{"key":"2018_CR26","unstructured":"Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., & Hochreiter, S. (2017). GANs trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems, 30."},{"key":"2018_CR27","doi-asserted-by":"crossref","unstructured":"Hsu, G.S., Tsai, C.H., & Wu, H.Y. (2022). Dual-generator face reenactment. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 642\u2013650).","DOI":"10.1109\/CVPR52688.2022.00072"},{"key":"2018_CR28","doi-asserted-by":"crossref","unstructured":"Johnson, J., Alahi, A., & Fei-Fei, L. (2016). Perceptual losses for real-time style transfer and super-resolution. In European conference on computer vision (pp. 694\u2013711). Springer.","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"2018_CR29","doi-asserted-by":"crossref","unstructured":"Kang, K., Kim, S., & Cho, S. (2021). Gan inversion for out-of-range images with geometric transformations. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 13941\u201313949).","DOI":"10.1109\/ICCV48922.2021.01368"},{"key":"2018_CR30","unstructured":"Karras, T., Aittala, M., Hellsten, J., Laine, S., Lehtinen, J., & Aila, T. (2020a). Training generative adversarial networks with limited data. In H. Larochelle, M. Ranzato, R. Hadsell, et al. (Eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, virtual."},{"key":"2018_CR31","doi-asserted-by":"crossref","unstructured":"Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 4401\u20134410).","DOI":"10.1109\/CVPR.2019.00453"},{"key":"2018_CR32","doi-asserted-by":"crossref","unstructured":"Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., & Aila, T. (2020b). Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 8110\u20138119).","DOI":"10.1109\/CVPR42600.2020.00813"},{"key":"2018_CR33","unstructured":"Kingma, D.P., Ba, J. (2015). Adam: A method for stochastic optimization. In Y. Bengio, Y. LeCun (Eds.) 3rd International conference on learning representations, ICLR 2015, Conference Track Proceedings."},{"key":"2018_CR34","doi-asserted-by":"crossref","unstructured":"Kowalski, M., Garbin, S.J., Estellers, V., Johnson, M., & Shotton, J. (2020). Config: Controllable neural face image generation. In European conference on computer vision (ECCV).","DOI":"10.1007\/978-3-030-58621-8_18"},{"key":"2018_CR35","doi-asserted-by":"crossref","unstructured":"Meshry, M., Suri, S., Davis, L.S., & Shrivastava, A. (2021). Learned spatial representations for few-shot talking-head synthesis. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 13829\u201313838).","DOI":"10.1109\/ICCV48922.2021.01357"},{"key":"2018_CR36","doi-asserted-by":"crossref","unstructured":"Nagrani, A., Chung, J.S., & Zisserman, A. (2017). Voxceleb: A large-scale speaker identification dataset. In INTERSPEECH.","DOI":"10.21437\/Interspeech.2017-950"},{"key":"2018_CR37","doi-asserted-by":"crossref","unstructured":"Nitzan, Y., Bermano, A., & Li, Y., & Cohen-Or, D. (2020). Face identity disentanglement via latent space mapping. Preprint at arXiv:2005.07728.","DOI":"10.1145\/3414685.3417826"},{"key":"2018_CR38","doi-asserted-by":"crossref","unstructured":"Nitzan, Y., Gal, R., & Brenner, O., & Cohen-Or, D. (2021). Large: Latent-based regression through GAN semantics. Preprint at arXiv:2107.11186.","DOI":"10.1109\/CVPR52688.2022.01864"},{"key":"2018_CR39","unstructured":"Oldfield, J., Georgopoulos, M., Panagakis, Y., Nicolaou, M. A., & Patras, I. (2021). Tensor component analysis for interpreting the latent space of GANs. In 32nd British machine vision conference 2021, BMVC 2021 (p. 222)."},{"key":"2018_CR40","unstructured":"Oldfield, J., Tzelepis, C., & Panagakis, Y., Nicolaou, M. A., & Patras, I. (2023). PandA: Unsupervised learning of parts and appearances in the feature maps of GANs. In The eleventh international conference on learning representations, ICLR 2023, OpenReview.net. https:\/\/openreview.net\/pdf?id=iUdSB2kK9GY."},{"key":"2018_CR41","doi-asserted-by":"crossref","unstructured":"Parmar, G., Li, Y., Lu, J., Zhang, R., Zhu, J. Y., & Singh, K. K. (2022). Spatially-adaptive multilayer selection for GAN inversion and editing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 11399\u201311409).","DOI":"10.1109\/CVPR52688.2022.01111"},{"key":"2018_CR42","first-page":"8026","volume":"32","author":"A Paszke","year":"2019","unstructured":"Paszke, A., Gross, S., Massa, F., et al. (2019). Pytorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems, 32, 8026\u20138037.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2018_CR43","doi-asserted-by":"crossref","unstructured":"Ren, Y., Li, G., Chen, Y., Li, T. H., & Liu, S. (2021). Pirenderer: Controllable portrait image generation via semantic neural rendering. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 13759\u201313768).","DOI":"10.1109\/ICCV48922.2021.01350"},{"key":"2018_CR44","doi-asserted-by":"crossref","unstructured":"Richardson, E., Alaluf, Y., Patashnik, O., Nitzan, Y., Azar, Y., Shapiro, S., & Cohen-Or, D. (2021). Encoding in style: A stylegan encoder for image-to-image translation. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2287\u20132296).","DOI":"10.1109\/CVPR46437.2021.00232"},{"key":"2018_CR45","doi-asserted-by":"crossref","unstructured":"Roich, D., Mokady, R., Bermano, A. H., & Cohen-Or, D. (2021). Pivotal tuning for latent-based editing of real images. Preprint arXiv:2106.05744.","DOI":"10.1145\/3544777"},{"key":"2018_CR46","unstructured":"R\u00f6ssler, A., Cozzolino, D., & Verdoliva, L., Riess, C., Thies, J., & Nie\u00dfner, M. (2018). FaceForensics: A large-scale video dataset for forgery detection in human faces."},{"key":"2018_CR47","doi-asserted-by":"crossref","unstructured":"Sanchez, E., & Valstar, M. (2020). A recurrent cycle consistency loss for progressive face-to-face synthesis. In 2020 15th IEEE international conference on automatic face and gesture recognition (FG 2020) (pp. 53\u201360). IEEE.","DOI":"10.1109\/FG47880.2020.00015"},{"key":"2018_CR48","doi-asserted-by":"crossref","unstructured":"Shen, J., Zafeiriou, S., Chrysos, G. G., Kossaifi, J., Tzimiropoulos, G., & Pantic, M. (2015). The first facial landmark tracking in-the-wild challenge: Benchmark and results. In Proceedings of the IEEE international conference on computer vision workshops (pp. 50\u201358).","DOI":"10.1109\/ICCVW.2015.132"},{"key":"2018_CR49","unstructured":"Shen, Y., Yang, C., Tang, X., & Zhou, B. (2020). Interfacegan: Interpreting the disentangled face representation learned by GANs. In IEEE transactions on pattern analysis and machine intelligence."},{"key":"2018_CR50","doi-asserted-by":"crossref","unstructured":"Shen, Y., & Zhou, B. (2021). Closed-form factorization of latent semantics in GANs. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 1532\u20131540).","DOI":"10.1109\/CVPR46437.2021.00158"},{"key":"2018_CR51","doi-asserted-by":"crossref","unstructured":"Shoshan, A., Bhonker, N., Kviatkovsky, I., & Medioni, G. (2021). Gan-control: Explicitly controllable GANs. Preprint arXiv:2101.02477.","DOI":"10.1109\/ICCV48922.2021.01382"},{"key":"2018_CR52","first-page":"7137","volume":"32","author":"A Siarohin","year":"2019","unstructured":"Siarohin, A., Lathuili\u00e8re, S., Tulyakov, S., Ricci, E., & Sebe, N. (2019). First order motion model for image animation. Advances in Neural Information Processing Systems, 32, 7137\u20137147.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2018_CR53","doi-asserted-by":"crossref","unstructured":"Skorokhodov, I., Tulyakov, S., & Elhoseiny, M. (2022). Stylegan-v: A continuous video generator with the price, image quality and perks of stylegan2. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 3626\u20133636).","DOI":"10.1109\/CVPR52688.2022.00361"},{"issue":"6","key":"2018_CR54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3414685.3417803","volume":"39","author":"A Tewari","year":"2020","unstructured":"Tewari, A., Elgharib, M., Bernard, F., Seidel, H. P., P\u00e9rez, P., Zollh\u00f6fer, M., & Theobalt, C. (2020). Pie: Portrait image embedding for semantic control. ACM Transactions on Graphics, 39(6), 1\u201314.","journal-title":"ACM Transactions on Graphics"},{"key":"2018_CR55","doi-asserted-by":"crossref","unstructured":"Tewari A, Elgharib M, Bharaj G, Bernard F, Seidel HP, P\u00e9rez P, Zollhofer M, Theobalt C(2020b). Stylerig: Rigging stylegan for 3d control over portrait images. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 6142\u20136151).","DOI":"10.1109\/CVPR42600.2020.00618"},{"issue":"4","key":"2018_CR56","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3450626.3459838","volume":"40","author":"O Tov","year":"2021","unstructured":"Tov, O., Alaluf, Y., Nitzan, Y., Patashnik, O., & Cohen-Or, D. (2021). Designing an encoder for stylegan image manipulation. ACM Transactions on Graphics, 40(4), 1\u201314.","journal-title":"ACM Transactions on Graphics"},{"key":"2018_CR57","doi-asserted-by":"crossref","unstructured":"Tripathy, S., Kannala, J., & Rahtu, E. (2020). Icface: Interpretable and controllable face reenactment using GANs. In Proceedings of the IEEE\/CVF winter conference on applications of computer vision (pp. 3385\u20133394).","DOI":"10.1109\/WACV45572.2020.9093474"},{"key":"2018_CR58","doi-asserted-by":"crossref","unstructured":"Tripathy, S., Kannala, J., & Rahtu, E. (2021). Facegan: Facial attribute controllable reenactment GAN. In Proceedings of the IEEE\/CVF winter conference on applications of computer vision (pp. 1329\u20131338).","DOI":"10.1109\/WACV48630.2021.00137"},{"key":"2018_CR59","unstructured":"Tzelepis, C., Oldfield, J., Tzimiropoulos, G., & Patras, I. (2022). ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences. Preprint arXiv:2206.02104"},{"key":"2018_CR60","doi-asserted-by":"crossref","unstructured":"Tzelepis, C., Tzimiropoulos, G., & Patras, I. (2021). WarpedGANSpace: Finding non-linear RBF paths in GAN latent space. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 6393\u20136402).","DOI":"10.1109\/ICCV48922.2021.00633"},{"key":"2018_CR61","unstructured":"Unterthiner, T., van Steenkiste, S., Kurach, K., Marinier, R., Michalski, M., & Gelly, S. (2018). Towards accurate generative models of video: A new metric & challenges. Preprint arXiv:1812.01717."},{"key":"2018_CR62","unstructured":"Voynov, A., & Babenko, A. (2020). Unsupervised discovery of interpretable directions in the GAN latent space. In International conference on machine learning (pp. 9786\u20139796). PMLR."},{"key":"2018_CR63","unstructured":"Wang, C., Chai, M., He, M., Chen, D., & Liao, J. (2021a). Cross-domain and disentangled face manipulation with 3d guidance. Preprint arXiv:2104.11228."},{"key":"2018_CR64","doi-asserted-by":"crossref","unstructured":"Wang, T.C., Mallya, A., & Liu, M.Y. (2021b). One-shot free-view neural talking-head synthesis for video conferencing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 10039\u201310049).","DOI":"10.1109\/CVPR46437.2021.00991"},{"key":"2018_CR65","doi-asserted-by":"crossref","unstructured":"Wang, T., Zhang, Y., Fan, Y., Wang, J., & Chen, Q. (2022a). High-fidelity GAN inversion for image attribute editing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 11379\u201311388).","DOI":"10.1109\/CVPR52688.2022.01109"},{"key":"2018_CR66","unstructured":"Wang, Y., Yang, D., Bremond, F., & Dantcheva, A. (2022b). Latent image animator: Learning to animate images via latent space navigation. In International conference on learning representations."},{"key":"2018_CR67","doi-asserted-by":"crossref","unstructured":"Wiles, O., Koepke, A., & Zisserman, A. (2018). X2face: A network for controlling face generation using images, audio, and pose codes. In Proceedings of the European conference on computer vision (ECCV) (pp. 670\u2013686).","DOI":"10.1007\/978-3-030-01261-8_41"},{"key":"2018_CR68","doi-asserted-by":"crossref","unstructured":"Yang, H., Chai, L., Wen, Q., Zhao, S., Sun, Z., & He, S. (2021). Discovering interpretable latent space directions of GANs beyond binary attributes. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 12177\u201312185).","DOI":"10.1109\/CVPR46437.2021.01200"},{"key":"2018_CR69","doi-asserted-by":"crossref","unstructured":"Yang, K., Chen, K., Guo, D., Zhang, S. H., Guo, Y. C., & Zhang, W. (2022). Face2face $$\\rho $$: Real-time high-resolution one-shot face reenactment. In European conference on computer vision (pp. 55\u201371). Springer.","DOI":"10.1007\/978-3-031-19778-9_4"},{"key":"2018_CR70","doi-asserted-by":"crossref","unstructured":"Yao, X., Newson, A., Gousseau, Y., & Hellier, P. (2021). A latent transformer for disentangled face editing in images and videos. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 13789\u201313798).","DOI":"10.1109\/ICCV48922.2021.01353"},{"key":"2018_CR71","doi-asserted-by":"crossref","unstructured":"Yao, X., Newson, A., Gousseau, Y., & Hellier, P. (2022a). A style-based GAN encoder for high fidelity reconstruction of images and videos. In European conference on computer vision.","DOI":"10.1007\/978-3-031-19784-0_34"},{"key":"2018_CR72","doi-asserted-by":"crossref","unstructured":"Yao, X., Newson, A., Gousseau, Y., & Hellier, P. (2022b). A style-based GAN encoder for high fidelity reconstruction of images and videos. In X. V. Part (Ed.), Computer Vision-ECCV 2022: 17th European conference, (pp. 581\u2013597). Springer.","DOI":"10.1007\/978-3-031-19784-0_34"},{"key":"2018_CR73","doi-asserted-by":"crossref","unstructured":"Yao, G., Yuan, Y., Shao, T., & Zhou, K. (2020). Mesh guided one-shot face reenactment using graph convolutional networks. In Proceedings of the 28th ACM international conference on multimedia (pp. 1773\u20131781).","DOI":"10.1145\/3394171.3413865"},{"key":"2018_CR74","doi-asserted-by":"crossref","unstructured":"Zakharov, E., Ivakhnenko, A., Shysheya, A., & Lempitsky, V. (2020). Fast bi-layer neural synthesis of one-shot realistic head avatars. In ECCV.","DOI":"10.1007\/978-3-030-58610-2_31"},{"key":"2018_CR75","doi-asserted-by":"crossref","unstructured":"Zakharov, E., Shysheya, A., Burkov, E., & Lempitsky, V. (2019). Few-shot adversarial learning of realistic neural talking head models. In Proceedings of the IEEE\/CVF international conference on computer vision (pp. 9459\u20139468).","DOI":"10.1109\/ICCV.2019.00955"},{"key":"2018_CR76","doi-asserted-by":"crossref","unstructured":"Zeng, X., Pan, Y., Wang, M., Zhang, J., & Liu, Y. (2020). Realistic face reenactment via self-supervised disentangling of identity and pose. In Proceedings of the AAAI conference on artificial intelligence (pp. 12757\u201312764).","DOI":"10.1609\/aaai.v34i07.6970"},{"key":"2018_CR77","doi-asserted-by":"crossref","unstructured":"Zhang, J., Zeng, X., Wang, M., Pan, Y., Liu, L., Liu, Y., Ding, Y., & Fan, C. (2020). Freenet: Multi-identity face reenactment. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 5326\u20135335).","DOI":"10.1109\/CVPR42600.2020.00537"},{"key":"2018_CR78","doi-asserted-by":"crossref","unstructured":"Zhang, R., Isola, P., Efros, A.A., & Shechtman, E., & Wang, O. (2018). The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 586\u2013595).","DOI":"10.1109\/CVPR.2018.00068"},{"key":"2018_CR79","doi-asserted-by":"crossref","unstructured":"Zheng, Y., Yang, H., Zhang, T., Bao, J., Chen, D., Huang, Y., Yuan, L., Chen, D., Zeng, M., & Wen, F. (2022). General facial representation learning in a visual-linguistic manner. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 18697\u201318709).","DOI":"10.1109\/CVPR52688.2022.01814"},{"key":"2018_CR80","doi-asserted-by":"crossref","unstructured":"Zhou, H., Liu, J., Liu, Z., Liu, Y., & Wang, X. (2020). Rotate-and-render: Unsupervised photorealistic face rotation from single-view images. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 5911\u20135920).","DOI":"10.1109\/CVPR42600.2020.00595"},{"key":"2018_CR81","doi-asserted-by":"crossref","unstructured":"Zhu, J., Shen, Y., Zhao, D., & Zhou, B. (2020). In-domain GAN inversion for real image editing. In European conference on computer vision (pp. 592\u2013608). Springer.","DOI":"10.1007\/978-3-030-58520-4_35"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02018-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-024-02018-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02018-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,11]],"date-time":"2024-07-11T14:33:33Z","timestamp":1720708413000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-024-02018-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,13]]},"references-count":81,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["2018"],"URL":"https:\/\/doi.org\/10.1007\/s11263-024-02018-6","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,13]]},"assertion":[{"value":"31 March 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 January 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 March 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}