{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T16:13:29Z","timestamp":1775578409527,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":39,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547838","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:42:35Z","timestamp":1665416555000},"page":"2663-2671","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":69,"title":["MegaPortraits: One-shot Megapixel Neural Head Avatars"],"prefix":"10.1145","author":[{"given":"Nikita","family":"Drobyshev","sequence":"first","affiliation":[{"name":"Samsung AI Center, Moscow, Russian Fed."}]},{"given":"Jenya","family":"Chelishev","sequence":"additional","affiliation":[{"name":"Samsung AI Center, Moscow, Russian Fed."}]},{"given":"Taras","family":"Khakhulin","sequence":"additional","affiliation":[{"name":"Samsung AI Center &amp; Skolkovo University of Science and Technology, Moscow, Russian Fed."}]},{"given":"Aleksei","family":"Ivakhnenko","sequence":"additional","affiliation":[{"name":"Samsung AI Center, Moscow, Russian Fed."}]},{"given":"Victor","family":"Lempitsky","sequence":"additional","affiliation":[{"name":"Yandex, Erevan, Armenia"}]},{"given":"Egor","family":"Zakharov","sequence":"additional","affiliation":[{"name":"Samsung AI Center &amp; Skolkovo University of Science and Technology, Moscow, Russian Fed."}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/311535.311556"},{"key":"e_1_3_2_2_2_1","volume-title":"Neural Head Reenactment with Latent Pose Descriptors. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Burkov Egor","year":"2020","unstructured":"Egor Burkov , I. Pasechnik , Artur Grigorev , and Victor S. Lempitsky . 2020 . Neural Head Reenactment with Latent Pose Descriptors. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ( 2020 ), 13783--13792. Egor Burkov, I. Pasechnik, Artur Grigorev, and Victor S. Lempitsky. 2020. Neural Head Reenactment with Latent Pose Descriptors. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), 13783--13792."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"crossref","unstructured":"Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. VoxCeleb2: Deep Speaker Recognition. In INTERSPEECH.  Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. VoxCeleb2: Deep Speaker Recognition. In INTERSPEECH.","DOI":"10.21437\/Interspeech.2018-1929"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"crossref","unstructured":"Jia Deng Wei Dong Richard Socher Li-Jia Li K. Li and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR.  Jia Deng Wei Dong Richard Socher Li-Jia Li K. Li and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_2_5_1","volume-title":"RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Deng Jiankang","year":"2020","unstructured":"Jiankang Deng , J. Guo , Evangelos Ververas , Irene Kotsia , Stefanos Zafeiriou , and InsightFace FaceSoft . 2020 . RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), 5202--5211. Jiankang Deng, J. Guo, Evangelos Ververas, Irene Kotsia, Stefanos Zafeiriou, and InsightFace FaceSoft. 2020. RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild. 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), 5202--5211."},{"key":"e_1_3_2_2_6_1","volume-title":"HeadGAN: One-shot Neural Head Synthesis and Editing. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV).","author":"Doukas Michail Christos","year":"2021","unstructured":"Michail Christos Doukas , Stefanos Zafeiriou , and Viktoriia Sharmanska . 2021 . HeadGAN: One-shot Neural Head Synthesis and Editing. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV). Michail Christos Doukas, Stefanos Zafeiriou, and Viktoriia Sharmanska. 2021. HeadGAN: One-shot Neural Head Synthesis and Editing. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_2_7_1","volume-title":"Hyung Jin Chang, and Y. Demiris","author":"Fischer Tobias","year":"2018","unstructured":"Tobias Fischer , Hyung Jin Chang, and Y. Demiris . 2018 . RT-GENE: Real- Time Eye Gaze Estimation in Natural Environments. In ECCV. Tobias Fischer, Hyung Jin Chang, and Y. Demiris. 2018. RT-GENE: Real-Time Eye Gaze Estimation in Natural Environments. In ECCV."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00854"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00763"},{"key":"e_1_3_2_2_10_1","unstructured":"Sungjoo Ha Martin Kersner Beomsu Kim Seokjun Seo and Dongyoung Kim. 2020. MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets. In AAAI.  Sungjoo Ha Martin Kersner Beomsu Kim Seokjun Seo and Dongyoung Kim. 2020. MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets. In AAAI."},{"key":"e_1_3_2_2_11_1","unstructured":"Martin Heusel Hubert Ramsauer Thomas Unterthiner Bernhard Nessler and Sepp Hochreiter. 2017. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Advances in Neural Information Processing Systems.  Martin Heusel Hubert Ramsauer Thomas Unterthiner Bernhard Nessler and Sepp Hochreiter. 2017. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"crossref","unstructured":"Justin Johnson Alexandre Alahi and Li Fei-Fei. 2016. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In ECCV.  Justin Johnson Alexandre Alahi and Li Fei-Fei. 2016. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In ECCV.","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00453"},{"key":"e_1_3_2_2_14_1","volume-title":"Lau","author":"Ke Zhanghan","year":"2022","unstructured":"Zhanghan Ke , Jiayu Sun , Kaican Li , Qiong Yan , and Rynson W.H . Lau . 2022 . MODNet: Real- Time Trimap-Free Portrait Matting via Objective Decomposition. In AAAI. Zhanghan Ke, Jiayu Sun, Kaican Li, Qiong Yan, and Rynson W.H. Lau. 2022. MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition. In AAAI."},{"key":"e_1_3_2_2_15_1","first-page":"1","article-title":"Deep video portraits","volume":"37","author":"Kim Hyeongwoo","year":"2018","unstructured":"Hyeongwoo Kim , Pablo Garrido , Ayush Tewari , Weipeng Xu , Justus Thies , Matthias Nie\u00dfner , Patrick P\u00e9rez , Christian Richardt , Michael Zollh\u00f6fer , and Christian Theobalt . 2018 . Deep video portraits . ACM Transactions on Graphics (TOG) , Vol. 37 (2018), 1 -- 14 . Hyeongwoo Kim, Pablo Garrido, Ayush Tewari, Weipeng Xu, Justus Thies, Matthias Nie\u00dfner, Patrick P\u00e9rez, Christian Richardt, Michael Zollh\u00f6fer, and Christian Theobalt. 2018. Deep video portraits. ACM Transactions on Graphics (TOG), Vol. 37 (2018), 1 -- 14.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201401"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3323020"},{"key":"e_1_3_2_2_18_1","unstructured":"Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In ICLR.  Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In ICLR."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Ben Mildenhall Pratul P. Srinivasan Matthew Tancik Jonathan T. Barron Ravi Ramamoorthi and Ren Ng. 2020. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In ECCV.  Ben Mildenhall Pratul P. Srinivasan Matthew Tancik Jonathan T. Barron Ravi Ramamoorthi and Ren Ng. 2020. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In ECCV.","DOI":"10.1007\/978-3-030-58452-8_24"},{"key":"e_1_3_2_2_20_1","volume-title":"Nerfies: Deformable Neural Radiance Fields. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV).","author":"Park Keunhong","year":"2021","unstructured":"Keunhong Park , U. Sinha , Jonathan T. Barron , Sofien Bouaziz , Dan B. Goldman , Steven M. Seitz , and Ricardo Martin-Brualla . 2021 a. Nerfies: Deformable Neural Radiance Fields. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV). Keunhong Park, U. Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B. Goldman, Steven M. Seitz, and Ricardo Martin-Brualla. 2021a. Nerfies: Deformable Neural Radiance Fields. 2021 IEEE\/CVF International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_2_21_1","volume-title":"Seitz","author":"Park Keunhong","year":"2021","unstructured":"Keunhong Park , U. Sinha , Peter Hedman , Jonathan T. Barron , Sofien Bouaziz , Dan B. Goldman , Ricardo Martin-Brualla , and Steven M . Seitz . 2021 b. HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields. ArXiv . Keunhong Park, U. Sinha, Peter Hedman, Jonathan T. Barron, Sofien Bouaziz, Dan B. Goldman, Ricardo Martin-Brualla, and Steven M. Seitz. 2021b. HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields. ArXiv."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"crossref","unstructured":"Omkar M. Parkhi Andrea Vedaldi and Andrew Zisserman. 2015. Deep Face Recognition. In BMVC.  Omkar M. Parkhi Andrea Vedaldi and Andrew Zisserman. 2015. Deep Face Recognition. In BMVC.","DOI":"10.5244\/C.29.41"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00248"},{"key":"e_1_3_2_2_24_1","unstructured":"Aliaksandr Siarohin St\u00e9phane Lathuili\u00e8re S. Tulyakov Elisa Ricci and N. Sebe. 2019b. First Order Motion Model for Image Animation. ArXiv Vol. abs\/2003.00196 (2019).  Aliaksandr Siarohin St\u00e9phane Lathuili\u00e8re S. Tulyakov Elisa Ricci and N. Sebe. 2019b. First Order Motion Model for Image Animation. ArXiv Vol. abs\/2003.00196 (2019)."},{"key":"e_1_3_2_2_25_1","volume-title":"Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman . 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR , Vol. abs\/ 1409 .1556 ( 2015 ). Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR, Vol. abs\/1409.1556 (2015)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00372"},{"key":"e_1_3_2_2_27_1","volume-title":"Resolution-robust Large Mask Inpainting with Fourier Convolutions. arXiv preprint arXiv:2109.07161","author":"Suvorov Roman","year":"2021","unstructured":"Roman Suvorov , Elizaveta Logacheva , Anton Mashikhin , Anastasia Remizova , Arsenii Ashukha , Aleksei Silvestrov , Naejin Kong , Harshith Goka , Kiwoong Park , and Victor Lempitsky . 2021. Resolution-robust Large Mask Inpainting with Fourier Convolutions. arXiv preprint arXiv:2109.07161 ( 2021 ). Roman Suvorov, Elizaveta Logacheva, Anton Mashikhin, Anastasia Remizova, Arsenii Ashukha, Aleksei Silvestrov, Naejin Kong, Harshith Goka, Kiwoong Park, and Victor Lempitsky. 2021. Resolution-robust Large Mask Inpainting with Fourier Convolutions. arXiv preprint arXiv:2109.07161 (2021)."},{"key":"e_1_3_2_2_28_1","volume-title":"Face2Face: real-time face capture and reenactment of RGB videos. ArXiv","author":"Thies Justus","year":"2019","unstructured":"Justus Thies , Michael Zollh\u00f6fer , Marc Stamminger , Christian Theobalt , and Matthias Nie\u00dfner . 2019. Face2Face: real-time face capture and reenactment of RGB videos. ArXiv , Vol. abs\/ 2007 .14808 ( 2019 ). Justus Thies, Michael Zollh\u00f6fer, Marc Stamminger, Christian Theobalt, and Matthias Nie\u00dfner. 2019. Face2Face: real-time face capture and reenactment of RGB videos. ArXiv, Vol. abs\/2007.14808 (2019)."},{"key":"e_1_3_2_2_29_1","volume-title":"CosFace: Large Margin Cosine Loss for Deep Face Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wang H.","year":"2018","unstructured":"H. Wang , Yitong Wang , Zheng Zhou , Xing Ji , Zhifeng Li , Dihong Gong , Jin Zhou , and Wenyu Liu . 2018 c. CosFace: Large Margin Cosine Loss for Deep Face Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2018), 5265--5274. H. Wang, Yitong Wang, Zheng Zhou, Xing Ji, Zhifeng Li, Dihong Gong, Jin Zhou, and Wenyu Liu. 2018c. CosFace: Large Margin Cosine Loss for Deep Face Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2018), 5265--5274."},{"key":"e_1_3_2_2_30_1","unstructured":"Ting-Chun Wang Ming-Yu Liu Jun-Yan Zhu Guilin Liu Andrew Tao Jan Kautz and Bryan Catanzaro. 2018a. Video-to-Video Synthesis. In Advances in Neural Information Processing Systems (NeurIPS).  Ting-Chun Wang Ming-Yu Liu Jun-Yan Zhu Guilin Liu Andrew Tao Jan Kautz and Bryan Catanzaro. 2018a. Video-to-Video Synthesis. In Advances in Neural Information Processing Systems (NeurIPS)."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00917"},{"key":"e_1_3_2_2_32_1","volume-title":"One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing. 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Wang Ting-Chun","year":"2021","unstructured":"Ting-Chun Wang , Arun Mallya , and Ming-Yu Liu . 2021 . One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing. 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Ting-Chun Wang, Arun Mallya, and Ming-Yu Liu. 2021. One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing. 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2003.819861"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"crossref","unstructured":"Gengshan Yang Minh Vo Natalia Neverova Deva Ramanan Andrea Vedaldi and Hanbyul Joo. 2021. BANMo: Building Animatable 3D Neural Models from Many Casual Videos. ArXiv.  Gengshan Yang Minh Vo Natalia Neverova Deva Ramanan Andrea Vedaldi and Hanbyul Joo. 2021. BANMo: Building Animatable 3D Neural Models from Many Casual Videos. ArXiv.","DOI":"10.1109\/CVPR52688.2022.00288"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413965"},{"key":"e_1_3_2_2_36_1","volume-title":"Lempitsky","author":"Zakharov Egor","year":"2020","unstructured":"Egor Zakharov , Aleksei Ivakhnenko , Aliaksandra Shysheya , and Victor S . Lempitsky . 2020 . Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars. In ECCV. Egor Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, and Victor S. Lempitsky. 2020. Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars. In ECCV."},{"key":"e_1_3_2_2_37_1","volume-title":"Few-Shot Adversarial Learning of Realistic Neural Talking Head Models. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV).","author":"Zakharov Egor","unstructured":"Egor Zakharov , Aliaksandra Shysheya , Egor Burkov , and Victor S. Lempitsky . 2019 . Few-Shot Adversarial Learning of Realistic Neural Talking Head Models. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV). Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, and Victor S. Lempitsky. 2019. Few-Shot Adversarial Learning of Realistic Neural Talking Head Models. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_3_2_2_39_1","volume-title":"Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. 2017 IEEE International Conference on Computer Vision (ICCV)","author":"Zhu Jun-Yan","year":"2017","unstructured":"Jun-Yan Zhu , Taesung Park , Phillip Isola , and Alexei A. Efros . 2017 . Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. 2017 IEEE International Conference on Computer Vision (ICCV) ( 2017 ), 2242--2251. Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. 2017 IEEE International Conference on Computer Vision (ICCV) (2017), 2242--2251."}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547838","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547838","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:35Z","timestamp":1750186955000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547838"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":39,"alternative-id":["10.1145\/3503161.3547838","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547838","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}