{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T03:26:33Z","timestamp":1769570793474,"version":"3.49.0"},"reference-count":88,"publisher":"Association for Computing Machinery (ACM)","issue":"1s","license":[{"start":{"date-parts":[[2021,1,31]],"date-time":"2021-01-31T00:00:00Z","timestamp":1612051200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2021,1,31]]},"abstract":"<jats:p>Current face spoof detection schemes mainly rely on physiological cues such as eye blinking, mouth movements, and micro-expression changes, or textural attributes of the face images [9]. But none of these methods represent a viable mechanism for makeup-induced spoofing, especially since makeup has been widely used. Compared with face alteration techniques such as plastic surgery, makeup is non-permanent and cost efficient, which makes makeup-induced spoofing become a realistic threat to the integrity of a face recognition system. To solve this problem, we propose a generative model to construct spoofing face images (confusing face images) for improving the accuracy and robustness of automatic face recognition. Our network structure is composed of two separate parts, with one using inter-attention mechanism to obtain interested face region, and another using intra-attention to translate imitation style with preserving imitation style-excluding details. These two attention mechanisms can precisely learn imitation style, where inter-attention pays more attention to imitation regions of image and intra-attention learns face attributes with long distance in image. To effectively discriminate generated images, we introduce an imitation style discriminator. Our model (SPGAN) generates face images that transfer the imitation style from target to subject image and preserve the imitation-excluding features. Experimental results demonstrate the performance of our model in improving quality of imitated face images.<\/jats:p>","DOI":"10.1145\/3432817","type":"journal-article","created":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T01:53:55Z","timestamp":1617242035000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["SPGAN: Face Forgery Using Spoofing Generative Adversarial Networks"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2965-6196","authenticated-orcid":false,"given":"Yidong","family":"Li","sequence":"first","affiliation":[{"name":"Beijing Jiaotong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wenhua","family":"Liu","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8408-3816","authenticated-orcid":false,"given":"Yi","family":"Jin","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuanzhouhan","family":"Cao","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,3,31]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCB.2011.6117503"},{"key":"e_1_2_1_2_1","unstructured":"Martin Arjovsky Soumith Chintala and L\u00e9on Bottou. 2017. Wasserstein GAN. In ICML. DOI:https:\/\/doi.org\/arXiv:1701.07875  Martin Arjovsky Soumith Chintala and L\u00e9on Bottou. 2017. Wasserstein GAN. In ICML. DOI:https:\/\/doi.org\/arXiv:1701.07875"},{"key":"#cr-split#-e_1_2_1_3_1.1","doi-asserted-by":"crossref","unstructured":"Konstantinos Bousmalis Nathan Silberman David Dohan Dumitru Erhan and Dilip Krishnan. 2017. Unsupervised pixel-level domain adaptation with generative adversarial networks. In CVPR. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.18 10.1109\/CVPR.2017.18","DOI":"10.1109\/CVPR.2017.18"},{"key":"#cr-split#-e_1_2_1_3_1.2","doi-asserted-by":"crossref","unstructured":"Konstantinos Bousmalis Nathan Silberman David Dohan Dumitru Erhan and Dilip Krishnan. 2017. Unsupervised pixel-level domain adaptation with generative adversarial networks. In CVPR. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.18","DOI":"10.1109\/CVPR.2017.18"},{"key":"#cr-split#-e_1_2_1_4_1.1","doi-asserted-by":"crossref","unstructured":"Huiwen Chang Jingwan Lu Fisher Yu and Adam Finkelstein. 2018. PairedCycleGAN asymmetric style transfer for applying and removing makeup. In CVPR. 40-48. DOI:https:\/\/doi.org\/10.1109\/CVPR.2018.00012 10.1109\/CVPR.2018.00012","DOI":"10.1109\/CVPR.2018.00012"},{"key":"#cr-split#-e_1_2_1_4_1.2","doi-asserted-by":"crossref","unstructured":"Huiwen Chang Jingwan Lu Fisher Yu and Adam Finkelstein. 2018. PairedCycleGAN asymmetric style transfer for applying and removing makeup. In CVPR. 40-48. DOI:https:\/\/doi.org\/10.1109\/CVPR.2018.00012","DOI":"10.1109\/CVPR.2018.00012"},{"key":"e_1_2_1_5_1","volume-title":"Yoshua Bengio, and Wenjie Li.","author":"Che Tong","year":"2017","unstructured":"Tong Che , Yanran Li , Athul Paul Jacob , Yoshua Bengio, and Wenjie Li. 2017 . Mode regularized generative adversarial networks. In ICLR. DOI :https:\/\/doi.org\/arXiv:1612.02136 Tong Che, Yanran Li, Athul Paul Jacob, Yoshua Bengio, and Wenjie Li. 2017. Mode regularized generative adversarial networks. In ICLR. DOI:https:\/\/doi.org\/arXiv:1612.02136"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2015.09.005"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Li Chen Kun Zhou and Stephen Lin. 2015. Simulating makeup through physics-based manipulation of intrinsic image layers. In CVPR.  Li Chen Kun Zhou and Stephen Lin. 2015. Simulating makeup through physics-based manipulation of intrinsic image layers. In CVPR.","DOI":"10.1109\/CVPR.2015.7299093"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2699184"},{"key":"e_1_2_1_9_1","volume-title":"Texture deformation based generative adversarial networks for face editing. ArXiv Preprint ArXiv:1812.09832","author":"Chen WenTing","year":"2018","unstructured":"WenTing Chen , Xinpeng Xie , Xi Jia , and Linlin Shen . 2018. Texture deformation based generative adversarial networks for face editing. ArXiv Preprint ArXiv:1812.09832 ( 2018 ). WenTing Chen, Xinpeng Xie, Xi Jia, and Linlin Shen. 2018. Texture deformation based generative adversarial networks for face editing. ArXiv Preprint ArXiv:1812.09832 (2018)."},{"key":"#cr-split#-e_1_2_1_10_1.1","doi-asserted-by":"crossref","unstructured":"Jianpeng Cheng Dong Li and Mirella Lapata. 2016. Long short-term memory-networks for machine reading. In EMNLP. 551-561. DOI:https:\/\/doi.org\/10.18653\/v1\/D16-1053 10.18653\/v1","DOI":"10.18653\/v1\/D16-1053"},{"key":"#cr-split#-e_1_2_1_10_1.2","doi-asserted-by":"crossref","unstructured":"Jianpeng Cheng Dong Li and Mirella Lapata. 2016. Long short-term memory-networks for machine reading. In EMNLP. 551-561. DOI:https:\/\/doi.org\/10.18653\/v1\/D16-1053","DOI":"10.18653\/v1\/D16-1053"},{"key":"e_1_2_1_11_1","volume-title":"Stimulus-driven and concept-driven analysis for image caption generation. Neurocomputing","author":"Ding Songtao","year":"2019","unstructured":"Songtao Ding , Shiru Qu , Yuling Xi , and Shaohua Wan . 2019. Stimulus-driven and concept-driven analysis for image caption generation. Neurocomputing ( 2019 ), 520\u2013530. DOI:https:\/\/doi.org\/10.1016\/j.neucom.2019.04.095 10.1016\/j.neucom.2019.04.095 Songtao Ding, Shiru Qu, Yuling Xi, and Shaohua Wan. 2019. Stimulus-driven and concept-driven analysis for image caption generation. Neurocomputing (2019), 520\u2013530. DOI:https:\/\/doi.org\/10.1016\/j.neucom.2019.04.095"},{"key":"#cr-split#-e_1_2_1_12_1.1","doi-asserted-by":"crossref","unstructured":"Harshala Gammulle Tharindu Fernando Simon Denman Sridha Sridharan and Clinton Fookes. 2019. Coupled generative adversarial network for continuous fine-grained action segmentation. In WACV. 200-209. DOI:https:\/\/doi.org\/10.1109\/wacv.2019.00027 10.1109\/wacv.2019.00027","DOI":"10.1109\/WACV.2019.00027"},{"key":"#cr-split#-e_1_2_1_12_1.2","doi-asserted-by":"crossref","unstructured":"Harshala Gammulle Tharindu Fernando Simon Denman Sridha Sridharan and Clinton Fookes. 2019. Coupled generative adversarial network for continuous fine-grained action segmentation. In WACV. 200-209. DOI:https:\/\/doi.org\/10.1109\/wacv.2019.00027","DOI":"10.1109\/WACV.2019.00027"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377876"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295327"},{"key":"e_1_2_1_15_1","volume-title":"Unsupervised image-to-image translation with generative adversarial networks. arXiv:CVPR","author":"Hao Dong","year":"2017","unstructured":"Dong Hao , Neekhara Paarth , and Wu Chao . 2017. Unsupervised image-to-image translation with generative adversarial networks. arXiv:CVPR ( 2017 ). DOI:https:\/\/doi.org\/arXiv:1701.02676 Dong Hao, Neekhara Paarth, and Wu Chao. 2017. Unsupervised image-to-image translation with generative adversarial networks. arXiv:CVPR (2017). DOI:https:\/\/doi.org\/arXiv:1701.02676"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/3045118.3045167"},{"key":"e_1_2_1_17_1","unstructured":"T. Darrell J. Long and E. Shelhamer. 2015. Fully convolutional networks for semantic segmentation. In CVPR. 3431\u20133440.  T. Darrell J. Long and E. Shelhamer. 2015. Fully convolutional networks for semantic segmentation. In CVPR. 3431\u20133440."},{"key":"#cr-split#-e_1_2_1_18_1.1","doi-asserted-by":"crossref","unstructured":"Justin Johnson Alexandre Alahi and Li Fei Fei. 2016. Perceptual losses for real-time style transfer and super-resolution. In ECCV. 694-711. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46475-6_43 10.1007\/978-3-319-46475-6_43","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"#cr-split#-e_1_2_1_18_1.2","doi-asserted-by":"crossref","unstructured":"Justin Johnson Alexandre Alahi and Li Fei Fei. 2016. Perceptual losses for real-time style transfer and super-resolution. In ECCV. 694-711. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46475-6_43","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"#cr-split#-e_1_2_1_19_1.1","doi-asserted-by":"crossref","unstructured":"Zhu Jun Yan Park Taesung and Isola Phillip. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV. 2242-2251. DOI:https:\/\/doi.org\/10.1109\/iccv.2017.244 10.1109\/iccv.2017.244","DOI":"10.1109\/ICCV.2017.244"},{"key":"#cr-split#-e_1_2_1_19_1.2","doi-asserted-by":"crossref","unstructured":"Zhu Jun Yan Park Taesung and Isola Phillip. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV. 2242-2251. DOI:https:\/\/doi.org\/10.1109\/iccv.2017.244","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2018.01.002"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1167\/16.12.326"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295346"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/3045390.3045555"},{"key":"#cr-split#-e_1_2_1_24_1.1","doi-asserted-by":"crossref","unstructured":"Christian Ledig Lucas Theis and Huszar. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR. 105-114. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.19 10.1109\/CVPR.2017.19","DOI":"10.1109\/CVPR.2017.19"},{"key":"#cr-split#-e_1_2_1_24_1.2","doi-asserted-by":"crossref","unstructured":"Christian Ledig Lucas Theis and Huszar. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR. 105-114. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.19","DOI":"10.1109\/CVPR.2017.19"},{"key":"#cr-split#-e_1_2_1_25_1.1","doi-asserted-by":"crossref","unstructured":"Christian Ledig Lucas Theis Ferenc Huszar Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz and Zehan Wang. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR. 105-114. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.19 10.1109\/CVPR.2017.19","DOI":"10.1109\/CVPR.2017.19"},{"key":"#cr-split#-e_1_2_1_25_1.2","doi-asserted-by":"crossref","unstructured":"Christian Ledig Lucas Theis Ferenc Huszar Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz and Zehan Wang. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR. 105-114. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.19","DOI":"10.1109\/CVPR.2017.19"},{"key":"e_1_2_1_26_1","unstructured":"Lingzhi Li Jianmin Bao Yang Hao Dong Chen and Fang Wen. 2019. FaceShifter: Towards high fidelity and occlusion aware face swapping. In CVPR.  Lingzhi Li Jianmin Bao Yang Hao Dong Chen and Fang Wen. 2019. FaceShifter: Towards high fidelity and occlusion aware face swapping. In CVPR."},{"key":"#cr-split#-e_1_2_1_27_1.1","doi-asserted-by":"crossref","unstructured":"Minjun Li Haozhi Huang Lin Ma Wei Liu Tong Zhang and Yu-Gang Jiang. 2018. Unsupervised image-to-image translation with stacked cycle-consistent adversarial networks. In ECCV. 184-199. DOI:https:\/\/doi.org\/10.1007\/978-3-030-01240-3_12 10.1007\/978-3-030-01240-3_12","DOI":"10.1007\/978-3-030-01240-3_12"},{"key":"#cr-split#-e_1_2_1_27_1.2","doi-asserted-by":"crossref","unstructured":"Minjun Li Haozhi Huang Lin Ma Wei Liu Tong Zhang and Yu-Gang Jiang. 2018. Unsupervised image-to-image translation with stacked cycle-consistent adversarial networks. In ECCV. 184-199. DOI:https:\/\/doi.org\/10.1007\/978-3-030-01240-3_12","DOI":"10.1007\/978-3-030-01240-3_12"},{"key":"e_1_2_1_28_1","volume-title":"Convolutional network for attribute-driven and identity-preserving human face generation. Preprint arXiv 1608.06434","author":"Li Mu","year":"2016","unstructured":"Mu Li , Wangmeng Zuo , and David Zhang . 2016. Convolutional network for attribute-driven and identity-preserving human face generation. Preprint arXiv 1608.06434 ( 2016 ). Mu Li, Wangmeng Zuo, and David Zhang. 2016. Convolutional network for attribute-driven and identity-preserving human face generation. Preprint arXiv 1608.06434 (2016)."},{"key":"e_1_2_1_29_1","volume-title":"Deep identity aware transfer of facial attributes. Preprint arXiv 1610.05586","author":"Li Mu","year":"2016","unstructured":"Mu Li , Wangmeng Zuo , and David Zhang . 2016. Deep identity aware transfer of facial attributes. Preprint arXiv 1610.05586 ( 2016 ). Mu Li, Wangmeng Zuo, and David Zhang. 2016. Deep identity aware transfer of facial attributes. Preprint arXiv 1610.05586 (2016)."},{"key":"e_1_2_1_30_1","volume-title":"Yu Mo, and Yoshua Bengio.","author":"Lin Zhouhan","year":"2017","unstructured":"Zhouhan Lin , Minwei Feng , Cicero Nogueira Dos Santos , Yu Mo, and Yoshua Bengio. 2017 . A structured self-attentive sentence embedding. Preprint arXiv:1703.03130 (2017). Zhouhan Lin, Minwei Feng, Cicero Nogueira Dos Santos, Yu Mo, and Yoshua Bengio. 2017. A structured self-attentive sentence embedding. Preprint arXiv:1703.03130 (2017)."},{"key":"e_1_2_1_31_1","first-page":"10080","volume-title":"Neural Proc. Lett.","author":"Liu Xiaowei","year":"2019","unstructured":"Xiaowei Liu , Kenli Li , and Keqin Li . 2019 . Attentive semantic and perceptual faces completion using self-attention generative adversarial networks . Neural Proc. Lett. (2019). DOI:https:\/\/doi.org\/10.1007\/s11063-019- 10080 - 10082 10.1007\/s11063-019-10080-2 Xiaowei Liu, Kenli Li, and Keqin Li. 2019. Attentive semantic and perceptual faces completion using self-attention generative adversarial networks. Neural Proc. Lett. (2019). DOI:https:\/\/doi.org\/10.1007\/s11063-019-10080-2"},{"key":"#cr-split#-e_1_2_1_32_1.1","doi-asserted-by":"crossref","unstructured":"Chen Long Hanwang Zhang Jun Xiao Liqiang Nie and Tat Seng Chua. 2017. SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning. In CVPR. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.667 10.1109\/CVPR.2017.667","DOI":"10.1109\/CVPR.2017.667"},{"key":"#cr-split#-e_1_2_1_32_1.2","doi-asserted-by":"crossref","unstructured":"Chen Long Hanwang Zhang Jun Xiao Liqiang Nie and Tat Seng Chua. 2017. SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning. In CVPR. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.667","DOI":"10.1109\/CVPR.2017.667"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.02.092"},{"key":"e_1_2_1_34_1","unstructured":"Luke Metz Ben Poole David Pfau and Jascha Sohl-Dickstein. 2017. Unrolled generative adversarial networks. In ICLR. DOI:https:\/\/doi.org\/arXiv:1611.02163  Luke Metz Ben Poole David Pfau and Jascha Sohl-Dickstein. 2017. Unrolled generative adversarial networks. In ICLR. DOI:https:\/\/doi.org\/arXiv:1611.02163"},{"key":"e_1_2_1_35_1","volume-title":"Conditional generative adversarial nets. Preprint arXiv 1411.1784","author":"Mirza Mehdi","year":"2014","unstructured":"Mehdi Mirza and Simon Osindero . 2014. Conditional generative adversarial nets. Preprint arXiv 1411.1784 ( 2014 ). Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. Preprint arXiv 1411.1784 (2014)."},{"key":"e_1_2_1_36_1","volume-title":"Spectral normalization for generative adversarial networks. Preprint ArXiv","author":"Miyato Takeru","year":"2018","unstructured":"Takeru Miyato , Toshiki Kataoka , Masanori Koyama , and Yuichi Yoshida . 2018. Spectral normalization for generative adversarial networks. Preprint ArXiv ( 2018 ). DOI:https:\/\/doi.org\/arXiv:1802.05957 Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral normalization for generative adversarial networks. Preprint ArXiv (2018). DOI:https:\/\/doi.org\/arXiv:1802.05957"},{"key":"e_1_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Thomas Brox Olaf Ronneberger and Philipp Fischer. 2015. U-Net: Convolutional networks for biomedical image segmentation. In MICCAI.  Thomas Brox Olaf Ronneberger and Philipp Fischer. 2015. U-Net: Convolutional networks for biomedical image segmentation. In MICCAI.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_2_1_38_1","unstructured":"Andrew Zisserman Omkar M. Parkhi and Andrea Vedaldi. 2015. Deep face recognition. In BMVC.  Andrew Zisserman Omkar M. Parkhi and Andrea Vedaldi. 2015. Deep face recognition. In BMVC."},{"key":"#cr-split#-e_1_2_1_39_1.1","doi-asserted-by":"crossref","unstructured":"Ankur P. Parikh Oscar Tckstrm Dipanjan Das and Jakob Uszkoreit. 2016. A decomposable attention model for natural language inference. In EMNLP. 2249-2255. DOI:https:\/\/doi.org\/10.18653\/v1\/D16-1244 10.18653\/v1","DOI":"10.18653\/v1\/D16-1244"},{"key":"#cr-split#-e_1_2_1_39_1.2","doi-asserted-by":"crossref","unstructured":"Ankur P. Parikh Oscar Tckstrm Dipanjan Das and Jakob Uszkoreit. 2016. A decomposable attention model for natural language inference. In EMNLP. 2249-2255. DOI:https:\/\/doi.org\/10.18653\/v1\/D16-1244","DOI":"10.18653\/v1\/D16-1244"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2016.2578288"},{"key":"e_1_2_1_41_1","volume-title":"A deep reinforced model for abstractive summarization. ArXiv Preprint ArXiv:1705.04304","author":"Paulus Romain","year":"2017","unstructured":"Romain Paulus , Caiming Xiong , and Richard Socher . 2017. A deep reinforced model for abstractive summarization. ArXiv Preprint ArXiv:1705.04304 ( 2017 ). Romain Paulus, Caiming Xiong, and Richard Socher. 2017. A deep reinforced model for abstractive summarization. ArXiv Preprint ArXiv:1705.04304 (2017)."},{"key":"e_1_2_1_42_1","volume-title":"\u00c1lvarez","author":"Perarnau Guim","year":"2016","unstructured":"Guim Perarnau , Joost van de Weijer , Bogdan Raducanu , and Jose M . \u00c1lvarez 2016 . Invertible conditional GANs for image editing. arXiv preprint arXiv 1611.06355 (2016). Guim Perarnau, Joost van de Weijer, Bogdan Raducanu, and Jose M. \u00c1lvarez 2016. Invertible conditional GANs for image editing. arXiv preprint arXiv 1611.06355 (2016)."},{"key":"#cr-split#-e_1_2_1_43_1.1","doi-asserted-by":"crossref","unstructured":"Albert Pumarola Antonio Agudo Aleix M. Martinez Alberto Sanfeliu and Francesc Moreno-Noguer. 2018. GANimation: Anatomically-aware facial animation from a single image. In ECCV. 835-851. DOI:https:\/\/doi.org\/10.1007\/978-3-030-01249-6_50 10.1007\/978-3-030-01249-6_50","DOI":"10.1007\/978-3-030-01249-6_50"},{"key":"#cr-split#-e_1_2_1_43_1.2","doi-asserted-by":"crossref","unstructured":"Albert Pumarola Antonio Agudo Aleix M. Martinez Alberto Sanfeliu and Francesc Moreno-Noguer. 2018. GANimation: Anatomically-aware facial animation from a single image. In ECCV. 835-851. DOI:https:\/\/doi.org\/10.1007\/978-3-030-01249-6_50","DOI":"10.1007\/978-3-030-01249-6_50"},{"key":"#cr-split#-e_1_2_1_44_1.1","doi-asserted-by":"crossref","unstructured":"Weichao Qiu and Alan Yuille. 2016. UnrealCV: Connecting computer vision to unreal engine. In ECCV. 909-916. DOI:https:\/\/doi.org\/10.1007\/978-3-319-49409-8_75 10.1007\/978-3-319-49409-8_75","DOI":"10.1007\/978-3-319-49409-8_75"},{"key":"#cr-split#-e_1_2_1_44_1.2","doi-asserted-by":"crossref","unstructured":"Weichao Qiu and Alan Yuille. 2016. UnrealCV: Connecting computer vision to unreal engine. In ECCV. 909-916. DOI:https:\/\/doi.org\/10.1007\/978-3-319-49409-8_75","DOI":"10.1007\/978-3-319-49409-8_75"},{"key":"e_1_2_1_45_1","unstructured":"Alec Radford Luke Metz and Soumith Chintala. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. Preprint ArXiv. DOI:https:\/\/doi.org\/arXiv:1511.06434  Alec Radford Luke Metz and Soumith Chintala. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. Preprint ArXiv. DOI:https:\/\/doi.org\/arXiv:1511.06434"},{"key":"#cr-split#-e_1_2_1_46_1.1","doi-asserted-by":"crossref","unstructured":"Stephan R. Richter Vibhav Vineet Stefan Roth and Vladlen Koltun. 2016. Playing for data: Ground truth from computer games. In ECCV. 102-118. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46475-6_7 10.1007\/978-3-319-46475-6_7","DOI":"10.1007\/978-3-319-46475-6_7"},{"key":"#cr-split#-e_1_2_1_46_1.2","doi-asserted-by":"crossref","unstructured":"Stephan R. Richter Vibhav Vineet Stefan Roth and Vladlen Koltun. 2016. Playing for data: Ground truth from computer games. In ECCV. 102-118. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46475-6_7","DOI":"10.1007\/978-3-319-46475-6_7"},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Shao Rui Xiangyuan Lan Jiawei Li and Yuen Pong. 2019. Multi-adversarial discriminative deep domain generalization for face presentation attack detection. In CVPR. 10023\u201310031.  Shao Rui Xiangyuan Lan Jiawei Li and Yuen Pong. 2019. Multi-adversarial discriminative deep domain generalization for face presentation attack detection. In CVPR. 10023\u201310031.","DOI":"10.1109\/CVPR.2019.01026"},{"key":"e_1_2_1_48_1","volume-title":"Sim-to-real robot learning from pixels with progressive nets. Preprint arXiv:1809.07480","author":"Rusu Andrei A.","year":"2016","unstructured":"Andrei A. Rusu , Matej Vecerik , Thomas Rothorl , Nicolas Heess , Razvan Pascanu , and Raia Hadsell . 2016. Sim-to-real robot learning from pixels with progressive nets. Preprint arXiv:1809.07480 ( 2016 ). Andrei A. Rusu, Matej Vecerik, Thomas Rothorl, Nicolas Heess, Razvan Pascanu, and Raia Hadsell. 2016. Sim-to-real robot learning from pixels with progressive nets. Preprint arXiv:1809.07480 (2016)."},{"key":"e_1_2_1_49_1","unstructured":"Tim Salimans Han Zhang Alec Radford and Dimitris Metaxas. 2018. Improving GANs using optimal transport. In ICLR. DOI:https:\/\/arxiv.org\/abs\/1803.05573.  Tim Salimans Han Zhang Alec Radford and Dimitris Metaxas. 2018. Improving GANs using optimal transport. In ICLR. DOI:https:\/\/arxiv.org\/abs\/1803.05573."},{"key":"e_1_2_1_50_1","volume-title":"Generative adversarial networks. Preprint arXiv 1406.2661","author":"Salvaris Mathew","year":"2018","unstructured":"Mathew Salvaris , Danielle Dean , and Wee Hyong Tok . 2018. Generative adversarial networks. Preprint arXiv 1406.2661 ( 2018 ). Mathew Salvaris, Danielle Dean, and Wee Hyong Tok. 2018. Generative adversarial networks. Preprint arXiv 1406.2661 (2018)."},{"key":"#cr-split#-e_1_2_1_51_1.1","doi-asserted-by":"crossref","unstructured":"Wei Shen and Rujie Liu. 2017. Learning residual images for face attribute manipulation. In CVPR. 1225-1233. DOI:https:\/\/doi.org\/10.1109\/cvpr.2017.135 10.1109\/cvpr.2017.135","DOI":"10.1109\/CVPR.2017.135"},{"key":"#cr-split#-e_1_2_1_51_1.2","doi-asserted-by":"crossref","unstructured":"Wei Shen and Rujie Liu. 2017. Learning residual images for face attribute manipulation. In CVPR. 1225-1233. DOI:https:\/\/doi.org\/10.1109\/cvpr.2017.135","DOI":"10.1109\/CVPR.2017.135"},{"key":"e_1_2_1_52_1","unstructured":"Casper Kaae Sonderby Jose Caballero Lucas Theis and Wenzhe Shi. 2017. Amortised MAP inference for image super-resolution. In ICLR.  Casper Kaae Sonderby Jose Caballero Lucas Theis and Wenzhe Shi. 2017. Amortised MAP inference for image super-resolution. In ICLR."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240618"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/PG.2007.21"},{"key":"e_1_2_1_55_1","volume-title":"Towards adapting deep visuomotor representations from simulated to real environments. arXiv preprint arXiv 1511.07111","author":"Tzeng Eric","year":"2016","unstructured":"Eric Tzeng , Coline Devin , Judy Hoffman , Chelsea Finn , and Xingchao Peng . 2016. Towards adapting deep visuomotor representations from simulated to real environments. arXiv preprint arXiv 1511.07111 ( 2016 ). Eric Tzeng, Coline Devin, Judy Hoffman, Chelsea Finn, and Xingchao Peng. 2016. Towards adapting deep visuomotor representations from simulated to real environments. arXiv preprint arXiv 1511.07111 (2016)."},{"key":"#cr-split#-e_1_2_1_56_1.1","doi-asserted-by":"crossref","unstructured":"Paul Upchurch Jacob Gardner Geoff Pleiss Robert Pless Noah Snavely Kavita Bala and Kilian Weinberger. 2017. Deep feature interpolation for image content changes. In CVPR. 6090-6099. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.645 10.1109\/CVPR.2017.645","DOI":"10.1109\/CVPR.2017.645"},{"key":"#cr-split#-e_1_2_1_56_1.2","doi-asserted-by":"crossref","unstructured":"Paul Upchurch Jacob Gardner Geoff Pleiss Robert Pless Noah Snavely Kavita Bala and Kilian Weinberger. 2017. Deep feature interpolation for image content changes. In CVPR. 6090-6099. DOI:https:\/\/doi.org\/10.1109\/CVPR.2017.645","DOI":"10.1109\/CVPR.2017.645"},{"key":"e_1_2_1_57_1","first-page":"1","article-title":"Automated colorization of a grayscale image with seed points propagation","volume":"99","author":"Wan Shaohua","year":"2020","unstructured":"Shaohua Wan , Yu Xia , Lianyong Qi , Yee Hong Yang , and Mohammed Atiquzzaman . 2020 . Automated colorization of a grayscale image with seed points propagation . IEEE Trans. Multimedia 99 (2020), 1 \u2013 1 . DOI:https:\/\/doi.org\/10.1109\/TMM.2020.2976573 10.1109\/TMM.2020.2976573 Shaohua Wan, Yu Xia, Lianyong Qi, Yee Hong Yang, and Mohammed Atiquzzaman. 2020. Automated colorization of a grayscale image with seed points propagation. IEEE Trans. Multimedia 99 (2020), 1\u20131. DOI:https:\/\/doi.org\/10.1109\/TMM.2020.2976573","journal-title":"IEEE Trans. Multimedia"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.5555\/3015812.3015821"},{"key":"#cr-split#-e_1_2_1_59_1.1","doi-asserted-by":"crossref","unstructured":"E. Kolve Y. Zhu and R. Mottaghi. 2017. Target-driven visual navigation in indoor scenes using deep reinforcement learning. In ICRA. 3357-3364. DOI:https:\/\/doi.org\/10.1109\/ICRA.2017.7989381 10.1109\/ICRA.2017.7989381","DOI":"10.1109\/ICRA.2017.7989381"},{"key":"#cr-split#-e_1_2_1_59_1.2","doi-asserted-by":"crossref","unstructured":"E. Kolve Y. Zhu and R. Mottaghi. 2017. Target-driven visual navigation in indoor scenes using deep reinforcement learning. In ICRA. 3357-3364. DOI:https:\/\/doi.org\/10.1109\/ICRA.2017.7989381","DOI":"10.1109\/ICRA.2017.7989381"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2017.2698143"},{"key":"#cr-split#-e_1_2_1_61_1.1","doi-asserted-by":"crossref","unstructured":"Donggeun Yoo Namil Kim Sunggyun Park Anthony S. Paek and In So Kweon. 2016. Pixel-level domain transfer. In ECCV. 517-532. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46484-8_31 10.1007\/978-3-319-46484-8_31","DOI":"10.1007\/978-3-319-46484-8_31"},{"key":"#cr-split#-e_1_2_1_61_1.2","doi-asserted-by":"crossref","unstructured":"Donggeun Yoo Namil Kim Sunggyun Park Anthony S. Paek and In So Kweon. 2016. Pixel-level domain transfer. In ECCV. 517-532. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46484-8_31","DOI":"10.1007\/978-3-319-46484-8_31"},{"key":"e_1_2_1_62_1","unstructured":"Munyoung Kim Yunjey Choi Minje Choi Jung-Woo Ha Sunghun Kim and Jaegul Choo. 2018. StarGAN:Unified generative adversarial networks for multidomain image-to-image translation. In CVPR. 8789\u20138797.  Munyoung Kim Yunjey Choi Minje Choi Jung-Woo Ha Sunghun Kim and Jaegul Choo. 2018. StarGAN:Unified generative adversarial networks for multidomain image-to-image translation. In CVPR. 8789\u20138797."},{"key":"e_1_2_1_63_1","unstructured":"H. Zhang I. Goodfellow D. Metaxas and A. Odena. 2019. Self-attention generative adversarial networks. In ICML. 7354\u20137363.  H. Zhang I. Goodfellow D. Metaxas and A. Odena. 2019. Self-attention generative adversarial networks. In ICML. 7354\u20137363."},{"key":"e_1_2_1_64_1","doi-asserted-by":"crossref","unstructured":"Han Zhang Tao Xu Hongsheng Li Shaoting Zhang Xiaogang Wang Xiaolei Huang and Dimitris Metaxas. 2017. StackGAN: Text to photo-realistic image synthesis with stacked generative adversarial networks. In ICCV. 5908\u20135916.  Han Zhang Tao Xu Hongsheng Li Shaoting Zhang Xiaogang Wang Xiaolei Huang and Dimitris Metaxas. 2017. StackGAN: Text to photo-realistic image synthesis with stacked generative adversarial networks. In ICCV. 5908\u20135916.","DOI":"10.1109\/ICCV.2017.629"},{"key":"e_1_2_1_65_1","volume-title":"Efros","author":"Zhang Richard","year":"2016","unstructured":"Richard Zhang , Phillip Isola , and Alexei A . Efros . 2016 . Colorful image colorization. In ECCV. 649\u2013666. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46487-9_40 10.1007\/978-3-319-46487-9_40 Richard Zhang, Phillip Isola, and Alexei A. Efros. 2016. Colorful image colorization. In ECCV. 649\u2013666. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46487-9_40"},{"key":"e_1_2_1_66_1","volume-title":"Energy-based generative adversarial network. Preprint ArXiv","author":"Zhao Junbo","year":"2016","unstructured":"Junbo Zhao , Michael Mathieu , and Yann Lecun . 2016. Energy-based generative adversarial network. Preprint ArXiv ( 2016 ). Junbo Zhao, Michael Mathieu, and Yann Lecun. 2016. Energy-based generative adversarial network. Preprint ArXiv (2016)."},{"key":"e_1_2_1_67_1","volume-title":"GeneGAN: Learning object transfiguration and attribute subspace from unpaired data. Preprint arXiv 1705.04932","author":"Zhou Shuchang","year":"2017","unstructured":"Shuchang Zhou , Taihong Xiao , Yi Yang , Dieqiao Feng , Qinyao He , and Weiran He. 2017. GeneGAN: Learning object transfiguration and attribute subspace from unpaired data. Preprint arXiv 1705.04932 ( 2017 ). Shuchang Zhou, Taihong Xiao, Yi Yang, Dieqiao Feng, Qinyao He, and Weiran He. 2017. GeneGAN: Learning object transfiguration and attribute subspace from unpaired data. Preprint arXiv 1705.04932 (2017)."},{"key":"#cr-split#-e_1_2_1_68_1.1","doi-asserted-by":"crossref","unstructured":"Jun Yan Zhu Taesung Park and Phillip Isola. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV. 2242-2251. DOI:https:\/\/doi.org\/10.1109\/iccv.2017.244 10.1109\/iccv.2017.244","DOI":"10.1109\/ICCV.2017.244"},{"key":"#cr-split#-e_1_2_1_68_1.2","doi-asserted-by":"crossref","unstructured":"Jun Yan Zhu Taesung Park and Phillip Isola. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV. 2242-2251. DOI:https:\/\/doi.org\/10.1109\/iccv.2017.244","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_2_1_69_1","volume-title":"Efros","author":"Zhu Jun Yan","year":"2016","unstructured":"Jun Yan Zhu , Philipp Kr\u00e4henb\u00fchl , Eli Shechtman , and Alexei A . Efros . 2016 . Generative visual manipulation on the natural image manifold. In ECCV. DOI :https:\/\/doi.org\/10.1007\/978-3-319-46454-1_36 10.1007\/978-3-319-46454-1_36 Jun Yan Zhu, Philipp Kr\u00e4henb\u00fchl, Eli Shechtman, and Alexei A. Efros. 2016. Generative visual manipulation on the natural image manifold. In ECCV. DOI:https:\/\/doi.org\/10.1007\/978-3-319-46454-1_36"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3432817","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3432817","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:47:11Z","timestamp":1750193231000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3432817"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,31]]},"references-count":88,"journal-issue":{"issue":"1s","published-print":{"date-parts":[[2021,1,31]]}},"alternative-id":["10.1145\/3432817"],"URL":"https:\/\/doi.org\/10.1145\/3432817","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,31]]},"assertion":[{"value":"2020-03-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-31","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}