{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,6]],"date-time":"2025-11-06T15:40:34Z","timestamp":1762443634770,"version":"3.41.0"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2018,12,4]],"date-time":"2018-12-04T00:00:00Z","timestamp":1543881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,12,31]]},"abstract":"<jats:p>\n            Facial caricature is an art form of drawing faces in an exaggerated way to convey humor or sarcasm. In this paper, we propose the first Generative Adversarial Network (GAN) for unpaired photo-to-caricature translation, which we call \"CariGANs\". It explicitly models geometric exaggeration and appearance stylization using two components:\n            <jats:italic>CariGeoGAN<\/jats:italic>\n            , which only models the geometry-to-geometry transformation from face photos to caricatures, and\n            <jats:italic>CariStyGAN<\/jats:italic>\n            , which transfers the style appearance from caricatures to face photos without any geometry deformation. In this way, a difficult cross-domain translation problem is decoupled into two easier tasks. The perceptual study shows that caricatures generated by our\n            <jats:italic>CariGANs<\/jats:italic>\n            are closer to the hand-drawn ones, and at the same time better persevere the identity, compared to state-of-the-art methods. Moreover, our\n            <jats:italic>CariGANs<\/jats:italic>\n            allow users to control the shape exaggeration degree and change the color\/texture style by tuning the parameters or giving an example caricature.\n          <\/jats:p>","DOI":"10.1145\/3272127.3275046","type":"journal-article","created":{"date-parts":[[2018,11,28]],"date-time":"2018-11-28T19:16:10Z","timestamp":1543432570000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":65,"title":["CariGANs"],"prefix":"10.1145","volume":"37","author":[{"given":"Kaidi","family":"Cao","sequence":"first","affiliation":[{"name":"Tsinghua University"}]},{"given":"Jing","family":"Liao","sequence":"additional","affiliation":[{"name":"City University of Hong Kong, Microsoft Research"}]},{"given":"Lu","family":"Yuan","sequence":"additional","affiliation":[{"name":"Microsoft AI Perception and Mixed Reality"}]}],"member":"320","published-online":{"date-parts":[[2018,12,4]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/259081.259231"},{"key":"e_1_2_2_2_1","volume-title":"Proc. Visual. 165--170","author":"Akleman Ergun","year":"2000","unstructured":"Ergun Akleman , James Palmer , and Ryan Logan . 2000 . Making extreme caricatures with a new interactive 2D deformation technique with simplicial complexes . In Proc. Visual. 165--170 . Ergun Akleman, James Palmer, and Ryan Logan. 2000. Making extreme caricatures with a new interactive 2D deformation technique with simplicial complexes. In Proc. Visual. 165--170."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1162\/leon.2007.40.4.392"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.126"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.296"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/641007.641040"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.361"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073660"},{"key":"e_1_2_2_9_1","volume-title":"A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576","author":"Gatys Leon A","year":"2015","unstructured":"Leon A Gatys , Alexander S Ecker , and Matthias Bethge . 2015. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 ( 2015 ). Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2015. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015)."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/966131.966133"},{"key":"e_1_2_2_11_1","volume-title":"Reducing the dimensionality of data with neural networks. science 313, 5786","author":"Hinton Geoffrey E","year":"2006","unstructured":"Geoffrey E Hinton and Ruslan R Salakhutdinov . 2006. Reducing the dimensionality of data with neural networks. science 313, 5786 ( 2006 ), 504--507. Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. science 313, 5786 (2006), 504--507."},{"key":"e_1_2_2_12_1","volume-title":"Multimodal Unsupervised Image-to-image Translation. arXiv preprint arXiv:1804.04732","author":"Huang Xun","year":"2018","unstructured":"Xun Huang , Ming-Yu Liu , Serge Belongie , and Jan Kautz . 2018. Multimodal Unsupervised Image-to-image Translation. arXiv preprint arXiv:1804.04732 ( 2018 ). Xun Huang, Ming-Yu Liu, Serge Belongie, and Jan Kautz. 2018. Multimodal Unsupervised Image-to-image Translation. arXiv preprint arXiv:1804.04732 (2018)."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.632"},{"volume-title":"Proc","author":"Johnson Justin","key":"e_1_2_2_14_1","unstructured":"Justin Johnson , Alexandre Alahi , and Li Fei-Fei . 2016. Perceptual losses for real-time style transfer and super-resolution . In Proc . ECCV. Springer , 694--711. Justin Johnson, Alexandre Alahi, and Li Fei-Fei. 2016. Perceptual losses for real-time style transfer and super-resolution. In Proc. ECCV. Springer, 694--711."},{"key":"e_1_2_2_15_1","volume-title":"Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196","author":"Karras Tero","year":"2017","unstructured":"Tero Karras , Timo Aila , Samuli Laine , and Jaakko Lehtinen . 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 ( 2017 ). Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017)."},{"key":"e_1_2_2_16_1","volume-title":"Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192","author":"Kim Taeksoo","year":"2017","unstructured":"Taeksoo Kim , Moonsu Cha , Hyunsoo Kim , Jungkwon Lee , and Jiwon Kim . 2017. Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192 ( 2017 ). Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jungkwon Lee, and Jiwon Kim. 2017. Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192 (2017)."},{"key":"e_1_2_2_17_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSMC.1999.816567"},{"key":"e_1_2_2_19_1","volume-title":"Proc. International Conference on Multimedia Modeling. Springer, 536--547","author":"Hai Le Nguyen Kim","year":"2011","unstructured":"Nguyen Kim Hai Le , Yong Peng Why , and Golam Ashraf . 2011 . Shape stylized face caricatures . In Proc. International Conference on Multimedia Modeling. Springer, 536--547 . Nguyen Kim Hai Le, Yong Peng Why, and Golam Ashraf. 2011. Shape stylized face caricatures. In Proc. International Conference on Multimedia Modeling. Springer, 536--547."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/826030.826637"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073683"},{"key":"e_1_2_2_22_1","volume-title":"Proc. ACCV","volume":"2","author":"Chiang Wen-Hung Liao Pei-Ying","year":"2004","unstructured":"Pei-Ying Chiang Wen-Hung Liao and Tsai-Yen Li . 2004 . Automatic caricature generation by analyzing facial features . In Proc. ACCV , Vol. 2 . Pei-Ying Chiang Wen-Hung Liao and Tsai-Yen Li. 2004. Automatic caricature generation by analyzing facial features. In Proc. ACCV, Vol. 2."},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1180639.1180783"},{"key":"e_1_2_2_24_1","unstructured":"Ming-Yu Liu Thomas Breuel and Jan Kautz. 2017. Unsupervised image-to-image translation networks. In Advances in Neural Information Processing Systems. 700--708.   Ming-Yu Liu Thomas Breuel and Jan Kautz. 2017. Unsupervised image-to-image translation networks. In Advances in Neural Information Processing Systems. 700--708."},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.425"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.304"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186223.1186294"},{"key":"e_1_2_2_28_1","volume-title":"Proc. of NIPS.","author":"Paszke Adam","year":"2017","unstructured":"Adam Paszke , Sam Gross , Soumith Chintala , Gregory Chanan , Edward Yang , Zachary DeVito , Zeming Lin , Alban Desmaison , Luca Antiga , and Adam Lerer . 2017 . Automatic differentiation in PyTorch . In Proc. of NIPS. Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. In Proc. of NIPS."},{"volume-title":"How to draw caricatures","author":"Redman Lenn","key":"e_1_2_2_29_1","unstructured":"Lenn Redman . 1984. How to draw caricatures . Vol. 1 . Contemporary Books Chicago , IL. Lenn Redman. 1984. How to draw caricatures. Vol. 1. Contemporary Books Chicago, IL."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925968"},{"key":"e_1_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Rupesh N Shet Ka H Lai Eran A Edirisinghe and Paul WH Chung. 2005. Use of neural networks in automatic caricature generation: an approach based on drawing style capture. (2005).  Rupesh N Shet Ka H Lai Eran A Edirisinghe and Paul WH Chung. 2005. Use of neural networks in automatic caricature generation: an approach based on drawing style capture. (2005).","DOI":"10.1049\/cp:20050066"},{"key":"e_1_2_2_32_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/81.558448"},{"key":"e_1_2_2_34_1","volume-title":"Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200","author":"Taigman Yaniv","year":"2016","unstructured":"Yaniv Taigman , Adam Polyak , and Lior Wolf . 2016. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200 ( 2016 ). Yaniv Taigman, Adam Polyak, and Lior Wolf. 2016. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200 (2016)."},{"volume-title":"Proc","author":"Tseng Chien-Chung","key":"e_1_2_2_35_1","unstructured":"Chien-Chung Tseng and Jenn-Jier James Lien . 2007. Synthesis of exaggerative caricature with inter and intra correlations . In Proc . ACCV. Springer , 314--323. Chien-Chung Tseng and Jenn-Jier James Lien. 2007. Synthesis of exaggerative caricature with inter and intra correlations. In Proc. ACCV. Springer, 314--323."},{"key":"e_1_2_2_36_1","volume-title":"Proc. CVPR. IEEE, 861--868","author":"Yang Fei","year":"2012","unstructured":"Fei Yang , Lubomir Bourdev , Eli Shechtman , Jue Wang , and Dimitris Metaxas . 2012 . Facial expression editing in video using a temporally-smooth factorization . In Proc. CVPR. IEEE, 861--868 . Fei Yang, Lubomir Bourdev, Eli Shechtman, Jue Wang, and Dimitris Metaxas. 2012. Facial expression editing in video using a temporally-smooth factorization. In Proc. CVPR. IEEE, 861--868."},{"key":"e_1_2_2_37_1","volume-title":"Dualgan: Unsupervised dual learning for image-to-image translation. arXiv preprint","author":"Yi Zili","year":"2017","unstructured":"Zili Yi , Hao Zhang , Ping Tan , and Minglun Gong . 2017 . Dualgan: Unsupervised dual learning for image-to-image translation. arXiv preprint (2017). Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong. 2017. Dualgan: Unsupervised dual learning for image-to-image translation. arXiv preprint (2017)."},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_2_2_39_1","unstructured":"Jun-Yan Zhu Richard Zhang Deepak Pathak Trevor Darrell Alexei A Efros Oliver Wang and Eli Shechtman. 2017b. Toward multimodal image-to-image translation. In Advances in Neural Information Processing Systems. 465--476.   Jun-Yan Zhu Richard Zhang Deepak Pathak Trevor Darrell Alexei A Efros Oliver Wang and Eli Shechtman. 2017b. Toward multimodal image-to-image translation. In Advances in Neural Information Processing Systems. 465--476."},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.371"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275046","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/dl.acm.org\/ft_gateway.cfm?id=3275046&ftid=2020796&dwn=1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:44:04Z","timestamp":1750207444000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275046"}},"subtitle":["unpaired photo-to-caricature translation"],"short-title":[],"issued":{"date-parts":[[2018,12,4]]},"references-count":40,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,31]]}},"alternative-id":["10.1145\/3272127.3275046"],"URL":"https:\/\/doi.org\/10.1145\/3272127.3275046","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2018,12,4]]},"assertion":[{"value":"2018-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}