{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,24]],"date-time":"2026-06-24T19:13:17Z","timestamp":1782328397565,"version":"3.54.5"},"publisher-location":"New York, NY, USA","reference-count":93,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,8,7]],"date-time":"2022-08-07T00:00:00Z","timestamp":1659830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-sa\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,8,7]]},"DOI":"10.1145\/3528233.3530757","type":"proceedings-article","created":{"date-parts":[[2022,7,20]],"date-time":"2022-07-20T13:56:43Z","timestamp":1658325403000},"page":"1-10","source":"Crossref","is-referenced-by-count":1463,"title":["Palette: Image-to-Image Diffusion Models"],"prefix":"10.1145","author":[{"given":"Chitwan","family":"Saharia","sequence":"first","affiliation":[{"name":"Google Research, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"William","family":"Chan","sequence":"additional","affiliation":[{"name":"Google Research, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Huiwen","family":"Chang","sequence":"additional","affiliation":[{"name":"Google Research, United States of America"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chris","family":"Lee","sequence":"additional","affiliation":[{"name":"Google Research, United States of America"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jonathan","family":"Ho","sequence":"additional","affiliation":[{"name":"Google Research, United States of America"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tim","family":"Salimans","sequence":"additional","affiliation":[{"name":"Google Research, Netherlands"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"David","family":"Fleet","sequence":"additional","affiliation":[{"name":"Google Research, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mohammad","family":"Norouzi","sequence":"additional","affiliation":[{"name":"Google Research, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,8,7]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Lynton Ardizzone Carsten L\u00fcth Jakob Kruse Carsten Rother and Ullrich K\u00f6the. 2019. Guided Image Generation with Conditional Invertible Neural Networks. In arXiv:1907.02392. Lynton Ardizzone Carsten L\u00fcth Jakob Kruse Carsten Rother and Ullrich K\u00f6the. 2019. Guided Image Generation with Conditional Invertible Neural Networks. In arXiv:1907.02392."},{"key":"e_1_3_2_1_2_1","unstructured":"Martin Arjovsky Soumith Chintala and L\u00e9on Bottou. 2017. Wasserstein GAN. In arXiv. Martin Arjovsky Soumith Chintala and L\u00e9on Bottou. 2017. Wasserstein GAN. In arXiv."},{"key":"e_1_3_2_1_3_1","unstructured":"Jacob Austin Daniel Johnson Jonathan Ho Danny Tarlow and Rianne van\u00a0den Berg. 2021. Structured Denoising Diffusion Models in Discrete State-Spaces. arXiv preprint arXiv:2107.03006(2021). Jacob Austin Daniel Johnson Jonathan Ho Danny Tarlow and Rianne van\u00a0den Berg. 2021. Structured Denoising Diffusion Models in Discrete State-Spaces. arXiv preprint arXiv:2107.03006(2021)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531330"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.344972"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00234"},{"key":"e_1_3_2_1_7_1","unstructured":"Ruojin Cai Guandao Yang Hadar Averbuch-Elor Zekun Hao Serge Belongie Noah Snavely and Bharath Hariharan. 2020. Learning Gradient Fields for Shape Generation. In ECCV. Ruojin Cai Guandao Yang Hadar Averbuch-Elor Zekun Hao Serge Belongie Noah Snavely and Bharath Hariharan. 2020. Learning Gradient Fields for Shape Generation. In ECCV."},{"key":"e_1_3_2_1_8_1","unstructured":"Liang-Chieh Chen George Papandreou Florian Schroff and Hartwig Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587(2017). Liang-Chieh Chen George Papandreou Florian Schroff and Hartwig Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587(2017)."},{"key":"e_1_3_2_1_9_1","unstructured":"Nanxin Chen Yu Zhang Heiga Zen Ron\u00a0J. Weiss Mohammad Norouzi and William Chan. 2021a. WaveGrad: Estimating Gradients for Waveform Generation. In ICLR. Nanxin Chen Yu Zhang Heiga Zen Ron\u00a0J. Weiss Mohammad Norouzi and William Chan. 2021a. WaveGrad: Estimating Gradients for Waveform Generation. In ICLR."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Nanxin Chen Yu Zhang Heiga Zen Ron\u00a0J. Weiss Mohammad Norouzi Najim Dehak and William Chan. 2021b. WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. In INTERSPEECH. Nanxin Chen Yu Zhang Heiga Zen Ron\u00a0J. Weiss Mohammad Norouzi Najim Dehak and William Chan. 2021b. WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. In INTERSPEECH.","DOI":"10.21437\/Interspeech.2021-1897"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Yen-Chi Cheng Chieh\u00a0Hubert Lin Hsin-Ying Lee Jian Ren Sergey Tulyakov and Ming-Hsuan Yang. 2021. In&Out: Diverse Image Outpainting via GAN Inversion. arXiv preprint arXiv:2104.00675(2021). Yen-Chi Cheng Chieh\u00a0Hubert Lin Hsin-Ying Lee Jian Ren Sergey Tulyakov and Ming-Hsuan Yang. 2021. In&Out: Diverse Image Outpainting via GAN Inversion. arXiv preprint arXiv:2104.00675(2021).","DOI":"10.1109\/CVPR52688.2022.01114"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00916"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Ryan Dahl Mohammad Norouzi and Jonathon Shlens. 2017. Pixel recursive super resolution. In ICCV. Ryan Dahl Mohammad Norouzi and Jonathon Shlens. 2017. Pixel recursive super resolution. In ICCV.","DOI":"10.1109\/ICCV.2017.581"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.307"},{"key":"e_1_3_2_1_16_1","unstructured":"Prafulla Dhariwal and Alex Nichol. 2021. Diffusion models beat gans on image synthesis. arXiv preprint arXiv:2105.05233(2021). Prafulla Dhariwal and Alex Nichol. 2021. Diffusion models beat gans on image synthesis. arXiv preprint arXiv:2105.05233(2021)."},{"key":"e_1_3_2_1_17_1","volume-title":"Density estimation using real NVP. arXiv:1605.08803","author":"Dinh Laurent","year":"2016","unstructured":"Laurent Dinh , Jascha Sohl-Dickstein , and Samy Bengio . 2016. Density estimation using real NVP. arXiv:1605.08803 ( 2016 ). Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. 2016. Density estimation using real NVP. arXiv:1605.08803 (2016)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.73"},{"key":"e_1_3_2_1_19_1","volume-title":"Generating Images with Perceptual Similarity Metrics based on Deep Networks. arXiv 1602.0264","author":"Dosovitskiy Alexey","year":"2016","unstructured":"Alexey Dosovitskiy and Thomas Brox . 2016. Generating Images with Perceptual Similarity Metrics based on Deep Networks. arXiv 1602.0264 ( 2016 ). Alexey Dosovitskiy and Thomas Brox. 2016. Generating Images with Perceptual Similarity Metrics based on Deep Networks. arXiv 1602.0264 (2016)."},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings, Part VIII 16","author":"Ehrlich Max","year":"2020","unstructured":"Max Ehrlich , Larry Davis , Ser-Nam Lim , and Abhinav Shrivastava . 2020 . Quantization guided jpeg artifact correction. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part VIII 16 . Springer, 293\u2013309. Max Ehrlich, Larry Davis, Ser-Nam Lim, and Abhinav Shrivastava. 2020. Quantization guided jpeg artifact correction. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part VIII 16. Springer, 293\u2013309."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.517"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2895280"},{"key":"e_1_3_2_1_23_1","volume-title":"Generative Adversarial Networks. NIPS","author":"Goodfellow J","year":"2014","unstructured":"Ian\u00a0 J Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014. Generative Adversarial Networks. NIPS ( 2014 ). Ian\u00a0J Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Networks. NIPS (2014)."},{"key":"e_1_3_2_1_24_1","volume-title":"Pixcolor: Pixel recursive colorization. arXiv preprint arXiv:1705.07208(2017).","author":"Guadarrama Sergio","year":"2017","unstructured":"Sergio Guadarrama , Ryan Dahl , David Bieber , Mohammad Norouzi , Jonathon Shlens , and Kevin Murphy . 2017 . Pixcolor: Pixel recursive colorization. arXiv preprint arXiv:1705.07208(2017). Sergio Guadarrama, Ryan Dahl, David Bieber, Mohammad Norouzi, Jonathon Shlens, and Kevin Murphy. 2017. Pixcolor: Pixel recursive colorization. arXiv preprint arXiv:1705.07208(2017)."},{"key":"e_1_3_2_1_25_1","unstructured":"Ishaan Gulrajani Faruk Ahmed Martin Arjovsky Vincent Dumoulin and Aaron Courville. 2017. Improved training of wasserstein gans. arXiv preprint arXiv:1704.00028(2017). Ishaan Gulrajani Faruk Ahmed Martin Arjovsky Vincent Dumoulin and Aaron Courville. 2017. Improved training of wasserstein gans. arXiv preprint arXiv:1704.00028(2017)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58529-7_41"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00458"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276382"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33709-3_2"},{"key":"e_1_3_2_1_30_1","unstructured":"Jonathan Ho Ajay Jain and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. arXiv preprint arXiv:2006.11239(2020). Jonathan Ho Ajay Jain and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. arXiv preprint arXiv:2006.11239(2020)."},{"key":"e_1_3_2_1_31_1","unstructured":"Jonathan Ho Chitwan Saharia William Chan David\u00a0J. Fleet Mohammad Norouzi and Tim Salimans. 2021. Cascaded Diffusion Models for High Fidelity Image Generation. In arXiv. Jonathan Ho Chitwan Saharia William Chan David\u00a0J. Fleet Mohammad Norouzi and Tim Salimans. 2021. Cascaded Diffusion Models for High Fidelity Image Generation. In arXiv."},{"key":"e_1_3_2_1_32_1","unstructured":"Emiel Hoogeboom Didrik Nielsen Priyank Jaini Patrick Forr\u00e9 and Max Welling. 2021. Argmax flows and multinomial diffusion: Towards non-autoregressive language models. arXiv preprint arXiv:2102.05379(2021). Emiel Hoogeboom Didrik Nielsen Priyank Jaini Patrick Forr\u00e9 and Max Welling. 2021. Argmax flows and multinomial diffusion: Towards non-autoregressive language models. arXiv preprint arXiv:2102.05379(2021)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073659"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Phillip Isola Jun-Yan Zhu and Tinghui\u00a0Zhou ajnd Alexei A.\u00a0Efros. 2017a. Image-to-Image Translation with Conditional Adversarial Nets. In CVPR. Phillip Isola Jun-Yan Zhu and Tinghui\u00a0Zhou ajnd Alexei A.\u00a0Efros. 2017a. Image-to-Image Translation with Conditional Adversarial Nets. In CVPR.","DOI":"10.1109\/CVPR.2017.632"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.632"},{"key":"e_1_3_2_1_36_1","volume-title":"Solving linear inverse problems using the prior implicit in a denoiser. arXiv preprint","author":"Kadkhodaie Zahra","year":"2007","unstructured":"Zahra Kadkhodaie and Eero\u00a0 P Simoncelli . 2021. Solving linear inverse problems using the prior implicit in a denoiser. arXiv preprint 2007 .13640(2021). Zahra Kadkhodaie and Eero\u00a0P Simoncelli. 2021. Solving linear inverse problems using the prior implicit in a denoiser. arXiv preprint 2007.13640(2021)."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Jiwon Kim Jung\u00a0Kwon Lee and Kyoung\u00a0Mu Lee. 2016. Deeply-recursive convolutional network for image super-resolution. In CVPR. 1637\u20131645. Jiwon Kim Jung\u00a0Kwon Lee and Kyoung\u00a0Mu Lee. 2016. Deeply-recursive convolutional network for image super-resolution. In CVPR. 1637\u20131645.","DOI":"10.1109\/CVPR.2016.181"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Soo\u00a0Ye Kim Kfir Aberman Nori Kanazawa Rahul Garg Neal Wadhwa Huiwen Chang Nikhil Karnad Munchurl Kim and Orly Liba. 2021. Zoom-to-Inpaint: Image Inpainting with High-Frequency Details. arxiv:2012.09401\u00a0[cs.CV] Soo\u00a0Ye Kim Kfir Aberman Nori Kanazawa Rahul Garg Neal Wadhwa Huiwen Chang Nikhil Karnad Munchurl Kim and Orly Liba. 2021. Zoom-to-Inpaint: Image Inpainting with High-Frequency Details. arxiv:2012.09401\u00a0[cs.CV]","DOI":"10.1109\/CVPRW56347.2022.00063"},{"key":"e_1_3_2_1_39_1","volume-title":"Kingma and Prafulla Dhariwal","author":"P.","year":"2018","unstructured":"Diederik\u00a0 P. Kingma and Prafulla Dhariwal . 2018 . Glow : Generative Flow with Invertible 1x1 Convolutions. In NIPS. Diederik\u00a0P. Kingma and Prafulla Dhariwal. 2018. Glow: Generative Flow with Invertible 1x1 Convolutions. In NIPS."},{"key":"e_1_3_2_1_40_1","unstructured":"Diederik\u00a0P Kingma Tim Salimans Ben Poole and Jonathan Ho. 2021. Variational Diffusion Models. arXiv preprint arXiv:2107.00630(2021). Diederik\u00a0P Kingma Tim Salimans Ben Poole and Jonathan Ho. 2021. Variational Diffusion Models. arXiv preprint arXiv:2107.00630(2021)."},{"key":"e_1_3_2_1_41_1","unstructured":"Diederik\u00a0P Kingma and Max Welling. 2013. Auto-Encoding Variational Bayes. In ICLR. Diederik\u00a0P Kingma and Max Welling. 2013. Auto-Encoding Variational Bayes. In ICLR."},{"key":"e_1_3_2_1_42_1","volume-title":"Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761(2020).","author":"Kong Zhifeng","year":"2020","unstructured":"Zhifeng Kong , Wei Ping , Jiaji Huang , Kexin Zhao , and Bryan Catanzaro . 2020 . Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761(2020). Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. 2020. Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761(2020)."},{"key":"e_1_3_2_1_43_1","volume-title":"DiffWave: A Versatile Diffusion Model for Audio Synthesis. ICLR","author":"Kong Zhifeng","year":"2021","unstructured":"Zhifeng Kong , Wei Ping , Jiaji Huang , Kexin Zhao , and Bryan Catanzaro . 2021. DiffWave: A Versatile Diffusion Model for Audio Synthesis. ICLR ( 2021 ). Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. 2021. DiffWave: A Versatile Diffusion Model for Audio Synthesis. ICLR (2021)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2366145.2366150","article-title":"Quality prediction for image completion","volume":"31","author":"Kopf Johannes","year":"2012","unstructured":"Johannes Kopf , Wolf Kienzle , Steven Drucker , and Sing\u00a0Bing Kang . 2012 . Quality prediction for image completion . ACM Transactions on Graphics (ToG) 31 , 6 (2012), 1 \u2013 8 . Johannes Kopf, Wolf Kienzle, Steven Drucker, and Sing\u00a0Bing Kang. 2012. Quality prediction for image completion. ACM Transactions on Graphics (ToG) 31, 6 (2012), 1\u20138.","journal-title":"ACM Transactions on Graphics (ToG)"},{"key":"e_1_3_2_1_45_1","volume-title":"Colorization Transformer. In ICLR","author":"Kumar Manoj","year":"2021","unstructured":"Manoj Kumar , Dirk Weissenborn , and Nal Kalchbrenner . 2021 . Colorization Transformer. In ICLR 2021. https:\/\/openreview.net\/forum?id=5NA1PinlGFu Manoj Kumar, Dirk Weissenborn, and Nal Kalchbrenner. 2021. Colorization Transformer. In ICLR 2021. https:\/\/openreview.net\/forum?id=5NA1PinlGFu"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_35"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Christian Ledig Lucas Theis Ferenc Husz\u00e1r Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz Zehan Wang 2017. Photo-realistic single image super-resolution using a generative adversarial network. In ICCV. Christian Ledig Lucas Theis Ferenc Husz\u00e1r Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz Zehan Wang 2017. Photo-realistic single image super-resolution using a generative adversarial network. In ICCV.","DOI":"10.1109\/CVPR.2017.19"},{"key":"e_1_3_2_1_48_1","unstructured":"Chieh\u00a0Hubert Lin Hsin-Ying Lee Yen-Chi Cheng Sergey Tulyakov and Ming-Hsuan Yang. 2021. InfinityGAN: Towards Infinite-Resolution Image Synthesis. arXiv preprint arXiv:2104.03963(2021). Chieh\u00a0Hubert Lin Hsin-Ying Lee Yen-Chi Cheng Sergey Tulyakov and Ming-Hsuan Yang. 2021. InfinityGAN: Towards Infinite-Resolution Image Synthesis. arXiv preprint arXiv:2104.03963(2021)."},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01252-6_6"},{"key":"e_1_3_2_1_50_1","volume-title":"Proceedings, Part II 16","author":"Liu Hongyu","year":"2020","unstructured":"Hongyu Liu , Bin Jiang , Yibing Song , Wei Huang , and Chao Yang . 2020 . Rethinking image inpainting via a mutual encoder-decoder with feature equalizations. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part II 16 . Springer, 725\u2013741. Hongyu Liu, Bin Jiang, Yibing Song, Wei Huang, and Chao Yang. 2020. Rethinking image inpainting via a mutual encoder-decoder with feature equalizations. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part II 16. Springer, 725\u2013741."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.01065"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2018.00121"},{"key":"e_1_3_2_1_53_1","unstructured":"Chenlin Meng Yang Song Jiaming Song Jiajun Wu Jun-Yan Zhu and Stefano Ermon. 2021. SDEdit: Image Synthesis and Editing with Stochastic Differential Equations. arXiv preprint arXiv:2108.01073(2021). Chenlin Meng Yang Song Jiaming Song Jiajun Wu Jun-Yan Zhu and Stefano Ermon. 2021. SDEdit: Image Synthesis and Editing with Stochastic Differential Equations. arXiv preprint arXiv:2108.01073(2021)."},{"key":"e_1_3_2_1_54_1","volume-title":"PULSE: Self-supervised photo upsampling via latent space exploration of generative models. In CVPR.","author":"Menon Sachit","year":"2020","unstructured":"Sachit Menon , Alexandru Damian , Shijia Hu , Nikhil Ravi , and Cynthia Rudin . 2020 . PULSE: Self-supervised photo upsampling via latent space exploration of generative models. In CVPR. Sachit Menon, Alexandru Damian, Shijia Hu, Nikhil Ravi, and Cynthia Rudin. 2020. PULSE: Self-supervised photo upsampling via latent space exploration of generative models. In CVPR."},{"key":"e_1_3_2_1_55_1","unstructured":"Luke Metz Ben Poole David Pfau and Jascha Sohl-Dickstein. 2016. Unrolled generative adversarial networks. arXiv preprint arXiv:1611.02163(2016). Luke Metz Ben Poole David Pfau and Jascha Sohl-Dickstein. 2016. Unrolled generative adversarial networks. arXiv preprint arXiv:1611.02163(2016)."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00408"},{"key":"e_1_3_2_1_57_1","unstructured":"Alex Nichol and Prafulla Dhariwal. 2021. Improved Denoising Diffusion Probabilistic Models. arXiv preprint arXiv:2102.09672(2021). Alex Nichol and Prafulla Dhariwal. 2021. Improved Denoising Diffusion Probabilistic Models. arXiv preprint arXiv:2102.09672(2021)."},{"key":"e_1_3_2_1_58_1","unstructured":"Niki Parmar Ashish Vaswani Jakob Uszkoreit Lukasz Kaiser Noam Shazeer Alexander Ku and Dustin Tran. 2018. Image transformer. In ICML. Niki Parmar Ashish Vaswani Jakob Uszkoreit Lukasz Kaiser Noam Shazeer Alexander Ku and Dustin Tran. 2018. Image transformer. In ICML."},{"key":"e_1_3_2_1_59_1","unstructured":"Guocheng Qian Jinjin Gu Jimmy Ren Chao Dong Furong Zhao and Juan Lin. 2019. Trinity of Pixel Enhancement: a Joint Solution for Demosaicking Denoising and Super-Resolution. In arXiv:1905.02538. Guocheng Qian Jinjin Gu Jimmy Ren Chao Dong Furong Zhao and Juan Lin. 2019. Trinity of Pixel Enhancement: a Joint Solution for Demosaicking Denoising and Super-Resolution. In arXiv:1905.02538."},{"key":"e_1_3_2_1_60_1","unstructured":"Alec Radford Luke Metz and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434(2015). Alec Radford Luke Metz and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434(2015)."},{"key":"e_1_3_2_1_61_1","unstructured":"Suman Ravuri and Oriol Vinyals. 2019. Classification accuracy score for conditional generative models. arXiv preprint arXiv:1905.10887(2019). Suman Ravuri and Oriol Vinyals. 2019. Classification accuracy score for conditional generative models. arXiv preprint arXiv:1905.10887(2019)."},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"crossref","unstructured":"Amelie Royer Alexander Kolesnikov and Christoph\u00a0H. Lampert. 2017. Probabilistic Image Colorization. In arXiv:1705.04258. Amelie Royer Alexander Kolesnikov and Christoph\u00a0H. Lampert. 2017. Probabilistic Image Colorization. In arXiv:1705.04258.","DOI":"10.5244\/C.31.85"},{"key":"e_1_3_2_1_63_1","unstructured":"Chitwan Saharia Jonathan Ho William Chan Tim Salimans David\u00a0J Fleet and Mohammad Norouzi. 2021. Image super-resolution via iterative refinement. arXiv preprint arXiv:2104.07636(2021). Chitwan Saharia Jonathan Ho William Chan Tim Salimans David\u00a0J Fleet and Mohammad Norouzi. 2021. Image super-resolution via iterative refinement. arXiv preprint arXiv:2104.07636(2021)."},{"key":"e_1_3_2_1_64_1","unstructured":"Tim Salimans Andrej Karpathy Xi Chen and Diederik\u00a0P. Kingma. 2017. PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications. In ICLR. Tim Salimans Andrej Karpathy Xi Chen and Diederik\u00a0P. Kingma. 2017. PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications. In ICLR."},{"key":"e_1_3_2_1_65_1","unstructured":"Hiroshi Sasaki Chris\u00a0G Willcocks and Toby\u00a0P Breckon. 2021. UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models. arXiv preprint arXiv:2104.05358(2021). Hiroshi Sasaki Chris\u00a0G Willcocks and Toby\u00a0P Breckon. 2021. UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models. arXiv preprint arXiv:2104.05358(2021)."},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10599-4_2"},{"key":"e_1_3_2_1_67_1","unstructured":"Abhishek Sinha Jiaming Song Chenlin Meng and Stefano Ermon. 2021. D2C: Diffusion-Denoising Models for Few-shot Conditional Generation. arXiv preprint arXiv:2106.06819(2021). Abhishek Sinha Jiaming Song Chenlin Meng and Stefano Ermon. 2021. D2C: Diffusion-Denoising Models for Few-shot Conditional Generation. arXiv preprint arXiv:2106.06819(2021)."},{"key":"e_1_3_2_1_68_1","unstructured":"Jascha Sohl-Dickstein Eric Weiss Niru Maheswaranathan and Surya Ganguli. 2015. Deep unsupervised learning using nonequilibrium thermodynamics. In ICML. PMLR 2256\u20132265. Jascha Sohl-Dickstein Eric Weiss Niru Maheswaranathan and Surya Ganguli. 2015. Deep unsupervised learning using nonequilibrium thermodynamics. In ICML. PMLR 2256\u20132265."},{"key":"e_1_3_2_1_69_1","unstructured":"Yang Song and Stefano Ermon. 2020. Improved Techniques for Training Score-Based Generative Models. arXiv preprint arXiv:2006.09011(2020). Yang Song and Stefano Ermon. 2020. Improved Techniques for Training Score-Based Generative Models. arXiv preprint arXiv:2006.09011(2020)."},{"key":"e_1_3_2_1_70_1","unstructured":"Yang Song Jascha Sohl-Dickstein Diederik\u00a0P. Kingma Abhishek Kumar Stefano Ermon and Ben Poole. 2021. Score-Based Generative Modeling through Stochastic Differential Equations. In ICLR. Yang Song Jascha Sohl-Dickstein Diederik\u00a0P. Kingma Abhishek Kumar Stefano Ermon and Ben Poole. 2021. Score-Based Generative Modeling through Stochastic Differential Equations. In ICLR."},{"key":"e_1_3_2_1_71_1","unstructured":"Yaniv Taigman Adam Polyak and Lior Wolf. 2016. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200(2016). Yaniv Taigman Adam Polyak and Lior Wolf. 2016. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200(2016)."},{"key":"e_1_3_2_1_72_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 10521\u201310530","author":"Teterwak Piotr","year":"2019","unstructured":"Piotr Teterwak , Aaron Sarna , Dilip Krishnan , Aaron Maschinot , David Belanger , Ce Liu , and William\u00a0 T Freeman . 2019 . Boundless: Generative adversarial networks for image extension . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 10521\u201310530 . Piotr Teterwak, Aaron Sarna, Dilip Krishnan, Aaron Maschinot, David Belanger, Ce Liu, and William\u00a0T Freeman. 2019. Boundless: Generative adversarial networks for image extension. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 10521\u201310530."},{"key":"e_1_3_2_1_73_1","volume-title":"NVAE: A Deep Hierarchical Variational Autoencoder. In NeurIPS.","author":"Vahdat Arash","year":"2020","unstructured":"Arash Vahdat and Jan Kautz . 2020 . NVAE: A Deep Hierarchical Variational Autoencoder. In NeurIPS. Arash Vahdat and Jan Kautz. 2020. NVAE: A Deep Hierarchical Variational Autoencoder. In NeurIPS."},{"key":"e_1_3_2_1_74_1","unstructured":"Arash Vahdat Karsten Kreis and Jan Kautz. 2021. Score-based Generative Modeling in Latent Space. arXiv preprint arXiv:2106.05931(2021). Arash Vahdat Karsten Kreis and Jan Kautz. 2021. Score-based Generative Modeling in Latent Space. arXiv preprint arXiv:2106.05931(2021)."},{"key":"e_1_3_2_1_75_1","unstructured":"Aaron van\u00a0den Oord Nal Kalchbrenner Oriol Vinyals Lasse Espeholt Alex Graves and Koray Kavukcuoglu. 2016. Conditional image generation with PixelCNN decoders. In NIPS. 4790\u20134798. Aaron van\u00a0den Oord Nal Kalchbrenner Oriol Vinyals Lasse Espeholt Alex Graves and Koray Kavukcuoglu. 2016. Conditional image generation with PixelCNN decoders. In NIPS. 4790\u20134798."},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01270"},{"key":"e_1_3_2_1_77_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention Is All You Need. In NIPS. Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention Is All You Need. In NIPS."},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661278"},{"key":"e_1_3_2_1_79_1","volume-title":"Wide-Context Semantic Image Extrapolation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1399\u20131408","author":"Wang Yi","year":"2019","unstructured":"Yi Wang , Xin Tao , Xiaoyong Shen , and Jiaya Jia . 2019 a. Wide-Context Semantic Image Extrapolation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1399\u20131408 . Yi Wang, Xin Tao, Xiaoyong Shen, and Jiaya Jia. 2019a. Wide-Context Semantic Image Extrapolation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1399\u20131408."},{"key":"e_1_3_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00149"},{"key":"e_1_3_2_1_81_1","unstructured":"Dingdong Yang Seunghoon Hong Yunseok Jang Tianchen Zhao and Honglak Lee. 2019b. Diversity-sensitive conditional generative adversarial networks. arXiv preprint arXiv:1901.09024(2019). Dingdong Yang Seunghoon Hong Yunseok Jang Tianchen Zhao and Honglak Lee. 2019b. Diversity-sensitive conditional generative adversarial networks. arXiv preprint arXiv:1901.09024(2019)."},{"key":"e_1_3_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.01066"},{"key":"e_1_3_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00753"},{"key":"e_1_3_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00577"},{"key":"e_1_3_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00457"},{"key":"e_1_3_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00259"},{"key":"e_1_3_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46487-9_40"},{"key":"e_1_3_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00578"},{"key":"e_1_3_2_1_89_1","unstructured":"Shengyu Zhao Jonathan Cui Yilun Sheng Yue Dong Xiao Liang Eric\u00a0I Chang and Yan Xu. 2021. Large scale image completion via co-modulated generative adversarial networks. arXiv preprint arXiv:2103.10428(2021). Shengyu Zhao Jonathan Cui Yilun Sheng Yue Dong Xiao Liang Eric\u00a0I Chang and Yan Xu. 2021. Large scale image completion via co-modulated generative adversarial networks. arXiv preprint arXiv:2103.10428(2021)."},{"key":"e_1_3_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00153"},{"key":"e_1_3_2_1_91_1","volume-title":"Places: A 10 million Image Database for Scene Recognition","author":"Zhou Bolei","year":"2017","unstructured":"Bolei Zhou , Agata Lapedriza , Aditya Khosla , Aude Oliva , and Antonio Torralba . 2017 . Places: A 10 million Image Database for Scene Recognition . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2017). Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)."},{"key":"e_1_3_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_3_2_1_93_1","unstructured":"Jun-Yan Zhu Richard Zhang Deepak Pathak Trevor Darrell Alexei\u00a0A Efros Oliver Wang and Eli Shechtman. 2017b. Multimodal Image-to-Image Translation by Enforcing Bi-Cycle Consistency. In Advances in neural information processing systems. 465\u2013476. Jun-Yan Zhu Richard Zhang Deepak Pathak Trevor Darrell Alexei\u00a0A Efros Oliver Wang and Eli Shechtman. 2017b. Multimodal Image-to-Image Translation by Enforcing Bi-Cycle Consistency. In Advances in neural information processing systems. 465\u2013476."}],"event":{"name":"SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference","location":"Vancouver BC Canada","acronym":"SIGGRAPH '22","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3528233.3530757","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:42Z","timestamp":1750186962000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3528233.3530757"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,7]]},"references-count":93,"alternative-id":["10.1145\/3528233.3530757","10.1145\/3528233"],"URL":"https:\/\/doi.org\/10.1145\/3528233.3530757","relation":{},"subject":[],"published":{"date-parts":[[2022,8,7]]}}}