{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,24]],"date-time":"2026-06-24T14:56:32Z","timestamp":1782312992156,"version":"3.54.5"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2021,7,19]],"date-time":"2021-07-19T00:00:00Z","timestamp":1626652800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100003977","name":"Israel Science Foundation","doi-asserted-by":"crossref","award":["2366\/16, 2492\/20"],"award-info":[{"award-number":["2366\/16, 2492\/20"]}],"id":[{"id":"10.13039\/501100003977","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2021,8,31]]},"abstract":"<jats:p>In recent years, considerable progress has been made in the visual quality of Generative Adversarial Networks (GANs). Even so, these networks still suffer from degradation in quality for high-frequency content, stemming from a spectrally biased architecture, and similarly unfavorable loss functions. To address this issue, we present a novel general-purpose Style and WAvelet based GAN (SWAGAN) that implements progressive generation in the frequency domain. SWAGAN incorporates wavelets throughout its generator and discriminator architectures, enforcing a frequency-aware latent representation at every step of the way. This approach, designed to directly tackle the spectral bias of neural networks, yields an improvement in the ability to generate medium and high frequency content, including structures which other networks fail to learn. We demonstrate the advantage of our method by integrating it into the SyleGAN2 framework, and verifying that content generation in the wavelet domain leads to more realistic high-frequency content, even when trained for fewer iterations. Furthermore, we verify that our model's latent space retains the qualities that allow StyleGAN to serve as a basis for a multitude of editing tasks, and show that our frequency-aware approach also induces improved high-frequency performance in downstream tasks.<\/jats:p>","DOI":"10.1145\/3450626.3459836","type":"journal-article","created":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T00:04:26Z","timestamp":1626739466000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":110,"title":["SWAGAN"],"prefix":"10.1145","volume":"40","author":[{"given":"Rinon","family":"Gal","sequence":"first","affiliation":[{"name":"Tel Aviv University, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dana Cohen","family":"Hochberg","sequence":"additional","affiliation":[{"name":"Tel Aviv University, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Amit","family":"Bermano","sequence":"additional","affiliation":[{"name":"Tel Aviv University, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Daniel","family":"Cohen-Or","sequence":"additional","affiliation":[{"name":"Tel Aviv University, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2021,7,19]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00453"},{"key":"e_1_2_2_2_1","volume-title":"Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems. 2172--2180.","author":"Chen Xi","year":"2016","unstructured":"Xi Chen , Yan Duan , Rein Houthooft , John Schulman , Ilya Sutskever , and Pieter Abbeel . 2016 . Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems. 2172--2180. Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems. 2172--2180."},{"key":"e_1_2_2_3_1","volume-title":"SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains. arXiv preprint arXiv:2012.05535","author":"Chen Yuanqi","year":"2020","unstructured":"Yuanqi Chen , Ge Li , Cece Jin , Shan Liu , and Thomas Li. 2020. SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains. arXiv preprint arXiv:2012.05535 ( 2020 ). Yuanqi Chen, Ge Li, Cece Jin, Shan Liu, and Thomas Li. 2020. SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains. arXiv preprint arXiv:2012.05535 (2020)."},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/18.57199"},{"key":"e_1_2_2_5_1","doi-asserted-by":"crossref","unstructured":"Ingrid Daubechies. 1992. Ten lectures on wavelets. SIAM.  Ingrid Daubechies. 1992. Ten lectures on wavelets. SIAM.","DOI":"10.1137\/1.9781611970104"},{"key":"e_1_2_2_6_1","volume-title":"Deep generative image models using a laplacian pyramid of adversarial networks. arXiv preprint arXiv:1506.05751","author":"Denton Emily","year":"2015","unstructured":"Emily Denton , Soumith Chintala , Arthur Szlam , and Rob Fergus . 2015. Deep generative image models using a laplacian pyramid of adversarial networks. arXiv preprint arXiv:1506.05751 ( 2015 ). Emily Denton, Soumith Chintala, Arthur Szlam, and Rob Fergus. 2015. Deep generative image models using a laplacian pyramid of adversarial networks. arXiv preprint arXiv:1506.05751 (2015)."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00791"},{"key":"e_1_2_2_8_1","volume-title":"Fourier Spectrum Discrepancies in Deep Network Generated Images. arXiv preprint arXiv:1911.06465","author":"Dzanic Tarik","year":"2019","unstructured":"Tarik Dzanic , Karan Shah , and Freddie Witherden . 2019. Fourier Spectrum Discrepancies in Deep Network Generated Images. arXiv preprint arXiv:1911.06465 ( 2019 ). Tarik Dzanic, Karan Shah, and Freddie Witherden. 2019. Fourier Spectrum Discrepancies in Deep Network Generated Images. arXiv preprint arXiv:1911.06465 (2019)."},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2016.7532596"},{"key":"e_1_2_2_10_1","volume-title":"Advances in Neural Information Processing Systems 27","author":"Goodfellow Ian","unstructured":"Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014. Generative Adversarial Nets . In Advances in Neural Information Processing Systems 27 , Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger (Eds.). Curran Associates, Inc. , 2672--2680. http:\/\/papers.nips.cc\/paper\/5423-generative-adversarial-nets.pdf Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems 27, Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 2672--2680. http:\/\/papers.nips.cc\/paper\/5423-generative-adversarial-nets.pdf"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.187"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-019-01154-8"},{"key":"e_1_2_2_13_1","volume-title":"A deep convolutional neural network using directional wavelets for low-dose X-ray CT reconstruction. Medical physics 44, 10","author":"Kang Eunhee","year":"2017","unstructured":"Eunhee Kang , Junhong Min , and Jong Chul Ye. 2017. A deep convolutional neural network using directional wavelets for low-dose X-ray CT reconstruction. Medical physics 44, 10 ( 2017 ), e360--e375. Eunhee Kang, Junhong Min, and Jong Chul Ye. 2017. A deep convolutional neural network using directional wavelets for low-dose X-ray CT reconstruction. Medical physics 44, 10 (2017), e360--e375."},{"key":"e_1_2_2_14_1","volume-title":"MSG-GAN: multi-scale gradient GAN for stable image synthesis. arXiv preprint arXiv:1903.06048","author":"Karnewar Animesh","year":"2019","unstructured":"Animesh Karnewar and Oliver Wang . 2019. MSG-GAN: multi-scale gradient GAN for stable image synthesis. arXiv preprint arXiv:1903.06048 ( 2019 ). Animesh Karnewar and Oliver Wang. 2019. MSG-GAN: multi-scale gradient GAN for stable image synthesis. arXiv preprint arXiv:1903.06048 (2019)."},{"key":"e_1_2_2_15_1","volume-title":"Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196","author":"Karras Tero","year":"2017","unstructured":"Tero Karras , Timo Aila , Samuli Laine , and Jaakko Lehtinen . 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 ( 2017 ). Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017)."},{"key":"e_1_2_2_16_1","volume-title":"Proc. NeurIPS.","author":"Karras Tero","year":"2020","unstructured":"Tero Karras , Miika Aittala , Janne Hellsten , Samuli Laine , Jaakko Lehtinen , and Timo Aila . 2020 a. Training Generative Adversarial Networks with Limited Data . In Proc. NeurIPS. Tero Karras, Miika Aittala, Janne Hellsten, Samuli Laine, Jaakko Lehtinen, and Timo Aila. 2020a. Training Generative Adversarial Networks with Limited Data. In Proc. NeurIPS."},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00453"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00813"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.19"},{"key":"e_1_2_2_20_1","volume-title":"Wavelet-Based Dual-Branch Network for Image Demoir\u00e9ing. arXiv preprint arXiv:2007.07173","author":"Liu Lin","year":"2020","unstructured":"Lin Liu , Jianzhuang Liu , Shanxin Yuan , Gregory Slabaugh , Ales Leonardis , Wengang Zhou , and Qi Tian . 2020. Wavelet-Based Dual-Branch Network for Image Demoir\u00e9ing. arXiv preprint arXiv:2007.07173 ( 2020 ). Lin Liu, Jianzhuang Liu, Shanxin Yuan, Gregory Slabaugh, Ales Leonardis, Wengang Zhou, and Qi Tian. 2020. Wavelet-Based Dual-Branch Network for Image Demoir\u00e9ing. arXiv preprint arXiv:2007.07173 (2020)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2921451"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01215"},{"key":"e_1_2_2_23_1","unstructured":"Ziwei Liu Ping Luo Xiaogang Wang and Xiaoou Tang. 2018. Large-scale celebfacesattributes (celeba) dataset. (2018).  Ziwei Liu Ping Luo Xiaogang Wang and Xiaoou Tang. 2018. Large-scale celebfacesattributes (celeba) dataset. (2018)."},{"key":"e_1_2_2_24_1","volume-title":"Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934","author":"Mildenhall Ben","year":"2020","unstructured":"Ben Mildenhall , Pratul P Srinivasan , Matthew Tancik , Jonathan T Barron , Ravi Ramamoorthi , and Ren Ng . 2020 . Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934 (2020). Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. 2020. Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934 (2020)."},{"key":"e_1_2_2_25_1","volume-title":"Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784","author":"Mirza Mehdi","year":"2014","unstructured":"Mehdi Mirza and Simon Osindero . 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 ( 2014 ). Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)."},{"key":"e_1_2_2_26_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7588--7597","author":"Nguyen-Phuoc Thu","year":"2019","unstructured":"Thu Nguyen-Phuoc , Chuan Li , Lucas Theis , Christian Richardt , and Yong-Liang Yang . 2019 . Hologan: Unsupervised learning of 3d representations from natural images . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7588--7597 . Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt, and Yong-Liang Yang. 2019. Hologan: Unsupervised learning of 3d representations from natural images. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7588--7597."},{"key":"e_1_2_2_27_1","volume-title":"Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434","author":"Radford Alec","year":"2015","unstructured":"Alec Radford , Luke Metz , and Soumith Chintala . 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 ( 2015 ). Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)."},{"key":"e_1_2_2_28_1","volume-title":"International Conference on Machine Learning. PMLR, 5301--5310","author":"Rahaman Nasim","year":"2019","unstructured":"Nasim Rahaman , Aristide Baratin , Devansh Arpit , Felix Draxler , Min Lin , Fred Hamprecht , Yoshua Bengio , and Aaron Courville . 2019 . On the spectral bias of neural networks . In International Conference on Machine Learning. PMLR, 5301--5310 . Nasim Rahaman, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred Hamprecht, Yoshua Bengio, and Aaron Courville. 2019. On the spectral bias of neural networks. In International Conference on Machine Learning. PMLR, 5301--5310."},{"key":"e_1_2_2_29_1","volume-title":"Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation. arXiv preprint arXiv:2008.00951","author":"Richardson Elad","year":"2020","unstructured":"Elad Richardson , Yuval Alaluf , Or Patashnik , Yotam Nitzan , Yaniv Azar , Stav Shapiro , and Daniel Cohen-Or . 2020. Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation. arXiv preprint arXiv:2008.00951 ( 2020 ). Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, and Daniel Cohen-Or. 2020. Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation. arXiv preprint arXiv:2008.00951 (2020)."},{"key":"e_1_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Yujun Shen Jinjin Gu Xiaoou Tang and Bolei Zhou. 2020. Interpreting the Latent Space of GANs for Semantic Face Editing. In CVPR.  Yujun Shen Jinjin Gu Xiaoou Tang and Bolei Zhou. 2020. Interpreting the Latent Space of GANs for Semantic Face Editing. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00926"},{"key":"e_1_2_2_31_1","unstructured":"Gage Skidmore. 2016. Gal Gadot image by Gage Skidmore [CC BY-SA 3.0] via Wikimedia Commons - https:\/\/commons.wikimedia.org\/w\/index.php?curid=50402815. (2016).  Gage Skidmore. 2016. Gal Gadot image by Gage Skidmore [CC BY-SA 3.0] via Wikimedia Commons - https:\/\/commons.wikimedia.org\/w\/index.php?curid=50402815. (2016)."},{"key":"e_1_2_2_32_1","volume-title":"Fourier features let networks learn high frequency functions in low dimensional domains. arXiv preprint arXiv:2006.10739","author":"Tancik Matthew","year":"2020","unstructured":"Matthew Tancik , Pratul P Srinivasan , Ben Mildenhall , Sara Fridovich-Keil , Nithin Raghavan , Utkarsh Singhal , Ravi Ramamoorthi , Jonathan T Barron , and Ren Ng. 2020. Fourier features let networks learn high frequency functions in low dimensional domains. arXiv preprint arXiv:2006.10739 ( 2020 ). Matthew Tancik, Pratul P Srinivasan, Ben Mildenhall, Sara Fridovich-Keil, Nithin Raghavan, Utkarsh Singhal, Ravi Ramamoorthi, Jonathan T Barron, and Ren Ng. 2020. Fourier features let networks learn high frequency functions in low dimensional domains. arXiv preprint arXiv:2006.10739 (2020)."},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3323035"},{"key":"e_1_2_2_34_1","volume-title":"Designing an Encoder for StyleGAN Image Manipulation. arXiv preprint arXiv:2102.02766","author":"Tov Omer","year":"2021","unstructured":"Omer Tov , Yuval Alaluf , Yotam Nitzan , Or Patashnik , and Daniel Cohen-Or . 2021. Designing an Encoder for StyleGAN Image Manipulation. arXiv preprint arXiv:2102.02766 ( 2021 ). Omer Tov, Yuval Alaluf, Yotam Nitzan, Or Patashnik, and Daniel Cohen-Or. 2021. Designing an Encoder for StyleGAN Image Manipulation. arXiv preprint arXiv:2102.02766 (2021)."},{"key":"e_1_2_2_35_1","volume-title":"Multi-level Wavelet-based Generative Adversarial Network for Perceptual Quality Enhancement of Compressed Video. arXiv preprint arXiv:2008.00499","author":"Wang Jianyi","year":"2020","unstructured":"Jianyi Wang , Xin Deng , Mai Xu , Congyong Chen , and Yuhang Song . 2020. Multi-level Wavelet-based Generative Adversarial Network for Perceptual Quality Enhancement of Compressed Video. arXiv preprint arXiv:2008.00499 ( 2020 ). Jianyi Wang, Xin Deng, Mai Xu, Congyong Chen, and Yuhang Song. 2020. Multi-level Wavelet-based Generative Adversarial Network for Perceptual Quality Enhancement of Compressed Video. arXiv preprint arXiv:2008.00499 (2020)."},{"key":"e_1_2_2_36_1","volume-title":"International Conference on Learning Representations.","author":"Williams Travis","year":"2018","unstructured":"Travis Williams and Robert Li . 2018 . Wavelet pooling for convolutional neural networks . In International Conference on Learning Representations. Travis Williams and Robert Li. 2018. Wavelet pooling for convolutional neural networks. In International Conference on Learning Representations."},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00913"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357254.3358600"},{"key":"e_1_2_2_39_1","volume-title":"Image Super-Resolution Using a Wavelet-based Generative Adversarial Network. arXiv preprint arXiv:1907.10213","author":"Zhang Qi","year":"2019","unstructured":"Qi Zhang , Huafeng Wang , and Sichen Yang . 2019a. Image Super-Resolution Using a Wavelet-based Generative Adversarial Network. arXiv preprint arXiv:1907.10213 ( 2019 ). Qi Zhang, Huafeng Wang, and Sichen Yang. 2019a. Image Super-Resolution Using a Wavelet-based Generative Adversarial Network. arXiv preprint arXiv:1907.10213 (2019)."},{"key":"e_1_2_2_40_1","first-page":"165","article-title":"Joint sub-bands learning with clique structures for wavelet domain super-resolution","volume":"31","author":"Zhong Zhisheng","year":"2018","unstructured":"Zhisheng Zhong , Tiancheng Shen , Yibo Yang , Zhouchen Lin , and Chao Zhang . 2018 . Joint sub-bands learning with clique structures for wavelet domain super-resolution . Advances in Neural Information Processing Systems 31 (2018), 165 -- 175 . Zhisheng Zhong, Tiancheng Shen, Yibo Yang, Zhouchen Lin, and Chao Zhang. 2018. Joint sub-bands learning with clique structures for wavelet domain super-resolution. Advances in Neural Information Processing Systems 31 (2018), 165--175.","journal-title":"Advances in Neural Information Processing Systems"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3450626.3459836","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3450626.3459836","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:20Z","timestamp":1750191440000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3450626.3459836"}},"subtitle":["a style-based wavelet-driven generative model"],"short-title":[],"issued":{"date-parts":[[2021,7,19]]},"references-count":40,"aliases":["10.1145\/3476576.3476707","10.1145\/3476576.3476707"],"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,8,31]]}},"alternative-id":["10.1145\/3450626.3459836"],"URL":"https:\/\/doi.org\/10.1145\/3450626.3459836","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,19]]},"assertion":[{"value":"2021-07-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}