{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T14:49:00Z","timestamp":1774968540021,"version":"3.50.1"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2024,10,31]],"date-time":"2024-10-31T00:00:00Z","timestamp":1730332800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2024,10,31]]},"abstract":"<jats:p>Text Style Transfer (TST) is a relevant branch of natural language processing that aims to control the style attributes of a piece of text while preserving its original content. To address TST in the absence of parallel data, Cycle-consistent Generative Adversarial Networks (CycleGANs) have recently emerged as promising solutions. Existing CycleGAN-based TST approaches suffer from the following limitations: (1) They apply self-supervision, based on the cycle-consistency principle, in the latent space. This approach turns out to be less robust to mixed-style inputs, i.e., when the source text is partly in the original and partly in the target style; (2) Generators and discriminators rely on recurrent networks, which are exposed to known issues with long-term text dependencies; (3) The target style is weakly enforced, as the discriminator distinguishes real from fake sentences without explicitly accounting for the generated text's style. We propose a new CycleGAN-based TST approach that applies self-supervision directly at the sequence level to effectively handle mixed-style inputs and employs Transformers to leverage the attention mechanism for both text encoding and decoding. We also employ a pre-trained style classifier to guide the generation of text in the target style while maintaining the original content's meaning. The experimental results achieved on the formality and sentiment transfer tasks show that our approach outperforms existing ones, both CycleGAN-based and not (including an open-source Large Language Model), on benchmark data and shows better robustness to mixed-style inputs.<\/jats:p>","DOI":"10.1145\/3678179","type":"journal-article","created":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T13:11:55Z","timestamp":1721308315000},"page":"1-38","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Self-supervised Text Style Transfer Using Cycle-Consistent Adversarial Networks"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8838-064X","authenticated-orcid":false,"given":"Moreno","family":"La Quatra","sequence":"first","affiliation":[{"name":"Kore University of Enna, Enna, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1744-6674","authenticated-orcid":false,"given":"Giuseppe","family":"Gallipoli","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Turin, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7185-5247","authenticated-orcid":false,"given":"Luca","family":"Cagliero","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Turin, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,11,7]]},"reference":[{"key":"e_1_3_3_2_2","first-page":"4671","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems 2018 (NeurIPS \u201918)","volume":"31","author":"Chen Liqun","year":"2018","unstructured":"Liqun Chen, Shuyang Dai, Chenyang Tao, Haichao Zhang, Zhe Gan, Dinghan Shen, Yizhe Zhang, Guoyin Wang, Ruiyi Zhang, and Lawrence Carin. 2018. Adversarial Text Generation via Feature-Mover's Distance. In Proceedings of the Annual Conference on Neural Information Processing Systems 2018 (NeurIPS \u201918), Vol. 31. Curran Associates, Inc., 4671\u20134682. https:\/\/proceedings.neurips.cc\/paper\/2018\/hash\/074177d3eb6371e32c16c55a3b8f706b-Abstract.html"},{"issue":"70","key":"e_1_3_3_3_2","first-page":"1","article-title":"Scaling Instruction-Finetuned Language Models","volume":"25","author":"Chung Hyung Won","year":"2024","unstructured":"Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-Ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, and Jason Wei. 2024. Scaling Instruction-Finetuned Language Models. Journal of Machine Learning Research 25, 70 (2024), 1\u201353.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1601"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11330"},{"key":"e_1_3_3_7_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"27","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 27. Curran Associates, Inc. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2014\/file\/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf"},{"key":"e_1_3_3_8_2","volume-title":"Proceedings of the 8th International Conference on Learning Representations (ICLR \u201920)","author":"He Junxian","year":"2020","unstructured":"Junxian He, Xinyi Wang, Graham Neubig, and Taylor Berg-Kirkpatrick. 2020. A Probabilistic Formulation of Unsupervised Text Style Transfer. In Proceedings of the 8th International Conference on Learning Representations (ICLR \u201920). Addis Abeba. DOI: https:\/\/arxiv.org\/abs\/2002.03912"},{"key":"e_1_3_3_9_2","unstructured":"Zhiting Hu Zichao Yang Xiaodan Liang Ruslan Salakhutdinov and Eric P. Xing. 2018. Toward Controlled Generation of Text. arXiv:1703.00955. Retrieved from https:\/\/arxiv.org\/abs\/1703.00955"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3544903.3544906"},{"key":"e_1_3_3_11_2","first-page":"24941","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems 2021 (NeurIPS \u201921)","author":"Hu Zhiting","year":"2021","unstructured":"Zhiting Hu and Li Erran Li. 2021. A Causal Lens for Controllable Text Generation. In Proceedings of the Annual Conference on Neural Information Processing Systems 2021 (NeurIPS \u201921), 24941\u201324955. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2021\/hash\/d0f5edad9ac19abed9e235c0fe0aa59f-Abstract.html"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.138"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.201"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00426"},{"key":"e_1_3_3_15_2","unstructured":"Nitish Shirish Keskar Bryan McCann Lav R. Varshney Caiming Xiong and Richard Socher. 2019. CTRL: A conditional transformer language model for controllable generation. arXiv:1909.05858. Retrieved from http:\/\/arxiv.org\/abs\/1909.05858"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_3_17_2","volume-title":"Proceedings of the 7th International Conference on Learning Representations (ICLR \u201919)","author":"Lample Guillaume","year":"2019","unstructured":"Guillaume Lample, Sandeep Subramanian, Eric Smith, Ludovic Denoyer, Marc\u2019Aurelio Ranzato, and Y-Lan Boureau. 2019. Multiple-Attribute Text Rewriting. In Proceedings of the 7th International Conference on Learning Representations (ICLR \u201919). OpenReview.net. Retrieved from https:\/\/openreview.net\/forum?id=H1g2NhC5KQ"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1169"},{"key":"e_1_3_3_20_2","unstructured":"Dayiheng Liu Jie Fu Yidan Zhang Chris Pal and Jiancheng Lv. 2019a. Revision in Continuous Space: Fine-Grained Control of Text Style Transfer. arXiv:1905.12304. Retrieved from https:\/\/arxiv.org\/abs\/1905.12304"},{"key":"e_1_3_3_21_2","unstructured":"Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019b. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv:1907.11692. Retrieved from https:\/\/arxiv.org\/abs\/1907.11692"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.8268839"},{"key":"e_1_3_3_23_2","volume-title":"Proceedings of the 7th International Conference on Learning Representations (ICLR \u201919)","author":"Loshchilov Ilya","year":"2019","unstructured":"Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In Proceedings of the 7th International Conference on Learning Representations (ICLR \u201919). OpenReview.net. DOI: https:\/\/openreview.net\/forum?id=Bkg6RiCqY7"},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/711"},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.31"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-6319"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1080"},{"key":"e_1_3_3_28_2","first-page":"140","article-title":"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research 21, 140 (2020), 1\u201367. DOI: http:\/\/jmlr.org\/papers\/v21\/20-074.html","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1012"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.344"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-short.94"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-45439-5_36"},{"key":"e_1_3_3_33_2","unstructured":"Victor Sanh Lysandre Debut Julien Chaumond and Thomas Wolf. 2019. DistilBERT a distilled version of BERT: Smaller faster cheaper and lighter. arXiv:1910.01108. arXiv:1910.01108. Retrieved from https:\/\/arxiv.org\/abs\/1910.01108"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1499"},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295427"},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3448733"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAI.2021.3115992"},{"key":"e_1_3_3_38_2","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale Dan Bikel Lukas Blecher Cristian Canton Ferrer Moya Chen Guillem Cucurull David Esiobu Jude Fernandes Jeremy Fu Wenyin Fu Brian Fuller Cynthia Gao Vedanuj Goswami Naman Goyal Anthony Hartshorn Saghar Hosseini Rui Hou Hakan Inan Marcin Kardas Viktor Kerkez Madian Khabsa Isabel Kloumann Artem Korenev Punit Singh Koura Marie-Anne Lachaux Thibaut Lavril Jenya Lee Diana Liskovich Yinghai Lu Yuning Mao Xavier Martinet Todor Mihaylov Pushkar Mishra Igor Molybog Yixin Nie Andrew Poulton Jeremy Reizenstein Rashi Rungta Kalyan Saladi Alan Schelten Ruan Silva Eric Michael Smith Ranjan Subramanian Xiaoqing Ellen Tan Binh Tang Ross Taylor Adina Williams Jian Xiang Kuan Puxin Xu Zheng Yan Iliyan Zarov Yuchen Zhang Angela Fan Melanie Kambadur Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov and Thomas Scialom. 2023. Llama 2: Open foundation and fine-tuned chat models. arXiv:2307.09288. arXiv:2307.09288. Retrieved from https:\/\/arxiv.org\/abs\/2307.09288"},{"key":"e_1_3_3_39_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 30. Curran Associates, Inc. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2017\/file\/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1365"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_3_42_2","first-page":"433","volume-title":"Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI \u201919)","volume":"115","author":"Xie Yujia","year":"2019","unstructured":"Yujia Xie, Xiangfeng Wang, Ruijia Wang, and Hongyuan Zha. 2019. A Fast Proximal Point Method for Computing Exact Wasserstein Distance. In Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI \u201919), Vol. 115, AUAI Press, 433\u2013453. DOI: http:\/\/proceedings.mlr.press\/v115\/xie20b.html"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1090"},{"key":"e_1_3_3_44_2","first-page":"2899","volume-title":"Proceedings of COLING 2012. The COLING 2012 Organizing Committee","author":"Xu Wei","year":"2012","unstructured":"Wei Xu, Alan Ritter, Bill Dolan, Ralph Grishman, and Colin Cherry. 2012. Paraphrasing for Style. In Proceedings of COLING 2012. The COLING 2012 Organizing Committee, 2899\u20132914. Retrieved from https:\/\/aclanthology.org\/C12-1177"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1138"},{"key":"e_1_3_3_46_2","unstructured":"Zhirui Zhang Shuo Ren Shujie Liu Jianyong Wang Peng Chen Mu Li Ming Zhou and Enhong Chen. 2018a. Style Transfer as Unsupervised Machine Translation. arXiv:1808.07894. Retrieved from https:\/\/arxiv.org\/abs\/1808.07894"},{"key":"e_1_3_3_47_2","unstructured":"Yanpeng Zhao Wei Bi Deng Cai Xiaojiang Liu Kewei Tu and Shuming Shi. 2018. Language Style Transfer from Sentences with Arbitrary Unknown Styles. arXiv:1808.04071. Retrieved from http:\/\/arxiv.org\/abs\/1808.04071"},{"key":"e_1_3_3_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3678179","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3678179","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:54:08Z","timestamp":1750287248000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3678179"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,31]]},"references-count":47,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,10,31]]}},"alternative-id":["10.1145\/3678179"],"URL":"https:\/\/doi.org\/10.1145\/3678179","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,31]]},"assertion":[{"value":"2023-06-23","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-26","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-11-07","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}