{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T14:23:14Z","timestamp":1775830994219,"version":"3.50.1"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T00:00:00Z","timestamp":1602547200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Nature Science Foundation of China","doi-asserted-by":"crossref","award":["U1636201"],"award-info":[{"award-number":["U1636201"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2020,11,30]]},"abstract":"<jats:p>In this article, conditional-transforming variational autoencoders (CTVAEs) are proposed for generating diverse short text conversations. In conditional variational autoencoders (CVAEs), the prior distribution of latent variable z follows a multivariate Gaussian distribution with mean and variance modulated by the input conditions. Previous work found that this distribution tended to become condition-independent in practical applications. Thus, this article designs CTVAEs to enhance the influence of conditions in CVAEs. In a CTVAE model, the latent variable z is sampled by performing a non-linear transformation on the combination of the input conditions and the samples from a condition-independent prior distribution N (0, I). In our experiments using a Chinese Sina Weibo dataset, the CTVAE model derives z samples for decoding with better condition-dependency than that of the CVAE model. The earth mover\u2019s distance (EMD) between the distributions of the latent variable z at the training stage, and the testing stage is also reduced by using the CTVAE model. In subjective preference tests, our proposed CTVAE model performs significantly better than CVAE and sequence-to-sequence (Seq2Seq) models on generating diverse, informative, and topic-relevant responses.<\/jats:p>","DOI":"10.1145\/3402884","type":"journal-article","created":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T11:47:34Z","timestamp":1602589654000},"page":"1-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9800-3271","authenticated-orcid":false,"given":"Yu-Ping","family":"Ruan","sequence":"first","affiliation":[{"name":"National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhen-Hua","family":"Ling","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaodan","family":"Zhu","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Queen\u2019s University, Kingston, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,10,13]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015","author":"Bahdanau Dzmitry","year":"2015","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2015 . Neural machine translation by jointly learning to align and translate . In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015 , San Diego, CA, May 7\u20139 , 2015. http:\/\/arxiv.org\/abs\/1409.0473. Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, May 7\u20139, 2015. http:\/\/arxiv.org\/abs\/1409.0473."},{"key":"e_1_2_1_2_1","volume-title":"Learning end-to-end goal-oriented dialog. CoRR abs\/1605.07683","author":"Bordes Antoine","year":"2016","unstructured":"Antoine Bordes and Jason Weston . 2016. Learning end-to-end goal-oriented dialog. CoRR abs\/1605.07683 ( 2016 ). arxiv:1605.07683 http:\/\/arxiv.org\/abs\/1605.07683. Antoine Bordes and Jason Weston. 2016. Learning end-to-end goal-oriented dialog. CoRR abs\/1605.07683 (2016). arxiv:1605.07683 http:\/\/arxiv.org\/abs\/1605.07683."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K16-1002"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1152"},{"key":"e_1_2_1_5_1","volume-title":"BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_2_1_6_1","unstructured":"Li Dong Nan Yang Wenhui Wang Furu Wei Xiaodong Liu Yu Wang Jianfeng Gao Ming Zhou and Hsiao-Wuen Hon. 2019. Unified language model pre-training for natural language understanding and generation. In Advances in Neural Information Processing Systems. 13042--13054.  Li Dong Nan Yang Wenhui Wang Furu Wei Xiaodong Liu Yu Wang Jianfeng Gao Ming Zhou and Hsiao-Wuen Hon. 2019. Unified language model pre-training for natural language understanding and generation. In Advances in Neural Information Processing Systems. 13042--13054."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1354"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence.","author":"Ghazvininejad Marjan","year":"2018","unstructured":"Marjan Ghazvininejad , Chris Brockett , Ming-Wei Chang , Bill Dolan , Jianfeng Gao , Wen-tau Yih, and Michel Galley . 2018 . A knowledge-grounded neural conversation model . In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. Marjan Ghazvininejad, Chris Brockett, Ming-Wei Chang, Bill Dolan, Jianfeng Gao, Wen-tau Yih, and Michel Galley. 2018. A knowledge-grounded neural conversation model. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence."},{"key":"e_1_2_1_9_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672--2680.  Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672--2680."},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the International Conference on Machine Learning. 1587--1596","author":"Hu Zhiting","year":"2017","unstructured":"Zhiting Hu , Zichao Yang , Xiaodan Liang , Ruslan Salakhutdinov , and Eric P Xing . 2017 . Toward controlled generation of text . In Proceedings of the International Conference on Machine Learning. 1587--1596 . Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, and Eric P Xing. 2017. Toward controlled generation of text. In Proceedings of the International Conference on Machine Learning. 1587--1596."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1139"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015","author":"Diederik","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization . In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015 , San Diego, CA, May 7--9 , 2015 . Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, May 7--9, 2015."},{"key":"e_1_2_1_13_1","volume-title":"Danilo Jimenez Rezende, and Max Welling","author":"Kingma Diederik P.","year":"2014","unstructured":"Diederik P. Kingma , Shakir Mohamed , Danilo Jimenez Rezende, and Max Welling . 2014 . Semi-supervised learning with deep generative models. In Advances in Neural Information Processing Systems . 3581--3589. Diederik P. Kingma, Shakir Mohamed, Danilo Jimenez Rezende, and Max Welling. 2014. Semi-supervised learning with deep generative models. In Advances in Neural Information Processing Systems. 3581--3589."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 2nd International Conference on Learning Representations, ICLR 2014","author":"Diederik","year":"2014","unstructured":"Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes . In Proceedings of the 2nd International Conference on Learning Representations, ICLR 2014 , Banff, AB, Canada, April 14--16 , 2014 . http:\/\/arxiv.org\/abs\/1312.6114 Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14--16, 2014. http:\/\/arxiv.org\/abs\/1312.6114"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/631"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1014"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1094"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1127"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1230"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1412--1421","author":"Luong Thang","unstructured":"Thang Luong , Hieu Pham , and Christopher D. Manning . 2015. Effective approaches to attention-based neural machine translation . In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1412--1421 . Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1412--1421."},{"key":"e_1_2_1_21_1","unstructured":"Wentao Ma Yiming Cui Nan Shao Su He Weinan Zhang Ting Liu Shijin Wang and Guoping Hu. 2019. TripleNet: Triple attention network for multi-turn response selection in retrieval-based chatbots. In CoNLL.  Wentao Ma Yiming Cui Nan Shao Su He Weinan Zhang Ting Liu Shijin Wang and Guoping Hu. 2019. TripleNet: Triple attention network for multi-turn response selection in retrieval-based chatbots. In CoNLL."},{"key":"e_1_2_1_22_1","first-page":"2579","article-title":"Visualizing data using t-SNE","author":"van der Maaten Laurens","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton . 2008 . Visualizing data using t-SNE . Journal of Machine Learning Research 9 , Nov. (2008), 2579 -- 2605 . Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, Nov. (2008), 2579--2605.","journal-title":"Journal of Machine Learning Research 9"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2010-343"},{"key":"e_1_2_1_24_1","volume-title":"Improving language understanding by generative pre-training.","author":"Radford Alec","year":"2018","unstructured":"Alec Radford , Karthik Narasimhan , Tim Salimans , and Ilya Sutskever . 2018. Improving language understanding by generative pre-training. Retrieved from https:\/\/s3-us-west-2.amazonaws.com\/openai-assets\/researchcovers\/languageunsupervised\/languageunderstandingpaper.pdf ( 2018 ). Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. Improving language understanding by generative pre-training. Retrieved from https:\/\/s3-us-west-2.amazonaws.com\/openai-assets\/researchcovers\/languageunsupervised\/languageunderstandingpaper.pdf (2018)."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683853"},{"key":"e_1_2_1_26_1","volume-title":"Perceptual Metrics for Image Database Navigation","author":"Rubner Yossi","unstructured":"Yossi Rubner and Carlo Tomasi . 2001. The earth mover\u2019s distance . In Perceptual Metrics for Image Database Navigation . Springer , 13--28. Yossi Rubner and Carlo Tomasi. 2001. The earth mover\u2019s distance. In Perceptual Metrics for Image Database Navigation. Springer, 13--28."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1026543900054"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0269888906000944"},{"key":"e_1_2_1_29_1","volume-title":"Proceedings of the 30th AAAI Conference on Artificial Intelligence, February 12--17","author":"Serban Iulian Vlad","year":"2016","unstructured":"Iulian Vlad Serban , Alessandro Sordoni , Yoshua Bengio , Aaron C. Courville , and Joelle Pineau . 2016 . Building end-to-end dialogue systems using generative hierarchical neural network models . In Proceedings of the 30th AAAI Conference on Artificial Intelligence, February 12--17 , 2016, Phoenix, Arizona 3776--3784. Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C. Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In Proceedings of the 30th AAAI Conference on Artificial Intelligence, February 12--17, 2016, Phoenix, Arizona 3776--3784."},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Iulian Vlad Serban Alessandro Sordoni Ryan Lowe Laurent Charlin Joelle Pineau Aaron C. Courville and Yoshua Bengio. 2017. A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI. 3295--3301.  Iulian Vlad Serban Alessandro Sordoni Ryan Lowe Laurent Charlin Joelle Pineau Aaron C. Courville and Yoshua Bengio. 2017. A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI. 3295--3301.","DOI":"10.1609\/aaai.v31i1.10983"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1152"},{"key":"e_1_2_1_32_1","unstructured":"Lifeng Shang Tetsuya Sakai Zhengdong Lu Hang Li Ryuichiro Higashinaka and Yusuke Miyao. 2016. Overview of the NTCIR-12 short text conversation task. In NTCIR.  Lifeng Shang Tetsuya Sakai Zhengdong Lu Hang Li Ryuichiro Higashinaka and Yusuke Miyao. 2016. Overview of the NTCIR-12 short text conversation task. In NTCIR."},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence.","author":"Shen Xiaoyu","year":"2018","unstructured":"Xiaoyu Shen , Hui Su , Shuzi Niu , and Vera Demberg . 2018 . Improving variational encoder-decoders in dialogue generation . In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. Xiaoyu Shen, Hui Su, Shuzi Niu, and Vera Demberg. 2018. Improving variational encoder-decoders in dialogue generation. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence."},{"key":"e_1_2_1_34_1","unstructured":"Kihyuk Sohn Honglak Lee and Xinchen Yan. 2015. Learning structured output representation using deep conditional generative models. In Advances in Neural Information Processing Systems. 3483--3491.  Kihyuk Sohn Honglak Lee and Xinchen Yan. 2015. Learning structured output representation using deep conditional generative models. In Advances in Neural Information Processing Systems. 3483--3491."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/721"},{"key":"e_1_2_1_36_1","volume-title":"Le","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever , Oriol Vinyals , and Quoc V . Le . 2014 . Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems . 3104--3112. Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems. 3104--3112."},{"key":"e_1_2_1_37_1","volume-title":"A neural conversational model. arXiv preprint arXiv:1506.05869","author":"Vinyals Oriol","year":"2015","unstructured":"Oriol Vinyals and Quoc Le. 2015. A neural conversational model. arXiv preprint arXiv:1506.05869 ( 2015 ). Oriol Vinyals and Quoc Le. 2015. A neural conversational model. arXiv preprint arXiv:1506.05869 (2015)."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1042"},{"key":"e_1_2_1_39_1","volume-title":"Young","author":"Williams Jason D.","year":"2007","unstructured":"Jason D. Williams and Steve J . Young . 2007 . Partially observable Markov decision processes for spoken dialog systems. Computer Speech 8 Language 21, 2 (2007), 393--422. DOI:https:\/\/doi.org\/10.1016\/j.csl.2006.06.008 10.1016\/j.csl.2006.06.008 Jason D. Williams and Steve J. Young. 2007. Partially observable Markov decision processes for spoken dialog systems. Computer Speech 8 Language 21, 2 (2007), 393--422. DOI:https:\/\/doi.org\/10.1016\/j.csl.2006.06.008"},{"key":"e_1_2_1_40_1","volume-title":"Thirty-Second AAAI Conference on Artificial Intelligence.","author":"Wu Yu","year":"2018","unstructured":"Yu Wu , Wei Wu , Dejian Yang , Can Xu , and Zhoujun Li . 2018 . Neural response generation with dynamic vocabularies . In Thirty-Second AAAI Conference on Artificial Intelligence. Yu Wu, Wei Wu, Dejian Yang, Can Xu, and Zhoujun Li. 2018. Neural response generation with dynamic vocabularies. In Thirty-Second AAAI Conference on Artificial Intelligence."},{"key":"e_1_2_1_41_1","doi-asserted-by":"crossref","unstructured":"Chen Xing Wei Wu Yu Wu Jie Liu Yalou Huang Ming Zhou and Wei-Ying Ma. 2017. Topic aware neural response generation. In AAAI. 3351--3357.  Chen Xing Wei Wu Yu Wu Jie Liu Yalou Huang Ming Zhou and Wei-Ying Ma. 2017. Topic aware neural response generation. In AAAI. 3351--3357.","DOI":"10.1609\/aaai.v31i1.10981"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2911542"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_47"},{"key":"e_1_2_1_44_1","volume-title":"Le","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang , Zihang Dai , Yiming Yang , Jaime Carbonell , Russ R. Salakhutdinov , and Quoc V . Le . 2019 . Xlnet : Generalized autoregressive pretraining for language understanding. In Advances in Neural Information Processing Systems . 5754--5764. Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R. Salakhutdinov, and Quoc V. Le. 2019. Xlnet: Generalized autoregressive pretraining for language understanding. In Advances in Neural Information Processing Systems. 5754--5764."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2012.2225812"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1061"},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Ganbin Zhou Ping Luo Rongyu Cao Fen Lin Bo Chen and Qing He. 2017. Mechanism-aware neural machine for dialogue response generation. In AAAI. 3400--3407.  Ganbin Zhou Ping Luo Rongyu Cao Fen Lin Bo Chen and Qing He. 2017. Mechanism-aware neural machine for dialogue response generation. In AAAI. 3400--3407.","DOI":"10.1609\/aaai.v31i1.10976"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1104"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1366"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3402884","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3402884","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:41:35Z","timestamp":1750200095000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3402884"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,13]]},"references-count":49,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2020,11,30]]}},"alternative-id":["10.1145\/3402884"],"URL":"https:\/\/doi.org\/10.1145\/3402884","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,13]]},"assertion":[{"value":"2019-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-05-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}