{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T18:52:37Z","timestamp":1775674357655,"version":"3.50.1"},"reference-count":186,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2021,8,30]],"date-time":"2021-08-30T00:00:00Z","timestamp":1630281600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100007569","name":"Carl-Zeiss-Stiftung","doi-asserted-by":"publisher","award":["062017-02"],"award-info":[{"award-number":["062017-02"]}],"id":[{"id":"10.13039\/100007569","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Neural encoder-decoder models for language generation can be trained to predict words directly from linguistic or non-linguistic inputs. When generating with these so-called end-to-end models, however, the NLG system needs an additional decoding procedure that determines the output sequence, given the infinite search space over potential sequences that could be generated with the given vocabulary. This survey paper provides an overview of the different ways of implementing decoding on top of neural network-based generation models. Research into decoding has become a real trend in the area of neural language generation, and numerous recent papers have shown that the choice of decoding method has a considerable impact on the quality and various linguistic properties of the generation output of a neural NLG system. This survey aims to contribute to a more systematic understanding of decoding methods across different areas of neural NLG. We group the reviewed methods with respect to the broad type of objective that they optimize in the generation of the sequence\u2014likelihood, diversity, and task-specific linguistic constraints or goals\u2014and discuss their respective strengths and weaknesses.<\/jats:p>","DOI":"10.3390\/info12090355","type":"journal-article","created":{"date-parts":[[2021,8,30]],"date-time":"2021-08-30T11:01:37Z","timestamp":1630321297000},"page":"355","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Decoding Methods in Neural Language Generation: A Survey"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1384-1218","authenticated-orcid":false,"given":"Sina","family":"Zarrie\u00df","sequence":"first","affiliation":[{"name":"Faculty for Linguistics and Literature Studies, Bielefeld University, 33615 Bielefeld, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Henrik","family":"Voigt","sequence":"additional","affiliation":[{"name":"Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, 07743 Jena, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Simeon","family":"Sch\u00fcz","sequence":"additional","affiliation":[{"name":"Faculty for Linguistics and Literature Studies, Bielefeld University, 33615 Bielefeld, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Reiter, E., and Dale, R. (2000). Building Natural Language Generation Systems, Cambridge University Press.","DOI":"10.1017\/CBO9780511519857"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1613\/jair.5477","article-title":"Survey of the state of the art in natural language generation: Core tasks, applications and evaluation","volume":"61","author":"Gatt","year":"2018","journal-title":"J. Artif. Intell. Res."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Wen, T.H., Ga\u0161i\u0107, M., Mrk\u0161i\u0107, N., Su, P.H., Vandyke, D., and Young, S. (2015, January 17\u201321). Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal.","DOI":"10.18653\/v1\/D15-1199"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gehrmann, S., Dai, F., Elder, H., and Rush, A. (2018, January 5\u20138). End-to-End Content and Plan Selection for Data-to-Text Generation. Proceedings of the 11th International Conference on Natural Language Generation, Tilburg, The Netherlands.","DOI":"10.18653\/v1\/W18-6505"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Castro Ferreira, T., Moussallem, D., K\u00e1d\u00e1r, \u00c1., Wubben, S., and Krahmer, E. (2018, January 15\u201320). NeuralREG: An end-to-end approach to referring expression generation. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1182"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Castro Ferreira, T., van der Lee, C., van Miltenburg, E., and Krahmer, E. (2019, January 3\u20137). Neural data-to-text generation: A comparison between pipeline and end-to-end architectures. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.","DOI":"10.18653\/v1\/D19-1052"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1016\/j.csl.2019.06.009","article-title":"Evaluating the state-of-the-art of end-to-end natural language generation: The e2e nlg challenge","volume":"59","author":"Novikova","year":"2020","journal-title":"Comput. Speech Lang."},{"key":"ref_8","unstructured":"Lowerre, B.T. (1976). The HARPY Speech Recognition System. [Ph. D. Thesis, Carnegie Mellon University]."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1017\/S1351324997001502","article-title":"Building applied natural language generation systems","volume":"3","author":"Reiter","year":"1997","journal-title":"Nat. Lang. Eng."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1162\/tacl_a_00313","article-title":"Leveraging pre-trained checkpoints for sequence generation tasks","volume":"8","author":"Rothe","year":"2020","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_11","first-page":"9","article-title":"Language models are unsupervised multitask learners","volume":"1","author":"Radford","year":"2019","journal-title":"OpenAI Blog"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Serban, I., Sordoni, A., Bengio, Y., Courville, A., and Pineau, J. (2016, January 12\u201317). Building end-to-end dialogue systems using generative hierarchical neural network models. Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.9883"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Kukich, K. (1983). Design of a knowledge-based report generator. Proceedings of the 21st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics.","DOI":"10.3115\/981311.981340"},{"key":"ref_14","unstructured":"McKeown, K. (1992). Text Generation, Cambridge University Press."},{"key":"ref_15","unstructured":"Busemann, S., and Horacek, H. (2021, August 27). A Flexible Shallow Approach to Text Generation. Available online: https:\/\/aclanthology.org\/W98-1425.pdf."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Gatt, A., and Reiter, E. (2009, January 30\u201331). SimpleNLG: A realisation engine for practical applications. Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009), Athens, Greece.","DOI":"10.3115\/1610195.1610208"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1162\/0891201053630291","article-title":"Real versus template-based natural language generation: A false opposition?","volume":"31","author":"Theune","year":"2005","journal-title":"Comput. Linguist."},{"key":"ref_18","unstructured":"Langkilde, I. (2000, January 29). Forest-based statistical sentence generation. Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics, Seattle, WA, USA."},{"key":"ref_19","unstructured":"Ratnaparkhi, A. (2000, January 29). Trainable Methods for Surface Natural Language Generation. Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics, Seattle, WA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Cahill, A., Forst, M., and Rohrer, C. (2007, January 17\u201320). Stochastic Realisation Ranking for a Free Word Order Language. Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07), Saarbr\u00fccken, Germany.","DOI":"10.3115\/1610163.1610168"},{"key":"ref_21","unstructured":"White, M., Rajkumar, R., and Martin, S. (2007, January 11). Towards broad coverage surface realization with CCG. Proceedings of the Workshop on Using Corpora for NLG: Language Generation and Machine Translation (UCNLG+ MT), Columbus, OH, USA."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1017\/S1351324907004664","article-title":"Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models","volume":"14","author":"Belz","year":"2008","journal-title":"Nat. Lang. Eng."},{"key":"ref_23","unstructured":"Angeli, G., Liang, P., and Klein, D. (2010, January 9\u201311). A simple domain-independent probabilistic approach to generation. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA."},{"key":"ref_24","unstructured":"Konstas, I., and Lapata, M. (2012, January 3\u20138). Unsupervised concept-to-text generation with hypergraphs. Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Stroudsburg, PA, USA."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1162\/coli.2007.33.2.201","article-title":"Hierarchical phrase-based translation","volume":"33","author":"Chiang","year":"2007","journal-title":"Comput. Linguist."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1162\/COLI_a_00199","article-title":"Stochastic Language Generation in Dialogue using Factored Language Models","volume":"40","author":"Mairesse","year":"2014","journal-title":"Comput. Linguist."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1162\/COLI_a_00063","article-title":"Controlling User Perceptions of Linguistic Style: Trainable Generation of Personality Traits","volume":"37","author":"Mairesse","year":"2011","journal-title":"Comput. Linguist."},{"key":"ref_28","unstructured":"Belz, A., White, M., Espinosa, D., Kow, E., Hogan, D., and Stent, A. (2011, January 28\u201331). The first surface realisation shared task: Overview and evaluation results. Proceedings of the 13th European Workshop on Natural Language Generation, Nancy, France."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Mille, S., Belz, A., Bohnet, B., Graham, Y., and Wanner, L. (2019, January 3). The second multilingual surface realisation shared task (SR\u201919): Overview and evaluation results. Proceedings of the 2nd Workshop on Multilingual Surface Realisation (MSR 2019), Hong Kong, China.","DOI":"10.18653\/v1\/D19-6301"},{"key":"ref_30","unstructured":"Bohnet, B., Wanner, L., Mille, S., and Burga, A. (2010, January 23\u201327). Broad coverage multilingual deep sentence generation with a stochastic multi-level realizer. Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), Beijing, China."},{"key":"ref_31","unstructured":"Bohnet, B., Bj\u00f6rkelund, A., Kuhn, J., Seeker, W., and Zarriess, S. (2012, January 12\u201314). Generating non-projective word order in statistical linearization. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Pourdamghani, N., Knight, K., and Hermjakob, U. (2016). Generating English from Abstract Meaning Representations. Proceedings of the 9th International Natural Language Generation Conference, Association for Computational Linguistics.","DOI":"10.18653\/v1\/W16-6603"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Flanigan, J., Dyer, C., Smith, N.A., and Carbonell, J. (2016). Generation from Abstract Meaning Representation using Tree Transducers. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics.","DOI":"10.18653\/v1\/N16-1087"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1162\/089120103321337458","article-title":"Word reordering and a dynamic programming beam search algorithm for statistical machine translation","volume":"29","author":"Tillmann","year":"2003","journal-title":"Comput. Linguist."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Koehn, P. (2004). Pharaoh: A beam search decoder for phrase-based statistical machine translation models. Conference of the Association for Machine Translation in the Americas, Springer.","DOI":"10.1007\/978-3-540-30194-3_13"},{"key":"ref_36","unstructured":"Rush, A.M., and Collins, M. (2011, January 19\u201324). Exact decoding of syntactic translation models through lagrangian relaxation. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Oregon, Portland."},{"key":"ref_37","unstructured":"Rush, A., Chang, Y.W., and Collins, M. (2013, January 18\u201321). Optimal beam search for machine translation. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, USA."},{"key":"ref_38","unstructured":"Sutskever, I., Vinyals, O., and Le, Q.V. Sequence to Sequence Learning with Neural Networks. Proceedings of the 27th International Conference on Neural Information Processing Systems-Volume 2, NIPS\u201914."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Graves, A. (2012). Sequence transduction with recurrent neural networks. arXiv.","DOI":"10.1007\/978-3-642-24797-2"},{"key":"ref_40","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017). Attention is all you need. arXiv."},{"key":"ref_41","unstructured":"Ranzato, M., Chopra, S., Auli, M., and Zaremba, W. (2016, January 2\u20134). Sequence level training with recurrent neural networks. Proceedings of the 4th International Conference on Learning Representations, San Juan, Puerto Rico."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Rennie, S.J., Marcheret, E., Mroueh, Y., Ross, J., and Goel, V. (2017, January 21\u201326). Self-critical sequence training for image captioning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.131"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Chen, H., Ding, G., Zhao, S., and Han, J. (2018, January 2\u20137). Temporal-Difference Learning With Sampling Baseline for Image Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.12263"},{"key":"ref_44","unstructured":"Bahdanau, D., Brakel, P., Xu, K., Goyal, A., Lowe, R., Pineau, J., Courville, A.C., and Bengio, Y. (2017). An Actor-Critic Algorithm for Sequence Prediction. arXiv."},{"key":"ref_45","unstructured":"Zhang, L., Sung, F., Liu, F., Xiang, T., Gong, S., Yang, Y., and Hospedales, T.M. (2017). Actor-Critic Sequence Training for Image Captioning. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1007\/BF00992696","article-title":"Simple statistical gradient-following algorithms for connectionist reinforcement learning","volume":"8","author":"Williams","year":"2004","journal-title":"Mach. Learn."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Papineni, K., Roukos, S., Ward, T., and Zhu, W.J. (2002, January 6\u201312). BLEU: A method for automatic evaluation of machine translation. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, PA, USA.","DOI":"10.3115\/1073083.1073135"},{"key":"ref_48","unstructured":"Oord, A., Li, Y., Babuschkin, I., Simonyan, K., Vinyals, O., Kavukcuoglu, K., Driessche, G., Lockhart, E., Cobo, L., and Stimberg, F. (2018, January 10\u201315). Parallel wavenet: Fast high-fidelity speech synthesis. Proceedings of the International conference on machine learning, Stockholm, Sweden."},{"key":"ref_49","unstructured":"Gu, J., Bradbury, J., Xiong, C., Li, V.O., and Socher, R. (May, January 30). Non-autoregressive neural machine translation. Proceedings of the ICLR, Vancouver, BC, Canada."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Lee, J., Mansimov, E., and Cho, K. (November, January 31). Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1149"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1162\/tacl_a_00292","article-title":"Insertion-based Decoding with Automatically Inferred Generation Order","volume":"7","author":"Gu","year":"2019","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_52","unstructured":"Stern, M., Chan, W., Kiros, J., and Uszkoreit, J. (2019, January 10\u201315). Insertion transformer: Flexible sequence generation via insertion operations. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Chen, Y., Gilroy, S., Maletti, A., May, J., and Knight, K. (2018, January 1\u20136). Recurrent Neural Networks as Weighted Language Recognizers. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, New Orleans, LO, USA.","DOI":"10.18653\/v1\/N18-1205"},{"key":"ref_54","unstructured":"Li, J., Monroe, W., and Jurafsky, D. (2016). A Simple, Fast Diverse Decoding Algorithm for Neural Generation. arXiv."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Klein, G., Kim, Y., Deng, Y., Senellart, J., and Rush, A. (August, January 30). OpenNMT: Open-Source Toolkit for Neural Machine Translation. Proceedings of the ACL 2017, Vancouver, BC, Canada.","DOI":"10.18653\/v1\/P17-4012"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Stahlberg, F., and Byrne, B. (2019, January 3\u20134). On NMT Search Errors and Model Errors: Cat Got Your Tongue?. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.","DOI":"10.18653\/v1\/D19-1331"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"795","DOI":"10.1162\/tacl_a_00346","article-title":"Best-First Beam Search","volume":"8","author":"Meister","year":"2020","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Huang, L., Zhao, K., and Ma, M. (2017). When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size). Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D17-1227"},{"key":"ref_59","unstructured":"Newman, B., Hewitt, J., Liang, P., and Manning, C.D. The EOS Decision and Length Extrapolation. Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"He, W., He, Z., Wu, H., and Wang, H. (2016, January 12\u201317). Improved neural machine translation with SMT features. Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.9983"},{"key":"ref_61","unstructured":"Murray, K., and Chiang, D. (November, January 31). Correcting Length Bias in Neural Machine Translation. Proceedings of the Third Conference on Machine Translation, Belgium, Brussels."},{"key":"ref_62","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv."},{"key":"ref_63","unstructured":"Freitag, M., and Al-Onaizan, Y. (August, January 30). Beam Search Strategies for Neural Machine Translation. Proceedings of the First Workshop on Neural Machine Translation, Vancouver, BC, Canada."},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Yang, Y., Huang, L., and Ma, M. (November, January 31). Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1342"},{"key":"ref_65","unstructured":"Koehn, P., and Knowles, R. (August, January 30). Six Challenges for Neural Machine Translation. Proceedings of the First Workshop on Neural Machine Translation, Vancouver, BC, Canada."},{"key":"ref_66","unstructured":"Cohen, E., and Beck, C. (2019, January 10\u201315). Empirical analysis of beam search performance degradation in neural sequence models. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Vinyals, O., Toshev, A., Bengio, S., and Erhan, D. (2015, January 7\u201312). Show and tell: A neural image caption generator. Proceedings of the Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Karpathy, A., and Fei-Fei, L. (2015, January 7\u201312). Deep visual-semantic alignments for generating image descriptions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Sountsov, P., and Sarawagi, S. (2016, January 1\u20135). Length bias in Encoder Decoder Models and a Case for Global Conditioning. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1158"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Zarrie\u00df, S., and Schlangen, D. (2018, January 5\u20138). Decoding Strategies for Neural Referring Expression Generation. Proceedings of the 11th International Conference on Natural Language Generation, Tilburg, The Netherlands.","DOI":"10.18653\/v1\/W18-6563"},{"key":"ref_71","unstructured":"Holtzman, A., Buys, J., Du, L., Forbes, M., and Choi, Y. (2020, January 30). The Curious Case of Neural Text Degeneration. Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Shao, Y., Gouws, S., Britz, D., Goldie, A., Strope, B., and Kurzweil, R. (2017, January 7\u201311). Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark.","DOI":"10.18653\/v1\/D17-1235"},{"key":"ref_73","unstructured":"Kulikov, I., Miller, A., Cho, K., and Weston, J. (November, January 29). Importance of Search and Evaluation Strategies in Neural Dialogue Modeling. Proceedings of the 12th International Conference on Natural Language Generation, Tokyo, Japan."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Meister, C., Cotterell, R., and Vieira, T. (2020, January 16\u201320). If beam search is the answer, what was the question?. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online.","DOI":"10.18653\/v1\/2020.emnlp-main.170"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Huang, T.H., Ferraro, F., Mostafazadeh, N., Misra, I., Agrawal, A., Devlin, J., Girshick, R., He, X., Kohli, P., and Batra, D. (2016, January 12\u201317). Visual storytelling. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA.","DOI":"10.18653\/v1\/N16-1147"},{"key":"ref_76","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1109\/TPAMI.2016.2587640","article-title":"Show and tell: Lessons learned from the 2015 mscoco image captioning challenge","volume":"39","author":"Vinyals","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_77","first-page":"849","article-title":"Speakers optimize information density through syntactic reduction","volume":"19","author":"Levy","year":"2007","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Ghazvininejad, M., Brockett, C., Chang, M.W., Dolan, B., Gao, J., Yih, W.t., and Galley, M. (2018, January 2\u20137). A knowledge-grounded neural conversation model. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LO, USA.","DOI":"10.1609\/aaai.v32i1.11977"},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Shuster, K., Humeau, S., Bordes, A., and Weston, J. (2020, January 5\u201310). Image-Chat: Engaging Grounded Conversations. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online.","DOI":"10.18653\/v1\/2020.acl-main.219"},{"key":"ref_80","unstructured":"Wu, Y., Schuster, M., Chen, Z., Le, Q.V., Norouzi, M., Macherey, W., Krikun, M., Cao, Y., Gao, Q., and Macherey, K. (2016). Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. arXiv."},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Sennrich, R., Haddow, B., and Birch, A. (2016, January 7\u201312). Neural Machine Translation of Rare Words with Subword Units. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany.","DOI":"10.18653\/v1\/P16-1162"},{"key":"ref_82","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1162\/tacl_a_00065","article-title":"Google\u2019s multilingual neural machine translation system: Enabling zero-shot translation","volume":"5","author":"Johnson","year":"2017","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_83","unstructured":"Vaswani, A., Bengio, S., Brevdo, E., Chollet, F., Gomez, A.N., Gouws, S., Jones, L., Kaiser, \u0141., Kalchbrenner, N., and Parmar, N. (2018). Tensor2tensor for neural machine translation. arXiv."},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Ott, M., Edunov, S., Baevski, A., Fan, A., Gross, S., Ng, N., Grangier, D., and Auli, M. (2019, January 2\u20137). fairseq: A Fast, Extensible Toolkit for Sequence Modeling. Proceedings of the NAACL-HLT 2019: Demonstrations, Minneapolis, MN, USA.","DOI":"10.18653\/v1\/N19-4009"},{"key":"ref_85","unstructured":"Song, K., Tan, X., Qin, T., Lu, J., and Liu, T.Y. (2019, January 10\u201315). MASS: Masked Sequence to Sequence Pre-training for Language Generation. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_86","unstructured":"See, A., Liu, P.J., and Manning, C.D. (August, January 30). Get To The Point: Summarization with Pointer-Generator Networks. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, BC, Canada."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Gehrmann, S., Deng, Y., and Rush, A. (November, January 31). Bottom-Up Abstractive Summarization. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1443"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Kry\u015bci\u0144ski, W., Paulus, R., Xiong, C., and Socher, R. (November, January 31). Improving Abstraction in Text Summarization. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1207"},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Narayan, S., Cohen, S.B., and Lapata, M. (November, January 31). Don\u2019t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1206"},{"key":"ref_90","doi-asserted-by":"crossref","unstructured":"Liu, Y., and Lapata, M. (2019, January 3\u20137). Text Summarization with Pretrained Encoders. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.","DOI":"10.18653\/v1\/D19-1387"},{"key":"ref_91","unstructured":"Wallach, H., Larochelle, H., Beygelzimer, A., d\u2019Alch\u00e9-Buc, F., Fox, E., and Garnett, R. (2019). Unified Language Model Pre-training for Natural Language Understanding and Generation. Advances in Neural Information Processing Systems, Curran Associates, Inc."},{"key":"ref_92","unstructured":"Vinyals, O., and Le, Q. (2015). A neural conversational model. arXiv."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Du\u0161ek, O., and Jur\u010d\u00ed\u010dek, F. (2016, January 13\u201315). A Context-aware Natural Language Generator for Dialogue Systems. Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Los Angeles, CA, USA.","DOI":"10.18653\/v1\/W16-3622"},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Li, J., Monroe, W., Ritter, A., Jurafsky, D., Galley, M., and Gao, J. (2016, January 1\u20135). Deep Reinforcement Learning for Dialogue Generation. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1127"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Das, A., Kottur, S., Gupta, K., Singh, A., Yadav, D., Moura, J.M., Parikh, D., and Batra, D. (2017, January 21\u201326). Visual dialog. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.121"},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Li, J., Galley, M., Brockett, C., Gao, J., and Dolan, B. (2016, January 12\u201317). A Diversity-Promoting Objective Function for Neural Conversation Models. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA.","DOI":"10.18653\/v1\/N16-1014"},{"key":"ref_97","doi-asserted-by":"crossref","unstructured":"Baheti, A., Ritter, A., Li, J., and Dolan, B. (November, January 31). Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1431"},{"key":"ref_98","unstructured":"Wolf, T., Sanh, V., Chaumond, J., and Delangue, C. (2019). TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents. arXiv."},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Fan, A., Lewis, M., and Dauphin, Y. (2018). Hierarchical Neural Story Generation. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/P18-1082"},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Holtzman, A., Buys, J., Forbes, M., Bosselut, A., Golub, D., and Choi, Y. (2018, January 15\u201320). Learning to Write with Cooperative Discriminators. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1152"},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"See, A., Pappu, A., Saxena, R., Yerukola, A., and Manning, C.D. (2019, January 3\u20134). Do Massively Pretrained Language Models Make Better Storytellers?. Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), Hong Kong, China.","DOI":"10.18653\/v1\/K19-1079"},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Zhai, F., Demberg, V., and Koller, A. (2020, January 8\u201313). Story Generation with Rich Details. Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain.","DOI":"10.18653\/v1\/2020.coling-main.212"},{"key":"ref_103","unstructured":"Caccia, M., Caccia, L., Fedus, W., Larochelle, H., Pineau, J., and Charlin, L. (2020, January 26\u201330). Language GANs Falling Short. Proceedings of the 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia."},{"key":"ref_104","doi-asserted-by":"crossref","unstructured":"Kiddon, C., Zettlemoyer, L., and Choi, Y. (2016, January 1\u20135). Globally coherent text generation with neural checklist models. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1032"},{"key":"ref_105","doi-asserted-by":"crossref","unstructured":"Wiseman, S., Shieber, S., and Rush, A. (2017, January 1\u20137). Challenges in Data-to-Document Generation. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark.","DOI":"10.18653\/v1\/D17-1239"},{"key":"ref_106","doi-asserted-by":"crossref","unstructured":"Puzikov, Y., and Gurevych, I. (2018, January 5\u20138). E2E NLG challenge: Neural models vs. templates. Proceedings of the 11th International Conference on Natural Language Generation, Tilburg, The Netherlands.","DOI":"10.18653\/v1\/W18-6557"},{"key":"ref_107","doi-asserted-by":"crossref","unstructured":"Gehrmann, S., Dai, F.Z., Elder, H., and Rush, A.M. (2018). End-to-end content and plan selection for data-to-text generation. arXiv.","DOI":"10.18653\/v1\/W18-6505"},{"key":"ref_108","doi-asserted-by":"crossref","unstructured":"Marcheggiani, D., and Perez-Beltrachini, L. (2018, January 5\u20138). Deep Graph Convolutional Encoders for Structured Data to Text Generation. Proceedings of the 11th International Conference on Natural Language Generation, Tilburg, The Netherlands.","DOI":"10.18653\/v1\/W18-6501"},{"key":"ref_109","unstructured":"Puduppully, R., Dong, L., and Lapata, M. (February, January 27). Data-to-text generation with content selection and planning. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_110","doi-asserted-by":"crossref","unstructured":"Kale, M., and Rastogi, A. (2020, January 15\u201318). Text-to-Text Pre-Training for Data-to-Text Tasks. Proceedings of the 13th International Conference on Natural Language Generation, Dublin, Ireland.","DOI":"10.18653\/v1\/2020.inlg-1.14"},{"key":"ref_111","doi-asserted-by":"crossref","unstructured":"Zhao, C., Walker, M., and Chaturvedi, S. (2020). Bridging the Structural Gap Between Encoding and Decoding for Data-To-Text Generation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.acl-main.224"},{"key":"ref_112","unstructured":"Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., and Bengio, Y. (2015, January 6\u201311). Show, attend and tell: Neural image caption generation with visual attention. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_113","doi-asserted-by":"crossref","unstructured":"Lu, J., Xiong, C., Parikh, D., and Socher, R. (2017, January 21\u201326). Knowing when to look: Adaptive attention via a visual sentinel for image captioning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.345"},{"key":"ref_114","doi-asserted-by":"crossref","unstructured":"Anderson, P., He, X., Buehler, C., Teney, D., Johnson, M., Gould, S., and Zhang, L. (2018, January 18\u201323). Bottom-up and top-down attention for image captioning and visual question answering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00636"},{"key":"ref_115","doi-asserted-by":"crossref","unstructured":"Cornia, M., Stefanini, M., Baraldi, L., and Cucchiara, R. (2020, January 14\u201319). Meshed-memory transformer for image captioning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01059"},{"key":"ref_116","unstructured":"Ippolito, D., Kriz, R., Sedoc, J., Kustikova, M., and Callison-Burch, C. (August, January 28). Comparison of Diverse Decoding Methods from Conditional Language Models. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy."},{"key":"ref_117","doi-asserted-by":"crossref","unstructured":"Yu, L., Tan, H., Bansal, M., and Berg, T.L. (2017, January 21\u201326). A joint speakerlistener-reinforcer model for referring expressions. Proceedings of the Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.375"},{"key":"ref_118","doi-asserted-by":"crossref","first-page":"101184","DOI":"10.1016\/j.csl.2020.101184","article-title":"Generating unambiguous and diverse referring expressions","volume":"68","author":"Panagiaris","year":"2021","journal-title":"Comput. Speech Lang."},{"key":"ref_119","unstructured":"Yu, H., Wang, J., Huang, Z., Yang, Y., and Xu, W. (July, January 26). Video paragraph captioning using hierarchical recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_120","doi-asserted-by":"crossref","unstructured":"Krause, J., Johnson, J., Krishna, R., and Fei-Fei, L. (2017, January 21\u2013216). A hierarchical approach for generating descriptive image paragraphs. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.356"},{"key":"ref_121","doi-asserted-by":"crossref","unstructured":"Krishna, R., Hata, K., Ren, F., Fei-Fei, L., and Carlos Niebles, J. (2017, January 22\u201329). Dense-captioning events in videos. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.83"},{"key":"ref_122","doi-asserted-by":"crossref","unstructured":"Melas-Kyriazi, L., Rush, A.M., and Han, G. (November, January 31). Training for diversity in image paragraph captioning. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1084"},{"key":"ref_123","doi-asserted-by":"crossref","unstructured":"Chatterjee, M., and Schwing, A.G. (2018, January 8\u201314). Diverse and coherent paragraph generation from images. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01216-8_45"},{"key":"ref_124","doi-asserted-by":"crossref","unstructured":"Wang, X., Chen, W., Wu, J., Wang, Y.F., and Wang, W.Y. (2018, January 18\u201323). Video captioning via hierarchical reinforcement learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00443"},{"key":"ref_125","doi-asserted-by":"crossref","unstructured":"Salvador, A., Drozdzal, M., Giro-i Nieto, X., and Romero, A. (2019, January 15\u201320). Inverse cooking: Recipe generation from food images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01070"},{"key":"ref_126","doi-asserted-by":"crossref","unstructured":"Song, L., Zhang, Y., Wang, Z., and Gildea, D. (2018, January 15\u201320). A Graph-to-Sequence Model for AMR-to-Text Generation. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1150"},{"key":"ref_127","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1162\/tacl_a_00297","article-title":"AMR-to-text generation with graph transformer","volume":"8","author":"Wang","year":"2020","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_128","doi-asserted-by":"crossref","unstructured":"Mager, M., Fernandez Astudillo, R., Naseem, T., Sultan, M.A., Lee, Y.S., Florian, R., and Roukos, S. (2020, January 6\u20138). GPT-too: A Language-Model-First Approach for AMR-to-Text Generation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online.","DOI":"10.18653\/v1\/2020.acl-main.167"},{"key":"ref_129","unstructured":"McKeown, K., Kukich, K., and Shaw, J. (April, January 31). Practical issues in automatic documentation generation. Proceedings of the 3rd Applied Natural Language Processing Conference (ANLP 94), Trento, Italy."},{"key":"ref_130","doi-asserted-by":"crossref","unstructured":"Shetty, R., Rohrbach, M., Anne Hendricks, L., Fritz, M., and Schiele, B. (2017, January 22\u201329). Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.445"},{"key":"ref_131","doi-asserted-by":"crossref","unstructured":"Dai, B., Fidler, S., Urtasun, R., and Lin, D. (2017, January 22\u201329). Towards Diverse and Natural Image Descriptions via a Conditional GAN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.323"},{"key":"ref_132","doi-asserted-by":"crossref","unstructured":"van Miltenburg, E., Elliott, D., and Vossen, P. (2018, January 21\u201325). Measuring the Diversity of Automatic Image Descriptions. Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, NM, USA.","DOI":"10.18653\/v1\/W17-3503"},{"key":"ref_133","doi-asserted-by":"crossref","unstructured":"Paiva, D.S., and Evans, R. (2005). Empirically-based Control of Natural Language Generation. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL\u201905), Association for Computational Linguistics.","DOI":"10.3115\/1219840.1219848"},{"key":"ref_134","unstructured":"Viethen, J., and Dale, R. (2010, January 9\u201310). Speaker-dependent variation in content selection for referring expression generation. Proceedings of the Australasian Language Technology Association Workshop, Melbourne, Australia."},{"key":"ref_135","doi-asserted-by":"crossref","unstructured":"Castro Ferreira, T., Krahmer, E., and Wubben, S. (2016). Towards more variation in text generation: Developing and evaluating variation models for choice of referential form. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/P16-1054"},{"key":"ref_136","doi-asserted-by":"crossref","unstructured":"Chen, F., Ji, R., Sun, X., Wu, Y., and Su, J. (2018, January 18\u201323). Groupcap: Group-based image captioning with structured relevance and diversity constraints. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00146"},{"key":"ref_137","unstructured":"Dai, B., Fidler, S., and Lin, D. (2018, January 3\u20138). A neural compositional paradigm for image captioning. Proceedings of the Advances in Neural Information Processing Systems, Montr\u00e9al, QC, Canada."},{"key":"ref_138","doi-asserted-by":"crossref","unstructured":"Deshpande, A., Aneja, J., Wang, L., Schwing, A.G., and Forsyth, D. (2019, January 15\u201320). Fast, diverse and accurate image captioning guided by part-of-speech. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01095"},{"key":"ref_139","doi-asserted-by":"crossref","unstructured":"Wang, Q., and Chan, A.B. (2019, January 15\u201320). Describing like humans: On diversity in image captioning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00432"},{"key":"ref_140","unstructured":"Belz, A., and Reiter, E. (2006). Comparing automatic and human evaluation of NLG systems. Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Association for Computational Linguistics."},{"key":"ref_141","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1162\/coli.2009.35.4.35405","article-title":"An investigation into the validity of some metrics for automatically evaluating natural language generation systems","volume":"35","author":"Reiter","year":"2009","journal-title":"Comput. Linguist."},{"key":"ref_142","doi-asserted-by":"crossref","unstructured":"Gkatzia, D., and Mahamood, S. (2015, January 10\u201311). A snapshot of NLG evaluation practices 2005-2014. Proceedings of the 15th European Workshop on Natural Language Generation (ENLG), Brighton, UK.","DOI":"10.18653\/v1\/W15-4708"},{"key":"ref_143","doi-asserted-by":"crossref","unstructured":"Novikova, J., Du\u0161ek, O., Cercas Curry, A., and Rieser, V. (2017). Why We Need New Evaluation Metrics for NLG. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D17-1238"},{"key":"ref_144","doi-asserted-by":"crossref","unstructured":"Howcroft, D.M., Belz, A., Clinciu, M.A., Gkatzia, D., Hasan, S.A., Mahamood, S., Mille, S., van Miltenburg, E., Santhanam, S., and Rieser, V. (2020). Twenty Years of Confusion in Human Evaluation: NLG Needs Evaluation Sheets and Standardised Definitions. Proceedings of the 13th International Conference on Natural Language Generation, Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.inlg-1.23"},{"key":"ref_145","doi-asserted-by":"crossref","unstructured":"Nichols, J. (1992). Linguistic Diversity in Space and Time, University of Chicago Press.","DOI":"10.7208\/chicago\/9780226580593.001.0001"},{"key":"ref_146","unstructured":"Gimpel, K., Batra, D., Dyer, C., and Shakhnarovich, G. (2013). A Systematic Exploration of Diversity in Machine Translation. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics."},{"key":"ref_147","unstructured":"Vijayakumar, A.K., Cogswell, M., Selvaraju, R.R., Sun, Q., Lee, S., Crandall, D., and Batra, D. (2016). Diverse beam search: Decoding diverse solutions from neural sequence models. arXiv."},{"key":"ref_148","unstructured":"Zhang, Y., Galley, M., Gao, J., Gan, Z., Li, X., Brockett, C., and Dolan, B. (2018). Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization. Proceedings of the 32nd International Conference on Neural Information Processing Systems, Curran Associates Inc."},{"key":"ref_149","unstructured":"Wang, Z., Wu, F., Lu, W., Xiao, J., Li, X., Zhang, Z., and Zhuang, Y. (2016, January 9\u201315). Diverse Image Captioning via GroupTalk. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, New York, NY, USA."},{"key":"ref_150","doi-asserted-by":"crossref","unstructured":"Zhu, Y., Lu, S., Zheng, L., Guo, J., Zhang, W., Wang, J., and Yu, Y. (2018, January 8\u201312). Texygen: A benchmarking platform for text generation models. Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, MI, USA.","DOI":"10.1145\/3209978.3210080"},{"key":"ref_151","doi-asserted-by":"crossref","unstructured":"Alihosseini, D., Montahaei, E., and Soleymani Baghshah, M. (2019). Jointly Measuring Diversity and Quality in Text Generation Models. Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural Language Generation, Association for Computational Linguistics.","DOI":"10.18653\/v1\/W19-2311"},{"key":"ref_152","unstructured":"Hashimoto, T., Zhang, H., and Liang, P. (2019). Unifying Human and Statistical Evaluation for Natural Language Generation. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics."},{"key":"ref_153","doi-asserted-by":"crossref","unstructured":"Zhang, S., Dinan, E., Urbanek, J., Szlam, A., Kiela, D., and Weston, J. (2018). Personalizing Dialogue Agents: I have a dog, do you have pets too?. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/P18-1205"},{"key":"ref_154","doi-asserted-by":"crossref","unstructured":"Kriz, R., Sedoc, J., Apidianaki, M., Zheng, C., Kumar, G., Miltsakaki, E., and Callison-Burch, C. (2019). Complexity-weighted loss and diverse reranking for sentence simplification. arXiv.","DOI":"10.18653\/v1\/N19-1317"},{"key":"ref_155","doi-asserted-by":"crossref","first-page":"101094","DOI":"10.1016\/j.csl.2020.101094","article-title":"Cluster-based beam search for pointer-generator chatbot grounded by knowledge","volume":"64","author":"Tam","year":"2020","journal-title":"Comput. Speech Lang."},{"key":"ref_156","doi-asserted-by":"crossref","unstructured":"Hotate, K., Kaneko, M., and Komachi, M. (2020). Generating Diverse Corrections with Local Beam Search for Grammatical Error Correction. Proceedings of the 28th International Conference on Computational Linguistics, International Committee on Computational Linguistics.","DOI":"10.18653\/v1\/2020.coling-main.193"},{"key":"ref_157","first-page":"147","article-title":"A learning algorithm for Boltzmann machines","volume":"9","author":"Ackley","year":"1985","journal-title":"Cogn. Sci."},{"key":"ref_158","doi-asserted-by":"crossref","unstructured":"Massarelli, L., Petroni, F., Piktus, A., Ott, M., Rockt\u00e4schel, T., Plachouras, V., Silvestri, F., and Riedel, S. (2020). How Decoding Strategies Affect the Verifiability of Generated Text. Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.findings-emnlp.22"},{"key":"ref_159","doi-asserted-by":"crossref","unstructured":"Sch\u00fcz, S., Han, T., and Zarrie\u00df, S. (2021, January 29\u201331). Diversity as a By-Product: Goal-oriented Language Generation Leads to Linguistic Variation. Proceedings of the 22nd Annual SIGdial Meeting on Discourse and Dialogue. Association for Computational Linguistics, Singapore.","DOI":"10.18653\/v1\/2021.sigdial-1.43"},{"key":"ref_160","unstructured":"Zhang, H., Duckworth, D., Ippolito, D., and Neelakantan, A. (2021). Trading Off Diversity and Quality in Natural Language Generation. Proceedings of the Workshop on Human Evaluation of NLP Systems (HumEval), Association for Computational Linguistics."},{"key":"ref_161","doi-asserted-by":"crossref","unstructured":"Anderson, P., Fernando, B., Johnson, M., and Gould, S. (2017). Guided Open Vocabulary Image Captioning with Constrained Beam Search. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D17-1098"},{"key":"ref_162","doi-asserted-by":"crossref","unstructured":"Balakrishnan, A., Rao, J., Upasani, K., White, M., and Subba, R. (2019). Constrained Decoding for Neural NLG from Compositional Representations in Task-Oriented Dialogue. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics.","DOI":"10.18653\/v1\/P19-1080"},{"key":"ref_163","doi-asserted-by":"crossref","unstructured":"Hokamp, C., and Liu, Q. (2017). Lexically constrained decoding for sequence generation using grid beam search. arXiv.","DOI":"10.18653\/v1\/P17-1141"},{"key":"ref_164","doi-asserted-by":"crossref","unstructured":"Post, M., and Vilar, D. (2018). Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/N18-1119"},{"key":"ref_165","doi-asserted-by":"crossref","unstructured":"Zhang, X., and Lapata, M. (2014). Chinese Poetry Generation with Recurrent Neural Networks. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics.","DOI":"10.3115\/v1\/D14-1074"},{"key":"ref_166","doi-asserted-by":"crossref","unstructured":"Ghazvininejad, M., Shi, X., Choi, Y., and Knight, K. (2016). Generating Topical Poetry. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D16-1126"},{"key":"ref_167","doi-asserted-by":"crossref","unstructured":"Hopkins, J., and Kiela, D. (2017). Automatically Generating Rhythmic Verse with Neural Networks. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/P17-1016"},{"key":"ref_168","doi-asserted-by":"crossref","unstructured":"Andreas, J., and Klein, D. (2016). Reasoning about Pragmatics with Neural Listeners and Speakers. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D16-1125"},{"key":"ref_169","doi-asserted-by":"crossref","unstructured":"Cohn-Gordon, R., Goodman, N., and Potts, C. (2018). Pragmatically Informative Image Captioning with Character-Level Inference. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), Association for Computational Linguistics.","DOI":"10.18653\/v1\/N18-2070"},{"key":"ref_170","doi-asserted-by":"crossref","unstructured":"Vedantam, R., Bengio, S., Murphy, K., Parikh, D., and Chechik, G. (2017, January 21\u201326). Context-aware captions from context-agnostic supervision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.120"},{"key":"ref_171","doi-asserted-by":"crossref","unstructured":"Zarrie\u00df, S., and Schlangen, D. (2019). Know What You Don\u2019t Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics.","DOI":"10.18653\/v1\/P19-1063"},{"key":"ref_172","unstructured":"Shen, S., Fried, D., Andreas, J., and Klein, D. (2019). Pragmatically Informative Text Generation. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics."},{"key":"ref_173","doi-asserted-by":"crossref","unstructured":"Kim, H., Kim, B., and Kim, G. (2020). Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.emnlp-main.65"},{"key":"ref_174","doi-asserted-by":"crossref","unstructured":"Gu, J., Cho, K., and Li, V.O. (2017). Trainable Greedy Decoding for Neural Machine Translation. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/D17-1210"},{"key":"ref_175","doi-asserted-by":"crossref","unstructured":"Krahmer, E., and van Deemter, K. (2011). Computational Generation of Referring Expressions: A Survey. Comput. Linguist., 38.","DOI":"10.1162\/COLI_a_00088"},{"key":"ref_176","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1207\/s15516709cog1902_3","article-title":"Computational interpretations of the Gricean maxims in the generation of referring expressions","volume":"19","author":"Dale","year":"1995","journal-title":"Cogn. Sci."},{"key":"ref_177","doi-asserted-by":"crossref","first-page":"998","DOI":"10.1126\/science.1218633","article-title":"Predicting pragmatic reasoning in language games","volume":"336","author":"Frank","year":"2012","journal-title":"Science"},{"key":"ref_178","doi-asserted-by":"crossref","unstructured":"Du\u0161ek, O., Novikova, J., and Rieser, V. (2018). Findings of the E2E NLG Challenge. Proceedings of the 11th International Conference on Natural Language Generation, Association for Computational Linguistics.","DOI":"10.18653\/v1\/W18-6539"},{"key":"ref_179","unstructured":"Chen, Y., Cho, K., Bowman, S.R., and Li, V.O. (May, January 30). Stable and Effective Trainable Greedy Decoding for Sequence to Sequence Learning. Proceedings of the 6th International Conference on Learning Representations (ICLR 2018), Workshop Track, Online."},{"key":"ref_180","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1162\/tacl_a_00254","article-title":"Analysis methods in neural language processing: A survey","volume":"7","author":"Belinkov","year":"2019","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_181","doi-asserted-by":"crossref","unstructured":"Devlin, J., Cheng, H., Fang, H., Gupta, S., Deng, L., He, X., Zweig, G., and Mitchell, M. (2015). Language Models for Image Captioning: The Quirks and What Works. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Association for Computational Linguistics.","DOI":"10.3115\/v1\/P15-2017"},{"key":"ref_182","doi-asserted-by":"crossref","unstructured":"Kazemzadeh, S., Ordonez, V., Matten, M., and Berg, T.L. (2014, January 25\u201329). ReferItGame: Referring to Objects in Photographs of Natural Scenes. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1086"},{"key":"ref_183","doi-asserted-by":"crossref","unstructured":"De Vries, H., Strub, F., Chandar, S., Pietquin, O., Larochelle, H., and Courville, A. (2017, January 21\u201326). Guesswhat?! visual object discovery through multi-modal dialogue. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.475"},{"key":"ref_184","doi-asserted-by":"crossref","unstructured":"Grice, H.P. (1975). Logic and conversation. Speech Acts, Brill.","DOI":"10.1163\/9789004368811_003"},{"key":"ref_185","unstructured":"Clark, H.H. (1996). Using Language, Cambridge University Press."},{"key":"ref_186","doi-asserted-by":"crossref","unstructured":"Gkatzia, D., Hastie, H., and Lemon, O. (2014, January 26\u201330). Finding middle ground? Multi-objective Natural Language Generation from time-series data. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden.","DOI":"10.3115\/v1\/E14-4041"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/12\/9\/355\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:52:46Z","timestamp":1760165566000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/12\/9\/355"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,30]]},"references-count":186,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["info12090355"],"URL":"https:\/\/doi.org\/10.3390\/info12090355","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,30]]}}}