{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:27:23Z","timestamp":1760146043340,"version":"build-2065373602"},"reference-count":51,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2024,9,21]],"date-time":"2024-09-21T00:00:00Z","timestamp":1726876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Amazon com, Inc."}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>This article introduces Alquist 5.0, our SocialBot that was designed for the Alexa Prize SocialBot Grand Challenge 5. Building upon previous iterations, we present the integration of our novel neural response generator (NRG) Barista within a hybrid architecture that combines traditional predefined dialogues with advanced neural response generation. We provide a comprehensive analysis of the current state-of-the-art NRGs and large language models (LLMs), leveraging these insights to enhance Barista\u2019s capabilities. A key focus of our development was in ensuring the safety of our chatbot and implementing robust measures to prevent profanity and inappropriate content. Additionally, we incorporated a new search engine to improve information retrieval and response accuracy. Expanding the capabilities of our system, we designed Alquist 5.0 to accommodate multimodal devices, utilizing APL templates enriched with custom features to deliver an outstanding conversational experience complemented by an excellent user interface. This paper offers detailed insights into the development of Alquist 5.0, which effectively addresses evolving user demands while preserving its empathetic and knowledgeable conversational prowess across a wide range of topics.<\/jats:p>","DOI":"10.3390\/fi16090344","type":"journal-article","created":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T09:15:07Z","timestamp":1727082907000},"page":"344","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Alquist 5.0: Dialogue Trees Meet Generative Models, a Novel Approach for Enhancing SocialBot Conversations"],"prefix":"10.3390","volume":"16","author":[{"given":"Ondrej","family":"Kobza","sequence":"first","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-1861-3778","authenticated-orcid":false,"given":"David","family":"Herel","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jan","family":"Cuhel","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tommaso","family":"Gargiani","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Petr","family":"Marek","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jan","family":"Sedivy","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, Czech Technical University in Prague, Jugosl\u00e1vsk\u00fdch partyz\u00e1n\u016f 1580, 160 00 Praha, Czech Republic"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,9,21]]},"reference":[{"key":"ref_1","unstructured":"Johnston, M., Flagg, C., Gottardi, A., Sahai, S., Lu, Y., Sagi, S., Dai, L., Goyal, P., Hedayatnia, B., and Hu, L. (2023, January 12). Advancing Open Domain Dialog: The Fifth Alexa Prize SocialBot Grand Challenge. Proceedings of the Alexa Prize SocialBot Grand Challenge 5 Proceedings, Online."},{"key":"ref_2","unstructured":"Konr\u00e1d, J., Pichl, J., Marek, P., Lorenc, P., Ta, V.D., Kobza, O., H\u1ef3lov\u00e1, L., and \u0160ediv\u1ef3, J. (2021). Alquist 4.0: Towards social intelligence using generative models and dialogue personalization. arXiv."},{"key":"ref_3","unstructured":"Merlo, P., Tiedemann, J., and Tsarfaty, R. (2021, January 19\u201323). Recipes for Building an Open-Domain Chatbot. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Online."},{"key":"ref_4","unstructured":"Shuster, K., Xu, J., Komeili, M., Ju, D., Smith, E.M., Roller, S., Ung, M., Chen, M., Arora, K., and Lane, J. (2022). BlenderBot 3: A deployed conversational agent that continually learns to responsibly engage. arXiv."},{"key":"ref_5","unstructured":"Jurafsky, D., and Martin, J.H. (2009). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Pearson Prentice Hall."},{"key":"ref_6","unstructured":"Pichl, J., Marek, P., Konr\u00e1d, J., Matul\u00edk, M., and \u0160ediv\u00fd, J. (2020). Alquist 2.0: Alexa Prize Socialbot Based on Sub-Dialogue Models. arXiv."},{"key":"ref_7","unstructured":"Lemon, O., Hakkani-Tur, D., Li, J.J., Ashrafzadeh, A., Garcia, D.H., Alikhani, M., Vandyke, D., and Du\u0161ek, O. (2022, January 7\u20139). Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent. Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Edinburgh, UK."},{"key":"ref_8","unstructured":"Moschitti, A., Pang, B., and Daelemans, W. (2014, January 25\u201329). Learning Phrase Representations using RNN Encoder\u2013Decoder for Statistical Machine Translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long Short-term Memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_10","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017). Attention Is All You Need. arXiv."},{"key":"ref_11","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv."},{"key":"ref_12","unstructured":"Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., and Soricut, R. (2020). ALBERT: A Lite BERT for Self-supervised Learning of Language Representations. arXiv."},{"key":"ref_13","unstructured":"Radford, A., and Narasimhan, K. (2018). Improving Language Understanding by Generative Pre-Training, OpenAI."},{"key":"ref_14","first-page":"1","article-title":"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer","volume":"21","author":"Raffel","year":"2020","journal-title":"J. Mach. Learn. Res."},{"key":"ref_15","unstructured":"Celikyilmaz, A., and Wen, T.H. (2020, January 5\u201310). DIALOGPT: Large-Scale Generative Pre-training for Conversational Response Generation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Online."},{"key":"ref_16","unstructured":"Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M.A., Lacroix, T., Rozi\u00e8re, B., Goyal, N., Hambro, E., and Azhar, F. (2023). LLaMA: Open and Efficient Foundation Language Models. arXiv."},{"key":"ref_17","unstructured":"Penedo, G., Malartic, Q., Hesslow, D., Cojocaru, R., Cappelli, A., Alobeidli, H., Pannier, B., Almazrouei, E., and Launay, J. (2023). The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only. arXiv."},{"key":"ref_18","unstructured":"He, P., Liu, X., Gao, J., and Chen, W. (2021, January 3\u20137). DeBERTa: Decoding-enhanced BERT with Disentangled Attention. Proceedings of the International Conference on Learning Representations, Virtual Event."},{"key":"ref_19","unstructured":"Chung, H.W., Hou, L., Longpre, S., Zoph, B., Tay, Y., Fedus, W., Li, Y., Wang, X., Dehghani, M., and Brahma, S. (2022). Scaling Instruction-Finetuned Language Models. arXiv."},{"key":"ref_20","unstructured":"Merlo, P., Tiedemann, J., and Tsarfaty, R. (2021, January 19\u201323). Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Online."},{"key":"ref_21","unstructured":"Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K\u00fcttler, H., Lewis, M., tau Yih, W., and Rockt\u00e4schel, T. (2021). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"71","DOI":"10.32604\/iasc.2023.032783","article-title":"ALBERT with Knowledge Graph Encoder Utilizing Semantic Similarity for Commonsense Question Answering","volume":"36","author":"Choi","year":"2023","journal-title":"Intell. Autom. Soft Comput."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Herel, D., Cisneros, H., and Mikolov, T. (2022). Preserving Semantics in Textual Adversarial Attacks. arXiv.","DOI":"10.3233\/FAIA230376"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3488534","article-title":"A Transformer-based Framework for Neutralizing and Reversing the Political Polarity of News Articles","volume":"5","author":"Liu","year":"2021","journal-title":"Proc. ACM Hum.-Comput. Interact."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Davidson, T., Warmsley, D., Macy, M., and Weber, I. (2017, January 15\u201318). Automated Hate Speech Detection and the Problem of Offensive Language. Proceedings of the 11th International AAAI Conference on Web and Social Media, ICWSM \u201917, Montreal, QC, Canada.","DOI":"10.1609\/icwsm.v11i1.14955"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wulczyn, E., Thain, N., and Dixon, L. (2017, January 3\u20137). Ex machina: Personal attacks seen at scale. Proceedings of the 26th International World Wide Web Conference, WWW 2017, Perth, Australia.","DOI":"10.1145\/3038912.3052591"},{"key":"ref_27","unstructured":"Li, H., Levow, G.A., Yu, Z., Gupta, C., Sisman, B., Cai, S., Vandyke, D., Dethlefs, N., Wu, Y., and Li, J.J. (2021, January 29\u201331). Assessing Political Prudence of Open-domain Chatbots. Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Online."},{"key":"ref_28","unstructured":"Zong, C., Xia, F., Li, W., and Navigli, R. (2021, January 1\u20136). RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Online."},{"key":"ref_29","unstructured":"Riloff, E., Chiang, D., Hockenmaier, J., and Tsujii, J. (November, January 31). Learning Gender-Neutral Word Embeddings. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium."},{"key":"ref_30","unstructured":"Walker, M., Ji, H., and Stent, A. (2018, January 1\u20136). Gender Bias in Coreference Resolution. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), New Orleans, LA, USA."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"012133","DOI":"10.1088\/1757-899X\/879\/1\/012133","article-title":"Effect of User Interface and User Experience on Application Sales","volume":"879","author":"Pratama","year":"2020","journal-title":"IOP Conf. Ser. Mater. Sci. Eng."},{"key":"ref_32","unstructured":"Rodr\u00edguez Cantelar, M., Cal, D.d.l., Estecha Garitagoitia, M.S., Grande Guti\u00e9rrez, A., Mart\u00edn, D., Rodr\u00edguez N\u00fa\u00f1ez-Milara, N., Mart\u00ednez JIm\u00e9nez, R., and D\u2019Haro Enr\u00edquez, L.F. (2021). Genuine2: An open domain chatbot based on generative models. Alexa Prize SocialBot Grand Challenge 4 Proceedings, Amazon Science."},{"key":"ref_33","unstructured":"Adiwardana, D., Luong, M.T., So, D.R., Hall, J., Fiedel, N., Thoppilan, R., Yang, Z., Kulshreshtha, A., Nemade, G., and Lu, Y. (2020). Towards a Human-like Open-Domain Chatbot. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Roller, S., Dinan, E., Goyal, N., Ju, D., Williamson, M., Liu, Y., Xu, J., Ott, M., Shuster, K., and Smith, E.M. (2020). Recipes for building an open-domain chatbot. arXiv.","DOI":"10.18653\/v1\/2021.eacl-main.24"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Xu, J., Szlam, A., and Weston, J. (2021). Beyond Goldfish Memory: Long-Term Open-Domain Conversation. arXiv.","DOI":"10.18653\/v1\/2022.acl-long.356"},{"key":"ref_36","unstructured":"Muresan, S., Nakov, P., and Villavicencio, A. (2022, January 22\u201327). Internet-Augmented Dialogue Generation. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Dublin, Ireland."},{"key":"ref_37","unstructured":"Lee, J., Shim, M., Son, S., Park, C., Kim, Y., and Lim, H. (2022). There is no rose without a thorn: Finding weaknesses on BlenderBot 2.0 in terms of Model, Data and User-Centric Approach. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Kobza, O., Herel, D., Cuhel, J., Gargiani, T., Pichl, J., Marek, P., Konrad, J., and Sedivy, J. (2023). Enhancements in BlenderBot 3: Expanding Beyond a Singular Model Governance and Boosting Generational Performance. Future Internet, 15.","DOI":"10.3390\/fi15120384"},{"key":"ref_39","unstructured":"Khatri, C., Hedayatnia, B., Venkatesh, A., Nunn, J., Pan, Y., Liu, Q., Song, H., Gottardi, A., Kwatra, S., and Pancholi, S. (2018). Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Gopalakrishnan, K., Hedayatnia, B., Chen, Q., Gottardi, A., Kwatra, S., Venkatesh, A., Gabriel, R., and Hakkani-T\u00fcr, D. (2019). Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations. Proc. Interspeech, 1891\u20131895.","DOI":"10.21437\/Interspeech.2019-3079"},{"key":"ref_41","unstructured":"Korhonen, A., Traum, D., and M\u00e0rquez, L. (August, January 28). Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy."},{"key":"ref_42","unstructured":"Nakamura, S., Gasic, M., Zuckerman, I., Skantze, G., Nakano, M., Papangelis, A., Ultes, S., and Yoshino, K. (2019, January 11\u201313). Coached Conversational Preference Elicitation: A Case Study in Understanding Movie Preferences. Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, Stockholm, Sweden."},{"key":"ref_43","unstructured":"Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., and Chen, W. (2022, January 25\u201329). LoRA: Low-Rank Adaptation of Large Language Models. Proceedings of the International Conference on Learning Representations, Virtual Event."},{"key":"ref_44","unstructured":"Soltan, S., Ananthakrishnan, S., FitzGerald, J.G.M., Gupta, R., Hamza, W., Khan, H., Peris, C., Rawls, S., Rosenbaum, A., and Rumshisky, A. (2022). AlexaTM 20B: Few-shot learning using a large-scale multilingual seq2seq model. arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1","DOI":"10.29297\/orbit.v1i2.49","article-title":"Why we should have seen that coming: Comments on microsoft\u2019s tay \u201cexperiment\u201d, and wider implications","volume":"1","author":"Wolf","year":"2017","journal-title":"Orbit J."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"103541","DOI":"10.1109\/ACCESS.2021.3098979","article-title":"When the Timeline Meets the Pipeline: A Survey on Automated Cyberbullying Detection","volume":"9","author":"Elsafoury","year":"2021","journal-title":"IEEE Access"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Sun, H., Xu, G., Deng, J., Cheng, J., Zheng, C., Zhou, H., Peng, N., Zhu, X., and Huang, M. (2022, January 22\u201327). On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark. Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland.","DOI":"10.18653\/v1\/2022.findings-acl.308"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Nadeem, M., Bethke, A., and Reddy, S. (2020). StereoSet: Measuring stereotypical bias in pretrained language models. arXiv.","DOI":"10.18653\/v1\/2021.acl-long.416"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1162\/tacl_a_00051","article-title":"Enriching Word Vectors with Subword Information","volume":"5","author":"Bojanowski","year":"2017","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_50","unstructured":"Le, C.P., Dai, L., Johnston, M., Liu, Y., Walker, M., and Ghanadan, R. (2023). Improving open-domain dialogue evaluation with a causal inference model. arXiv."},{"key":"ref_51","unstructured":"Antypas, D., Ushio, A., Camacho-Collados, J., Silva, V., Neves, L., and Barbieri, F. (2022, January 12\u201317). Twitter Topic Classification. Proceedings of the 29th International Conference on Computational Linguistics, Gyeongju, Republic of Korea."}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/16\/9\/344\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:01:31Z","timestamp":1760112091000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/16\/9\/344"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,21]]},"references-count":51,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2024,9]]}},"alternative-id":["fi16090344"],"URL":"https:\/\/doi.org\/10.3390\/fi16090344","relation":{},"ISSN":["1999-5903"],"issn-type":[{"type":"electronic","value":"1999-5903"}],"subject":[],"published":{"date-parts":[[2024,9,21]]}}}