{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,6]],"date-time":"2024-09-06T21:03:31Z","timestamp":1725656611495},"reference-count":0,"publisher":"Sociedade Brasileira de Computa\u00e7\u00e3o","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"abstract":"<jats:p>A chatbot is an artificial intelligence based system aimed at chatting with users, commonly used as a virtual assistant to help people or answer questions. Intent classification is an essential task for chatbots where it aims to identify what the user wants in a certain dialogue. However, for many domains, little data are available to properly train those systems. In this work, we evaluate the performance of two methods to generate synthetic data for chatbots, one based on template questions and another based on neural text generation. We build four datasets that are used training chatbot components in the intent classification task. We intend to simulate the task of migrating a search-based portal to an interactive dialogue-based information service by using artificial datasets for initial model training. Our results show that template-based datasets are slightly superior to those neural-based generated in our application domain, however, neural-generated present good results and they are a viable option when one has limited access to domain experts to hand-code text templates.<\/jats:p>","DOI":"10.5753\/stil.2021.17806","type":"proceedings-article","created":{"date-parts":[[2021,12,6]],"date-time":"2021-12-06T15:11:08Z","timestamp":1638803468000},"page":"265-274","source":"Crossref","is-referenced-by-count":1,"title":["Evaluation of Synthetic Datasets Generation for Intent Classification Tasks in Portuguese"],"prefix":"10.5753","author":[{"given":"Robson T.","family":"Paula","sequence":"first","affiliation":[]},{"given":"D\u00e9cio G.","family":"Aguiar Neto","sequence":"additional","affiliation":[]},{"given":"Davi","family":"Romero","sequence":"additional","affiliation":[]},{"given":"Paulo T.","family":"Guerra","sequence":"additional","affiliation":[]}],"member":"3742","published-online":{"date-parts":[[2021,11,29]]},"event":{"name":"Simp\u00f3sio Brasileiro de Tecnologia da Informa\u00e7\u00e3o e da Linguagem Humana","number":"13","location":"Brasil","acronym":"STIL 2021"},"container-title":["Anais do XIII Simp\u00f3sio Brasileiro de Tecnologia da Informa\u00e7\u00e3o e da Linguagem Humana (STIL 2021)"],"original-title":[],"link":[{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/download\/17806\/17640","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/download\/17806\/17640","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,6]],"date-time":"2021-12-06T15:11:49Z","timestamp":1638803509000},"score":1,"resource":{"primary":{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/view\/17806"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,29]]},"references-count":0,"URL":"https:\/\/doi.org\/10.5753\/stil.2021.17806","relation":{},"subject":[],"published":{"date-parts":[[2021,11,29]]}}}