{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,8]],"date-time":"2024-09-08T12:59:47Z","timestamp":1725800387468},"reference-count":0,"publisher":"Sociedade Brasileira de Computa\u00e7\u00e3o","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"abstract":"<jats:p>Textual similarity deals with determining how similar two pieces of texts are, considering the lexical (surface forms) or semantic (meaning) closeness. In this paper we applied word embeddings for measuring e-commerce product title similarity in Brazilian Portuguese. We generated some domainspecific word embeddings (using Word2Vec, FastText and GloVe) and compared them with general-domain models (word embeddings and BERT models). We concluded that the cosine similarity calculated using the domain-specific word embeddings was a good approach to distinguish between similar and nonsimilar products, but the multilingual BERT pre-trained model proved to be the best one.<\/jats:p>","DOI":"10.5753\/stil.2021.17791","type":"proceedings-article","created":{"date-parts":[[2021,12,6]],"date-time":"2021-12-06T10:11:08Z","timestamp":1638785468000},"page":"121-132","source":"Crossref","is-referenced-by-count":1,"title":["Measuring Brazilian Portuguese Product Titles Similarity using Embeddings"],"prefix":"10.5753","author":[{"given":"Alan da Silva","family":"Romualdo","sequence":"first","affiliation":[]},{"given":"Livy","family":"Real","sequence":"additional","affiliation":[]},{"given":"Helena de Medeiros","family":"Caseli","sequence":"additional","affiliation":[]}],"member":"3742","published-online":{"date-parts":[[2021,11,29]]},"event":{"name":"Simp\u00f3sio Brasileiro de Tecnologia da Informa\u00e7\u00e3o e da Linguagem Humana","number":"13","location":"Brasil","acronym":"STIL 2021"},"container-title":["Anais do XIII Simp\u00f3sio Brasileiro de Tecnologia da Informa\u00e7\u00e3o e da Linguagem Humana (STIL 2021)"],"original-title":[],"link":[{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/download\/17791\/17625","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/download\/17791\/17625","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,6]],"date-time":"2021-12-06T10:12:30Z","timestamp":1638785550000},"score":1,"resource":{"primary":{"URL":"https:\/\/sol.sbc.org.br\/index.php\/stil\/article\/view\/17791"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,29]]},"references-count":0,"URL":"https:\/\/doi.org\/10.5753\/stil.2021.17791","relation":{},"subject":[],"published":{"date-parts":[[2021,11,29]]}}}