{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,29]],"date-time":"2025-06-29T12:03:51Z","timestamp":1751198631421,"version":"3.40.5"},"reference-count":0,"publisher":"IOS Press","isbn-type":[{"type":"print","value":"9781643684369"},{"type":"electronic","value":"9781643684376"}],"license":[{"start":{"date-parts":[[2023,9,28]],"date-time":"2023-09-28T00:00:00Z","timestamp":1695859200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,9,28]]},"abstract":"<jats:p>Explaining artificial intelligence models can be utilized to launch targeted adversarial attacks on text classification algorithms. Understanding the reasoning behind the model\u2019s decisions makes it easier to prepare such samples. Most of the current text-based adversarial attacks rely on brute-force by using SHAP approach to identify the importance of tokens in the samples, we modify the crucial ones to prepare targeted attacks. We base our results on experiments using 5 datasets. Our results show that our approach outperforms TextBugger and TextFooler, achieving better results with 4 out of 5 datasets against TextBugger, and 3 out of 5 datasets against TextFooler, while minimizing perturbation introduced to the texts. In particular, we managed to outperform the efficacy of TextFooler by over 3100% and TextBugger by over 420% on the WikiPL dataset, additionally keeping high cosine similarity between the original text sample and the adversarial example. The evaluation of the results was additionally supported through a survey to assess their quality and ensure that the text perturbations did not change the intended class according to subjective, human classification.<\/jats:p>","DOI":"10.3233\/faia230356","type":"book-chapter","created":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T09:10:44Z","timestamp":1695978644000},"source":"Crossref","is-referenced-by-count":1,"title":["Do Not Trust Me: Explainability Against Text Classification"],"prefix":"10.3233","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0620-8123","authenticated-orcid":false,"given":"Mateusz","family":"Gniewkowski","sequence":"first","affiliation":[{"name":"Wroc\u0142aw University of Technology"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0381-9202","authenticated-orcid":false,"given":"Pawe\u0142","family":"Walkowiak","sequence":"additional","affiliation":[{"name":"Wroc\u0142aw University of Technology"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0266-5802","authenticated-orcid":false,"given":"Piotr","family":"Syga","sequence":"additional","affiliation":[{"name":"Wroc\u0142aw University of Technology"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3141-8712","authenticated-orcid":false,"given":"Marek","family":"Klonowski","sequence":"additional","affiliation":[{"name":"Wroc\u0142aw University of Technology"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7749-4251","authenticated-orcid":false,"given":"Tomasz","family":"Walkowiak","sequence":"additional","affiliation":[{"name":"Wroc\u0142aw University of Technology"}]}],"member":"7437","container-title":["Frontiers in Artificial Intelligence and Applications","ECAI 2023"],"original-title":[],"link":[{"URL":"https:\/\/ebooks.iospress.nl\/pdf\/doi\/10.3233\/FAIA230356","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T09:10:46Z","timestamp":1695978646000},"score":1,"resource":{"primary":{"URL":"https:\/\/ebooks.iospress.nl\/doi\/10.3233\/FAIA230356"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,28]]},"ISBN":["9781643684369","9781643684376"],"references-count":0,"URL":"https:\/\/doi.org\/10.3233\/faia230356","relation":{},"ISSN":["0922-6389","1879-8314"],"issn-type":[{"type":"print","value":"0922-6389"},{"type":"electronic","value":"1879-8314"}],"subject":[],"published":{"date-parts":[[2023,9,28]]}}}