{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T19:04:25Z","timestamp":1771614265552,"version":"3.50.1"},"reference-count":42,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2021,3,26]],"date-time":"2021-03-26T00:00:00Z","timestamp":1616716800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["FA8650-18-C-7823"],"award-info":[{"award-number":["FA8650-18-C-7823"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>With the online presence of more than half the world population, social media plays a very important role in the lives of individuals as well as businesses alike. Social media enables businesses to advertise their products, build brand value, and reach out to their customers. To leverage these social media platforms, it is important for businesses to process customer feedback in the form of posts and tweets. Sentiment analysis is the process of identifying the emotion, either positive, negative or neutral, associated with these social media texts. The presence of sarcasm in texts is the main hindrance in the performance of sentiment analysis. Sarcasm is a linguistic expression often used to communicate the opposite of what is said, usually something that is very unpleasant, with an intention to insult or ridicule. Inherent ambiguity in sarcastic expressions make sarcasm detection very difficult. In this work, we focus on detecting sarcasm in textual conversations from various social networking platforms and online media. To this end, we develop an interpretable deep learning model using multi-head self-attention and gated recurrent units. The multi-head self-attention module aids in identifying crucial sarcastic cue-words from the input, and the recurrent units learn long-range dependencies between these cue-words to better classify the input text. We show the effectiveness of our approach by achieving state-of-the-art results on multiple datasets from social networking platforms and online media. Models trained using our proposed approach are easily interpretable and enable identifying sarcastic cues in the input text which contribute to the final classification score. We visualize the learned attention weights on a few sample input texts to showcase the effectiveness and interpretability of our model.<\/jats:p>","DOI":"10.3390\/e23040394","type":"journal-article","created":{"date-parts":[[2021,3,26]],"date-time":"2021-03-26T13:17:53Z","timestamp":1616764673000},"page":"394","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":39,"title":["Interpretable Multi-Head Self-Attention Architecture for Sarcasm Detection in Social Media"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4008-075X","authenticated-orcid":false,"given":"Ramya","family":"Akula","sequence":"first","affiliation":[{"name":"Complex Adaptive Systems Lab, Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3302-9382","authenticated-orcid":false,"given":"Ivan","family":"Garibay","sequence":"additional","affiliation":[{"name":"Complex Adaptive Systems Lab, Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA"}]}],"member":"1968","published-online":{"date-parts":[[2021,3,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1037\/0894-4105.19.3.288","article-title":"The neuroanatomical basis of understanding sarcasm and its relationship to social cognition","volume":"19","author":"Tomer","year":"2005","journal-title":"Neuropsychology"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Skalicky, S., and Crossley, S. (2018, January 6). Linguistic Features of Sarcasm and Metaphor Production Quality. Proceedings of the Workshop on Figurative Language Processing, New Orleans, LA, USA.","DOI":"10.18653\/v1\/W18-0902"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Kreuz, R.J., and Caucci, G.M. (2007, January 26). Lexical influences on the perception of sarcasm. Proceedings of the Workshop on Computational Approaches to Figurative Language, Association for Computational Linguistics, Rochester, NY, USA.","DOI":"10.3115\/1611528.1611529"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Joshi, A., Sharma, V., and Bhattacharyya, P. (2015, January 26\u201331). Harnessing context incongruity for sarcasm detection. Proceedings of the 53rd Annual Meeting of the ACL and the 7th IJCNLP, Beijing, China.","DOI":"10.3115\/v1\/P15-2124"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ghosh, A., and Veale, T. (2017, January 7\u201311). Magnets for sarcasm: Making sarcasm detection timely, contextual and very personal. Proceedings of the 2017 Conference on EMNLP, Copenhagen, Denmark.","DOI":"10.18653\/v1\/D17-1050"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Ilic, S., Marrese-Taylor, E., Balazs, J., and Matsuo, Y. (2018, January 31). Deep contextualized word representations for detecting sarcasm and irony. Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Brussels, Belgium.","DOI":"10.18653\/v1\/W18-6202"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1162\/coli_a_00336","article-title":"Sarcasm analysis using conversation context","volume":"44","author":"Ghosh","year":"2018","journal-title":"Comput. Linguist."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Xiong, T., Zhang, P., Zhu, H., and Yang, Y. (2019, January 13\u201317). Sarcasm Detection with Self-matching Networks and Low-rank Bilinear Pooling. Proceedings of the World Wide Web Conference, San Francisco, CA, USA.","DOI":"10.1145\/3308558.3313735"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Liu, L., Priestley, J.L., Zhou, Y., Ray, H.E., and Han, M. (2019). A2text-net: A novel deep neural network for sarcasm detection. Proceedings of the 2019 IEEE First International Conference on Cognitive Machine Intelligence (CogMI), IEEE.","DOI":"10.1109\/CogMI48466.2019.00025"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Carvalho, P., Sarmento, L., Silva, M.J., and De Oliveira, E. (2009). Clues for detecting irony in user-generated contents: Oh...!! it\u2019s so easy. Proceedings of the 1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion, Association for Computing Machinery.","DOI":"10.1145\/1651461.1651471"},{"key":"ref_11","unstructured":"Gonz\u00e1lez-Ib\u00e1nez, R., Muresan, S., and Wacholder, N. (2011, January 19\u201324). Identifying sarcasm in Twitter: A closer look. Proceedings of the 49th Annual Meeting of the ACL: Human Language Technologies: Short Papers, Portland, OR, USA."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Tsur, O., Davidov, D., and Rappoport, A. (2010, January 23\u201326). ICWSM\u2014A great catchy name: Semi-supervised recognition of sarcastic sentences in online product reviews. Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media, Washington, DC, USA.","DOI":"10.1609\/icwsm.v4i1.14018"},{"key":"ref_13","unstructured":"Davidov, D., Tsur, O., and Rappoport, A. (2010). Semi-supervised recognition of sarcastic sentences in twitter and amazon. Proceedings of the Fourteenth Conference on Computational Natural Language Learning, Association for Computational Linguistics."},{"key":"ref_14","unstructured":"Riloff, E., Qadir, A., Surve, P., De Silva, L., Gilbert, N., and Huang, R. (2013, January 18\u201321). Sarcasm as contrast between a positive sentiment and negative situation. Proceedings of the 2013 Conference on EMNLP, Seattle, WA, USA."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wallace, B.C., and Charniak, E. (2015, January 26\u201331). Sparse, contextually informed models for irony detection: Exploiting user communities, entities and sentiment. Proceedings of the 53rd Annual Meeting of the ACL and the 7th IJCNLP, Beijing, China.","DOI":"10.3115\/v1\/P15-1100"},{"key":"ref_16","unstructured":"Poria, S., Cambria, E., Hazarika, D., and Vij, P. (2016, January 11\u201316). A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks. Proceedings of the COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, Osaka, Japan."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Amir, S., Wallace, B.C., Lyu, H., Carvalho, P., and Silva, M.J. (2016, January 11\u201312). Modelling Context with User Embeddings for Sarcasm Detection in Social Media. Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, Berlin, Germany.","DOI":"10.18653\/v1\/K16-1017"},{"key":"ref_18","unstructured":"Hazarika, D., Poria, S., Gorantla, S., Cambria, E., Zimmermann, R., and Mihalcea, R. (2018, January 20\u201326). CASCADE: Contextual Sarcasm Detection in Online Discussion Forums. Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, NM, USA."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Rajadesingan, A., Zafarani, R., and Liu, H. (2015, January 2\u20136). Sarcasm detection on twitter: A behavioral modeling approach. Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, Shanghai, China.","DOI":"10.1145\/2684822.2685316"},{"key":"ref_20","unstructured":"Zhang, M., Zhang, Y., and Fu, G. (2016, January 11\u201316). Tweet sarcasm detection using deep neural network. Proceedings of the COLING 2016, The 26th International Conference on Computational Linguistics: Technical Papers, Osaka, Japan."},{"key":"ref_21","unstructured":"Pt\u00e1\u010dek, T., Habernal, I., and Hong, J. (2014, January 23\u201329). Sarcasm detection on czech and english twitter. Proceedings of the COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, Dublin, Ireland."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, Z., Wu, Z., Wang, R., and Ren, Y. (2015). Twitter sarcasm detection exploiting a context-based model. Proceedings of the International Conference on Web Information Systems Engineering, Springer.","DOI":"10.1007\/978-3-319-26190-4_6"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Joshi, A., Tripathi, V., Bhattacharyya, P., and Carman, M. (2016, January 11\u201312). Harnessing sequence labeling for sarcasm detection in dialogue from tv series \u2018friends\u2019. Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, Berlin, Germany.","DOI":"10.18653\/v1\/K16-1015"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ghosh, A., and Veale, T. (2016). Fracking sarcasm using neural network. Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics.","DOI":"10.18653\/v1\/W16-0425"},{"key":"ref_25","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., and Funtowicz, M. (2019). In Proceedings of the HuggingFace\u2019s Transformers: State-of-the-art Natural Language Processing. arXiv.","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"ref_27","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv."},{"key":"ref_28","unstructured":"Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., and Dean, J. (2013, January 5\u201310). Distributed representations of words and phrases and their compositionality. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C. (2014, January 25\u201329). Glove: Global vectors for word representation. Proceedings of the 2014 Conference on EMNLP, Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Joulin, A., Grave, \u00c9., Bojanowski, P., and Mikolov, T. (2017, January 3\u20137). Bag of Tricks for Efficient Text Classification. Proceedings of the 15th Conference of the European Chapter of the ACL, Valencia, Spain.","DOI":"10.18653\/v1\/E17-2068"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., and Zettlemoyer, L. (2018, January 1\u20136). Deep contextualized word representations. Proceedings of the NAACL-HLT, New Orleans, LA, USA.","DOI":"10.18653\/v1\/N18-1202"},{"key":"ref_32","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2\u20137). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of NAACL: Human Language Technologies, Minneapolis, MN, USA."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., and Torralba, A. (2016, January 27\u201330). Learning deep features for discriminative localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.319"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Oraby, S., Harrison, V., Reed, L., Hernandez, E., Riloff, E., and Walker, M. (2016, January 13\u201315). Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue. Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Los Angeles, CA, USA.","DOI":"10.18653\/v1\/W16-3604"},{"key":"ref_36","unstructured":"Walker, M.A., Tree, J.E.F., Anand, P., Abbott, R., and King, J. (2012, January 23\u201325). A Corpus for Research on Deliberation and Debate. Proceedings of the LREC, Istanbul, Turkey."},{"key":"ref_37","unstructured":"Khodak, M., Saunshi, N., and Vodrahalli, K. (2018, January 7\u201312). A Large Self-Annotated Corpus for Sarcasm. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan."},{"key":"ref_38","unstructured":"Misra, R., and Arora, P. (2019). Sarcasm Detection using Hybrid Neural Network. arXiv."},{"key":"ref_39","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019, January 8\u201314). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Proceedings of the Advances in Neural Information Processing Systems 32, Vancouver, BC, Canada."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Far\u00edas, D.I.H., Patti, V., and Rosso, P. (2016). Irony detection in twitter: The role of affective content. Proceedings of the ACM Transactions on Internet Technology (TOIT), Association for Computing Machinery.","DOI":"10.1145\/2930663"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Tay, Y., Luu, A.T., Hui, S.C., and Su, J. (2018, January 15\u201320). Reasoning with Sarcasm by Reading In-Between. Proceedings of the 56th Annual Meeting of the ACL, Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1093"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Clark, K., Khandelwal, U., Levy, O., and Manning, C.D. (2019, January 1). What Does BERT Look at? An Analysis of BERT\u2019s Attention. Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Florence, Italy.","DOI":"10.18653\/v1\/W19-4828"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/4\/394\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:41:28Z","timestamp":1760161288000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/4\/394"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,26]]},"references-count":42,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,4]]}},"alternative-id":["e23040394"],"URL":"https:\/\/doi.org\/10.3390\/e23040394","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,26]]}}}