{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T07:58:24Z","timestamp":1770019104670,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":40,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,26]],"date-time":"2022-06-26T00:00:00Z","timestamp":1656201600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,26]]},"DOI":"10.1145\/3501247.3531557","type":"proceedings-article","created":{"date-parts":[[2022,6,24]],"date-time":"2022-06-24T22:41:09Z","timestamp":1656110469000},"page":"382-389","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Multimodal Zero-Shot Hateful Meme Detection"],"prefix":"10.1145","author":[{"given":"Jiawen","family":"Zhu","sequence":"first","affiliation":[{"name":"Singapore University of Technology and Design, Singapore"}]},{"given":"Roy Ka-Wei","family":"Lee","sequence":"additional","affiliation":[{"name":"Singapore University of Technology and Design, Singapore"}]},{"given":"Wen Haw","family":"Chong","sequence":"additional","affiliation":[{"name":"Singapore Management University, Singapore"}]}],"member":"320","published-online":{"date-parts":[[2022,6,26]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-75762-5_55"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3041021.3054223"},{"key":"e_1_3_2_1_3_1","volume-title":"CVAE-GAN: fine-grained image generation through asymmetric training","author":"Bao Jianmin","unstructured":"Jianmin Bao , Dong Chen , Fang Wen , Houqiang Li , and Gang Hua . 2017. CVAE-GAN: fine-grained image generation through asymmetric training . In IEEE ICCV. 2745\u20132754. Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, and Gang Hua. 2017. CVAE-GAN: fine-grained image generation through asymmetric training. In IEEE ICCV. 2745\u20132754."},{"key":"e_1_3_2_1_4_1","volume-title":"Generalized Zero-Shot Learning Using Multimodal Variational Auto-Encoder With Semantic Concepts","author":"Bendre Nihar","unstructured":"Nihar Bendre , Kevin Desai , and Peyman Najafirad . 2021. Generalized Zero-Shot Learning Using Multimodal Variational Auto-Encoder With Semantic Concepts . In IEEE ICIP. 1284\u20131288. Nihar Bendre, Kevin Desai, and Peyman Najafirad. 2021. Generalized Zero-Shot Learning Using Multimodal Variational Auto-Encoder With Semantic Concepts. In IEEE ICIP. 1284\u20131288."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394231.3397890"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3091478.3091487"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/SocialCom-PASSAT.2012.55"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Thomas Davidson Dana Warmsley Michael Macy and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Eleventh international aaai conference on web and social media.  Thomas Davidson Dana Warmsley Michael Macy and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Eleventh international aaai conference on web and social media.","DOI":"10.1609\/icwsm.v11i1.14955"},{"key":"e_1_3_2_1_9_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740908.2742760"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-3013"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Ross Girshick. 2015. Fast R-CNN. arxiv:1504.08083\u00a0[cs.CV]  Ross Girshick. 2015. Fast R-CNN. arxiv:1504.08083\u00a0[cs.CV]","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_1_13_1","unstructured":"Yoav Goldberg and Omer Levy. 2014. word2vec Explained: deriving Mikolov et al.\u2019s negative-sampling word-embedding method. arXiv preprint arXiv:1402.3722(2014).  Yoav Goldberg and Omer Levy. 2014. word2vec Explained: deriving Mikolov et al.\u2019s negative-sampling word-embedding method. arXiv preprint arXiv:1402.3722(2014)."},{"key":"e_1_3_2_1_14_1","volume-title":"Exploring hate speech detection in multimodal publications","author":"Gomez Raul","unstructured":"Raul Gomez , Jaume Gibert , Lluis Gomez , and Dimosthenis Karatzas . 2020. Exploring hate speech detection in multimodal publications . In IEEE ICCV. 1470\u20131478. Raul Gomez, Jaume Gibert, Lluis Gomez, and Dimosthenis Karatzas. 2020. Exploring hate speech detection in multimodal publications. In IEEE ICCV. 1470\u20131478."},{"key":"e_1_3_2_1_15_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. NeurIPS 27(2014).  Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. NeurIPS 27(2014)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3270101.3270103"},{"key":"e_1_3_2_1_17_1","volume-title":"The hateful memes challenge: Detecting hate speech in multimodal memes. arXiv","author":"Kiela Douwe","year":"2020","unstructured":"Douwe Kiela , Hamed Firooz , Aravind Mohan , Vedanuj Goswami , Amanpreet Singh , Pratik Ringshia , and Davide Testuggine . 2020. The hateful memes challenge: Detecting hate speech in multimodal memes. arXiv ( 2020 ). Douwe Kiela, Hamed Firooz, Aravind Mohan, Vedanuj Goswami, Amanpreet Singh, Pratik Ringshia, and Davide Testuggine. 2020. The hateful memes challenge: Detecting hate speech in multimodal memes. arXiv (2020)."},{"key":"e_1_3_2_1_18_1","volume-title":"Auto-encoding variational bayes. arXiv","author":"Kingma P","year":"2013","unstructured":"Diederik\u00a0 P Kingma and Max Welling . 2013. Auto-encoding variational bayes. arXiv ( 2013 ). Diederik\u00a0P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv (2013)."},{"key":"e_1_3_2_1_19_1","unstructured":"Anders Boesen\u00a0Lindbo Larsen S\u00f8ren\u00a0Kaae S\u00f8nderby Hugo Larochelle and Ole Winther. 2016. Autoencoding beyond pixels using a learned similarity metric. In ICML. PMLR 1558\u20131566.  Anders Boesen\u00a0Lindbo Larsen S\u00f8ren\u00a0Kaae S\u00f8nderby Hugo Larochelle and Ole Winther. 2016. Autoencoding beyond pixels using a learned similarity metric. In ICML. PMLR 1558\u20131566."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475625"},{"key":"e_1_3_2_1_21_1","volume-title":"Visualbert: A simple and performant baseline for vision and language. arXiv","author":"Li Liunian\u00a0Harold","year":"2019","unstructured":"Liunian\u00a0Harold Li , Mark Yatskar , Da Yin , Cho-Jui Hsieh , and Kai-Wei Chang . 2019 . Visualbert: A simple and performant baseline for vision and language. arXiv (2019). Liunian\u00a0Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, and Kai-Wei Chang. 2019. Visualbert: A simple and performant baseline for vision and language. arXiv (2019)."},{"key":"e_1_3_2_1_22_1","volume-title":"Early Prediction of Hate Speech Propagation. In 2021 International Conference on Data Mining Workshops (ICDMW). IEEE, 967\u2013974","author":"Lin Ken-Yu","year":"2021","unstructured":"Ken-Yu Lin , Roy Ka-Wei Lee , Wei Gao , and Wen-Chih Peng . 2021 . Early Prediction of Hate Speech Propagation. In 2021 International Conference on Data Mining Workshops (ICDMW). IEEE, 967\u2013974 . Ken-Yu Lin, Roy Ka-Wei Lee, Wei Gao, and Wen-Chih Peng. 2021. Early Prediction of Hate Speech Propagation. In 2021 International Conference on Data Mining Workshops (ICDMW). IEEE, 967\u2013974."},{"key":"e_1_3_2_1_23_1","volume-title":"A Multimodal Framework for the Detection of Hateful Memes. arXiv","author":"Lippe Phillip","year":"2020","unstructured":"Phillip Lippe , Nithin Holla , Shantanu Chandra , Santhosh Rajamanickam , Georgios Antoniou , Ekaterina Shutova , and Helen Yannakoudakis . 2020. A Multimodal Framework for the Detection of Hateful Memes. arXiv ( 2020 ). Phillip Lippe, Nithin Holla, Shantanu Chandra, Santhosh Rajamanickam, Georgios Antoniou, Ekaterina Shutova, and Helen Yannakoudakis. 2020. A Multimodal Framework for the Detection of Hateful Memes. arXiv (2020)."},{"key":"e_1_3_2_1_24_1","volume-title":"Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. arXiv","author":"Lu Jiasen","year":"2019","unstructured":"Jiasen Lu , Dhruv Batra , Devi Parikh , and Stefan Lee . 2019 . Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. arXiv (2019). Jiasen Lu, Dhruv Batra, Devi Parikh, and Stefan Lee. 2019. Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. arXiv (2019)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-3638"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872427.2883062"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-3006"},{"key":"e_1_3_2_1_28_1","volume-title":"2019. PyTorch: An Imperative Style","author":"Paszke Adam","year":"1912","unstructured":"Adam Paszke , Sam Gross , and Francisco\u00a0Massa et al. 2019. PyTorch: An Imperative Style , High-Performance Deep Learning Library . arxiv: 1912 .01703\u00a0[cs.LG] Adam Paszke, Sam Gross, and Francisco\u00a0Massa et al.2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. arxiv:1912.01703\u00a0[cs.LG]"},{"key":"e_1_3_2_1_29_1","volume-title":"Detecting Hateful Memes Using a Multimodal Deep Ensemble. arXiv","author":"Sandulescu Vlad","year":"2020","unstructured":"Vlad Sandulescu . 2020. Detecting Hateful Memes Using a Multimodal Deep Ensemble. arXiv ( 2020 ). Vlad Sandulescu. 2020. Detecting Hateful Memes Using a Multimodal Deep Ensemble. arXiv (2020)."},{"key":"e_1_3_2_1_30_1","volume-title":"Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 32\u201341","author":"Suryawanshi Shardul","unstructured":"Shardul Suryawanshi and Chakravarthi et al.2020. Multimodal meme dataset (multioff) for identifying offensive content in image and text . In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 32\u201341 . Shardul Suryawanshi and Chakravarthi et al.2020. Multimodal meme dataset (multioff) for identifying offensive content in image and text. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 32\u201341."},{"key":"e_1_3_2_1_31_1","volume-title":"Lxmert: Learning cross-modality encoder representations from transformers. arXiv","author":"Tan Hao","year":"2019","unstructured":"Hao Tan and Mohit Bansal . 2019 . Lxmert: Learning cross-modality encoder representations from transformers. arXiv (2019). Hao Tan and Mohit Bansal. 2019. Lxmert: Learning cross-modality encoder representations from transformers. arXiv (2019)."},{"key":"e_1_3_2_1_32_1","volume-title":"Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. arXiv","author":"Velioglu Riza","year":"2020","unstructured":"Riza Velioglu and Jewgeni Rose . 2020. Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. arXiv ( 2020 ). Riza Velioglu and Jewgeni Rose. 2020. Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. arXiv (2020)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-5618"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-2013"},{"key":"e_1_3_2_1_35_1","volume-title":"Feature Generating Networks for Zero-Shot Learning","author":"Xian Yongqin","unstructured":"Yongqin Xian , Tobias Lorenz , Bernt Schiele , and Zeynep Akata . 2018. Feature Generating Networks for Zero-Shot Learning . In IEEE CVPR. Yongqin Xian, Tobias Lorenz, Bernt Schiele, and Zeynep Akata. 2018. Feature Generating Networks for Zero-Shot Learning. In IEEE CVPR."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2398556"},{"key":"e_1_3_2_1_37_1","volume-title":"European semantic web conference","author":"Zhang Ziqi","unstructured":"Ziqi Zhang , David Robinson , and Jonathan Tepper . 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network . In European semantic web conference . Springer , 745\u2013760. Ziqi Zhang, David Robinson, and Jonathan Tepper. 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network. In European semantic web conference. Springer, 745\u2013760."},{"key":"e_1_3_2_1_38_1","volume-title":"Multimodal Learning for Hateful Memes Detection. arXiv","author":"Zhou Yi","year":"2020","unstructured":"Yi Zhou and Zhenhao Chen . 2020. Multimodal Learning for Hateful Memes Detection. arXiv ( 2020 ). Yi Zhou and Zhenhao Chen. 2020. Multimodal Learning for Hateful Memes Detection. arXiv (2020)."},{"key":"e_1_3_2_1_39_1","volume-title":"Zheng Wang, and Heng\u00a0Tao Shen.","author":"Zhu Jiawen","year":"2020","unstructured":"Jiawen Zhu , Xing Xu , Fumin Shen , Roy Ka-Wei Lee , Zheng Wang, and Heng\u00a0Tao Shen. 2020 . Ocean : A Dual Learning Approach For Generalized Zero-Shot Sketch-Based Image Retrieval. In IEEE ICME. 1\u20136. Jiawen Zhu, Xing Xu, Fumin Shen, Roy Ka-Wei Lee, Zheng Wang, and Heng\u00a0Tao Shen. 2020. Ocean: A Dual Learning Approach For Generalized Zero-Shot Sketch-Based Image Retrieval. In IEEE ICME. 1\u20136."},{"key":"e_1_3_2_1_40_1","volume-title":"Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution. arXiv","author":"Zhu Ron","year":"2020","unstructured":"Ron Zhu . 2020. Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution. arXiv ( 2020 ). Ron Zhu. 2020. Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution. arXiv (2020)."}],"event":{"name":"WebSci '22: 14th ACM Web Science Conference 2022","location":"Barcelona Spain","acronym":"WebSci '22","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["14th ACM Web Science Conference 2022"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3501247.3531557","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3501247.3531557","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:18Z","timestamp":1750191138000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3501247.3531557"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,26]]},"references-count":40,"alternative-id":["10.1145\/3501247.3531557","10.1145\/3501247"],"URL":"https:\/\/doi.org\/10.1145\/3501247.3531557","relation":{},"subject":[],"published":{"date-parts":[[2022,6,26]]},"assertion":[{"value":"2022-06-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}