This repository contains resources (conference/journal publications, references to datasets) associated the work "Detecting and Understanding Harmful Memes: A Survey" accepted at IJCAI-ECAI (Survey), 2022.
The automatic identification of harmful content online is of major concern for social media platforms, policymakers, and society. Researchers have studied textual, visual, and audio content, but typically in isolation. Yet, harmful content often combines multiple modalities, as in the case of memes, which are of particular interest due to their viral nature. With this in mind, here we offer a comprehensive survey with a focus on harmful memes. Based on a systematic analysis of recent literature, we first propose a new typology of harmful memes, and then we highlight and summarize the relevant state of the art. One interesting finding is that many types of harmful memes are not really studied, e.g., such featuring self-harm and extremism, partly due to the lack of suitable datasets. We further find that existing datasets mostly capture multi-class scenarios, which are not inclusive of the affective spectrum that memes can represent. Another observation is that memes can propagate globally through repackaging in different languages and that they can also be multilingual, blending different cultures. We conclude by highlighting several challenges related to multimodal semiotics, technological constraints and non-trivial social engagement, and we present several open-ended aspects such as delineating online harm and empirically examining related frameworks and assistive interventions, which we believe will motivate and drive future research.
This repo contains relevant resources Automatic Harmful Memes Detection. We list a comprehensive and up-to-date information for harmful meme detection.
-
Harmful Memes: Shraman Pramanick, Shivam Sharma, Dimitar Dimitrov, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets, 2021. [Paper] [Dataset]
-
Harmful Memes: Shraman Pramanik, Dimiter Dimitrov, Rituparna Mukherjee, Shivam Sharma, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty, Detecting Harmful Memes and Their Targets, 2021. [Paper] [Dataset]
-
Hateful Memes: Douwe Kiela, Hamed Firooz, Aravind Mohan, Vedanuj Goswami, Amanpreet Singh, Pratik Ringshia, Davide Testuggine, The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes, 2020. [Paper] [Dataset]
-
Fine-grained Propaganda Memes: Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov and Giovanni Da San Martino, Detecting Propaganda Techniques in Memes, 2021. [Paper] [Dataset]
-
Racist or Sexist Meme: Haris Bin Zia, Ignacio Castro, Gareth TysonRacist or Sexist Meme? Classifying Memes beyond Hateful, 2021. [Paper] [Dataset]
-
Antisemitism Detection: Mohit Chandra, Dheeraj Pailla, Himanshu Bhatia, Aadilmehdi Sanchawala, Manish Gupta, Manish Shrivastava, Ponnurangam Kumaraguru, Subverting the Jewtocracy: Online Antisemitism Detection Using Multimodal Deep Learning, 2021.
-
Troll Memes: Shardul Suryawanshi, Bharathi Raja Chakravarthi, Pranav Varma, Mihael Arcan, John P. McCrae and Paul Buitelaar, A Dataset for Troll Classification of TamilMemes, 2020. [Paper] [Dataset]
-
Sexist meme: Elisabetta Fersini; Francesca Gasparini; Silvia Corchs, Detecting Sexist MEME On The Web: A Study on Textual and Visual Cues, 2019. [Paper] [Dataset]
-
Hateful Memes: Hannah Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M Asano, Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset,
-
Offensive Memes: Benet Oriol Sabat, Cristian Canton Ferrer, Xavier Giro-i-Nieto, Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation, 2019. [Paper] [Dataset]
-
Hateful memes (Pinterest) Hannah Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M Asano, Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset, [Paper] [[Dataset](https://github.com/iamatulsingh/ pinterest-image-scrap)]
-
MMHS150K: Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas, Exploring Hate Speech Detection in Multimodal Publications, 2019. [Paper] [Dataset]
-
MultiOFF: Shardul Suryawanshi, Bharathi Raja Chakravarthi, Mihael Arcan, Paul Buitelaar, Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text [Paper] [Dataset]
- Detecting Harmful Memes and Their Targets [paper]
- MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets [paper]
- Racist or Sexist Meme? Classifying Memes beyond Hateful [paper]
- “Subverting the Jewtocracy”: Online Antisemitism Detection Using Multimodal Deep Learning [paper]
- Detecting Sexist MEME On The Web: A Study on Textual and Visual Cues [paper]
- Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset [paper]
- Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation [paper]
- Disentangling Hate in Online Memes [paper]
- Exploring Hate Speech Detection in Multimodal Publications [paper]
- AOMD: An Analogy-aware Approach to Offensive Meme Detection on Social Media [paper]
- KnowMeme: A Knowledge-enriched Graph Neural Network Solution to Offensive Meme Detection [paper]
- An approach to detect offence in Memes using Natural Language Processing(NLP) and Deep learning [paper]
- Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text [paper]
- Detecting Propaganda Techniques in Memes [paper]
- MinD at SemEval-2021 Task 6: Propaganda Detection using Transfer Learning and Multimodal Fusion [paper]
- Volta at SemEval-2021 Task 6: Towards Detecting Persuasive Texts and Images using Textual and Multimodal Ensemble [paper]
- Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification [paper]
- Fine-grained analysis of propaganda in news article [paper]
- Detection of Cyberbullying Incidents on the Instagram Social Network [paper]
- A Dataset for Troll Classification of TamilMemes [paper]
- Violence detection in hollywood movies by the fusion of visual and mid-level audio cues [paper]
- The self—harmed, visualized, and reblogged: Remaking of self-injury narratives on tumblr [paper]
- A unified taxonomy of harmful content [paper]
- A survey on multimodal disinformation detection [paper]
- A survey on stance detection for mis- and disinformation identification [paper]
- Detecting abusive language on online platforms: A critical analysis [paper]
- A Survey on Computational Propaganda Detection [paper]
- A multimodal memes classification: A survey and open research issues. [paper]
- A survey on automatic detection of hate speech in text [paper]
- Automatic cyberbullying detection: A systematic review[paper]
- A survey on hate speech detection using natural language processing [paper]
- A survey of video violence detection [paper]
- A survey of fake news: Fundamental theories, detection methods, and opportunities [paper]
Below we provide a list of recent work with current state-of-the-art results.
Title | Types | Task | Dataset | Task Type | Approach | AUC | Acc | F1 | Details |
---|---|---|---|---|---|---|---|---|---|
Detecting Harmful Memes and Their Targets | Harm | Harmful vs Non-harmful | HarMeme (Covid-19) | Binary | VisualBERT (Pretrained using MS COCO) | 0.81 | 0.8 | ||
Very harmful vs Partially-harmful vs Non-harmful | Multiclass | 0.74 | 0.54 | ||||||
Target Identification of Harmful Memes | Multiclass | 0.76 | 0.66 | ||||||
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets | Harm | Harmful vs Non-harmful | Harm-C | Binary | MOMENTA: CLIP, VGG-19, DistilBERT, Cross-modal attention fusion (CMAF) | 0.84 | 0.83 | ||
Very harmful vs Partially-harmful vs Non-harmful | Harm-C | Multiclass | 0.77 | 0.55 | |||||
Target Identification of Harmful Memes | Harm-C | Multiclass | 0.78 | 0.7 | |||||
Harmful vs Non-harmful | Harm-P | Binary | 0.9 | 0.88 | |||||
Very harmful vs Partially-harmful vs Non-harmful | Harm-P | Multiclass | 0.87 | 0.67 | |||||
Target Identification of Harmful Memes | Harm-P | Multiclass | 0.79 | 0.69 | |||||
Racist or Sexist Meme? Classifying Memes beyond Hateful | Hate | Protected category (PC) identification | FBHM | ML | CIMG + CTXT + LASER + LaBSE |
0.96 | |||
Detecting attack type (AT) | FBHM | Multilabel | 0.97 | ||||||
“Subverting the Jewtocracy”: Online Antisemitism Detection Using Multimodal Deep Learning | Hate | Antisemitic content detection | GAB | Binary | MFAS | 0.91 | |||
Binary | MFAS | 0.71 | |||||||
Antisemitism category classification | GAB | Multiclass | MFAS | 0.67 | |||||
Multiclass | MFAS | 0.68 | |||||||
Detecting Sexist MEME On The Web: A Study on Textual and Visual Cues | Hate | Automatic detection of sexist memes. | The MEME | Binary | Late fusion | 0.76 | Multi-modal Late-fusion Hand-crafted visual dec\scriptiors used: Low-level greyscale, colored, photographic and semantic features. Bag-of-words approached used as textual features. SVM |
||
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset |
Hate | Hateful Meme Detection | FBHM | Binary | CLIP (Linear Probe) | 0.56 | |||
Binary | CLIP (Linear Probe) | 0.57 | |||||||
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation | Hate | Hateful Meme Detection | Binary | BERT, VGG-16, MLP | 0.83 | ||||
Disentangling Hate in Online Memes | Hate | Hateful Meme Detection | FBHM | Binary | DisMultiHate (BERT, Faster-RCNN, Disentangled representations) | 0.83 | 0.76 | ||
MultiOFF | Binary | 0.65 | |||||||
Exploring Hate Speech Detection in Multimodal Publications | Hate | Hatespech detection in multimodal publications | MMHS150K | Binary | FCM (Feature concatenation model), Inception-V3, LSTM | 0.73 | 0.68 | 0.70 | |
AOMD: An Analogy-aware Approach to Offensive Meme Detection on Social Media | Offensive | Offensive (analogy) meme detection | GAB | Binary | Analogy-aware Multi-modal Representation Learning (Faster R-CNN, ResNet50, Glove-based LSTM, BERT) MLP |
0.69 | 0.56 | ||
Binary | 0.72 | 0.49 | |||||||
KnowMeme: A Knowledge-enriched Graph Neural Network Solution to Offensive Meme Detection | Offensive | Offensive meme detection | Binary | YOLO V4, ConceptNET, GNN | 0.73 | 0.49 | Knowledge-aware Multimodal Entity Extraction (KMEE): YOLO V4 Knowledge-enriched Information Network Construction: ConceptNET Supervised Offensive Meme Classification: GNN |
||
GAB | Binary | 0.7 | 0.55 | ||||||
An approach to detect offence in Memes using Natural Language Processing(NLP) and Deep learning | Offensive | Offensive meme detection | Offensive + Intensity dataset | Binary | CNN, FastText, LSTM - Sigmoid | 0.96 | |||
Offense intensity prediction | Multiclass | CNN, FastText, LSTM - Softmax | 0.99 | ||||||
Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text | Offensive | Offensive content detection | MultiOFF | Binary | Early fusion: Stacked LSTM/ BiLSTM/CNN-Text + VGG16 | 0.5 | Early fusion technique (Stacked LSTM/ BiLSTM/CNN-Text + VGG16) | ||
Detecting Propaganda Techniques in Memes | Propaganda | Detecting the type of propaganda techniques used in memes |
Multilabel | VisualBERT (Pretrained using MS COCO) | 0.48 | micro F1 | |||
MinD at SemEval-2021 Task 6: Propaganda Detection using Transfer Learning and Multimodal Fusion | Propaganda | Propaganda technique detection (Unimodal: Text) | Multilabel | Ensemble: BERT, RoBERTa, XLNet, ALBERT, DistilBERT, DeBERTa, Embeddings, Char n-gram | 0.59 | micro F1, Team name: MinD | |||
Volta at SemEval-2021 Task 6: Towards Detecting Persuasive Texts and Images using Textual and Multimodal Ensemble | Propaganda | Propaganda technique and span detection (Unimodal: Text) | Multilabel | RoBERTa | 0.48 | micro F1, Team name: Volta | |||
Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification | Propaganda | Propaganda technique detection (Multimodal: Meme) | Multilabel | RoBERTa, Embeddings | 0.58 | micro F1, Team name: Alpha | |||
Detection of Cyberbullying Incidents on the Instagram Social Network | Cyber-bullying | Detecting incidents of cyber-bullying | Binary | SVD +(Unigram, 3-gram), kernelPCA+(meta data, image categories) + lin. SVM | 0.87 | ||||
A Dataset for Troll Classification of TamilMemes | Cyber-bullying | Detecting Troll memes | TamilMemes | Binary | ResNET (Training: TamilMemes) | 0.52 | macro F1 | ||
ResNET (Training: TamilMemes + ImageNet) | 0.52 | ||||||||
MobileNet (Training: TamilMemes + ImageNet + Flickr1k) | 0.47 | ||||||||
ResNET (Training: TamilMemes + ImageNet + Flickr30k) | 0.52 |
Please cite the following paper.
@inproceedings{ijcai2022Shivam,
title = {Detecting and Understanding Harmful Memes: A Survey},
author = {Shivam Sharma and Firoj Alam and Md. Shad Akhtar and Dimitar Dimitrov and Giovanni Da San Martino and Hamed Firooz and Alon Halevy and Fabrizio Silvestri and Preslav Nakov and Tanmoy Chakraborty},
booktitle = {Proceedings of the 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence},
series={IJCAI-ECAI~'22},
publisher = {International Joint Conferences on Artificial Intelligence Organization},
year = {2022},
note = {Survey Track},
address={Messe Wien, Vienna, Austria},
}