OSPC: Artificial VLM Features for Hateful Meme Detection

Peter Gr\"onquist

arXiv:2407.12836·cs.CL·July 19, 2024

OSPC: Artificial VLM Features for Hateful Meme Detection

Peter Gr\"onquist

PDF

TL;DR

This paper presents a computationally efficient method using large Vision-Language Models to detect hateful memes by generating specialized feature encodings, achieving promising results with less resource demand.

Contribution

The paper introduces a novel approach leveraging VLMs for feature extraction in hate speech detection, reducing the need for extensive training and resources.

Findings

01

Achieved AUROC of 0.76 and accuracy of 0.69 on test data.

02

Utilized probabilistic features from VLMs for classification.

03

Applicable to resource-constrained environments and private models.

Abstract

The digital revolution and the advent of the world wide web have transformed human communication, notably through the emergence of memes. While memes are a popular and straightforward form of expression, they can also be used to spread misinformation and hate due to their anonymity and ease of use. In response to these challenges, this paper introduces a solution developed by team 'Baseline' for the AI Singapore Online Safety Prize Challenge. Focusing on computational efficiency and feature engineering, the solution achieved an AUROC of 0.76 and an accuracy of 0.69 on the test dataset. As key features, the solution leverages the inherent probabilistic capabilities of large Vision-Language Models (VLMs) to generate task-adapted feature encodings from text, and applies a distilled quantization tailored to the specific cultural nuances present in Singapore. This type of processing and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Cosine Annealing · Layer Normalization · Linear Layer · Weight Decay · Softmax · Discriminative Fine-Tuning · Attention Dropout