Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation

Nils A. Herrmann; Tobias Eder; Jingyi He; Georg Groh

arXiv:2603.22985·cs.CL·March 25, 2026

Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation

Nils A. Herrmann, Tobias Eder, Jingyi He, Georg Groh

PDF

Open Access

TL;DR

This paper introduces a fine-grained annotation scheme to distinguish incivility and intolerance in multimodal content, improving moderation accuracy and reducing harmful content under different model training approaches.

Contribution

It proposes a novel annotation scheme based on communication science, applied to memes, and demonstrates how combining coarse and fine-grained labels enhances moderation models.

Findings

01

Fine-grained annotations improve model performance.

02

Models trained with fine-grained labels have more balanced error profiles.

03

Combining coarse and fine labels enhances moderation reliability.

Abstract

Current multimodal toxicity benchmarks typically use a single binary hatefulness label. This coarse approach conflates two fundamentally different characteristics of expression: tone and content. Drawing on communication science theory, we introduce a fine-grained annotation scheme that distinguishes two separable dimensions: incivility (rude or dismissive tone) and intolerance (content that attacks pluralism and targets groups or identities) and apply it to 2,030 memes from the Hateful Memes dataset. We evaluate different vision-language models under coarse-label training, transfer learning across label schemes and a joint learning approach that combines the coarse hatefulness label with our fine-grained annotations. Our results show that fine-grained annotations complement existing coarse labels and, when used jointly, improve overall model performance. Moreover, models trained with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Misinformation and Its Impacts