TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech
Girish A. Koushik, Helen Treharne, Diptesh Kanojia

TL;DR
TANDEM introduces a novel temporal-aware neural framework that enhances multimodal hate speech detection by providing interpretable, structured reasoning and precise temporal grounding, significantly improving target identification accuracy.
Contribution
The paper presents TANDEM, a unified reinforcement learning-based approach that transforms hate speech detection into a structured reasoning task with temporal grounding, improving interpretability and accuracy.
Findings
Achieves 0.73 F1 in target identification, a 30% improvement over state-of-the-art.
Outperforms zero-shot and context-augmented baselines across three benchmark datasets.
Demonstrates stable reasoning over extended temporal sequences without dense supervision.
Abstract
Social media platforms are increasingly dominated by long-form multimodal content, where harmful narratives are constructed through a complex interplay of audio, visual, and textual cues. While automated systems can flag hate speech with high accuracy, they often function as "black boxes" that fail to provide the granular, interpretable evidence, such as precise timestamps and target identities, required for effective human-in-the-loop moderation. In this work, we introduce TANDEM, a unified framework that transforms audio-visual hate detection from a binary classification task into a structured reasoning problem. Our approach employs a novel tandem reinforcement learning strategy where vision-language and audio-language models optimize each other through self-constrained cross-modal context, stabilizing reasoning over extended temporal sequences without requiring dense frame-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Adversarial Robustness in Machine Learning · Spam and Phishing Detection
