Trustworthy Hate Speech Detection Through Visual Augmentation

Ziyuan Yang; Ming Yan; Yingyu Chen; Hui Wang; Zexin Lu and; Yi Zhang

arXiv:2409.13557·cs.CV·September 23, 2024

Trustworthy Hate Speech Detection Through Visual Augmentation

Ziyuan Yang, Ming Yan, Yingyu Chen, Hui Wang, Zexin Lu and, Yi Zhang

PDF

Open Access

TL;DR

This paper introduces TrusV-HSD, a novel multimodal approach that enhances hate speech detection by integrating visual data and a trustworthy loss to improve semantic understanding and reduce uncertainty.

Contribution

It presents a new multimodal hate speech detection method that leverages visual augmentation and trustworthy loss without needing paired data, improving detection accuracy.

Findings

01

Significant performance improvements over existing methods.

02

Effective extraction of trustworthy semantic information.

03

Robustness to uncertainty in hate speech detection.

Abstract

The surge of hate speech on social media platforms poses a significant challenge, with hate speech detection~(HSD) becoming increasingly critical. Current HSD methods focus on enriching contextual information to enhance detection performance, but they overlook the inherent uncertainty of hate speech. We propose a novel HSD method, named trustworthy hate speech detection method through visual augmentation (TrusV-HSD), which enhances semantic information through integration with diffused visual images and mitigates uncertainty with trustworthy loss. TrusV-HSD learns semantic representations by effectively extracting trustworthy information through multi-modal connections without paired data. Our experiments on public HSD datasets demonstrate the effectiveness of TrusV-HSD, showing remarkable improvements over conventional methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

MethodsFocus