Trustworthy Hate Speech Detection Through Visual Augmentation
Ziyuan Yang, Ming Yan, Yingyu Chen, Hui Wang, Zexin Lu and, Yi Zhang

TL;DR
This paper introduces TrusV-HSD, a novel multimodal approach that enhances hate speech detection by integrating visual data and a trustworthy loss to improve semantic understanding and reduce uncertainty.
Contribution
It presents a new multimodal hate speech detection method that leverages visual augmentation and trustworthy loss without needing paired data, improving detection accuracy.
Findings
Significant performance improvements over existing methods.
Effective extraction of trustworthy semantic information.
Robustness to uncertainty in hate speech detection.
Abstract
The surge of hate speech on social media platforms poses a significant challenge, with hate speech detection~(HSD) becoming increasingly critical. Current HSD methods focus on enriching contextual information to enhance detection performance, but they overlook the inherent uncertainty of hate speech. We propose a novel HSD method, named trustworthy hate speech detection method through visual augmentation (TrusV-HSD), which enhances semantic information through integration with diffused visual images and mitigates uncertainty with trustworthy loss. TrusV-HSD learns semantic representations by effectively extracting trustworthy information through multi-modal connections without paired data. Our experiments on public HSD datasets demonstrate the effectiveness of TrusV-HSD, showing remarkable improvements over conventional methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsFocus
