ReZG: Retrieval-Augmented Zero-Shot Counter Narrative Generation for   Hate Speech

Shuyu Jiang; Wenyi Tang; Xingshu Chen; Rui Tang; Haizhou Wang and; Wenxian Wang

arXiv:2310.05650·cs.CL·December 31, 2024·1 cites

ReZG: Retrieval-Augmented Zero-Shot Counter Narrative Generation for Hate Speech

Shuyu Jiang, Wenyi Tang, Xingshu Chen, Rui Tang, Haizhou Wang and, Wenxian Wang

PDF

Open Access 1 Repo

TL;DR

ReZG introduces a retrieval-augmented zero-shot method for generating highly specific counter narratives to combat hate speech, overcoming data annotation limitations and generalizing to unseen targets effectively.

Contribution

It proposes a multi-dimensional retrieval and energy-based constrained decoding approach enabling zero-shot counter narrative generation for unseen hate speech targets.

Findings

01

ReZG outperforms baselines with 2.0%+ relevance improvement.

02

ReZG achieves 4.5%+ higher countering success rate.

03

The method demonstrates strong generalization to unseen hate speech targets.

Abstract

The proliferation of hate speech (HS) on social media poses a serious threat to societal security. Automatic counter narrative (CN) generation, as an active strategy for HS intervention, has garnered increasing attention in recent years. Existing methods for automatically generating CNs mainly rely on re-training or fine-tuning pre-trained language models (PLMs) on human-curated CN corpora. Unfortunately, the annotation speed of CN corpora cannot keep up with the growth of HS targets, while generating specific and effective CNs for unseen targets remains a significant challenge for the model. To tackle this issue, we propose Retrieval-Augmented Zero-shot Generation (ReZG) to generate CNs with high-specificity for unseen targets. Specifically, we propose a multi-dimensional hierarchical retrieval method that integrates stance, semantics, and fitness, extending the retrieval metric from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mollyshuu/rezg
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection