Generate, Prune, Select: A Pipeline for Counterspeech Generation against   Online Hate Speech

Wanzheng Zhu; Suma Bhat

arXiv:2106.01625·cs.CL·June 4, 2021·6 cites

Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech

Wanzheng Zhu, Suma Bhat

PDF

Open Access 1 Repo

TL;DR

This paper introduces a three-module pipeline combining generation, filtering, and selection to produce diverse, relevant counterspeech responses to online hate speech, improving over standard NLG methods.

Contribution

The paper proposes a novel pipeline that enhances counterspeech generation by integrating a generative model, a BERT-based filter, and a retrieval-based selector, addressing diversity and relevance issues.

Findings

01

Improved diversity and relevance in counterspeech responses.

02

Effective filtering of ungrammatical responses using BERT.

03

Demonstrated superiority on three datasets.

Abstract

Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest. Natural Language Generation (NLG), is uniquely capable of developing scalable solutions. However, off-the-shelf NLG methods are primarily sequence-to-sequence neural models and they are limited in that they generate commonplace, repetitive and safe responses regardless of the hate speech (e.g., "Please refrain from using such language.") or irrelevant responses, making them ineffective for de-escalating hateful conversations. In this paper, we design a three-module pipeline approach to effectively improve the diversity and relevance. Our proposed pipeline first generates various counterspeech candidates by a generative model to promote diversity, then filters the ungrammatical ones using a BERT model, and finally selects the most relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WanzhengZhu/GPS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Linear Layer · Adam · Linear Warmup With Linear Decay · Layer Normalization · Residual Connection · WordPiece · Attention Dropout · Dense Connections