COT: A Generative Approach for Hate Speech Counter-Narratives via   Contrastive Optimal Transport

Linhao Zhang; Li Jin; Guangluan Xu; Xiaoyu Li; and Xian Sun

arXiv:2406.12304·cs.CL·June 19, 2024

COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport

Linhao Zhang, Li Jin, Guangluan Xu, Xiaoyu Li, and Xian Sun

PDF

Open Access

TL;DR

This paper introduces a contrastive optimal transport framework for generating targeted, diverse, and relevant counter-narratives to hate speech, addressing limitations of previous methods focused on fluency.

Contribution

The novel framework combines optimal transport, self-contrastive learning, and target-oriented decoding to improve relevance, diversity, and target interaction in hate speech counter-narrative generation.

Findings

01

Significantly outperforms existing methods on benchmark datasets.

02

Effectively incorporates hatred target information into generation.

03

Enhances diversity and relevance of generated counter-narratives.

Abstract

Counter-narratives, which are direct responses consisting of non-aggressive fact-based arguments, have emerged as a highly effective approach to combat the proliferation of hate speech. Previous methodologies have primarily focused on fine-tuning and post-editing techniques to ensure the fluency of generated contents, while overlooking the critical aspects of individualization and relevance concerning the specific hatred targets, such as LGBT groups, immigrants, etc. This research paper introduces a novel framework based on contrastive optimal transport, which effectively addresses the challenges of maintaining target interaction and promoting diversification in generating counter-narratives. Firstly, an Optimal Transport Kernel (OTK) module is leveraged to incorporate hatred target information in the token representations, in which the comparison pairs are extracted between original…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection