COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
Linhao Zhang, Li Jin, Guangluan Xu, Xiaoyu Li, and Xian Sun

TL;DR
This paper introduces a contrastive optimal transport framework for generating targeted, diverse, and relevant counter-narratives to hate speech, addressing limitations of previous methods focused on fluency.
Contribution
The novel framework combines optimal transport, self-contrastive learning, and target-oriented decoding to improve relevance, diversity, and target interaction in hate speech counter-narrative generation.
Findings
Significantly outperforms existing methods on benchmark datasets.
Effectively incorporates hatred target information into generation.
Enhances diversity and relevance of generated counter-narratives.
Abstract
Counter-narratives, which are direct responses consisting of non-aggressive fact-based arguments, have emerged as a highly effective approach to combat the proliferation of hate speech. Previous methodologies have primarily focused on fine-tuning and post-editing techniques to ensure the fluency of generated contents, while overlooking the critical aspects of individualization and relevance concerning the specific hatred targets, such as LGBT groups, immigrants, etc. This research paper introduces a novel framework based on contrastive optimal transport, which effectively addresses the challenges of maintaining target interaction and promoting diversification in generating counter-narratives. Firstly, an Optimal Transport Kernel (OTK) module is leveraged to incorporate hatred target information in the token representations, in which the comparison pairs are extracted between original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
