CONAN -- COunter NArratives through Nichesourcing: a Multilingual   Dataset of Responses to Fight Online Hate Speech

Y.L. Chung; E. Kuzmenko; S.S. Tekiroglu; M. Guerini

arXiv:1910.03270·cs.CL·October 9, 2019

CONAN -- COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

Y.L. Chung, E. Kuzmenko, S.S. Tekiroglu, M. Guerini

PDF

TL;DR

This paper introduces CONAN, a large multilingual dataset of hate speech and counter-narrative pairs created by experts, aiming to provide an alternative to censorship for combating online hate speech.

Contribution

It presents the first large-scale, multilingual, expert-annotated dataset of hate speech and counter-narratives, including annotations and data augmentation techniques.

Findings

01

Dataset quality assessed through initial experiments

02

Multilingual data enhances cross-lingual hate speech countering

03

Expert annotations improve response relevance and accuracy

Abstract

Although there is an unprecedented effort to provide adequate responses in terms of laws and policies to hate content on social media platforms, dealing with hatred online is still a tough problem. Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far by the research community, is to actually oppose hate content with counter-narratives (i.e. informed textual responses). In this paper, we describe the creation of the first large-scale, multilingual, expert-based dataset of hate speech/counter-narrative pairs. This dataset has been built with the effort of more than 100 operators from three different NGOs that applied their training and expertise to the task. Together with the collected data we also provide additional annotations about expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.