EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
Comfort Eseohen Ilevbare, Jesujoba O. Alabi, David Ifeoluwa Adelani,, Firdous Damilola Bakare, Oluwatoyin Bunmi Abiola, Oluwaseyi Adesina, Adeyemo

TL;DR
This paper introduces EkoHate, a new dataset for detecting abusive language and hate speech in Nigerian code-switched political Twitter discussions, and evaluates state-of-the-art methods for this task.
Contribution
It presents a novel dataset for hate speech detection in Nigerian political discussions and provides an empirical evaluation of various detection methods.
Findings
Achieved 95.1 F1 score in binary classification
Achieved 70.3 F1 score in four-label classification
Dataset transfers well to other offensive language datasets
Abstract
Nigerians have a notable online presence and actively discuss political and topical matters. This was particularly evident throughout the 2023 general election, where Twitter was used for campaigning, fact-checking and verification, and even positive and negative discourse. However, little or none has been done in the detection of abusive language and hate speech in Nigeria. In this paper, we curated code-switched Twitter data directed at three musketeers of the governorship election on the most populous and economically vibrant state in Nigeria; Lagos state, with the view to detect offensive speech in political discussions. We developed EkoHate -- an abusive language and hate speech dataset for political discussions between the three candidates and their followers using a binary (normal vs offensive) and fine-grained four-label annotation scheme. We analysed our dataset and provided an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHate Speech and Cyberbullying Detection
