Selecting and combining complementary feature representations and   classifiers for hate speech detection

Rafael M. O. Cruz; Woshington V. de Sousa; George D. C.; Cavalcanti

arXiv:2201.06721·cs.CL·January 19, 2022·1 cites

Selecting and combining complementary feature representations and classifiers for hate speech detection

Rafael M. O. Cruz, Woshington V. de Sousa, George D. C., Cavalcanti

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework for selecting and combining multiple feature extraction methods and classifiers to improve hate speech detection accuracy, demonstrating significant performance gains over existing approaches.

Contribution

It proposes a novel framework for analyzing and selecting complementary feature and classifier combinations to build effective multi-classifier systems for hate speech detection.

Findings

01

The framework effectively identifies complementary techniques.

02

The resulting multi-classifier system outperforms single models.

03

Significant improvements over heuristic selection methods.

Abstract

Hate speech is a major issue in social networks due to the high volume of data generated daily. Recent works demonstrate the usefulness of machine learning (ML) in dealing with the nuances required to distinguish between hateful posts from just sarcasm or offensive language. Many ML solutions for hate speech detection have been proposed by either changing how features are extracted from the text or the classification algorithm employed. However, most works consider only one type of feature extraction and classification algorithm. This work argues that a combination of multiple feature extraction techniques and different classification models is needed. We propose a framework to analyze the relationship between multiple feature extraction and classification techniques to understand how they complement each other. The framework is used to select a subset of complementary techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

menelau/hate-speech-mcs
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Advanced Malware Detection Techniques