Selecting and combining complementary feature representations and classifiers for hate speech detection
Rafael M. O. Cruz, Woshington V. de Sousa, George D. C., Cavalcanti

TL;DR
This paper introduces a framework for selecting and combining multiple feature extraction methods and classifiers to improve hate speech detection accuracy, demonstrating significant performance gains over existing approaches.
Contribution
It proposes a novel framework for analyzing and selecting complementary feature and classifier combinations to build effective multi-classifier systems for hate speech detection.
Findings
The framework effectively identifies complementary techniques.
The resulting multi-classifier system outperforms single models.
Significant improvements over heuristic selection methods.
Abstract
Hate speech is a major issue in social networks due to the high volume of data generated daily. Recent works demonstrate the usefulness of machine learning (ML) in dealing with the nuances required to distinguish between hateful posts from just sarcasm or offensive language. Many ML solutions for hate speech detection have been proposed by either changing how features are extracted from the text or the classification algorithm employed. However, most works consider only one type of feature extraction and classification algorithm. This work argues that a combination of multiple feature extraction techniques and different classification models is needed. We propose a framework to analyze the relationship between multiple feature extraction and classification techniques to understand how they complement each other. The framework is used to select a subset of complementary techniques to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Advanced Malware Detection Techniques
