Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh Mozafari, Reza Farahbakhsh, Noel Crespi

TL;DR
This paper presents a BERT-based transfer learning approach for hate speech detection on Twitter, introducing a bias mitigation mechanism that reduces racial bias in classifier predictions, advancing fairer automated content moderation.
Contribution
It proposes a novel bias alleviation method using re-weighted samples during fine-tuning of BERT for hate speech detection, addressing classifier bias rather than dataset bias.
Findings
Bias exists in classifiers, favoring certain racial language groups.
The bias mitigation mechanism significantly reduces racial bias in predictions.
The approach improves fairness in hate speech detection systems.
Abstract
Disparate biases associated with datasets and trained classifiers in hateful and abusive content identification tasks have raised many concerns recently. Although the problem of biased datasets on abusive language detection has been addressed more frequently, biases arising from trained classifiers have not yet been a matter of concern. Here, we first introduce a transfer learning approach for hate speech detection based on an existing pre-trained language model called BERT and evaluate the proposed model on two publicly available datasets annotated for racism, sexism, hate or offensive content on Twitter. Next, we introduce a bias alleviation mechanism in hate speech detection task to mitigate the effect of bias in training set during the fine-tuning of our pre-trained BERT-based model. Toward that end, we use an existing regularization method to reweight input samples, thereby…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Linear Layer · Multi-Head Attention · Layer Normalization · Attention Is All You Need · Dropout · Residual Connection · Attention Dropout · Weight Decay · Softmax
