To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Kristian Miok, Blaz Skrlj, Daniela Zaharie, Marko Robnik-Sikonja

TL;DR
This paper introduces Bayesian Attention Networks with Monte Carlo dropout in transformer models like BERT to improve hate speech detection reliability, providing well-calibrated confidence estimates and interpretability insights.
Contribution
It presents a novel Bayesian approach for transformer-based hate speech detection that enhances reliability estimation and interpretability over existing methods.
Findings
Monte Carlo dropout offers reliable confidence estimates.
The approach achieves state-of-the-art classification performance.
Reliability scores reduce workload in content moderation.
Abstract
Hate speech is an important problem in the management of user-generated content. To remove offensive content or ban misbehaving users, content moderators need reliable hate speech detectors. Recently, deep neural networks based on the transformer architecture, such as the (multilingual) BERT model, achieve superior performance in many natural language classification tasks, including hate speech detection. So far, these methods have not been able to quantify their output in terms of reliability. We propose a Bayesian method using Monte Carlo dropout within the attention layers of the transformer models to provide well-calibrated reliability estimates. We evaluate and visualize the results of the proposed approach on hate speech detection problems in several languages. Additionally, we test if affective dimensions can enhance the information extracted by the BERT model in hate speech…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Monte Carlo Dropout · Adam · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Layer Normalization · Attention Is All You Need
