Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
Jo\~ao A. Leite, Diego F. Silva, Kalina Bontcheva, Carolina, Scarton

TL;DR
This paper introduces a new large-scale dataset for toxic language detection in Brazilian Portuguese social media comments, evaluates BERT models, and highlights challenges in classifying various toxicity types.
Contribution
The paper provides the first large-scale annotated dataset for Brazilian Portuguese toxic comments and analyzes multilingual versus monolingual model performance.
Findings
BERT models achieved 76% macro-F1 in binary toxicity detection.
Monolingual data remains crucial for accurate toxicity classification.
Classifying less frequent toxicity types remains challenging.
Abstract
Hate speech and toxic comments are a common concern of social media platform users. Although these comments are, fortunately, the minority in these platforms, they are still capable of causing harm. Therefore, identifying these comments is an important task for studying and preventing the proliferation of toxicity in social media. Previous work in automatically detecting toxic comments focus mainly in English, with very few work in languages like Brazilian Portuguese. In this paper, we propose a new large-scale dataset for Brazilian Portuguese with tweets annotated as either toxic or non-toxic or in different types of toxicity. We present our dataset collection and annotation process, where we aimed to select candidates covering multiple demographic groups. State-of-the-art BERT models were able to achieve 76% macro-F1 score using monolingual data in the binary case. We also show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Spam and Phishing Detection
MethodsLinear Layer · Adam · Dense Connections · WordPiece · Multi-Head Attention · Layer Normalization · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · Dropout
