Toxic Language Detection in Social Media for Brazilian Portuguese: New   Dataset and Multilingual Analysis

Jo\~ao A. Leite; Diego F. Silva; Kalina Bontcheva; Carolina; Scarton

arXiv:2010.04543·cs.CL·October 12, 2020·43 cites

Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis

Jo\~ao A. Leite, Diego F. Silva, Kalina Bontcheva, Carolina, Scarton

PDF

Open Access 1 Repo 5 Datasets

TL;DR

This paper introduces a new large-scale dataset for toxic language detection in Brazilian Portuguese social media comments, evaluates BERT models, and highlights challenges in classifying various toxicity types.

Contribution

The paper provides the first large-scale annotated dataset for Brazilian Portuguese toxic comments and analyzes multilingual versus monolingual model performance.

Findings

01

BERT models achieved 76% macro-F1 in binary toxicity detection.

02

Monolingual data remains crucial for accurate toxicity classification.

03

Classifying less frequent toxicity types remains challenging.

Abstract

Hate speech and toxic comments are a common concern of social media platform users. Although these comments are, fortunately, the minority in these platforms, they are still capable of causing harm. Therefore, identifying these comments is an important task for studying and preventing the proliferation of toxicity in social media. Previous work in automatically detecting toxic comments focus mainly in English, with very few work in languages like Brazilian Portuguese. In this paper, we propose a new large-scale dataset for Brazilian Portuguese with tweets annotated as either toxic or non-toxic or in different types of toxicity. We present our dataset collection and annotation process, where we aimed to select candidates covering multiple demographic groups. State-of-the-art BERT models were able to achieve 76% macro-F1 score using monolingual data in the binary case. We also show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JAugusto97/ToLD-Br
noneOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Spam and Phishing Detection

MethodsLinear Layer · Adam · Dense Connections · WordPiece · Multi-Head Attention · Layer Normalization · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · Dropout