Unified and Multilingual Author Profiling for Detecting Haters

Ipek Baris Schlicht; Angel Felipe Magnoss\~ao de Paula

arXiv:2109.09233·cs.CL·September 21, 2021·1 cites

Unified and Multilingual Author Profiling for Detecting Haters

Ipek Baris Schlicht, Angel Felipe Magnoss\~ao de Paula

PDF

Open Access 1 Repo

TL;DR

This paper introduces a unified multilingual user profiling framework that detects hate speech spreaders across languages using sentence transformers and attention mechanisms, providing explainability and outperforming existing models.

Contribution

The paper proposes a novel multilingual user profiling method with attention-based explainability that surpasses current state-of-the-art transformer models.

Findings

01

Outperforms existing multilingual transformer models

02

Provides explainability through attention weights

03

Effective in identifying hate speech spreaders across languages

Abstract

This paper presents a unified user profiling framework to identify hate speech spreaders by processing their tweets regardless of the language. The framework encodes the tweets with sentence transformers and applies an attention mechanism to select important tweets for learning user profiles. Furthermore, the attention layer helps to explain why a user is a hate speech spreader by producing attention weights at both token and post level. Our proposed model outperformed the state-of-the-art multilingual transformer models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

isspek/cross-lingual-cyberbullying
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Cybercrime and Law Enforcement Studies