Analysis of Socially Unacceptable Discourse with Zero-shot Learning

Rayane Ghilene; Dimitra Niaouri; Michele Linardi; and Julien Longhi

arXiv:2409.13735·cs.CL·September 24, 2024

Analysis of Socially Unacceptable Discourse with Zero-shot Learning

Rayane Ghilene, Dimitra Niaouri, Michele Linardi, and Julien Longhi

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of zero-shot learning with pre-trained transformer models to detect and analyze socially unacceptable discourse online, aiming to improve tools for responsible communication.

Contribution

It introduces an entailment-based zero-shot classification approach for SUD detection, demonstrating its effectiveness without requiring labeled training data.

Findings

01

Good generalization to unseen data

02

Effective in characterizing extremist narratives

03

Supports development of robust SUD analysis tools

Abstract

Socially Unacceptable Discourse (SUD) analysis is crucial for maintaining online positive environments. We investigate the effectiveness of Entailment-based zero-shot text classification (unsupervised method) for SUD detection and characterization by leveraging pre-trained transformer models and prompting techniques. The results demonstrate good generalization capabilities of these models to unseen data and highlight the promising nature of this approach for generating labeled datasets for the analysis and characterization of extremist narratives. The findings of this research contribute to the development of robust tools for studying SUD and promoting responsible communication online.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rayaneghilene/ARENAS_Automatic_Extremist_Analysis/tree/main/Entailment_framework
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Terrorism, Counterterrorism, and Political Violence