A Unified Deep Learning Architecture for Abuse Detection
Antigoni-Maria Founta, Despoina Chatzakou, Nicolas Kourtellis, Jeremy, Blackburn, Athena Vakali, Ilias Leontiadis

TL;DR
This paper introduces a comprehensive deep learning framework that integrates user metadata and textual analysis to detect various types of online abuse on Twitter, significantly outperforming existing methods.
Contribution
The authors present a unified deep learning architecture capable of detecting multiple abusive behaviors without task-specific tuning, leveraging both metadata and text patterns.
Findings
Achieves 21-45% improvement in AUC over state-of-the-art methods.
Effectively detects diverse abusive behaviors like hate speech, sexism, racism, and sarcasm.
Demonstrates robustness across multiple Twitter datasets.
Abstract
Hate speech, offensive language, sexism, racism and other types of abusive behavior have become a common phenomenon in many online social media platforms. In recent years, such diverse abusive behaviors have been manifesting with increased frequency and levels of intensity. This is due to the openness and willingness of popular media platforms, such as Twitter and Facebook, to host content of sensitive or controversial topics. However, these platforms have not adequately addressed the problem of online abusive behavior, and their responsiveness to the effective detection and blocking of such inappropriate behavior remains limited. In the present paper, we study this complex problem by following a more holistic approach, which considers the various aspects of abusive behavior. To make the approach tangible, we focus on Twitter data and analyze user and textual properties from different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
