Deep Learning Approaches for Detecting Adversarial Cyberbullying and   Hate Speech in Social Networks

Sylvia Worlali Azumah; Nelly Elsayed; Zag ElSayed; Murat Ozer; Amanda; La Guardia

arXiv:2406.17793·cs.LG·June 27, 2024

Deep Learning Approaches for Detecting Adversarial Cyberbullying and Hate Speech in Social Networks

Sylvia Worlali Azumah, Nelly Elsayed, Zag ElSayed, Murat Ozer, Amanda, La Guardia

PDF

Open Access

TL;DR

This paper presents a deep learning-based method, specifically an LSTM model, for detecting adversarial cyberbullying and hate speech in social network text data, achieving high accuracy and outperforming previous approaches.

Contribution

It introduces a novel LSTM-based detection model with a correction algorithm tailored for adversarial attack content in social media texts.

Findings

01

LSTM model achieved 87.57% accuracy.

02

Model outperformed previous studies.

03

High precision, recall, and AUC-ROC scores.

Abstract

Cyberbullying is a significant concern intricately linked to technology that can find resolution through technological means. Despite its prevalence, technology also provides solutions to mitigate cyberbullying. To address growing concerns regarding the adverse impact of cyberbullying on individuals' online experiences, various online platforms and researchers are actively adopting measures to enhance the safety of digital environments. While researchers persist in crafting detection models to counteract or minimize cyberbullying, malicious actors are deploying adversarial techniques to circumvent these detection methods. This paper focuses on detecting cyberbullying in adversarial attack content within social networking site text data, specifically emphasizing hate speech. Utilizing a deep learning-based approach with a correction algorithm, this paper yielded significant results. An…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection