Promoting Security and Trust on Social Networks: Explainable   Cyberbullying Detection Using Large Language Models in a Stream-Based Machine   Learning Framework

Silvia Garc\'ia-M\'endez; Francisco De Arriba-P\'erez

arXiv:2505.03746·cs.SI·May 8, 2025

Promoting Security and Trust on Social Networks: Explainable Cyberbullying Detection Using Large Language Models in a Stream-Based Machine Learning Framework

Silvia Garc\'ia-M\'endez, Francisco De Arriba-P\'erez

PDF

TL;DR

This paper presents a real-time, stream-based machine learning framework utilizing large language models and explainability tools to detect cyberbullying on social media, achieving high accuracy and enhancing trustworthiness.

Contribution

It introduces an innovative, real-time cyberbullying detection system that combines stream-based ML, LLMs for feature engineering, and explainability dashboards, advancing current methods.

Findings

01

Achieves nearly 90% performance across metrics.

02

Outperforms existing cyberbullying detection methods.

03

Provides an explainability dashboard to increase trust.

Abstract

Social media platforms enable instant and ubiquitous connectivity and are essential to social interaction and communication in our technological society. Apart from its advantages, these platforms have given rise to negative behaviors in the online community, the so-called cyberbullying. Despite the many works involving generative Artificial Intelligence (AI) in the literature lately, there remain opportunities to study its performance apart from zero/few-shot learning strategies. Accordingly, we propose an innovative and real-time solution for cyberbullying detection that leverages stream-based Machine Learning (ML) models able to process the incoming samples incrementally and Large Language Models (LLMS) for feature engineering to address the evolving nature of abusive and hate speech online. An explainability dashboard is provided to promote the system's trustworthiness, reliability,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.