Promoting Security and Trust on Social Networks: Explainable Cyberbullying Detection Using Large Language Models in a Stream-Based Machine Learning Framework
Silvia Garc\'ia-M\'endez, Francisco De Arriba-P\'erez

TL;DR
This paper presents a real-time, stream-based machine learning framework utilizing large language models and explainability tools to detect cyberbullying on social media, achieving high accuracy and enhancing trustworthiness.
Contribution
It introduces an innovative, real-time cyberbullying detection system that combines stream-based ML, LLMs for feature engineering, and explainability dashboards, advancing current methods.
Findings
Achieves nearly 90% performance across metrics.
Outperforms existing cyberbullying detection methods.
Provides an explainability dashboard to increase trust.
Abstract
Social media platforms enable instant and ubiquitous connectivity and are essential to social interaction and communication in our technological society. Apart from its advantages, these platforms have given rise to negative behaviors in the online community, the so-called cyberbullying. Despite the many works involving generative Artificial Intelligence (AI) in the literature lately, there remain opportunities to study its performance apart from zero/few-shot learning strategies. Accordingly, we propose an innovative and real-time solution for cyberbullying detection that leverages stream-based Machine Learning (ML) models able to process the incoming samples incrementally and Large Language Models (LLMS) for feature engineering to address the evolving nature of abusive and hate speech online. An explainability dashboard is provided to promote the system's trustworthiness, reliability,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
