Identifying Cyberbullying Roles in Social Media
Manuel Sandoval, Mohammed Abuhamad, Patrick Furman, Mujtaba Nazari,, Deborah L. Hall, Yasin N. Silva

TL;DR
This study applies machine learning, especially fine-tuned RoBERTa with oversampling, to accurately identify roles in cyberbullying incidents on social media, achieving high F1 scores and highlighting challenges with data imbalance.
Contribution
It introduces a novel approach using LLMs and oversampling techniques for cyberbullying role detection, outperforming previous models and analyzing class-specific performance.
Findings
Best model achieved 83.5% F1 score with oversampled data.
Oversampling improves model performance significantly.
Models perform well on classes with more data and less ambiguity.
Abstract
Social media has revolutionized communication, allowing people worldwide to connect and interact instantly. However, it has also led to increases in cyberbullying, which poses a significant threat to children and adolescents globally, affecting their mental health and well-being. It is critical to accurately detect the roles of individuals involved in cyberbullying incidents to effectively address the issue on a large scale. This study explores the use of machine learning models to detect the roles involved in cyberbullying interactions. After examining the AMiCA dataset and addressing class imbalance issues, we evaluate the performance of various models built with four underlying LLMs (i.e., BERT, RoBERTa, T5, and GPT-2) for role detection. Our analysis shows that oversampling techniques help improve model performance. The best model, a fine-tuned RoBERTa using oversampled data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Bullying, Victimization, and Aggression · Social Media and Politics
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · Byte Pair Encoding · Dense Connections · Multi-Head Attention · Inverse Square Root Schedule · RoBERTa · Residual Connection
