Exploration and Evaluation of Bias in Cyberbullying Detection with   Machine Learning

Andrew Root; Liam Jakubowski; Mounika Vanamala

arXiv:2412.00609·cs.LG·December 3, 2024

Exploration and Evaluation of Bias in Cyberbullying Detection with Machine Learning

Andrew Root, Liam Jakubowski, Mounika Vanamala

PDF

Open Access

TL;DR

This paper investigates how biases from data collection and labeling affect cyberbullying detection models, emphasizing the importance of dataset curation and cross-dataset evaluation for real-world effectiveness.

Contribution

It provides a detailed analysis of bias sources in cyberbullying datasets and evaluates model generalization across different datasets, highlighting challenges in real-world deployment.

Findings

01

Models experience a significant drop in Macro F1 Score when tested on unseen datasets.

02

Biases from data collection and labeling significantly impact model performance.

03

Cross-dataset evaluation is crucial for assessing real-world applicability.

Abstract

It is well known that the usefulness of a machine learning model is due to its ability to generalize to unseen data. This study uses three popular cyberbullying datasets to explore the effects of data, how it's collected, and how it's labeled, on the resulting machine learning models. The bias introduced from differing definitions of cyberbullying and from data collection is discussed in detail. An emphasis is made on the impact of dataset expansion methods, which utilize current data points to fetch and label new ones. Furthermore, explicit testing is performed to evaluate the ability of a model to generalize to unseen datasets through cross-dataset evaluation. As hypothesized, the models have a significant drop in the Macro F1 Score, with an average drop of 0.222. As such, this study effectively highlights the importance of dataset curation and cross-dataset testing for creating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection