Cyberbullying Identification Using Participant-Vocabulary Consistency
Elaheh Raisi, Bert Huang

TL;DR
This paper introduces a participant-vocabulary consistency model to identify cyberbullying by discovering instigators, victims, and new bullying vocabulary from social media data, addressing vocabulary change and social network structure.
Contribution
It presents a novel model that simultaneously detects cyberbullying participants and evolving vocabulary using social interaction data and a seed dictionary.
Findings
Effective detection of new bullying vocabulary
Successful identification of victims and bullies
Applicable to Twitter and Ask.fm datasets
Abstract
With the rise of social media, people can now form relationships and communities easily regardless of location, race, ethnicity, or gender. However, the power of social media simultaneously enables harmful online behavior such as harassment and bullying. Cyberbullying is a serious social problem, making it an important topic in social network analysis. Machine learning methods can potentially help provide better understanding of this phenomenon, but they must address several key challenges: the rapidly changing vocabulary involved in cyber- bullying, the role of social network structure, and the scale of the data. In this study, we propose a model that simultaneously discovers instigators and victims of bullying as well as new bullying vocabulary by starting with a corpus of social interactions and a seed dictionary of bullying indicators. We formulate an objective function based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Bullying, Victimization, and Aggression · Advanced Malware Detection Techniques
