A survey of textual cyber abuse detection using cutting-edge language models and large language models
Jose A. Diaz-Garcia, Joao Paulo Carvalho

TL;DR
This survey reviews how cutting-edge language models and large language models are used to detect and generate online abuse on social media, highlighting technological advances and ethical considerations.
Contribution
It provides a comprehensive overview of the role of LMs and LLMs in cyber abuse detection and generation, emphasizing recent innovations and challenges.
Findings
LLMs enhance automated detection of abusive content.
LLMs can generate harmful content, raising ethical concerns.
Technological advances improve detection but also pose risks.
Abstract
The success of social media platforms has facilitated the emergence of various forms of online abuse within digital communities. This abuse manifests in multiple ways, including hate speech, cyberbullying, emotional abuse, grooming, and sexting. In this paper, we present a comprehensive analysis of the different forms of abuse prevalent in social media, with a particular focus on how emerging technologies, such as Language Models (LMs) and Large Language Models (LLMs), are reshaping both the detection and generation of abusive content within these networks. We delve into the mechanisms through which social media abuse is perpetuated, exploring the psychological and social impact. Additionally, we examine the dual role of advanced language models-highlighting their potential to enhance automated detection systems for abusive behavior while also acknowledging their capacity to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Cybercrime and Law Enforcement Studies · Information and Cyber Security
MethodsFocus
