"bot lane noob" Towards Deployment of NLP-based Toxicity Detectors in Video Games

Jonas Ave; Irdin Pekaric; Matthias Frohner; Giovanni Apruzzese

arXiv:2604.10175·cs.CR·April 14, 2026

"bot lane noob" Towards Deployment of NLP-based Toxicity Detectors in Video Games

Jonas Ave, Irdin Pekaric, Matthias Frohner, Giovanni Apruzzese

PDF

TL;DR

This paper introduces a new dataset and an NLP-based toxicity detector specifically designed for live video game chat, demonstrating improved accuracy and practical deployment tools.

Contribution

It provides a fine-grained labeled dataset of in-game messages, develops a superior toxicity detector, and releases resources including a browser extension for real-time toxicity detection.

Findings

01

The detector outperforms existing general-purpose toxicity detectors.

02

The dataset contains 1.4k toxic and 13.8k non-toxic messages from LoL matches.

03

The browser extension effectively flags toxic content without third-party queries.

Abstract

Toxicity and harassment are widespread in the video-gaming context. Especially in competitive online multiplayer scenarios, gamers oftentimes send harmful messages to other players (teammates or opponents) whose consequences span from mild annoyance to withdrawal and depression. Abundant prior work tackled these problems, e.g., pointing out the negative effects of toxic interactions. However, few works proposed countermeasures specifically developed and tested on textual messages sent during a match -- i.e., when the "harassment" actually occurs. We posit that such a scarcity stems from the lack of high-quality datasets that can be used to devise "automated" detectors based on natural-language processing (NLP) and machine learning (ML), and which can -- ideally -- mitigate the harm of toxic comments during a gaming session. This work provides a foundation for addressing the problem of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.