"bot lane noob" Towards Deployment of NLP-based Toxicity Detectors in Video Games
Jonas Ave, Irdin Pekaric, Matthias Frohner, Giovanni Apruzzese

TL;DR
This paper introduces a new dataset and an NLP-based toxicity detector specifically designed for live video game chat, demonstrating improved accuracy and practical deployment tools.
Contribution
It provides a fine-grained labeled dataset of in-game messages, develops a superior toxicity detector, and releases resources including a browser extension for real-time toxicity detection.
Findings
The detector outperforms existing general-purpose toxicity detectors.
The dataset contains 1.4k toxic and 13.8k non-toxic messages from LoL matches.
The browser extension effectively flags toxic content without third-party queries.
Abstract
Toxicity and harassment are widespread in the video-gaming context. Especially in competitive online multiplayer scenarios, gamers oftentimes send harmful messages to other players (teammates or opponents) whose consequences span from mild annoyance to withdrawal and depression. Abundant prior work tackled these problems, e.g., pointing out the negative effects of toxic interactions. However, few works proposed countermeasures specifically developed and tested on textual messages sent during a match -- i.e., when the "harassment" actually occurs. We posit that such a scarcity stems from the lack of high-quality datasets that can be used to devise "automated" detectors based on natural-language processing (NLP) and machine learning (ML), and which can -- ideally -- mitigate the harm of toxic comments during a gaming session. This work provides a foundation for addressing the problem of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
