ToxiTwitch: Toward Emote-Aware Hybrid Moderation for Live Streaming Platforms
Baktash Ansari, Elias Martin, Afra Mashhadi

TL;DR
This paper explores emote-aware toxicity detection on Twitch, introducing ToxiTwitch, a hybrid model that combines large language model embeddings with traditional classifiers, achieving up to 80% accuracy.
Contribution
It presents ToxiTwitch, a novel hybrid approach integrating LLM embeddings of text and emotes with machine learning classifiers for improved toxicity detection.
Findings
Incorporating emotes enhances toxicity detection accuracy.
ToxiTwitch outperforms BERT-based models with up to 80% accuracy.
Hybrid models show promise in complex, multimodal chat environments.
Abstract
The rapid growth of live-streaming platforms such as Twitch has introduced complex challenges in moderating toxic behavior. Traditional moderation approaches, such as human annotation and keyword-based filtering, have demonstrated utility, but human moderators on Twitch constantly struggle to scale effectively in the fast-paced, high-volume, and context-rich chat environment of the platform while also facing harassment themselves. Recent advances in large language models (LLMs), such as DeepSeek-R1-Distill and Llama-3-8B-Instruct, offer new opportunities for toxicity detection, especially in understanding nuanced, multimodal communication involving emotes. In this work, we present an exploratory comparison of toxicity detection approaches tailored to Twitch. Our analysis reveals that incorporating emotes improves the detection of toxic behavior. To this end, we introduce ToxiTwitch, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Advanced Malware Detection Techniques · Spam and Phishing Detection
