Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching

Wei Chee Yew; Hailun Xu; Sanjay Saha; Xiaotian Fan; Hiok Hian Ong; David Yuchen Wang; Kanchan Sarkar; Zhenheng Yang; Danhui Guan

arXiv:2512.03553·cs.CV·January 29, 2026

Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching

Wei Chee Yew, Hailun Xu, Sanjay Saha, Xiaotian Fan, Hiok Hian Ong, David Yuchen Wang, Kanchan Sarkar, Zhenheng Yang, Danhui Guan

PDF

Open Access

TL;DR

This paper introduces a hybrid content moderation system for livestreams that combines supervised classification with MLLM-boosted similarity matching, effectively detecting both known violations and novel unwanted content.

Contribution

It presents a scalable, multimodal moderation framework that integrates supervised learning and reference-based matching, enhanced by large language models for improved accuracy.

Findings

01

Achieves 67% recall at 80% precision for classification

02

Achieves 76% recall at 80% precision for similarity matching

03

Reduces unwanted livestream views by 6-8% in production

Abstract

Content moderation remains a critical yet challenging task for large-scale user-generated video platforms, especially in livestreaming environments where moderation must be timely, multimodal, and robust to evolving forms of unwanted content. We present a hybrid moderation framework deployed at production scale that combines supervised classification for known violations with reference-based similarity matching for novel or subtle cases. This hybrid design enables robust detection of both explicit violations and novel edge cases that evade traditional classifiers. Multimodal inputs (text, audio, visual) are processed through both pipelines, with a multimodal large language model (MLLM) distilling knowledge into each to boost accuracy while keeping inference lightweight. In production, the classification pipeline achieves 67% recall at 80% precision, and the similarity pipeline achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis