MTikGuard System: A Transformer-Based Multimodal System for Child-Safe Content Moderation on TikTok

Dat Thanh Nguyen; Nguyen Hung Lam; Anh Hoang-Thi Nguyen; Trong-Hop Do

arXiv:2511.17955·cs.CL·November 25, 2025

MTikGuard System: A Transformer-Based Multimodal System for Child-Safe Content Moderation on TikTok

Dat Thanh Nguyen, Nguyen Hung Lam, Anh Hoang-Thi Nguyen, Trong-Hop Do

PDF

Open Access

TL;DR

MTikGuard is a real-time, multimodal content moderation system for TikTok that combines an expanded dataset, advanced fusion techniques, and scalable architecture to effectively detect harmful content targeting children.

Contribution

The paper introduces an extended TikHarm dataset, a novel multimodal classification framework, and a scalable streaming architecture for real-time harmful content detection on TikTok.

Findings

01

Achieved 89.37% accuracy and 89.45% F1-score in harmful content detection.

02

Expanded dataset with 4,723 labeled videos including diverse real-world samples.

03

Demonstrated effective deployment of multimodal fusion in a scalable streaming system.

Abstract

With the rapid rise of short-form videos, TikTok has become one of the most influential platforms among children and teenagers, but also a source of harmful content that can affect their perception and behavior. Such content, often subtle or deceptive, challenges traditional moderation methods due to the massive volume and real-time nature of uploads. This paper presents MTikGuard, a real-time multimodal harmful content detection system for TikTok, with three key contributions: (1) an extended TikHarm dataset expanded to 4,723 labeled videos by adding diverse real-world samples, (2) a multimodal classification framework integrating visual, audio, and textual features to achieve state-of-the-art performance with 89.37% accuracy and 89.45% F1-score, and (3) a scalable streaming architecture built on Apache Kafka and Apache Spark for real-time deployment. The results demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Bullying, Victimization, and Aggression