FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter

Junxi Wang; Yaxiong Wang; Lechao Cheng; Zhun Zhong

arXiv:2508.19639·cs.MM·August 28, 2025

FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter

Junxi Wang, Yaxiong Wang, Lechao Cheng, Zhun Zhong

PDF

TL;DR

FakeSV-VLM is a novel framework that leverages large vision-language models with a mixture of experts and inconsistency detection modules to improve fake short-video news detection accuracy.

Contribution

This paper introduces FakeSV-VLM, a VLM-based approach with a progressive mixture of experts and inconsistency analysis for enhanced fake news detection in short videos.

Findings

01

Outperforms state-of-the-art models by +3.32% and +5.02% on benchmark datasets.

02

Effectively categorizes videos into four real/fake scenarios using specialized experts.

03

Demonstrates robustness and superior accuracy in fake news detection.

Abstract

We present FakeSV-VLM in this paper, a new VLM-based framework for detecting fake news on short video platforms. Despite significant efforts to combat this issue due to the severe threat that fake news videos pose to public information security, existing methods still fall short in detection accuracy, often due to lack of knowledge to verify the news is real or not. However, large Vision Language Models (VLMs) have absorbed extensive real-world knowledge from massive multimodal datasets. Motivated by this, we adapt advanced VLMs for fake news detection in short videos. Upon close examination of news samples, we observe that short video samples can be categorized into four distinct scenarios: both video and text are real (for real samples), or both are fake, or either the video or text is fake (for fake samples). Inspired by this insight, we design four experts tailored to handle each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.