VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos
Weihao Zhong, Yinhao Xiao, Minghui Xu, Xiuzhen Cheng

TL;DR
This paper introduces VMID, a multimodal fusion framework utilizing large language models to detect misinformation in short videos, significantly improving accuracy over existing single-modal methods.
Contribution
The paper presents a novel multimodal fake news detection framework that combines multi-level video analysis with large language models for enhanced misinformation identification.
Findings
Achieves 90.93% accuracy in fake news detection.
Outperforms baseline models like SV-FEND with 81.05% accuracy.
Demonstrates robustness and effectiveness in real-world scenarios.
Abstract
Short video platforms have become important channels for news dissemination, offering a highly engaging and immediate way for users to access current events and share information. However, these platforms have also emerged as significant conduits for the rapid spread of misinformation, as fake news and rumors can leverage the visual appeal and wide reach of short videos to circulate extensively among audiences. Existing fake news detection methods mainly rely on single-modal information, such as text or images, or apply only basic fusion techniques, limiting their ability to handle the complex, multi-layered information inherent in short videos. To address these limitations, this paper presents a novel fake news detection method based on multimodal information, designed to identify misinformation through a multi-level analysis of video content. This approach effectively utilizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Network Security and Intrusion Detection · Spam and Phishing Detection
