MM-FusionNet: Context-Aware Dynamic Fusion for Multi-modal Fake News Detection with Large Vision-Language Models
Junhao He, Tianyu Liu, Jingyuan Zhao, Benjamin Turner

TL;DR
This paper presents MM-FusionNet, a novel framework using large vision-language models with a context-aware dynamic fusion module to improve multi-modal fake news detection by adaptively prioritizing textual and visual information.
Contribution
Introduction of the CADFM module that employs bi-directional attention and dynamic gating for effective multi-modal fusion in fake news detection.
Findings
Achieves a state-of-the-art F1-score of 0.938 on LMFND dataset.
Outperforms existing multi-modal and single-modal baselines.
Demonstrates robustness and interpretability in real-world scenarios.
Abstract
The proliferation of multi-modal fake news on social media poses a significant threat to public trust and social stability. Traditional detection methods, primarily text-based, often fall short due to the deceptive interplay between misleading text and images. While Large Vision-Language Models (LVLMs) offer promising avenues for multi-modal understanding, effectively fusing diverse modal information, especially when their importance is imbalanced or contradictory, remains a critical challenge. This paper introduces MM-FusionNet, an innovative framework leveraging LVLMs for robust multi-modal fake news detection. Our core contribution is the Context-Aware Dynamic Fusion Module (CADFM), which employs bi-directional cross-modal attention and a novel dynamic modal gating network. This mechanism adaptively learns and assigns importance weights to textual and visual features based on their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection
