MMD-Thinker: Adaptive Multi-Dimensional Thinking for Multimodal Misinformation Detection
Junjie Wu, Guohong Fu

TL;DR
MMD-Thinker introduces an adaptive multi-dimensional reasoning framework for multimodal misinformation detection, leveraging tailored thinking modes, instruction tuning, and reinforcement learning to improve accuracy and robustness against evolving misinformation.
Contribution
The paper presents a novel two-stage framework with tailored thinking modes and reinforcement learning, along with a new dataset for multimodal misinformation detection, addressing reasoning limitations of existing models.
Findings
Achieves state-of-the-art performance on multiple benchmarks.
Effectively models complex multimodal misinformation reasoning.
Maintains flexible inference and token efficiency.
Abstract
Multimodal misinformation floods on various social media, and continues to evolve in the era of AI-generated content (AIGC). The emerged misinformation with low creation cost and high deception poses significant threats to society. While recent studies leverage general-purpose multimodal large language models (MLLMs) to achieve remarkable results in detection, they encounter two critical limitations: (1) Insufficient reasoning, where general-purpose MLLMs often follow the uniform reasoning paradigm but generate inaccurate explanations and judgments, due to the lack of the task-specific knowledge of multimodal misinformation detection. (2) Reasoning biases, where a single thinking mode make detectors a suboptimal path for judgment, struggling to keep pace with the fast-growing and intricate multimodal misinformation. In this paper, we propose MMD-Thinker, a two-stage framework for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Explainable Artificial Intelligence (XAI)
