T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search
Xing Cui, Yueying Zou, Zekun Li, Peipei Li, Xinyuan Xu, Xuannan Liu, Huaibo Huang

TL;DR
This paper introduces T^2Agent, a tool-augmented multimodal misinformation detection system that uses Monte Carlo Tree Search to dynamically select and coordinate multiple verification tools for improved detection accuracy.
Contribution
The paper presents a novel agent framework combining an extensible toolkit with MCTS and a multi-source verification strategy for effective multimodal misinformation detection.
Findings
Outperforms existing baselines on challenging benchmarks.
Effectively integrates multiple verification tools via a greedy selector.
Demonstrates strong potential as a training-free detector.
Abstract
Real-world multimodal misinformation often arises from mixed forgery sources, requiring dynamic reasoning and adaptive verification. However, existing methods mainly rely on static pipelines and limited tool usage, limiting their ability to handle such complexity and diversity. To address this challenge, we propose \method, a novel misinformation detection agent that incorporates an extensible toolkit with Monte Carlo Tree Search (MCTS). The toolkit consists of modular tools such as web search, forgery detection, and consistency analysis. Each tool is described using standardized templates, enabling seamless integration and future expansion. To avoid inefficiency from using all tools simultaneously, a greedy search-based selector is proposed to identify a task-relevant subset. This subset then serves as the action space for MCTS to dynamically collect evidence and perform multi-source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNetwork Security and Intrusion Detection · Spam and Phishing Detection
MethodsALIGN
