Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning

Junchi Yao; Jianhua Xu; Tianyu Xin; Ziyi Wang; Shenzhe Zhu; Shu Yang; Di Wang

arXiv:2505.16557·cs.MA·June 16, 2025

Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning

Junchi Yao, Jianhua Xu, Tianyu Xin, Ziyi Wang, Shenzhe Zhu, Shu Yang, Di Wang

PDF

Open Access

TL;DR

This paper introduces WandaPlan, an evaluation environment for assessing the reliability of LLM-based multi-agent planning systems in real-world scenarios, especially focusing on detecting various types of fraud to improve trustworthiness.

Contribution

The paper presents WandaPlan, a novel evaluation framework that models real-world data with deceptive content and assesses the vulnerabilities of current planning systems to fraud.

Findings

01

Existing systems are vulnerable to misinformation and coordinated fraud.

02

WandaPlan effectively simulates real-world fraud scenarios for testing.

03

An anti-fraud agent can enhance the reliability of planning systems.

Abstract

The rise of Large Language Model-based Multi-Agent Planning has leveraged advanced frameworks to enable autonomous and collaborative task execution. Some systems rely on platforms like review sites and social media, which are prone to fraudulent information, such as fake reviews or misleading descriptions. This reliance poses risks, potentially causing financial losses and harming user experiences. To evaluate the risk of planning systems in real-world applications, we introduce \textbf{WandaPlan}, an evaluation environment mirroring real-world data and injected with deceptive content. We assess system performance across three fraud cases: Misinformation Fraud, Team-Coordinated Multi-Person Fraud, and Level-Escalating Multi-Round Fraud. We reveal significant weaknesses in existing frameworks that prioritize task efficiency over data authenticity. At the same time, we validate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Multi-Agent Systems and Negotiation