Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Shu Yang; Shenzhe Zhu; Zeyu Wu; Keyu Wang; Junchi Yao; Junchao Wu; Lijie Hu; Mengdi Li; Derek F. Wong; Di Wang

arXiv:2502.12904·cs.CL·May 27, 2025

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Shu Yang, Shenzhe Zhu, Zeyu Wu, Keyu Wang, Junchi Yao, Junchao Wu, Lijie Hu, Mengdi Li, Derek F. Wong, Di Wang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

Fraud-R1 is a comprehensive benchmark with multi-round evaluation to test LLMs' robustness against various internet fraud and phishing tactics across different languages and interaction settings.

Contribution

Introduces Fraud-R1, a multi-round benchmark with diverse fraud cases and evaluation scenarios to assess LLMs' resistance to online fraud and phishing.

Findings

01

LLMs struggle more in role-play settings and with fake job postings.

02

Significant performance gap exists between Chinese and English LLMs.

03

Multi-round evaluation reveals vulnerabilities in LLM defenses against fraud.

Abstract

We introduce Fraud-R1, a benchmark designed to evaluate LLMs' ability to defend against internet fraud and phishing in dynamic, real-world scenarios. Fraud-R1 comprises 8,564 fraud cases sourced from phishing scams, fake job postings, social media, and news, categorized into 5 major fraud types. Unlike previous benchmarks, Fraud-R1 introduces a multi-round evaluation pipeline to assess LLMs' resistance to fraud at different stages, including credibility building, urgency creation, and emotional manipulation. Furthermore, we evaluate 15 LLMs under two settings: 1. Helpful-Assistant, where the LLM provides general decision-making assistance, and 2. Role-play, where the model assumes a specific persona, widely used in real-world agent-based interactions. Our evaluation reveals the significant challenges in defending against fraud and phishing inducement, especially in role-play settings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mala-lab/anomalygfm
pytorch

Datasets

Chouoftears/Fraud-R1-LLM-Defense-Fraud-Benchmark
dataset· 53 dl
53 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction · Auction Theory and Applications