Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Chen-Wei Chang; Shailik Sarkar; Shutonu Mitra; Qi Zhang; Hossein Salemi; Hemant Purohit; Fengxiu Zhang; Michin Hong; Jin-Hee Cho; Chang-Tien Lu

arXiv:2412.00621·cs.CR·November 5, 2025·2 cites

Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Chen-Wei Chang, Shailik Sarkar, Shutonu Mitra, Qi Zhang, Hossein Salemi, Hemant Purohit, Fengxiu Zhang, Michin Hong, Jin-Hee Cho, Chang-Tien Lu

PDF

Open Access

TL;DR

This paper examines the vulnerabilities of Large Language Models in scam detection by creating a detailed dataset with adversarial examples, revealing high misclassification rates and proposing strategies to enhance robustness.

Contribution

It introduces a comprehensive dataset with adversarial scam messages and analyzes LLM vulnerabilities, offering methods to improve their robustness against adversarial attacks.

Findings

01

Adversarial examples significantly increase misclassification rates.

02

A detailed dataset with nuanced scam types was created.

03

Proposed strategies improve LLM robustness against adversarial scams.

Abstract

Can we trust Large Language Models (LLMs) to accurately predict scam? This paper investigates the vulnerabilities of LLMs when facing adversarial scam messages for the task of scam detection. We addressed this issue by creating a comprehensive dataset with fine-grained labels of scam messages, including both original and adversarial scam messages. The dataset extended traditional binary classes for the scam detection task into more nuanced scam types. Our analysis showed how adversarial examples took advantage of vulnerabilities of a LLM, leading to high misclassification rate. We evaluated the performance of LLMs on these adversarial scam messages and proposed strategies to improve their robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Spam and Phishing Detection