ScamFerret: Detecting Scam Websites Autonomously with Large Language   Models

Hiroki Nakano; Takashi Koide; Daiki Chiba

arXiv:2502.10110·cs.CR·February 17, 2025

ScamFerret: Detecting Scam Websites Autonomously with Large Language Models

Hiroki Nakano, Takashi Koide, Daiki Chiba

PDF

Open Access 1 Repo

TL;DR

ScamFerret uses large language models to autonomously detect scam websites across multiple languages by analyzing web content, DNS, and reviews, achieving high accuracy without additional training.

Contribution

Introduces ScamFerret, an LLM-based agent system that identifies scam websites without training or fine-tuning, leveraging natural language understanding for multilingual detection.

Findings

01

Achieves 97.2% accuracy in English scam detection

02

Achieves 99.3% accuracy in multilingual online shopping scams

03

Effectively analyzes external web data for scam identification

Abstract

With the rise of sophisticated scam websites that exploit human psychological vulnerabilities, distinguishing between legitimate and scam websites has become increasingly challenging. This paper presents ScamFerret, an innovative agent system employing a large language model (LLM) to autonomously collect and analyze data from a given URL to determine whether it is a scam. Unlike traditional machine learning models that require large datasets and feature engineering, ScamFerret leverages LLMs' natural language understanding to accurately identify scam websites of various types and languages without requiring additional training or fine-tuning. Our evaluation demonstrated that ScamFerret achieves 0.972 accuracy in classifying four scam types in English and 0.993 accuracy in classifying online shopping websites across three different languages, particularly when using GPT-4. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ScamFerret/artifact
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Topic Modeling