CLASP: Cost-Optimized LLM-based Agentic System for Phishing Detection
Fouad Trad, Ali Chehab

TL;DR
CLASP is a cost-effective LLM-based system that detects phishing websites by analyzing URLs, screenshots, and HTML content, achieving high accuracy with optimized operational costs.
Contribution
Introduces CLASP, a novel multi-agent LLM system for phishing detection that balances performance and cost through strategic agent combination and evaluation.
Findings
Gemini 1.5 Flash achieved 83.01% F1 score.
System processes websites in 2.78 seconds on average.
Cost per 1,000 websites is approximately $3.18.
Abstract
Phishing websites remain a significant cybersecurity threat, necessitating accurate and cost-effective detection mechanisms. In this paper, we present CLASP, a novel system that effectively identifies phishing websites by leveraging multiple intelligent agents, built using large language models (LLMs), to analyze different aspects of a web resource. The system processes URLs or QR codes, employing specialized LLM-based agents that evaluate the URL structure, webpage screenshot, and HTML content to predict potential phishing threats. To optimize performance while minimizing operational costs, we experimented with multiple combination strategies for agent-based analysis, ultimately designing a strategic combination that ensures the per-website evaluation expense remains minimal without compromising detection accuracy. We tested various LLMs, including Gemini 1.5 Flash and GPT-4o mini, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques
