From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks

Aditya Kulkarni; Vivek Balachandran; Dinil Mon Divakaran; Tamal Das

arXiv:2407.20361·cs.CR·May 27, 2025·2 cites

From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks

Aditya Kulkarni, Vivek Balachandran, Dinil Mon Divakaran, Tamal Das

PDF

Open Access

TL;DR

This paper introduces PhishOracle, a tool for generating diverse adversarial phishing webpages, and evaluates the robustness of existing detection models, revealing significant vulnerabilities and the need for more resilient solutions.

Contribution

We develop PhishOracle to create diverse adversarial phishing webpages and evaluate the robustness of current detection models, highlighting their vulnerabilities against sophisticated attacks.

Findings

01

Detection models show significant drop in accuracy against adversarial webpages

02

Multimodal large language model-based detectors are more robust but still vulnerable

03

Many adversarial phishing webpages can deceive both models and users

Abstract

Phishing attacks attempt to deceive users into stealing sensitive information, posing a significant cybersecurity threat. Advances in machine learning (ML) and deep learning (DL) have led to the development of numerous phishing webpage detection solutions, but these models remain vulnerable to adversarial attacks. Evaluating their robustness against adversarial phishing webpages is essential. Existing tools contain datasets of pre-designed phishing webpages for a limited number of brands, and lack diversity in phishing features. To address these challenges, we develop PhishOracle, a tool that generates adversarial phishing webpages by embedding diverse phishing features into legitimate webpages. We evaluate the robustness of three existing task-specific models - Stack model, VisualPhishNet, and Phishpedia - against PhishOracle-generated adversarial phishing webpages and observe a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques