Evaluating the Effectiveness and Robustness of Visual Similarity-based   Phishing Detection Models

Fujiao Ji; Kiho Lee; Hyungjoon Koo; Wenhao You; Euijin Choo,; Hyoungshick Kim; and Doowon Kim

arXiv:2405.19598·cs.CR·January 31, 2025

Evaluating the Effectiveness and Robustness of Visual Similarity-based Phishing Detection Models

Fujiao Ji, Kiho Lee, Hyungjoon Koo, Wenhao You, Euijin Choo,, Hyoungshick Kim, and Doowon Kim

PDF

Open Access

TL;DR

This study evaluates the real-world effectiveness and robustness of visual similarity-based phishing detection models, revealing significant vulnerabilities and emphasizing the need for more resilient techniques against sophisticated evasion strategies.

Contribution

It provides a comprehensive large-scale evaluation of existing models' performance on real-world data and introduces adversarial testing to assess their robustness against evasion tactics.

Findings

01

High accuracy on curated datasets but low on real-world data

02

Attackers evade detection through logo mimicking and simple manipulations

03

Several models show vulnerabilities to adversarial logo manipulations

Abstract

Phishing attacks pose a significant threat to Internet users, with cybercriminals elaborately replicating the visual appearance of legitimate websites to deceive victims. Visual similarity-based detection systems have emerged as an effective countermeasure, but their effectiveness and robustness in real-world scenarios have been underexplored. In this paper, we comprehensively scrutinize and evaluate the effectiveness and robustness of popular visual similarity-based anti-phishing models using a large-scale dataset of 451k real-world phishing websites. Our analyses of the effectiveness reveal that while certain visual similarity-based models achieve high accuracy on curated datasets in the experimental settings, they exhibit notably low performance on real-world datasets, highlighting the importance of real-world evaluation. Furthermore, we find that the attackers evade the detectors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Text and Document Classification Technologies