Attacking logo-based phishing website detectors with adversarial perturbations
Jehyun Lee, Zhe Xin, Melanie Ng Pei See, Kanav Sabharwal, Giovanni, Apruzzese, Dinil Mon Divakaran

TL;DR
This paper demonstrates that deep learning-based logo phishing detectors are vulnerable to adversarial perturbations, which can craft subtle logo modifications that evade detection and deceive human users with high success rates.
Contribution
The authors introduce a novel generative adversarial attack method targeting logo-based phishing detectors, revealing their vulnerability against adversarial examples.
Findings
Adversarial logos can evade detection with up to 95% success rate.
Humans often fail to distinguish adversarial logos from original ones.
The attack is effective on real-world logo datasets.
Abstract
Recent times have witnessed the rise of anti-phishing schemes powered by deep learning (DL). In particular, logo-based phishing detectors rely on DL models from Computer Vision to identify logos of well-known brands on webpages, to detect malicious webpages that imitate a given brand. For instance, Siamese networks have demonstrated notable performance for these tasks, enabling the corresponding anti-phishing solutions to detect even "zero-day" phishing webpages. In this work, we take the next step of studying the robustness of logo-based phishing detectors against adversarial ML attacks. We propose a novel attack exploiting generative adversarial perturbations to craft "adversarial logos" that evade phishing detectors. We evaluate our attacks through: (i) experiments on datasets containing real logos, to evaluate the robustness of state-of-the-art phishing detectors; and (ii) user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Spam and Phishing Detection · Hate Speech and Cyberbullying Detection
