Robustness, Cost, and Attack-Surface Concentration in Phishing Detection
Julian Allagan, Mohamed Elbakary, Zohreh Safari, Weizheng Gao, Gabrielle Morgan, Essence Morgan, Vladimir Deriglazov

TL;DR
This paper investigates the robustness of phishing detection models against feature manipulation attacks, revealing that adversarial vulnerability is primarily driven by feature economics rather than model complexity.
Contribution
It introduces a cost-aware evasion framework, diagnostics for robustness, and formalizes the convergence of robustness across models based on feature economics.
Findings
Robustness converges across models under budgeted evasion.
Most successful evasions target a few low-cost features.
Feature restriction improves robustness only when removing dominant features.
Abstract
Phishing detectors built on engineered website features attain near-perfect accuracy under i.i.d.\ evaluation, yet deployment security depends on robustness to post-deployment feature manipulation. We study this gap through a cost-aware evasion framework that models discrete, monotone feature edits under explicit attacker budgets. Three diagnostics are introduced: minimal evasion cost (MEC), the evasion survival rate , and the robustness concentration index (RCI). On the UCI Phishing Websites benchmark (11\,055 instances, 30 ternary features), Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost all achieve under static evaluation. Under budgeted sanitization-style evasion, robustness converges across architectures: the median MEC equals 2 with full features, and over 80\% of successful minimal-cost evasions concentrate on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning
