Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

Julian Allagan; Mohamed Elbakary; Zohreh Safari; Weizheng Gao; Gabrielle Morgan; Essence Morgan; Vladimir Deriglazov

arXiv:2603.19204·cs.LG·March 20, 2026

Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

Julian Allagan, Mohamed Elbakary, Zohreh Safari, Weizheng Gao, Gabrielle Morgan, Essence Morgan, Vladimir Deriglazov

PDF

Open Access

TL;DR

This paper investigates the robustness of phishing detection models against feature manipulation attacks, revealing that adversarial vulnerability is primarily driven by feature economics rather than model complexity.

Contribution

It introduces a cost-aware evasion framework, diagnostics for robustness, and formalizes the convergence of robustness across models based on feature economics.

Findings

01

Robustness converges across models under budgeted evasion.

02

Most successful evasions target a few low-cost features.

03

Feature restriction improves robustness only when removing dominant features.

Abstract

Phishing detectors built on engineered website features attain near-perfect accuracy under i.i.d.\ evaluation, yet deployment security depends on robustness to post-deployment feature manipulation. We study this gap through a cost-aware evasion framework that models discrete, monotone feature edits under explicit attacker budgets. Three diagnostics are introduced: minimal evasion cost (MEC), the evasion survival rate $S (B)$ , and the robustness concentration index (RCI). On the UCI Phishing Websites benchmark (11\,055 instances, 30 ternary features), Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost all achieve $AUC \geq 0.979$ under static evaluation. Under budgeted sanitization-style evasion, robustness converges across architectures: the median MEC equals 2 with full features, and over 80\% of successful minimal-cost evasions concentrate on three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning