Phishing the Phishers with SpecularNet: Hierarchical Graph Autoencoding for Reference-Free Web Phishing Detection
Tailai Song, Pedro Casas, Michela Meo

TL;DR
SpecularNet is a lightweight, reference-free web phishing detection framework that models DOM structures hierarchically, achieving high accuracy with significantly faster inference and better scalability than existing methods.
Contribution
It introduces a novel hierarchical graph autoencoding architecture for reference-free phishing detection, rivaling heavyweight systems in accuracy while being more practical and scalable.
Findings
Achieves 93.9% F1 score on benchmark datasets.
Reduces inference time from seconds to 20 milliseconds per webpage.
Demonstrates robustness and effectiveness in real-world and adversarial scenarios.
Abstract
Phishing remains the most pervasive threat to the Web, enabling large-scale credential theft and financial fraud through deceptive webpages. While recent reference-based and generative-AI-driven phishing detectors achieve strong accuracy, their reliance on external knowledge bases, cloud services, and complex multimodal pipelines fundamentally limits practicality, scalability, and reproducibility. In contrast, conventional deep learning approaches often fail to generalize to evolving phishing campaigns. We introduce SpecularNet, a novel lightweight framework for reference-free web phishing detection that demonstrates how carefully designed compact architectures can rival heavyweight systems. SpecularNet operates solely on the domain name and HTML structure, modeling the Document Object Model (DOM) as a tree and leveraging a hierarchical graph autoencoding architecture with directional,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques
