PhishLang: A Real-Time, Fully Client-Side Phishing Detection Framework Using MobileBERT
Sayak Saha Roy, Shirin Nilizadeh

TL;DR
PhishLang is a fully client-side, real-time anti-phishing framework using MobileBERT and ensemble methods to detect evolving phishing threats while preserving user privacy.
Contribution
It introduces the first lightweight, client-side phishing detection framework based on MobileBERT with a multi-modal ensemble approach for improved accuracy.
Findings
Outperforms popular anti-phishing tools in detection accuracy
Detects zero-day and evasive phishing attacks effectively
Operates entirely locally without external blocklists
Abstract
In this paper, we introduce PhishLang, the first fully client-side anti-phishing framework built on a lightweight ensemble framework that utilizes advanced language models to analyze the contextual features of a website's source code and URL. Unlike traditional heuristic or machine learning approaches that rely on static features and struggle to adapt to evolving threats, or deep learning models that are computationally intensive, our approach utilizes MobileBERT, a fast and memory-efficient variant of the BERT architecture, to capture nuanced features indicative of phishing attacks. To further enhance detection accuracy, PhishLang employs a multi-modal ensemble approach, combining both the URL and Source detection models. This architecture ensures robustness by allowing one model to compensate for scenarios where the other may fail, or if both models provide ambiguous inferences. As a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · WordPiece · Linear Warmup With Linear Decay · BERT · MobileBERT · Linear Layer · Attention Dropout · Residual Connection
