Web Phishing Net (WPN): A scalable machine learning approach for real-time phishing campaign detection
Muhammad Fahad Zia, Sri Harish Kalidass

TL;DR
This paper introduces a scalable, privacy-preserving unsupervised machine learning method for real-time detection of phishing campaigns, including AI-generated threats, outperforming traditional approaches in speed and detection rate.
Contribution
The paper presents a novel unsupervised learning approach that is scalable, fast, and capable of detecting entire phishing campaigns without pair-wise comparisons, addressing limitations of existing methods.
Findings
High detection rate for phishing campaigns
Effective against AI-generated phishing URLs
Scalable and privacy-preserving detection method
Abstract
Phishing is the most prevalent type of cyber-attack today and is recognized as the leading source of data breaches with significant consequences for both individuals and corporations. Web-based phishing attacks are the most frequent with vectors such as social media posts and emails containing links to phishing URLs that once clicked on render host systems vulnerable to more sinister attacks. Research efforts to detect phishing URLs have involved the use of supervised learning techniques that use large amounts of data to train models and have high computational requirements. They also involve analysis of features derived from vectors including email contents thus affecting user privacy. Additionally, they suffer from a lack of resilience against evolution of threats especially with the advent of generative AI techniques to bypass these systems as with AI-generated phishing URLs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining
