PhishGuard: A Multi-Layered Ensemble Model for Optimal Phishing Website   Detection

Md Sultanul Islam Ovi; Md. Hasibur Rahman; and Mohammad Arif Hossain

arXiv:2409.19825·cs.CR·October 1, 2024

PhishGuard: A Multi-Layered Ensemble Model for Optimal Phishing Website Detection

Md Sultanul Islam Ovi, Md. Hasibur Rahman, and Mohammad Arif Hossain

PDF

Open Access

TL;DR

PhishGuard is a multi-layered ensemble machine learning model that significantly improves phishing website detection accuracy by combining multiple classifiers and advanced feature selection techniques.

Contribution

The paper introduces PhishGuard, a novel ensemble model that integrates several classifiers with optimized feature selection and tuning for superior phishing detection.

Findings

01

Achieved 99.05% detection accuracy on one dataset.

02

Outperformed existing state-of-the-art models.

03

Demonstrated effectiveness of ensemble learning with optimization techniques.

Abstract

Phishing attacks are a growing cybersecurity threat, leveraging deceptive techniques to steal sensitive information through malicious websites. To combat these attacks, this paper introduces PhishGuard, an optimal custom ensemble model designed to improve phishing site detection. The model combines multiple machine learning classifiers, including Random Forest, Gradient Boosting, CatBoost, and XGBoost, to enhance detection accuracy. Through advanced feature selection methods such as SelectKBest and RFECV, and optimizations like hyperparameter tuning and data balancing, the model was trained and evaluated on four publicly available datasets. PhishGuard outperformed state-of-the-art models, achieving a detection accuracy of 99.05% on one of the datasets, with similarly high results across other datasets. This research demonstrates that optimization methods in conjunction with ensemble…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Web Data Mining and Analysis