A Predicting Phishing Websites Using Support Vector Machine and MultiClass Classification Based on Association Rule Techniques

Nancy C. Woods; Virtue Ene Agada; Adebola K. Ojo

arXiv:2505.21141·cs.LG·May 28, 2025

A Predicting Phishing Websites Using Support Vector Machine and MultiClass Classification Based on Association Rule Techniques

Nancy C. Woods, Virtue Ene Agada, Adebola K. Ojo

PDF

Open Access

TL;DR

This paper presents a hybrid approach combining Support Vector Machines and Multi-Class Classification Rules based on Association Rules to improve the accuracy of phishing website detection, achieving over 98% accuracy and high AUC.

Contribution

The study introduces a novel hybrid model integrating SVM and MCAR techniques for more effective phishing website prediction, outperforming individual methods.

Findings

01

98.30% classification accuracy

02

98% Area under the Curve (AUC)

03

82.84% variance in prediction

Abstract

Phishing is a semantic attack which targets the user rather than the computer. It is a new Internet crime in comparison with other forms such as virus and hacking. Considering the damage phishing websites has caused to various economies by collapsing organizations, stealing information and financial diversion, various researchers have embarked on different ways of detecting phishing websites but there has been no agreement about the best algorithm to be used for prediction. This study is interested in integrating the strengths of two algorithms, Support Vector Machines (SVM) and Multi-Class Classification Rules based on Association Rules (MCAR) to establish a strong and better means of predicting phishing websites. A total of 11,056 websites were used from both PhishTank and yahoo directory to verify the effectiveness of this approach. Feature extraction and rules generation were done…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Text and Document Classification Technologies