A Predicting Phishing Websites Using Support Vector Machine and MultiClass Classification Based on Association Rule Techniques
Nancy C. Woods, Virtue Ene Agada, Adebola K. Ojo

TL;DR
This paper presents a hybrid approach combining Support Vector Machines and Multi-Class Classification Rules based on Association Rules to improve the accuracy of phishing website detection, achieving over 98% accuracy and high AUC.
Contribution
The study introduces a novel hybrid model integrating SVM and MCAR techniques for more effective phishing website prediction, outperforming individual methods.
Findings
98.30% classification accuracy
98% Area under the Curve (AUC)
82.84% variance in prediction
Abstract
Phishing is a semantic attack which targets the user rather than the computer. It is a new Internet crime in comparison with other forms such as virus and hacking. Considering the damage phishing websites has caused to various economies by collapsing organizations, stealing information and financial diversion, various researchers have embarked on different ways of detecting phishing websites but there has been no agreement about the best algorithm to be used for prediction. This study is interested in integrating the strengths of two algorithms, Support Vector Machines (SVM) and Multi-Class Classification Rules based on Association Rules (MCAR) to establish a strong and better means of predicting phishing websites. A total of 11,056 websites were used from both PhishTank and yahoo directory to verify the effectiveness of this approach. Feature extraction and rules generation were done…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Text and Document Classification Technologies
