AntiPhishStack: LSTM-based Stacked Generalization Model for Optimized   Phishing URL Detection

Saba Aslam; Hafsa Aslam; Arslan Manzoor; Chen Hui; Abdur Rasool

arXiv:2401.08947·cs.CR·January 23, 2024·1 cites

AntiPhishStack: LSTM-based Stacked Generalization Model for Optimized Phishing URL Detection

Saba Aslam, Hafsa Aslam, Arslan Manzoor, Chen Hui, Abdur Rasool

PDF

Open Access

TL;DR

AntiPhishStack is a novel two-phase deep learning model that combines LSTM, stacking, and meta-classification to improve phishing URL detection without relying on prior phishing-specific features, achieving high accuracy.

Contribution

The paper introduces AntiPhishStack, a unique stacked generalization model utilizing LSTM and feature symmetry, advancing phishing detection methods beyond traditional feature-based approaches.

Findings

01

Achieved 96.04% accuracy on benchmark datasets.

02

Demonstrated superior performance over existing phishing detection models.

03

Validated effectiveness of symmetrical feature learning in cybersecurity.

Abstract

The escalating reliance on revolutionary online web services has introduced heightened security risks, with persistent challenges posed by phishing despite extensive security measures. Traditional phishing systems, reliant on machine learning and manual features, struggle with evolving tactics. Recent advances in deep learning offer promising avenues for tackling novel phishing challenges and malicious URLs. This paper introduces a two-phase stack generalized model named AntiPhishStack, designed to detect phishing sites. The model leverages the learning of URLs and character-level TF-IDF features symmetrically, enhancing its ability to combat emerging phishing threats. In Phase I, features are trained on a base machine learning classifier, employing K-fold cross-validation for robust mean prediction. Phase II employs a two-layered stacked-based LSTM network with five adaptive optimizers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Sentiment Analysis and Opinion Mining · Text and Document Classification Technologies

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Balanced Selection