Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features
Aseer Al Faisal

TL;DR
This paper introduces a hybrid deep reinforcement learning model combining semantic and lexical features for highly accurate and robust phishing detection, demonstrating significant improvements over traditional methods.
Contribution
It proposes a novel QR-DQN framework integrating RoBERTa embeddings with lexical features for phishing detection, enhancing stability and generalization.
Findings
Achieved 99.86% accuracy on test data.
Reduced generalization gap from 1.66% to 0.04%.
Model demonstrated high robustness and reliability.
Abstract
Phishing is a cybercrime in which individuals are deceived into revealing personal information, often resulting in financial loss. These attacks commonly occur through fraudulent messages, misleading advertisements, and compromised legitimate websites. This study proposes a Quantile Regression Deep Q-Network (QR-DQN) approach that integrates RoBERTa semantic embeddings with handcrafted lexical features to enhance phishing detection while accounting for uncertainties. Unlike traditional DQN methods that estimate single scalar Q-values, QR-DQN leverages quantile regression to model the distribution of returns, improving stability and generalization on unseen phishing data. A diverse dataset of 105,000 URLs was curated from PhishTank, OpenPhish, Cloudflare, and other sources, and the model was evaluated using an 80/20 train-test split. The QR-DQN framework achieved a test accuracy of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Cybercrime and Law Enforcement Studies
