Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features

Aseer Al Faisal

arXiv:2512.06925·cs.LG·December 9, 2025

Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features

Aseer Al Faisal

PDF

Open Access

TL;DR

This paper introduces a hybrid deep reinforcement learning model combining semantic and lexical features for highly accurate and robust phishing detection, demonstrating significant improvements over traditional methods.

Contribution

It proposes a novel QR-DQN framework integrating RoBERTa embeddings with lexical features for phishing detection, enhancing stability and generalization.

Findings

01

Achieved 99.86% accuracy on test data.

02

Reduced generalization gap from 1.66% to 0.04%.

03

Model demonstrated high robustness and reliability.

Abstract

Phishing is a cybercrime in which individuals are deceived into revealing personal information, often resulting in financial loss. These attacks commonly occur through fraudulent messages, misleading advertisements, and compromised legitimate websites. This study proposes a Quantile Regression Deep Q-Network (QR-DQN) approach that integrates RoBERTa semantic embeddings with handcrafted lexical features to enhance phishing detection while accounting for uncertainties. Unlike traditional DQN methods that estimate single scalar Q-values, QR-DQN leverages quantile regression to model the distribution of returns, improving stability and generalization on unseen phishing data. A diverse dataset of 105,000 URLs was curated from PhishTank, OpenPhish, Cloudflare, and other sources, and the model was evaluated using an 80/20 train-test split. The QR-DQN framework achieved a test accuracy of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Cybercrime and Law Enforcement Studies