A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection
Orel Lavie, Asaf Shabtai, Gilad Katz

TL;DR
This paper introduces a method for automatically tuning and transferring deep reinforcement learning policies for cost-effective phishing detection, addressing challenges in configuration, adaptability, and robustness against attacks.
Contribution
It proposes a transferable and automatic tuning approach for DRL policies, improving performance calibration and robustness in phishing detection tasks.
Findings
Effective transfer of security policies across datasets
Enhanced robustness against adversarial attacks
Improved calibration of DRL-based detection performance
Abstract
Many challenging real-world problems require the deployment of ensembles multiple complementary learning models to reach acceptable performance levels. While effective, applying the entire ensemble to every sample is costly and often unnecessary. Deep Reinforcement Learning (DRL) offers a cost-effective alternative, where detectors are dynamically chosen based on the output of their predecessors, with their usefulness weighted against their computational cost. Despite their potential, DRL-based solutions are not widely used in this capacity, partly due to the difficulties in configuring the reward function for each new task, the unpredictable reactions of the DRL agent to changes in the data, and the inability to use common performance metrics (e.g., TPR/FPR) to guide the algorithm's performance. In this study we propose methods for fine-tuning and calibrating DRL-based policies so that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection
