Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen, Eren Kurshan

TL;DR
This paper introduces a Deep Q-Network reinforcement learning approach to adaptively select alert thresholds in retail banking fraud detection systems, improving detection rates and operational efficiency over static methods.
Contribution
It presents a novel adaptive threshold policy using reinforcement learning, addressing limitations of fixed thresholds in fraud alert systems.
Findings
Reduces fraud losses compared to static thresholds
Improves operational efficiency of alert processing
Outperforms traditional fixed threshold methods
Abstract
Machine learning models have widely been used in fraud detection systems. Most of the research and development efforts have been concentrated on improving the performance of the fraud scoring models. Yet, the downstream fraud alert systems still have limited to no model adoption and rely on manual steps. Alert systems are pervasively used across all payment channels in retail banking and play an important role in the overall fraud detection process. Current fraud detection systems end up with large numbers of dropped alerts due to their inability to account for the alert processing capacity. Ideally, alert threshold selection enables the system to maximize the fraud detection while balancing the upstream fraud scores and the available bandwidth of the alert processing teams. However, in practice, fixed thresholds that are used for their simplicity do not have this ability. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
