Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning
Sherly Alfonso-S\'anchez, Jes\'us Solano, Alejandro Correa-Bahnsen,, Kristina P. Sendova, and Cristi\'an Bravo

TL;DR
This paper demonstrates how reinforcement learning can be used to optimize credit limit adjustments in banking, balancing revenue maximization and risk minimization through data-driven policies.
Contribution
It introduces an offline reinforcement learning approach for credit limit decisions, outperforming traditional strategies and providing a novel framework for data-driven banking policies.
Findings
Double Q-learning outperforms other strategies.
Optimal policies are complex and data-driven.
Alternative data does not always improve predictions.
Abstract
Reinforcement learning has been explored for many problems, from video games with deterministic environments to portfolio and operations management in which scenarios are stochastic; however, there have been few attempts to test these methods in banking problems. In this study, we sought to find and automatize an optimal credit card limit adjustment policy by employing reinforcement learning techniques. Because of the historical data available, we considered two possible actions per customer, namely increasing or maintaining an individual's current credit limit. To find this policy, we first formulated this decision-making question as an optimization problem in which the expected profit was maximized; therefore, we balanced two adversarial goals: maximizing the portfolio's revenue and minimizing the portfolio's provisions. Second, given the particularities of our problem, we used an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Distress and Bankruptcy Prediction
MethodsDouble Q-learning · Q-Learning
