Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer   Credit

Raad Khraishi; Ramin Okhrati

arXiv:2203.03003·cs.LG·March 8, 2022

Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit

Raad Khraishi, Ramin Okhrati

PDF

Open Access

TL;DR

This paper presents an offline deep reinforcement learning method for personalized consumer credit pricing, leveraging static data and conservative Q-Learning to optimize prices without online testing.

Contribution

It introduces a novel application of offline deep reinforcement learning, specifically conservative Q-Learning, for dynamic credit pricing without requiring online interaction.

Findings

01

Effective personalized pricing policy learned from static data

02

No need for online price experimentation

03

Works on both real and synthetic datasets

Abstract

We introduce a method for pricing consumer credit using recent advances in offline deep reinforcement learning. This approach relies on a static dataset and requires no assumptions on the functional form of demand. Using both real and synthetic data on consumer credit applications, we demonstrate that our approach using the conservative Q-Learning algorithm is capable of learning an effective personalized pricing policy without any online interaction or price experimentation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Literacy, Pension, Retirement Analysis · Smart Grid Energy Management

MethodsQ-Learning