Autonomous CRM Control via CLV Approximation with Deep Reinforcement   Learning in Discrete and Continuous Action Space

Yegor Tkachenko

arXiv:1504.01840·cs.LG·April 9, 2015·23 cites

Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space

Yegor Tkachenko

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning framework for autonomous CRM control that estimates customer lifetime value and optimizes marketing actions in both discrete and continuous spaces.

Contribution

It introduces a novel approach combining CLV approximation with deep Q-learning for personalized marketing decision-making.

Findings

01

Effective CLV estimation from client state space.

02

Optimal marketing actions derived for individual clients.

03

Validated on KDD Cup 1998 dataset with promising results.

Abstract

The paper outlines a framework for autonomous control of a CRM (customer relationship management) system. First, it explores how a modified version of the widely accepted Recency-Frequency-Monetary Value system of metrics can be used to define the state space of clients or donors. Second, it describes a procedure to determine the optimal direct marketing action in discrete and continuous action space for the given individual, based on his position in the state space. The procedure involves the use of model-free Q-learning to train a deep neural network that relates a client's position in the state space to rewards associated with possible marketing actions. The estimated value function over the client state space can be interpreted as customer lifetime value, and thus allows for a quick plug-in estimation of CLV for a given client. Experimental results are presented, based on KDD Cup…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCustomer churn and segmentation

MethodsQ-Learning