Sales Time Series Analytics Using Deep Q-Learning
Bohdan M. Pavlyshenko

TL;DR
This paper explores the application of deep Q-learning to sales time series analytics, demonstrating its effectiveness in optimizing pricing and supply-demand decisions through environment modeling and reward maximization.
Contribution
It introduces a deep Q-learning framework for sales analytics, combining parametric and historical data models to optimize decision-making in pricing and supply-demand scenarios.
Findings
Deep Q-learning can effectively optimize sales-related decisions.
Environment modeling with historical data aids cold start of the learning process.
The approach shows promise for real-world business applications.
Abstract
The article describes the use of deep Q-learning models in the problems of sales time series analytics. In contrast to supervised machine learning which is a kind of passive learning using historical data, Q-learning is a kind of active learning with goal to maximize a reward by optimal sequence of actions. Model free Q-learning approach for optimal pricing strategies and supply-demand problems was considered in the work. The main idea of the study is to show that using deep Q-learning approach in time series analytics, the sequence of actions can be optimized by maximizing the reward function when the environment for learning agent interaction can be modeled using the parametric model and in the case of using the model which is based on the historical data. In the pricing optimizing case study environment was modeled using sales dependence on extras price and randomly simulated demand.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Forecasting Techniques and Applications
MethodsQ-Learning
