Reinforcement Learning for Stock Transactions

Ziyi Zhou; Nicholas Stern; Julien Laasri

arXiv:2505.16099·cs.LG·May 27, 2025

Reinforcement Learning for Stock Transactions

Ziyi Zhou, Nicholas Stern, Julien Laasri

PDF

Open Access

TL;DR

This paper explores applying reinforcement learning techniques, including Q-Learning and deep Q-Learning, to optimize stock trading decisions by modeling the problem as a Markov Decision Process and comparing different agents' performance.

Contribution

It introduces a novel RL-based framework for stock trading that defines a custom MDP and compares multiple RL agents and machine learning models for profit maximization.

Findings

01

RL agents can learn effective trading policies

02

Deep Q-Learning outperforms simpler RL methods

03

Machine learning models provide complementary price predictions

Abstract

Much research has been done to analyze the stock market. After all, if one can determine a pattern in the chaotic frenzy of transactions, then they could make a hefty profit from capitalizing on these insights. As such, the goal of our project was to apply reinforcement learning (RL) to determine the best time to buy a stock within a given time frame. With only a few adjustments, our model can be extended to identify the best time to sell a stock as well. In order to use the format of free, real-world data to train the model, we define our own Markov Decision Process (MDP) problem. These two papers [5] [6] helped us in formulating the state space and the reward system of our MDP problem. We train a series of agents using Q-Learning, Q-Learning with linear function approximation, and deep Q-Learning. In addition, we try to predict the stock prices using machine learning regression and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods

MethodsQ-Learning