Cooperative Multi-Agent Reinforcement Learning Framework for Scalping Trading
Uk Jo, Taehyun Jo, Wanjun Kim, Iljoo Yoon, Dongseok Lee, Seungho Lee

TL;DR
This paper introduces a novel deep reinforcement learning framework with a custom trading gym, combining supervised learning and RL to mimic scalpers using order book and settlement data, aiming to outperform human traders.
Contribution
The paper proposes a new RL framework with a specialized trading gym and hybrid algorithms, utilizing spatio-temporal CNNs for financial data, and a multi-agent structure for scalping trading.
Findings
Framework successfully mimics scalper behavior
Preliminary experiments show promising results
Potential to surpass human trading performance
Abstract
We explore deep Reinforcement Learning(RL) algorithms for scalping trading and knew that there is no appropriate trading gym and agent examples. Thus we propose gym and agent like Open AI gym in finance. Not only that, we introduce new RL framework based on our hybrid algorithm which leverages between supervised learning and RL algorithm and uses meaningful observations such order book and settlement data from experience watching scalpers trading. That is very crucial information for traders behavior to be decided. To feed these data into our model, we use spatio-temporal convolution layer, called Conv3D for order book data and temporal CNN, called Conv1D for settlement data. Those are preprocessed by episode filter we developed. Agent consists of four sub agents divided to clarify their own goal to make best decision. Also, we adopted value and policy based algorithm to our framework.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Complex Systems and Time Series Analysis
MethodsConvolution
