Bitcoin Transaction Strategy Construction Based on Deep Reinforcement Learning
Fengrui Liu, Yang Li, Baitong Li, Jiaxin Li, Huiyang Xie

TL;DR
This paper develops a deep reinforcement learning framework using PPO and LSTM for automatic high-frequency bitcoin trading, outperforming benchmarks and demonstrating potential for cryptocurrency asset management.
Contribution
It introduces a novel integration of PPO with LSTM for constructing automatic trading strategies in cryptocurrency markets, addressing high volatility and real-time decision-making.
Findings
LSTM outperforms other models in price prediction accuracy.
Proposed strategy yields 31.67% higher returns than benchmarks.
Framework effectively handles high-frequency trading in volatile markets.
Abstract
The emerging cryptocurrency market has lately received great attention for asset allocation due to its decentralization uniqueness. However, its volatility and brand new trading mode have made it challenging to devising an acceptable automatically-generating strategy. This study proposes a framework for automatic high-frequency bitcoin transactions based on a deep reinforcement learning algorithm-proximal policy optimization (PPO). The framework creatively regards the transaction process as actions, returns as awards and prices as states to align with the idea of reinforcement learning. It compares advanced machine learning-based models for static price predictions including support vector machine (SVM), multi-layer perceptron (MLP), long short-term memory (LSTM), temporal convolutional network (TCN), and Transformer by applying them to the real-time bitcoin price and the experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Softmax · Sigmoid Activation · Dropout · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer
