Cryptocurrency Portfolio Management with Reinforcement Learning: Soft Actor--Critic and Deep Deterministic Policy Gradient Algorithms
Kamal Paykan (Department of Mathematics, Tafresh University, Tafresh, Iran)

TL;DR
This paper introduces a reinforcement learning framework using SAC and DDPG algorithms for cryptocurrency portfolio management, demonstrating improved performance and robustness over traditional methods in volatile markets.
Contribution
It applies advanced deep reinforcement learning algorithms to cryptocurrency portfolio management, showing their effectiveness and robustness in highly volatile market conditions.
Findings
SAC outperforms DDPG in stability and robustness.
Reinforcement learning agents outperform baseline strategies.
Deep RL methods adapt well to volatile cryptocurrency markets.
Abstract
This paper proposes a reinforcement learning--based framework for cryptocurrency portfolio management using the Soft Actor--Critic (SAC) and Deep Deterministic Policy Gradient (DDPG) algorithms. Traditional portfolio optimization methods often struggle to adapt to the highly volatile and nonlinear dynamics of cryptocurrency markets. To address this, we design an agent that learns continuous trading actions directly from historical market data through interaction with a simulated trading environment. The agent optimizes portfolio weights to maximize cumulative returns while minimizing downside risk and transaction costs. Experimental evaluations on multiple cryptocurrencies demonstrate that the SAC and DDPG agents outperform baseline strategies such as equal-weighted and mean--variance portfolios. The SAC algorithm, with its entropy-regularized objective, shows greater stability and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Advanced Bandit Algorithms Research · Financial Markets and Investment Strategies
