Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading
Suyeol Yun

TL;DR
This paper introduces a novel offline reinforcement learning approach for quantitative trading by fine-tuning a pre-trained GPT-2 model with LoRA within a Decision Transformer framework, achieving competitive results with existing methods.
Contribution
It combines pre-trained language models and LoRA for efficient offline RL in trading, addressing temporal dependencies and overfitting issues.
Findings
Model learns effectively from expert trajectories.
Achieves superior rewards in certain trading scenarios.
Performs competitively with established offline RL algorithms.
Abstract
Developing effective quantitative trading strategies using reinforcement learning (RL) is challenging due to the high risks associated with online interaction with live financial markets. Consequently, offline RL, which leverages historical market data without additional exploration, becomes essential. However, existing offline RL methods often struggle to capture the complex temporal dependencies inherent in financial time series and may overfit to historical patterns. To address these challenges, we introduce a Decision Transformer (DT) initialized with pre-trained GPT-2 weights and fine-tuned using Low-Rank Adaptation (LoRA). This architecture leverages the generalization capabilities of pre-trained language models and the efficiency of LoRA to learn effective trading policies from expert trajectories solely from historical data. Our model performs competitively with established…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dense Connections · Label Smoothing · Dropout · Discriminative Fine-Tuning · Linear Layer · Cosine Annealing · Attention Dropout · Layer Normalization
