Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline   RL in Quantitative Trading

Suyeol Yun

arXiv:2411.17900·q-fin.CP·November 28, 2024

Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading

Suyeol Yun

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel offline reinforcement learning approach for quantitative trading by fine-tuning a pre-trained GPT-2 model with LoRA within a Decision Transformer framework, achieving competitive results with existing methods.

Contribution

It combines pre-trained language models and LoRA for efficient offline RL in trading, addressing temporal dependencies and overfitting issues.

Findings

01

Model learns effectively from expert trajectories.

02

Achieves superior rewards in certain trading scenarios.

03

Performs competitively with established offline RL algorithms.

Abstract

Developing effective quantitative trading strategies using reinforcement learning (RL) is challenging due to the high risks associated with online interaction with live financial markets. Consequently, offline RL, which leverages historical market data without additional exploration, becomes essential. However, existing offline RL methods often struggle to capture the complex temporal dependencies inherent in financial time series and may overfit to historical patterns. To address these challenges, we introduce a Decision Transformer (DT) initialized with pre-trained GPT-2 weights and fine-tuned using Low-Rank Adaptation (LoRA). This architecture leverages the generalization capabilities of pre-trained language models and the efficiency of LoRA to learn effective trading policies from expert trajectories solely from historical data. Our model performs competitively with established…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

syyunn/finrl-dt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dense Connections · Label Smoothing · Dropout · Discriminative Fine-Tuning · Linear Layer · Cosine Annealing · Attention Dropout · Layer Normalization