Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo, Emmanuel Hauptmann

TL;DR
This paper investigates fine-tuning large language models for stock return prediction using financial news, comparing different model architectures and integration methods, and demonstrating improved portfolio performance over traditional sentiment analysis.
Contribution
It introduces a comprehensive comparison of encoder-only and decoder-only LLMs for stock prediction and evaluates their integration methods, showing their effectiveness in financial forecasting.
Findings
Aggregated token-level embeddings improve return prediction accuracy.
Decoder LLMs outperform in large investment universes.
LLM-derived signals outperform traditional sentiment scores.
Abstract
Large language models (LLMs) and their fine-tuning techniques have demonstrated superior performance in various language understanding and generation tasks. This paper explores fine-tuning LLMs for stock return forecasting with financial newsflow. In quantitative investing, return forecasting is fundamental for subsequent tasks like stock picking, portfolio optimization, etc. We formulate the model to include text representation and forecasting modules. We propose to compare the encoder-only and decoder-only LLMs, considering they generate text representations in distinct ways. The impact of these different representations on forecasting performance remains an open question. Meanwhile, we compare two simple methods of integrating LLMs' token-level representations into the forecasting module. The experiments on real news and investment universes reveal that: (1) aggregated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
