Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models
Kelvin J.L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua

TL;DR
This paper introduces the SEP framework, enabling large language models to autonomously generate explainable stock predictions by self-reflection and reinforcement learning, outperforming traditional methods without requiring human-annotated explanations.
Contribution
The SEP framework allows LLMs to self-train for explainable stock prediction tasks without human annotations, combining self-reflection and reinforcement learning techniques.
Findings
Outperforms traditional deep learning in stock prediction accuracy.
Generates human-readable explanations for stock movements.
Effective in portfolio construction tasks.
Abstract
Explaining stock predictions is generally a difficult task for traditional non-generative deep learning models, where explanations are limited to visualizing the attention weights on important texts. Today, Large Language Models (LLMs) present a solution to this problem, given their known capabilities to generate human-readable explanations for their decision-making process. However, the task of stock prediction remains challenging for LLMs, as it requires the ability to weigh the varying impacts of chaotic social texts on stock prices. The problem gets progressively harder with the introduction of the explanation component, which requires LLMs to explain verbally why certain factors are more important than the others. On the other hand, to fine-tune LLMs for such a task, one would need expert-annotated samples of explanation for every stock movement in the training set, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsEntropy Regularization · Proximal Policy Optimization
