FinDPO: Financial Sentiment Analysis for Algorithmic Trading through Preference Optimization of LLMs
Giorgos Iacovides, Wuyang Zhou, Danilo Mandic

TL;DR
FinDPO introduces a finance-specific LLM framework that leverages preference optimization for improved sentiment analysis, outperforming existing models and enabling effective integration into trading strategies with significant returns.
Contribution
The paper presents FinDPO, a novel preference-based LLM training method tailored for financial sentiment analysis, enhancing generalization and practical trading application.
Findings
Achieves 11% higher accuracy on sentiment benchmarks.
Maintains 67% annual returns in simulated trading.
Attains a Sharpe ratio of 2.0 under realistic costs.
Abstract
Opinions expressed in online finance-related textual data are having an increasingly profound impact on trading decisions and market movements. This trend highlights the vital role of sentiment analysis as a tool for quantifying the nature and strength of such opinions. With the rapid development of Generative AI (GenAI), supervised fine-tuned (SFT) large language models (LLMs) have become the de facto standard for financial sentiment analysis. However, the SFT paradigm can lead to memorization of the training data and often fails to generalize to unseen samples. This is a critical limitation in financial domains, where models must adapt to previously unobserved events and the nuanced, domain-specific language of finance. To this end, we introduce FinDPO, the first finance-specific LLM framework based on post-training human preference alignment via Direct Preference Optimization (DPO).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
