Fine-tuning of lightweight large language models for sentiment classification on heterogeneous financial textual data
Alvaro Paredes Amorin, Andre Python, Christoph Weisser

TL;DR
This paper evaluates lightweight open-source large language models for financial sentiment analysis across diverse datasets, showing they can perform competitively with limited data and resources, offering a cost-effective alternative to larger models.
Contribution
It demonstrates that lightweight open-source LLMs can effectively generalize financial sentiment analysis across heterogeneous data sources with minimal training data.
Findings
Qwen3 8B and Llama3 8B perform best in most scenarios.
Models achieve good results with only 5% of training data.
Lightweight LLMs are a cost-effective alternative for financial NLP.
Abstract
Large language models (LLMs) play an increasingly important role in financial markets analysis by capturing signals from complex and heterogeneous textual data sources, such as tweets, news articles, reports, and microblogs. However, their performance is dependent on large computational resources and proprietary datasets, which are costly, restricted, and therefore inaccessible to many researchers and practitioners. To reflect realistic situations we investigate the ability of lightweight open-source LLMs -- smaller and publicly available models designed to operate with limited computational resources -- to generalize sentiment understanding from financial datasets of varying sizes, sources, formats, and languages. We compare the benchmark finance natural language processing (NLP) model, FinBERT, and three open-source lightweight LLMs, DeepSeek-LLM 7B, Llama3 8B Instruct, and Qwen3 8B…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Sentiment Analysis and Opinion Mining · Explainable Artificial Intelligence (XAI)
