LabelFusion: Fusing Large Language Models with Transformer Encoders for Robust Financial News Classification
Michael Schlee, Christoph Weisser, Timo Kivim\"aki, Melchizedek Mashiku, Benjamin Saefken

TL;DR
This paper introduces LabelFusion, a hybrid model combining large language models and transformer encoders for robust financial news classification, demonstrating superior performance with ample data and competitive results in low-data scenarios.
Contribution
The paper proposes LabelFusion, a novel hybrid architecture that fuses LLM prompts with transformer embeddings, improving classification accuracy in financial news analysis.
Findings
LabelFusion achieves 96.0% macro F1 on full data.
LLMs perform well even in zero-shot settings.
LabelFusion outperforms standalone models with sufficient data.
Abstract
Financial news plays a central role in shaping investor sentiment and short-term dynamics in commodity markets. Many downstream financial applications, such as commodity price prediction or sentiment modeling, therefore rely on the ability to automatically identify news articles relevant to specific assets. However, obtaining large labeled corpora for financial text classification is costly, and transformer-based classifiers such as RoBERTa often degrade significantly in low-data regimes. Our results show that appropriately prompted out-of-the-box Large Language Models (LLMs) achieve strong performance even in such settings. Furthermore, we propose LabelFusion, a hybrid architecture that combines the output of a prompt-engineered LLM with contextual embeddings produced by a fine-tuned RoBERTa encoder through a lightweight Multilayer Perceptron (MLP) voting layer. Evaluated on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Topic Modeling · Sentiment Analysis and Opinion Mining
