Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT Distillation
Graison Jos Thomas

TL;DR
This paper introduces a novel approach to improve a small, efficient financial sentiment analysis model by leveraging GPT-4 generated data to distill knowledge from a larger FinBERT model, achieving high accuracy with reduced size.
Contribution
The study presents a two-tiered knowledge distillation method using GPT-4 augmented data to enhance TinyFinBERT's performance for financial sentiment analysis.
Findings
TinyFinBERT achieves comparable accuracy to FinBERT
GPT-4 generated synthetic data improves model training
Distillation reduces model size while maintaining performance
Abstract
In the rapidly evolving field of financial sentiment analysis, the efficiency and accuracy of predictive models are critical due to their significant impact on financial markets. Transformer based models like BERT and large language models (LLMs) like GPT-4, have advanced NLP tasks considerably. Despite their advantages, BERT-based models face challenges with computational intensity in edge computing environments, and the substantial size and compute requirements of LLMs limit their practical deployment. This study proposes leveraging the generative capabilities of LLMs, such as GPT-4 Omni, to create synthetic, domain-specific training data. This approach addresses the challenge of data scarcity and enhances the performance of smaller models by making them competitive with their larger counterparts. The research specifically aims to enhance FinBERT, a BERT model fine-tuned for financial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Label Smoothing · Byte Pair Encoding · Linear Warmup With Cosine Annealing · Linear Layer · {Dispute@FaQ-s}How to file a dispute with Expedia? · Dropout · Absolute Position Encodings
