Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented   FinBERT Distillation

Graison Jos Thomas

arXiv:2409.18999·cs.CL·October 1, 2024·2 cites

Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT Distillation

Graison Jos Thomas

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to improve a small, efficient financial sentiment analysis model by leveraging GPT-4 generated data to distill knowledge from a larger FinBERT model, achieving high accuracy with reduced size.

Contribution

The study presents a two-tiered knowledge distillation method using GPT-4 augmented data to enhance TinyFinBERT's performance for financial sentiment analysis.

Findings

01

TinyFinBERT achieves comparable accuracy to FinBERT

02

GPT-4 generated synthetic data improves model training

03

Distillation reduces model size while maintaining performance

Abstract

In the rapidly evolving field of financial sentiment analysis, the efficiency and accuracy of predictive models are critical due to their significant impact on financial markets. Transformer based models like BERT and large language models (LLMs) like GPT-4, have advanced NLP tasks considerably. Despite their advantages, BERT-based models face challenges with computational intensity in edge computing environments, and the substantial size and compute requirements of LLMs limit their practical deployment. This study proposes leveraging the generative capabilities of LLMs, such as GPT-4 Omni, to create synthetic, domain-specific training data. This approach addresses the challenge of data scarcity and enhances the performance of smaller models by making them competitive with their larger counterparts. The research specifically aims to enhance FinBERT, a BERT model fine-tuned for financial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

graison-thomas/TinyFinBERT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Label Smoothing · Byte Pair Encoding · Linear Warmup With Cosine Annealing · Linear Layer · {Dispute@FaQ-s}How to file a dispute with Expedia? · Dropout · Absolute Position Encodings