BERT vs GPT for financial engineering
Edward Sharkey, Philip Treleaven

TL;DR
This paper benchmarks Transformer models, showing fine-tuned BERT models outperform GPT models in financial sentiment analysis, highlighting BERT's interpretability and competitive accuracy for financial engineering applications.
Contribution
The study introduces CopBERT, a domain-specific BERT model for financial sentiment analysis, demonstrating its superior performance over GPT models and other BERT variants in financial tasks.
Findings
Fine-tuned BERT outperforms GPT models in sentiment classification.
CopBERT achieves ~10% higher F1-score than GPT4.
BERT offers better interpretability despite lower predictive power.
Abstract
The paper benchmarks several Transformer models [4], to show how these models can judge sentiment from a news event. This signal can then be used for downstream modelling and signal identification for commodity trading. We find that fine-tuned BERT models outperform fine-tuned or vanilla GPT models on this task. Transformer models have revolutionized the field of natural language processing (NLP) in recent years, achieving state-of-the-art results on various tasks such as machine translation, text summarization, question answering, and natural language generation. Among the most prominent transformer models are Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformer (GPT), which differ in their architectures and objectives. A CopBERT model training data and process overview is provided. The CopBERT model outperforms similar domain specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods · Risk and Portfolio Optimization · Scheduling and Optimization Algorithms
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Linear Warmup With Linear Decay · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Absolute Position Encodings · Softmax
