FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
Xuan Xu, Fufang Wen, Beilin Chu, Zhibing Fu, Qinhong Lin, Jiaqi Liu, Binjie Fei, Yu Li, Linna Zhou, and Zhongliang Yang

TL;DR
FinBERT2 is a large, finance-specific bidirectional encoder that significantly improves financial NLP tasks, including classification, retrieval, and topic modeling, outperforming general models and previous BERT variants.
Contribution
The paper introduces FinBERT2, the largest Chinese financial pretraining corpus and a specialized encoder that bridges the gap between BERT and large language models in finance applications.
Findings
FinBERT2-based models outperform other BERT variants on classification tasks.
Contrastive fine-tuned models surpass open-source and proprietary embedders in retrieval tasks.
FinBERT2 enables superior financial topic clustering and representation.
Abstract
In natural language processing (NLP), the focus has shifted from encoder-only tiny language models like BERT to decoder-only large language models(LLMs) such as GPT-3. However, LLMs' practical application in the financial sector has revealed three limitations: (1) LLMs often perform worse than fine-tuned BERT on discriminative tasks despite costing much higher computational resources, such as market sentiment analysis in financial reports; (2) Application on generative tasks heavily relies on retrieval augmented generation (RAG) methods to provide current and specialized information, with general retrievers showing suboptimal performance on domain-specific retrieval tasks; (3) There are additional inadequacies in other feature-based scenarios, such as topic modeling. We introduce FinBERT2, a specialized bidirectional encoder pretrained on a high-quality, financial-specific corpus of 32b…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Machine Learning in Healthcare · Sentiment Analysis and Opinion Mining
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · WordPiece · Cosine Annealing · Multi-Head Attention · {Dispute@FaQ-s}How to file a dispute with Expedia? · Dropout · Dense Connections · 15 Ways to Contact How can i speak to someone at Delta Airlines
