FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models
Bithiah Yuan

TL;DR
FinBERT-QA is a financial question answering system that leverages pre-trained BERT models to improve answer relevance and retrieval accuracy in financial data, outperforming previous methods on key metrics.
Contribution
The paper introduces a novel financial QA system combining BM25 retrieval with BERT-based re-ranking, and demonstrates significant performance improvements on financial datasets.
Findings
Achieved 16% improvement in MRR
Achieved 17% improvement in NDCG
Achieved 21% improvement in Precision@1
Abstract
Motivated by the emerging demand in the financial industry for the automatic analysis of unstructured and structured data at scale, Question Answering (QA) systems can provide lucrative and competitive advantages to companies by facilitating the decision making of financial advisers. Consequently, we propose a novel financial QA system using the transformer-based pre-trained BERT language model to address the limitations of data scarcity and language specificity in the financial domain. Our system focuses on financial non-factoid answer selection, which retrieves a set of passage-level texts and selects the most relevant as the answer. To increase efficiency, we formulate the answer selection task as a re-ranking problem, in which our system consists of an Answer Retriever using BM25, a simple information retrieval approach, to first return a list of candidate answers, and an Answer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Advanced Text Analysis Techniques · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Attention Dropout · Softmax · Residual Connection · WordPiece · Linear Layer
