FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data
Ankur Sinha, Chaitanya Agarwal, Pekka Malo

TL;DR
FinBloom introduces a specialized 7B parameter language model fine-tuned on financial data, enabling real-time, context-aware financial query answering and decision-making with reduced latency.
Contribution
We developed FinBloom 7B, a financial domain-specific LLM fine-tuned on extensive financial news and filings, and created a dataset for real-time financial query context understanding.
Findings
Enhanced real-time financial query answering capabilities.
Significant reduction in data retrieval latency.
Improved performance in dynamic financial tasks.
Abstract
Large language models (LLMs) excel at generating human-like responses but often struggle with interactive tasks that require access to real-time information. This limitation poses challenges in finance, where models must access up-to-date information, such as recent news or price movements, to support decision-making. To address this, we introduce Financial Agent, a knowledge-grounding approach for LLMs to handle financial queries using real-time text and tabular data. Our contributions are threefold: First, we develop a Financial Context Dataset of over 50,000 financial queries paired with the required context. Second, we develop FinBloom 7B, a custom 7 billion parameter LLM, by fine-tuning Bloom 7B on 14 million financial news articles from Reuters and Deutsche Presse-Agentur (DPA), alongside a random sample of 25% from 12 million Securities and Exchange Commission (SEC) filings.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
