FANAL -- Financial Activity News Alerting Language Modeling Framework
Urjitkumar Patel, Fang-Chun Yeh, Chinmay Gondhalekar, Hari Nalluri

TL;DR
FANAL is a specialized BERT-based framework designed for real-time financial news detection and categorization, outperforming large language models in accuracy and cost efficiency.
Contribution
The paper introduces FANAL and ORBERT, novel fine-tuning techniques for financial event detection, achieving superior performance over existing models.
Findings
FANAL outperforms GPT-4o, Llama-3.1 8B, and Phi-3 in accuracy.
FANAL is more cost-efficient than large language models.
ORBERT improves class-wise probability calibration.
Abstract
In the rapidly evolving financial sector, the accurate and timely interpretation of market news is essential for stakeholders needing to navigate unpredictable events. This paper introduces FANAL (Financial Activity News Alerting Language Modeling Framework), a specialized BERT-based framework engineered for real-time financial event detection and analysis, categorizing news into twelve distinct financial categories. FANAL leverages silver-labeled data processed through XGBoost and employs advanced fine-tuning techniques, alongside ORBERT (Odds Ratio BERT), a novel variant of BERT fine-tuned with ORPO (Odds Ratio Preference Optimization) for superior class-wise probability calibration and alignment with financial event relevance. We evaluate FANAL's performance against leading large language models, including GPT-4o, Llama-3.1 8B, and Phi-3, demonstrating its superior accuracy and cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Data Quality and Management · Business Process Modeling and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Dropout · Dense Connections · Layer Normalization · Linear Layer · Multi-Head Attention · Weight Decay · Linear Warmup With Linear Decay · WordPiece
