FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad, Abdul-Mageed

TL;DR
FinTral is a multimodal financial large language model that integrates diverse data types and surpasses existing models like ChatGPT-3.5 and GPT-4 in financial tasks, demonstrating strong zero-shot performance and real-time analysis capabilities.
Contribution
We present FinTral, a novel multimodal LLM tailored for finance, with domain-specific training, extensive benchmarking, and superior zero-shot performance over existing models.
Findings
FinTral outperforms ChatGPT-3.5 in all evaluated tasks.
FinTral surpasses GPT-4 in five out of nine tasks.
FinTral demonstrates strong zero-shot and real-time financial analysis capabilities.
Abstract
We introduce FinTral, a suite of state-of-the-art multimodal large language models (LLMs) built upon the Mistral-7b model and tailored for financial analysis. FinTral integrates textual, numerical, tabular, and image data. We enhance FinTral with domain-specific pretraining, instruction fine-tuning, and RLAIF training by exploiting a large collection of textual and visual datasets we curate for this work. We also introduce an extensive benchmark featuring nine tasks and 25 datasets for evaluation, including hallucinations in the financial domain. Our FinTral model trained with direct preference optimization employing advanced Tools and Retrieval methods, dubbed FinTral-DPO-T&R, demonstrates an exceptional zero-shot performance. It outperforms ChatGPT-3.5 in all tasks and surpasses GPT-4 in five out of nine tasks, marking a significant advancement in AI-driven financial technology. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStock Market Forecasting Methods
MethodsAttention Is All You Need · Reinforcement Learning from AI Feedback · Linear Layer · Dense Connections · Label Smoothing · Adam · Softmax · Multi-Head Attention · Layer Normalization · Dropout
