A Dutch Financial Large Language Model

Sander Noels; Jorne De Blaere; Tijl De Bie

arXiv:2410.12835·cs.CE·October 18, 2024

A Dutch Financial Large Language Model

Sander Noels, Jorne De Blaere, Tijl De Bie

PDF

1 Repo

TL;DR

This paper introduces FinGEITje, a Dutch financial LLM, along with a large instruction dataset and evaluation benchmark, demonstrating superior performance in financial tasks with an open-source, automated approach.

Contribution

It presents the first Dutch financial LLM, a large instruction dataset, and an automated evaluation method, advancing multilingual financial NLP capabilities.

Findings

01

FinGEITje outperforms existing models on five financial tasks.

02

The dataset and evaluation method are openly available for research.

03

Automated evaluation reduces manual effort in performance assessment.

Abstract

This paper presents FinGEITje, the first Dutch financial Large Language Model (LLM) specifically designed and optimized for various financial tasks. Together with the model, we release a specialized Dutch financial instruction tuning dataset with over 140,000 samples, constructed employing an automated translation and data processing method. The open-source data construction method is provided, facilitating the creation of financial instruction datasets in different languages. To evaluate model performance, the study introduces the first Dutch financial evaluation benchmark, along with an automated evaluation method that utilizes an LLM as an independent evaluator, reducing manual intervention in performance evaluation. The experimental results highlight the superior performance of FinGEITje across five critical Dutch and English financial tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

snoels/fingeit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.