Fietje: An open, efficient LLM for Dutch
Bram Vanroy

TL;DR
Fietje is an open-source, efficient Dutch language model based on Phi 2, demonstrating competitive performance and emphasizing transparency, with rapid progress in Dutch NLP capabilities through small models.
Contribution
The paper introduces Fietje, a fully open-source Dutch LLM based on Phi 2, showcasing competitive results and emphasizing transparency and reproducibility.
Findings
Fietje achieves competitive results with larger models.
Small models now outperform older, larger Dutch models.
Open-source approach promotes accessibility and reproducibility.
Abstract
This paper introduces Fietje, a family of small language models (SLMs) specifically designed for the Dutch language. The model is based on Phi 2, an English-centric model of 2.7 billion parameters. Fietje demonstrated competitive results with larger language models upon its release. A core emphasis of this work is transparency and reproducibility: Fietje is fully open-source, with model weights, datasets, training, and evaluation code all publicly accessible. The paper discusses the performance of Fietje and many other models on an extensive evaluation suite of benchmarks on reasoning, sentiment analysis, world knowledge, linguistic acceptability and word sense disambiguation. Evaluation results illustrate the rapid progress in the field of LLMs, where recent small models outperform older, larger models that were fine-tuned for Dutch. This trend signals an exciting future for Dutch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗BramVanroy/fietje-2model· 48 dl· ♡ 1148 dl♡ 11
- 🤗BramVanroy/fietje-2-instructmodel· 183 dl· ♡ 6183 dl♡ 6
- 🤗BramVanroy/fietje-2-chatmodel· 141 dl· ♡ 7141 dl♡ 7
- 🤗RichardErkhov/BramVanroy_-_fietje-2-chat-4bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/BramVanroy_-_fietje-2-chat-8bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/BramVanroy_-_fietje-2-4bitsmodel
- 🤗RichardErkhov/BramVanroy_-_fietje-2-8bitsmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
