Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan Saattrup Nielsen, Kenneth Enevoldsen, Peter Schneider-Kamp

TL;DR
This study compares encoder and decoder language models on multilingual NLU tasks, revealing encoder models outperform decoders despite fewer parameters, and offers new evaluation methods for decoder models in this context.
Contribution
It extends the ScandEval benchmark to include decoder models and introduces a novel evaluation method for their performance on NLU tasks across multiple languages.
Findings
Encoder models outperform decoder models in NLU tasks.
Decoder models require more parameters to achieve similar performance.
UMAP analysis highlights distinct capabilities of encoder and decoder models.
Abstract
This paper explores the performance of encoder and decoder language models on multilingual Natural Language Understanding (NLU) tasks, with a broad focus on Germanic languages. Building upon the ScandEval benchmark, initially restricted to evaluating encoder models, we extend the evaluation framework to include decoder models. We introduce a method for evaluating decoder models on NLU tasks and apply it to the languages Danish, Swedish, Norwegian, Icelandic, Faroese, German, Dutch, and English. Through a series of experiments and analyses, we also address research questions regarding the comparative performance of encoder and decoder models, the impact of NLU task types, and the variation across language resources. Our findings reveal that encoder models can achieve significantly better NLU performance than decoder models despite having orders of magnitude fewer parameters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsFocus
