LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias, St\"urmer, Ilias Chalkidis

TL;DR
LEXTREME is a comprehensive multilingual and multi-task benchmark for legal NLP, designed to evaluate models across diverse languages and datasets, highlighting the field's challenges and progress.
Contribution
The paper introduces LEXTREME, the first multilingual and multi-task legal NLP benchmark, with aggregate scoring and open resources for evaluation.
Findings
Best baseline (XLM-R large) scores 61.3 on aggregate metrics.
Benchmark remains challenging with significant room for improvement.
Provides a diverse, multilingual dataset covering 24 languages.
Abstract
Lately, propelled by the phenomenal advances around the transformer architecture, the legal NLP field has enjoyed spectacular growth. To measure progress, well curated and challenging benchmarks are crucial. However, most benchmarks are English only and in legal NLP specifically there is no multilingual benchmark available yet. Additionally, many benchmarks are saturated, with the best models clearly outperforming the best humans and achieving near perfect scores. We survey the legal NLP literature and select 11 datasets covering 24 languages, creating LEXTREME. To provide a fair comparison, we propose two aggregate scores, one based on the datasets and one on the languages. The best baseline (XLM-R large) achieves both a dataset aggregate score a language aggregate score of 61.3. This indicates that LEXTREME is still very challenging and leaves ample room for improvement. To make it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗joelniklaus/legal-swiss-roberta-basemodel· 118 dl118 dl
- 🤗joelniklaus/legal-xlm-roberta-largemodel· 394 dl· ♡ 5394 dl♡ 5
- 🤗joelniklaus/legal-xlm-roberta-basemodel· 216 dl· ♡ 3216 dl♡ 3
- 🤗joelniklaus/legal-swiss-roberta-largemodel· 116 dl· ♡ 1116 dl♡ 1
- 🤗joelniklaus/legal-english-roberta-largemodel· 2 dl· ♡ 12 dl♡ 1
- 🤗joelniklaus/legal-english-roberta-basemodel· 14 dl14 dl
- 🤗joelniklaus/legal-portuguese-roberta-basemodel· 10 dl· ♡ 210 dl♡ 2
- 🤗joelniklaus/legal-english-longformer-basemodel· ♡ 2♡ 2
- 🤗joelniklaus/legal-swiss-longformer-basemodel· 119 dl· ♡ 2119 dl♡ 2
- 🤗joelniklaus/legal-xlm-longformer-basemodel· 1.0k dl· ♡ 41.0k dl♡ 4
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Artificial Intelligence in Law
