MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff, Nouamane Tazi, Lo\"ic Magne, Nils Reimers

TL;DR
MTEB is a comprehensive benchmark that evaluates 33 text embedding models across 8 tasks and 58 datasets, revealing no single method excels universally, highlighting the need for further research.
Contribution
Introduces the largest and most diverse text embedding benchmark (MTEB) covering multiple tasks, datasets, and languages, with open-source tools and a public leaderboard.
Findings
No embedding method dominates across all tasks
The field has not converged on a universal embedding method
Benchmark covers 58 datasets and 112 languages
Abstract
Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being proposed without proper evaluation. To solve this problem, we introduce the Massive Text Embedding Benchmark (MTEB). MTEB spans 8 embedding tasks covering a total of 58 datasets and 112 languages. Through the benchmarking of 33 models on MTEB, we establish the most comprehensive benchmark of text embeddings to date. We find that no particular text embedding method dominates across all tasks. This suggests that the field has yet to converge on a universal text embedding method and scale it up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗intfloat/multilingual-e5-largemodel· 4.5M dl· ♡ 11664.5M dl♡ 1166
- 🤗intfloat/multilingual-e5-smallmodel· 3.9M dl· ♡ 2963.9M dl♡ 296
- 🤗intfloat/multilingual-e5-basemodel· 2.5M dl· ♡ 3442.5M dl♡ 344
- 🤗intfloat/e5-base-v2model· 1.6M dl· ♡ 1541.6M dl♡ 154
- 🤗intfloat/multilingual-e5-large-instructmodel· 1.3M dl· ♡ 6091.3M dl♡ 609
- 🤗intfloat/e5-smallmodel· 105k dl· ♡ 44105k dl♡ 44
- 🤗intfloat/e5-basemodel· 118k dl· ♡ 25118k dl♡ 25
- 🤗intfloat/e5-largemodel· 19k dl· ♡ 8019k dl♡ 80
- 🤗intfloat/e5-small-unsupervisedmodel· 285 dl285 dl
- 🤗intfloat/e5-base-unsupervisedmodel· 434 dl· ♡ 2434 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
