Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Yanzhao Zhang, Mingxin Li, Dingkun Long, Xin Zhang, Huan Lin, Baosong Yang, Pengjun Xie, An Yang, Dayiheng Liu, Junyang Lin, Fei Huang, Jingren Zhou

TL;DR
The paper introduces the Qwen3 Embedding series, a new set of models built on Qwen3 foundation models that significantly improve text embedding and reranking across multiple languages and tasks, achieving state-of-the-art results.
Contribution
It presents a novel multi-stage training pipeline, model merging strategies, and diverse model sizes, advancing text embedding and reranking capabilities with robust multilingual performance.
Findings
Achieves state-of-the-art results on MTEB benchmark.
Excels in multilingual, code, and cross-lingual retrieval tasks.
Models are publicly available for community use.
Abstract
In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models. Leveraging the Qwen3 LLMs' robust capabilities in multilingual text understanding and generation, our innovative multi-stage training pipeline combines large-scale unsupervised pre-training with supervised fine-tuning on high-quality datasets. Effective model merging strategies further ensure the robustness and adaptability of the Qwen3 Embedding series. During the training process, the Qwen3 LLMs serve not only as backbone models but also play a crucial role in synthesizing high-quality, rich, and diverse training data across multiple domains and languages, thus enhancing the training pipeline. The Qwen3 Embedding series offers a spectrum of model sizes (0.6B, 4B, 8B) for both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Qwen/Qwen3-Embedding-0.6Bmodel· 5.6M dl· ♡ 9555.6M dl♡ 955
- 🤗Qwen/Qwen3-Embedding-8Bmodel· 1.6M dl· ♡ 6321.6M dl♡ 632
- 🤗Qwen/Qwen3-Embedding-4Bmodel· 1.6M dl· ♡ 2411.6M dl♡ 241
- 🤗Qwen/Qwen3-Reranker-8Bmodel· 190k dl· ♡ 216190k dl♡ 216
- 🤗Qwen/Qwen3-Reranker-0.6Bmodel· 906k dl· ♡ 326906k dl♡ 326
- 🤗Qwen/Qwen3-Embedding-0.6B-GGUFmodel· 31k dl· ♡ 50831k dl♡ 508
- 🤗alexliap/Qwen3-Embedding-8B-NVFP4model· 1.0k dl· ♡ 11.0k dl♡ 1
- 🤗Qwen/Qwen3-Reranker-4Bmodel· 675k dl· ♡ 121675k dl♡ 121
- 🤗Qwen/Qwen3-Embedding-8B-GGUFmodel· 29k dl· ♡ 10829k dl♡ 108
- 🤗Qwen/Qwen3-Embedding-4B-GGUFmodel· 16k dl· ♡ 9316k dl♡ 93
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
