Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Yanzhao Zhang; Mingxin Li; Dingkun Long; Xin Zhang; Huan Lin; Baosong Yang; Pengjun Xie; An Yang; Dayiheng Liu; Junyang Lin; Fei Huang; Jingren Zhou

arXiv:2506.05176·cs.CL·June 12, 2025·3 cites

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Yanzhao Zhang, Mingxin Li, Dingkun Long, Xin Zhang, Huan Lin, Baosong Yang, Pengjun Xie, An Yang, Dayiheng Liu, Junyang Lin, Fei Huang, Jingren Zhou

PDF

Open Access 1 Repo 10 Models 1 Datasets

TL;DR

The paper introduces the Qwen3 Embedding series, a new set of models built on Qwen3 foundation models that significantly improve text embedding and reranking across multiple languages and tasks, achieving state-of-the-art results.

Contribution

It presents a novel multi-stage training pipeline, model merging strategies, and diverse model sizes, advancing text embedding and reranking capabilities with robust multilingual performance.

Findings

01

Achieves state-of-the-art results on MTEB benchmark.

02

Excels in multilingual, code, and cross-lingual retrieval tasks.

03

Models are publicly available for community use.

Abstract

In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models. Leveraging the Qwen3 LLMs' robust capabilities in multilingual text understanding and generation, our innovative multi-stage training pipeline combines large-scale unsupervised pre-training with supervised fine-tuning on high-quality datasets. Effective model merging strategies further ensure the robustness and adaptability of the Qwen3 Embedding series. During the training process, the Qwen3 LLMs serve not only as backbone models but also play a crucial role in synthesizing high-quality, rich, and diverse training data across multiple domains and languages, thus enhancing the training pipeline. The Qwen3 Embedding series offers a spectrum of model sizes (0.6B, 4B, 8B) for both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qwenlm/qwen3-embedding
pytorchOfficial

Models

Datasets

SINAI/ALIA-legal-administrative-cqa
dataset· 76 dl
76 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques