Do Reasoning Models Enhance Embedding Models?
Wun Yu Chan, Shaojin Chen, Huihao Jing, Kwun Hang Lau, Elton Chun-Chai Li, Zihao Wang, Haoran Li, Yangqiu Song

TL;DR
This paper investigates whether reasoning-enhanced models improve semantic embeddings and finds that RLVR training does not significantly outperform base models in embedding tasks, due to manifold realignment effects.
Contribution
Introduces HRSA, a framework for analyzing representation similarity, and reveals how RLVR influences latent geometry without improving embedding performance.
Findings
RLVR causes local geometry reorganization but preserves global structure.
Contrastive learning aligns base and reasoning models, leading to manifold realignment.
RLVR optimizes within existing semantic landscapes rather than restructuring them.
Abstract
State-of-the-art embedding models are increasingly derived from decoder-only Large Language Model (LLM) backbones adapted via contrastive learning. Given the emergence of reasoning models trained via Reinforcement Learning with Verifiable Rewards (RLVR), a natural question arises: do enhanced reasoning translate to superior semantic representations when these models serve as embedding initializations? Contrary to expectation, our evaluation on MTEB and BRIGHT reveals a **null effect**: embedding models initialized from RLVR-tuned backbones yield no consistent performance advantage over their base counterparts when subjected to identical training recipes. To unpack this paradox, we introduce **H**ierarchical **R**epresentation **S**imilarity **A**nalysis (HRSA), a framework that decomposes similarity across representation, geometry, and function levels. HRSA reveals that while RLVR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗lucaswychan/Qwen2.5-1.5B-Reasoning-Embeddingmodel· 3 dl3 dl
- 🤗lucaswychan/Qwen-2.5-1.5B-SimpleRL-Zoo-Reasoning-Embeddingmodel· 6 dl· ♡ 16 dl♡ 1
- 🤗lucaswychan/Qwen2.5-0.5B-Reasoning-Embeddingmodel· 8 dl8 dl
- 🤗lucaswychan/Qwen-2.5-0.5B-SimpleRL-Zoo-Reasoning-Embeddingmodel
- 🤗lucaswychan/Qwen3-4B-Reasoning-Embeddingmodel· 3 dl3 dl
- 🤗lucaswychan/Qwen3-4B-PSR-Reasoning-Embeddingmodel· 2 dl2 dl
- 🤗lucaswychan/Nemotron-Research-Reasoning-Qwen-1.5B-Reasoning-Embeddingmodel· 7 dl7 dl
- 🤗lucaswychan/DeepSeek-R1-Distill-Qwen-1.5B-Reasoning-Embeddingmodel· 2 dl2 dl
- 🤗lucaswychan/Qwen3-0.6B-Reasoning-Embeddingmodel· 12 dl12 dl
- 🤗lucaswychan/Qwen3-0.6B-Base-Reasoning-Embeddingmodel· 3 dl3 dl
- lucaswychan/allnli-hard-neg-reasoning-embeddingdataset· 25 dl25 dl
- lucaswychan/dureader-hard-neg-reasoning-embeddingdataset· 24 dl24 dl
- lucaswychan/fever-hard-neg-reasoning-embeddingdataset· 17 dl17 dl
- lucaswychan/hotpotqa-hard-neg-reasoning-embeddingdataset· 13 dl13 dl
- lucaswychan/mrtydi-reasoning-embeddingdataset· 24 dl24 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Graph Neural Networks
