NV-Retriever: Improving text embedding models with effective hard-negative mining
Gabriel de Souza P. Moreira, Radek Osmulski, Mengyao Xu, Ronay Ak,, Benedikt Schifferer, Even Oldridge

TL;DR
This paper introduces NV-Retriever, a novel hard-negative mining method for contrastive training of text embedding models, significantly improving retrieval accuracy and training efficiency in information retrieval tasks.
Contribution
It proposes positive-aware mining techniques that leverage relevance scores for better false negative removal, enhancing contrastive learning for text embeddings.
Findings
NV-Retriever-v1 scores 60.9 on MTEB Retrieval benchmark
Achieves state-of-the-art results on BEIR dataset
Faster training with improved retrieval accuracy
Abstract
Text embedding models have been popular for information retrieval applications such as semantic search and Question-Answering systems based on Retrieval-Augmented Generation (RAG). Those models are typically Transformer models that are fine-tuned with contrastive learning objectives. One of the challenging aspects of fine-tuning embedding models is the selection of high quality hard-negative passages for contrastive learning. In this paper we introduce a family of positive-aware mining methods that use the positive relevance score as an anchor for effective false negative removal, leading to faster training and more accurate retrieval models. We provide an ablation study on hard-negative mining methods over their configurations, exploring different teacher and base models. We further demonstrate the efficacy of our proposed mining methods at scale with the NV-Retriever-v1 model, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗lightonai/LateOn-Code-edgemodel· 3.3k dl· ♡ 263.3k dl♡ 26
- 🤗nvidia/NV-Embed-v2model· 82k dl· ♡ 50882k dl♡ 508
- 🤗dragonkue/multilingual-e5-small-ko-v2model· 4.2k dl· ♡ 34.2k dl♡ 3
- 🤗nvidia/omni-embed-nemotron-3bmodel· 2.4k dl· ♡ 992.4k dl♡ 99
- 🤗nvidia/llama-nemotron-embed-1b-v2model· 61k dl· ♡ 4961k dl♡ 49
- 🤗lightonai/LateOn-Codemodel· 215 dl· ♡ 25215 dl♡ 25
- 🤗nvidia/NV-Retriever-v1model· 15 dl· ♡ 2415 dl♡ 24
- 🤗markaw/NV-Embed-v2model· 89 dl89 dl
- 🤗dragonkue/multilingual-e5-small-komodel· 5.6k dl· ♡ 105.6k dl♡ 10
- 🤗exp-models/dragonkue-KoEn-E5-Tinymodel· 402 dl· ♡ 5402 dl♡ 5
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Text and Document Classification Technologies
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections
