DIVE: Embedding Compression via Self-Limiting Gradient Updates

Dongfang Zhao

arXiv:2605.20689·cs.CL·May 21, 2026

DIVE: Embedding Compression via Self-Limiting Gradient Updates

Dongfang Zhao

PDF

1 Repo

TL;DR

DIVE introduces a novel embedding compression method using self-limiting gradient updates and contrastive learning, significantly improving retrieval performance across multiple datasets.

Contribution

It proposes a new compression adapter that mitigates overfitting in embedding compression by bounding perturbations and leveraging implicit views for self-supervised learning.

Findings

01

DIVE outperforms baseline adapters on all evaluated datasets.

02

Achieves better retrieval performance at various compression ratios.

03

Provides a 14M-parameter open-source implementation.

Abstract

High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), and SMEC (EMNLP 2025), enable dimensionality reduction through lightweight residual adapters, but their training objectives cause severe overfitting when labeled data is scarce, degrading retrieval performance below the frozen baseline. We propose \textsc{DIVE} (\textbf{D}imensionality reduction with \textbf{I}mplicit \textbf{V}iew \textbf{E}nsembles), a compression adapter that addresses this failure through two mechanisms. First, a self-limiting hinge-based triplet loss produces zero gradient once a triplet satisfies the margin constraint, bounding the total perturbation applied to the pretrained embedding space. Second, a head-wise NT-Xent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.