EGA: Adapting Frozen Encoders for Vector Search with Bounded Out-of-Distribution Degradation
Dongfang Zhao

TL;DR
EGA introduces a residual adapter for frozen vision encoders that maintains high accuracy on unseen classes by automatically limiting updates in already correct local geometries, improving out-of-distribution vector search.
Contribution
The paper proposes Euclidean Geodesic Alignment (EGA), a novel residual adapter that stabilizes unseen-class performance while refining seen classes in vector search.
Findings
EGA achieves 96.5% gradient-free triplets at convergence, preserving unseen-class regions.
EGA outperforms existing methods on five out-of-distribution benchmarks.
The approach transfers effectively to stronger backbones beyond CLIP.
Abstract
Vector search systems built on frozen vision encoders face queries from unseen classes at deployment, yet existing adapter training collapses under this shift: high-capacity adapters with global contrastive losses silently reassign unseen-class samples to wrong seen-class clusters, dropping worst-case Label Precision by over 40 points below the frozen baseline in our tests. We propose Euclidean Geodesic Alignment (EGA), a residual adapter that couples three principles: zero initialization, local triplet loss, and hypersphere projection. These collectively induce a self-limiting dynamic: triplets that already satisfy a small margin stop producing gradients, so the adapter automatically stops updating where the local geometry is already correct. Our experiments show that at convergence of triplets are gradient-free, leaving unseen-class regions largely untouched while still…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
