Evaluating Embedding Generalization: How LLMs, LoRA, and SLERP Shape Representational Geometry
Siyaxolisa Kabane

TL;DR
This paper compares the generalization of dense text embeddings from LLMs and non-LLMs, showing how SLERP merging improves model robustness and preserves task-specific gains in embedding quality.
Contribution
It introduces a controlled experimental framework to evaluate embedding generalization and demonstrates that SLERP merging effectively mitigates over-specialization in LLM-based embeddings.
Findings
LLM-based embeddings better capture numeric patterns
SLERP merging restores base-model structure
Merged models outperform non-merged in clustering tasks
Abstract
We investigate the generalization properties of dense text embeddings when the embedding backbone is a large language model (LLM) versus when it is a non-LLM encoder, and we study the extent to which spherical linear interpolation (SLERP) model-merging mitigates over-specialization introduced by task-specific adaptation (e.g., LoRA). To make the comparison concrete and domain-agnostic, we design a controlled suite of experiments in which models embed short numerical sequences and are evaluated on their ability to cluster and classify those sequences according to well-defined number-theoretic properties. Our experimental protocol compares four families of models: (1) non-LLM encoders trained from scratch or fine-tuned for embeddings, (2) LLM-based encoders adapted with parameter-efficient methods (LoRA), (3) LLM-based encoders with LoRA followed by model souping merging into the base…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques
