Evaluating Embedding Generalization: How LLMs, LoRA, and SLERP Shape Representational Geometry

Siyaxolisa Kabane

arXiv:2511.21703·cs.CL·December 1, 2025

Evaluating Embedding Generalization: How LLMs, LoRA, and SLERP Shape Representational Geometry

Siyaxolisa Kabane

PDF

Open Access

TL;DR

This paper compares the generalization of dense text embeddings from LLMs and non-LLMs, showing how SLERP merging improves model robustness and preserves task-specific gains in embedding quality.

Contribution

It introduces a controlled experimental framework to evaluate embedding generalization and demonstrates that SLERP merging effectively mitigates over-specialization in LLM-based embeddings.

Findings

01

LLM-based embeddings better capture numeric patterns

02

SLERP merging restores base-model structure

03

Merged models outperform non-merged in clustering tasks

Abstract

We investigate the generalization properties of dense text embeddings when the embedding backbone is a large language model (LLM) versus when it is a non-LLM encoder, and we study the extent to which spherical linear interpolation (SLERP) model-merging mitigates over-specialization introduced by task-specific adaptation (e.g., LoRA). To make the comparison concrete and domain-agnostic, we design a controlled suite of experiments in which models embed short numerical sequences and are evaluated on their ability to cluster and classify those sequences according to well-defined number-theoretic properties. Our experimental protocol compares four families of models: (1) non-LLM encoders trained from scratch or fine-tuned for embeddings, (2) LLM-based encoders adapted with parameter-efficient methods (LoRA), (3) LLM-based encoders with LoRA followed by model souping merging into the base…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques