Improving LLM Final Representations with Inter-Layer Geometry

Tom Ulanovski; Eyal Blyachman; Maya Bechler-Speicher

arXiv:2603.22665·cs.CL·May 14, 2026

Improving LLM Final Representations with Inter-Layer Geometry

Tom Ulanovski, Eyal Blyachman, Maya Bechler-Speicher

PDF

TL;DR

This paper introduces Cayley-Encoder, a novel, efficient graph-based method for leveraging intermediate LLM layers, significantly improving prediction accuracy across multiple tasks while maintaining low complexity.

Contribution

It proposes Cayley-Encoder, a mathematically grounded, sparse graph approach that enhances inter-layer communication in LLMs, outperforming existing methods with minimal additional parameters.

Findings

01

Cayley-Encoder outperforms baselines on 13 tasks and 9 LLMs.

02

Achieves up to 40 percentage points accuracy improvement.

03

Operates effectively in few-shot regimes and surpasses LoRA fine-tuning.

Abstract

The standard in LLM-based prediction is to use the final-layer representation as the input to a downstream predictor. However, intermediate layers may encode complementary task-relevant signals. Existing approaches therefore either search for the best layer for each task or apply expensive attention-based mechanisms to learn inter-layer aggregation. In this work, we first show that such complexity is unnecessary: a lightweight Graph Neural Network over a fully connected graph of LLM layers is more efficient and achieves significantly stronger predictive performance than existing approaches. We then introduce the Cayley-Encoder, which further improves both efficiency and predictive performance by replacing the fully connected graph with a Cayley graph over SL(2, Zn). These Cayley graphs provide a mathematically grounded topology that is sparse, regular by construction, and has low…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.