Improving LLM Final Representations with Inter-Layer Geometry
Tom Ulanovski, Eyal Blyachman, Maya Bechler-Speicher

TL;DR
This paper introduces Cayley-Encoder, a novel, efficient graph-based method for leveraging intermediate LLM layers, significantly improving prediction accuracy across multiple tasks while maintaining low complexity.
Contribution
It proposes Cayley-Encoder, a mathematically grounded, sparse graph approach that enhances inter-layer communication in LLMs, outperforming existing methods with minimal additional parameters.
Findings
Cayley-Encoder outperforms baselines on 13 tasks and 9 LLMs.
Achieves up to 40 percentage points accuracy improvement.
Operates effectively in few-shot regimes and surpasses LoRA fine-tuning.
Abstract
The standard in LLM-based prediction is to use the final-layer representation as the input to a downstream predictor. However, intermediate layers may encode complementary task-relevant signals. Existing approaches therefore either search for the best layer for each task or apply expensive attention-based mechanisms to learn inter-layer aggregation. In this work, we first show that such complexity is unnecessary: a lightweight Graph Neural Network over a fully connected graph of LLM layers is more efficient and achieves significantly stronger predictive performance than existing approaches. We then introduce the Cayley-Encoder, which further improves both efficiency and predictive performance by replacing the fully connected graph with a Cayley graph over SL(2, Zn). These Cayley graphs provide a mathematically grounded topology that is sparse, regular by construction, and has low…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
