Activation Manifold Projection: Liberating Task-Specific Behaviors from LLM Architectures
Al Kari

TL;DR
This paper introduces CAST, a novel framework that enables zero-shot transfer of task-specific behaviors encoded by LoRA adapters across different LLM architectures by learning a direct activation manifold mapping.
Contribution
CAST provides a direct, nonlinear activation space transfer method that outperforms weight-space approaches and enables zero-shot adapter transfer between diverse LLMs.
Findings
CAST achieves 85-95% of LoRA performance after transfer.
It outperforms existing weight-space transfer techniques.
Enables zero-shot transfer between heterogeneous LLM architectures.
Abstract
The proliferation of Large Language Model (LLM) architectures presents a fundamental challenge: valuable, task-specific behaviors learned through fine-tuning methods like Low-Rank Adaptation (LoRA) are effectively trapped within their source model's architecture, herein referred to architectural lock-in. Existing transfer methods attempt to bridge this gap by aligning the static weight spaces of models, a brittle and indirect approach that relies on tenuous correlations between parameter geometries. This paper introduces a fundamentally different and more direct paradigm: the Cartridge Activation Space Transfer (CAST), a novel framework that liberates LoRA-encoded behaviors by learning a direct, nonlinear mapping between the activation manifolds, the geometric structures formed by the model's internal neuron activations, of two distinct LLM architectures. CAST treats a pre-trained LoRA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
