Memory Inception: Latent-Space KV Cache Manipulation for Steering LLMs
Andy Zeyi Liu, Michael Zhang, Ilana Greenberg, Adam Alnasser, Lucas Baker, John Sous

TL;DR
Memory Inception (MI) is a training-free latent-space steering method for LLMs that selectively injects key-value banks at specific layers, enabling persistent, structured, and efficient guidance without cluttering prompts.
Contribution
MI introduces a novel, training-free approach to steer LLMs by manipulating latent attention space with text-derived KV banks at selected layers.
Findings
MI achieves superior control on personality-steering tasks.
Supports mid-conversation behavior shifts without rewriting transcripts.
Outperforms prompting on structured reasoning tasks and reduces KV storage significantly.
Abstract
Steering large language models (LLMs) is usually done by either instruction prompting or activation steering. Prompting often gives strong control, but caches guidance tokens at every layer and can clutter long interactions; activation steering is compact but typically weaker and does not support large structured reminders. We introduce memory inception (MI), a training-free method that steers in latent attention space by inserting text-derived key-value (KV) banks only at selected layers. Rather than materializing reminder content throughout the prompt cache, MI treats steering as selective KV allocation, injecting latent slots only where the model routes to them. On matched personality-steering tasks, MI gives the best overall control--drift trade-off, remaining competitive with prompting while consistently outperforming CAA. On updateable guidance, MI supports mid-conversation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
