Knowledge Capsules: Structured Nonparametric Memory Units for LLMs
Bin Ju, Shenfeng Weng, Danying Zhou, Rongkai Xu, Kunkai Su

TL;DR
The paper introduces Knowledge Capsules, a structured memory system for LLMs that enhances knowledge integration stability and accuracy without retraining, outperforming retrieval-augmented methods in QA tasks.
Contribution
It presents a novel External Key Value Injection framework that directly incorporates structured knowledge into attention mechanisms, improving over existing retrieval-based approaches.
Findings
Outperforms RAG and GraphRAG on multiple QA benchmarks.
Provides improved stability and accuracy in long context and multi-hop reasoning.
Requires no parameter updates during knowledge integration.
Abstract
Large language models (LLMs) encode knowledge in parametric weights, making it costly to update or extend without retraining. Retrieval-augmented generation (RAG) mitigates this limitation by appending retrieved text to the input, but operates purely through context expansion, where external knowledge competes as tokens within the attention mechanism. As a result, its influence is indirect and often unstable, particularly in long context and multi hop reasoning scenarios. We propose Knowledge Capsules, structured nonparametric memory units that represent normalized relational knowledge and can be constructed directly from document corpora using a frozen base model. Instead of injecting knowledge as text, we introduce an External Key Value Injection (KVI) framework that compiles capsules into attention-compatible key value representations, enabling external knowledge to directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
