ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

Chengzhang Yu; Zening Lu; Chenyang Zheng; Chiyue Wang; Yiming Zhang; Zhanpeng Jin

arXiv:2511.01581·cs.AI·November 4, 2025

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

Chengzhang Yu, Zening Lu, Chenyang Zheng, Chiyue Wang, Yiming Zhang, Zhanpeng Jin

PDF

Open Access 3 Reviews

TL;DR

ExplicitLM introduces an external memory bank for language models, enabling direct knowledge inspection and updates, leading to improved performance on knowledge tasks and enhanced interpretability.

Contribution

The paper presents a novel architecture with a large external memory bank and a differentiable retrieval mechanism, decoupling knowledge from model parameters for better transparency and updateability.

Findings

01

Achieves up to 43.67% improvement on knowledge-intensive tasks.

02

Gains 3.62× in low-data regimes with 10k samples.

03

Retrieval correlates strongly with prediction accuracy.

Abstract

Large language models suffer from knowledge staleness and lack of interpretability due to implicit knowledge storage across entangled network parameters, preventing targeted updates and reasoning transparency. We propose ExplicitLM, a novel architecture featuring a million-scale external memory bank storing human-readable knowledge as token sequences, enabling direct inspection and modification. We design a differentiable two-stage retrieval mechanism with efficient coarse-grained filtering via product key decomposition (reducing complexity from $O (N \cdot ∣ I ∣)$ to $O (N \cdot ∣ I ∣)$ ) and fine-grained Gumbel-Softmax matching for end-to-end training. Inspired by dual-system cognitive theory, we partition knowledge into frozen explicit facts (20%) and learnable implicit patterns (80%), maintained through Exponential Moving Average updates for stability. ExplicitLM…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 3

Strengths

The paper does a good job setting up the problem. Knowledge augmentation/ update/ editing is an open research problem.

Weaknesses

The experiment set up is not rigorous enough to demonstrate that this approach really works. There is almost no details on what kind of model is used as baseline, other than saying it’s a standard transformer model. Second, while it does a goos job in setting up the evaluation tasks, there is no discussion about what has been used as an evaluation set. No result on standard benchmarks are presented. While the paper presents results of perfect retrieval, they also did not compare their results wi

Reviewer 02Rating 2Confidence 4

Strengths

1. The research aim of this paper is forward-looking, i.e., building a readable, inspectable, and modifiable explicit memory bank. 2. Explicit division of the memory into different parts is interesting; however, the names for these two types of memory need to be carefully considered.

Weaknesses

1. **Limited Contribution** **Explicit Memory:** Similar concepts exist in Memory3, MemoryLLM, memory modules in model editing (e.g., MEMIT), and earlier Memory Networks. * **Product Key Retrieval:** Seems to directly adopted from works like "Mixture of a Million Experts". * **Readable Memory:** Conceptually similar to retrieving raw text in RAG. Consequently, while the paper integrates several existing ideas, `the integration itself is superficial, and the resulting archi

Reviewer 03Rating 6Confidence 4

Strengths

- The paper is well written and easy to follow. It has a good and detailed technical descripton of the approach that makes it easy to understand what's happening and how ExplicitLM is optimized. The contribution approaches the problem of retrieval augmented generation in a new way by fundamentally rethinking how we can represent external knowledge bases. - Empirical results show strong improvements over transformer baseline of up to 43.6%. These observations hold in several categories including

Weaknesses

- The evaluations are limited to transformer baseline only which makes the contribution less convincing because the transformer architecture is not particularly designed of knowledge retrieval in the first place. The contribution of the paper could be strengthend by comparing ExplicitLM against other baselines such as simple RAG. Other ablations here, such as runtime or scaling performance, may further strengthen the contribution. - The dataset construction and particularly how the memory bank i

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Ferroelectric and Negative Capacitance Devices