Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs

Jos\'e I. Orlicki

arXiv:2502.21030·cs.CL·March 3, 2025

Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs

Jos\'e I. Orlicki

PDF

TL;DR

This paper introduces an implicit memory framework for LLMs that enhances internal reasoning efficiency by mimicking human mental representations, showing significant training loss reduction and maintaining interpretability.

Contribution

It proposes a novel implicit memory module for LLMs, integrating human-like mental representations to improve reasoning efficiency and robustness.

Findings

01

Implicit Memory Module reduces training loss by 35-57%.

02

Framework maintains interpretability with optional chain-of-thought decoding.

03

Discusses scalable mechanisms and theoretical foundations for internal reasoning.

Abstract

Recent advances in large language models (LLMs) have popularized the chain-of-thought (CoT) paradigm, in which models produce explicit reasoning steps in natural language. Although this approach improves interpretability and facilitates external auditing, it may not represent the most computationally efficient method for internal reasoning. In contrast, human cognition relies on implicit mental representations that recall past sensory and episodic information without requiring complete verbalization. In this paper, we propose a framework that integrates implicit mental representations into the internal reasoning processes of LLMs. Preliminary experiments indicate that incorporating an Implicit Memory Module (IMM) into a simple GPT model yields a reduction of between 35% and 57% in final training loss compared to a regular GPT baseline. The addition of an explicit interpretability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Layer Normalization · Multi-Head Attention · Dense Connections · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Softmax