Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models
Toshiyuki Shigemura

TL;DR
This study reveals that large language models can recall learned information but often do not express it in typical outputs, highlighting a dissociation between memorization and generation.
Contribution
It provides empirical evidence that LLMs can reconstruct learned content but tend to suppress it in standard outputs across various tasks and models.
Findings
Zero instances of non-causal solutions in generated outputs
Memorization capability does not translate to expression in typical generation
Task-conditioned policies suppress learned content across contexts
Abstract
Large language models (LLMs) demonstrate the capacity to reconstruct and trace learned content from their training data under specific elicitation conditions, yet this capability does not manifest in standard generation contexts. This empirical observational study examines the expression of non-causal, non-implementable solution types across 300 prompt-response generations spanning narrative and problem-solving task contexts. Drawing on recent findings regarding memorization contiguity and alignment-induced discourse priors, we document a systematic dissociation between learned capability and expressed output. Across three distinct LLMs, ten task scenarios, and both creative narrative and practical advisory contexts, we documented zero instances of non-causal solution frames in generated outputs (0%, 95% CI: [0%, 1.2%]), despite verified reconstruction capability under conditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Computational and Text Analysis Methods
