The Predictive-Causal Gap: An Impossibility Theorem and Large-Scale Neural Evidence

Kejun Liu

arXiv:2605.05029·cs.LG·May 7, 2026

The Predictive-Causal Gap: An Impossibility Theorem and Large-Scale Neural Evidence

Kejun Liu

PDF

TL;DR

This paper demonstrates a fundamental limitation in predictive neural models, showing they tend to encode environment rather than system dynamics, especially in high-dimensional, noisy, or slow environments, with implications for AI modeling.

Contribution

It proves a structural property of predictive objectives causing models to encode environment modes over system modes, supported by large-scale neural evidence and theoretical analysis.

Findings

01

Optimal encoders predominantly track environment rather than system.

02

The gap worsens with increasing environment dimension, becoming causally blind at high N.

03

Operational grounding reduces but does not eliminate the predictive-causal gap.

Abstract

We report a systematic failure mode in predictive representation learning. Across 2695 neural network configurations trained to predict linear-Gaussian dynamics, the optimal encoder tracks the environment rather than the system it is meant to model. The mean causal fidelity -- the fraction of encoder sensitivity allocated to system degrees of freedom -- is 0.49, and only 2.5% of configurations exceed 0.70. The failure intensifies with dimension: at N=100, the optimal encoder becomes causally blind (fidelity ~10^{-8}) while achieving 92% lower prediction error than the causal representation. We prove this is not an optimization artifact but a structural property of the predictive objective: when environment modes are slower or less noisy than system modes, every minimizer of the population risk encodes the former. The set of dynamics exhibiting this predictive-causal gap is open and of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.