Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding

Jon-Paul Cacioli

arXiv:2604.21286·cs.CL·April 24, 2026

Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding

Jon-Paul Cacioli

PDF

TL;DR

This study investigates the role of cross-entropy in predictive coding networks, revealing it is crucial for the observed behavior of the K-way energy probe, with implications for understanding neural network training dynamics.

Contribution

It provides the first systematic, pre-registered analysis of how removing cross-entropy affects the K-way energy probe in bidirectional predictive coding networks.

Findings

01

Removing CE halves the probe-softmax gap.

02

CE training results in much larger output logits.

03

Temperature scaling decomposes the gap into scale and ranking effects.

Abstract

Cacioli (2026) showed that the K-way energy probe on standard discriminative predictive coding networks reduces approximately to a monotone function of the log-softmax margin. The reduction rests on five assumptions, including cross-entropy (CE) at the output and effectively feedforward inference dynamics. This pre-registered study tests the reduction's sensitivity to CE removal using two conditions: standard PC trained with MSE instead of CE, and bidirectional PC (bPC; Oliviers, Tang & Bogacz, 2025). Across 10 seeds on CIFAR-10 with a matched 2.1M-parameter backbone, we find three results. The negative result replicates on standard PC: the probe sits below softmax (Delta = -0.082, p < 10^-6). On bPC the probe exceeds softmax across all 10 seeds (Delta = +0.008, p = 0.000027), though a pre-registered manipulation check shows that bPC does not produce materially greater latent movement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.