TL;DR
This paper introduces Energy-Based Reasoning via Structured Latent Planning (EBRM), modeling reasoning as gradient optimization over latent trajectories with an energy function, addressing limitations of existing neural decoders.
Contribution
It proposes a novel latent planning framework with energy-based modeling, dual-path decoder training, and latent anchoring to improve reasoning in neural models.
Findings
Energy decreases monotonically on graph and logic tasks.
Latent planning induces structured trajectories in synthetic tasks.
Energy-based approach remains flat on arithmetic tasks, indicating limitations.
Abstract
Single-shot neural decoders commit to answers without iterative refinement, while chain-of-thought methods introduce discrete intermediate steps but lack a scalar measure of reasoning progress. We propose Energy-Based Reasoning via Structured Latent Planning (EBRM), which models reasoning as gradient-based optimization of a multi-step latent trajectory under a learned energy function . The energy decomposes into per-step compatibility, transition consistency, and trajectory smoothness terms. Training combines supervised encoder-decoder learning with contrastive energy shaping using hard negatives, while inference performs gradient descent or Langevin dynamics over and decodes from . We identify a critical failure mode: on CNF logic satisfaction, latent planning reduces accuracy from to . This degradation arises from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
