Learning Iterative Reasoning through Energy Diffusion
Yilun Du, Jiayuan Mao, Joshua B. Tenenbaum

TL;DR
The paper presents IRED, a new energy diffusion framework that learns iterative reasoning to solve complex problems by adapting inference steps, outperforming existing methods in various reasoning tasks.
Contribution
Introduces a novel energy diffusion approach with adaptive inference and two training techniques for improved reasoning performance.
Findings
Outperforms existing reasoning methods in multiple tasks.
Enables solving more complex problems outside training distribution.
Uses energy landscape annealing for easier inference.
Abstract
We introduce iterative reasoning through energy diffusion (IRED), a novel framework for learning to reason for a variety of tasks by formulating reasoning and decision-making problems with energy-based optimization. IRED learns energy functions to represent the constraints between input conditions and desired outputs. After training, IRED adapts the number of optimization steps during inference based on problem difficulty, enabling it to solve problems outside its training distribution -- such as more complex Sudoku puzzles, matrix completion with large value magnitudes, and pathfinding in larger graphs. Key to our method's success is two novel techniques: learning a sequence of annealed energy landscapes for easier inference and a combination of score function and energy landscape supervision for faster and more stable training. Our experiments show that IRED outperforms existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Neural Networks and Applications · Fuzzy Logic and Control Systems
MethodsDiffusion
