Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules
Binxu Wang, Jiaqi Shang, Haim Sompolinsky

TL;DR
This study compares diffusion and autoregressive generative models in learning abstract rules from limited data, revealing their complementary strengths and limitations in rule learning and reasoning tasks.
Contribution
It introduces the GenRAVEN dataset and systematically evaluates how diffusion and autoregressive models learn and apply rules, highlighting their different capabilities and scaling behaviors.
Findings
Diffusion models excel at unconditional generation and memorization.
Autoregressive models perform better at rule-consistent panel completion.
Rule learning emerges at around 1000 samples per rule for both model types.
Abstract
Humans excel at discovering regular structures from limited samples and applying inferred rules to novel settings. We investigate whether modern generative models can similarly learn underlying rules from finite samples and perform reasoning through conditional sampling. Inspired by Raven's Progressive Matrices task, we designed GenRAVEN dataset, where each sample consists of three rows, and one of 40 relational rules governing the object position, number, or attributes applies to all rows. We trained generative models to learn the data distribution, where samples are encoded as integer arrays to focus on rule learning. We compared two generative model families: diffusion (EDM, DiT, SiT) and autoregressive models (GPT2, Mamba). We evaluated their ability to generate structurally consistent samples and perform panel completion via unconditional and conditional sampling. We found…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion · Focus
