Why Code, Why Now: An Information-Theoretic Perspective on the Limits of Machine Learning
Zhimin Zhao

TL;DR
This paper introduces an information-theoretic hierarchy to explain the varying learnability of tasks like code generation and reinforcement learning, emphasizing the role of task structure over model size.
Contribution
It proposes a five-level hierarchy based on information structure to predict ML scalability and clarifies the limits of learning tasks through formal properties.
Findings
Code generation benefits from dense, local feedback at each token.
Reinforcement learning lacks such graded feedback, hindering scalability.
Diagnosing a task's position in the hierarchy predicts scaling outcomes better than model properties.
Abstract
This paper offers a new perspective on the limits of machine learning: the ceiling on progress is set not by model size or algorithm choice but by the information structure of the task itself. Code generation has progressed more reliably than reinforcement learning, largely because code provides dense, local, verifiable feedback at every token, whereas most reinforcement learning problems do not. This difference in feedback quality is not binary but graded. We propose a five-level hierarchy of learnability based on information structure and argue that diagnosing a task's position in this hierarchy is more predictive of scaling outcomes than any property of the model. The hierarchy rests on a formal distinction among three properties of computational problems (expressibility, computability, and learnability). We establish their pairwise relationships, including where implications hold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
