Effective Frontiers: A Unification of Neural Scaling Laws
Jiaxuan Zou, Zixuan Gong, Ye Su, Huayi Tang, Yong Liu

TL;DR
This paper introduces a unified theoretical framework for neural scaling laws based on the concept of an Effective Frontier, explaining how model capacity, data size, and compute influence learning through pattern coverage and bottlenecks.
Contribution
It unifies existing neural scaling laws under a single framework using the Effective Frontier concept, providing clear explanations for different scaling regimes.
Findings
Derives precise scaling laws for model capacity, data, and compute.
Shows that different scaling laws are equilibrium solutions of a constrained optimization.
Provides a universal explanation for neural scaling behaviors across architectures.
Abstract
Neural scaling laws govern the prediction power-law improvement of test loss with respect to model capacity (), datasize (), and compute (). However, existing theoretical explanations often rely on specific architectures or complex kernel methods, lacking intuitive universality. In this paper, we propose a unified framework that abstracts general learning tasks as the progressive coverage of patterns from a long-tail (Zipfian) distribution. We introduce the Effective Frontier (), a threshold in the pattern rank space that separates learned knowledge from the unlearned tail. We prove that reducible loss is asymptotically determined by the probability mass of the tail a resource-dependent frontier truncation. Based on our framework, we derive the precise scaling laws for , , and , attributing them to capacity, coverage, and optimization bottlenecks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Topic Modeling
