AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
David McCoy, Yulun Wu, Zachary Butzin-Dozier

TL;DR
This paper advocates for measuring AI progress by capability-per-resource rather than scale alone, proposing a gradient-guided resource allocation framework that enhances efficiency and sustainability in large language models.
Contribution
It introduces a theoretical framework and practical strategies for resource-efficient model updates based on gradient influence, challenging the scale-centric paradigm in AI development.
Findings
High-influence parameters can be identified using gradient norms.
Updating only influential parameters outperforms full tuning in efficiency.
Coordination of parameter and data selection yields multiplicative resource savings.
Abstract
This position paper challenges the "scaling fundamentalism" dominating AI research, where unbounded growth in model size and computation has led to unsustainable environmental impacts and widening resource inequality. We argue that LLM development should be fundamentally reoriented toward capability-per-resource rather than capability alone. We present a theoretical framework demonstrating that resource-allocation decisions guided by gradient influence patterns can dramatically improve efficiency throughout the AI lifecycle. Our analysis shows that in transformer-based models, where a small fraction of parameters exert outsized influence (following heavy-tailed distributions), three critical insights emerge: (1) updating only high-influence parameters strictly outperforms full-parameter tuning on a performance-per-resource basis; (2) simple gradient norms provide computationally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Big Data and Digital Economy · Advanced Neural Network Applications
