Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families

Felipe Maia Polo; Seamus Somerstep; Leshem Choshen; Yuekai Sun; Mikhail Yurochkin

arXiv:2412.06540·cs.LG·December 3, 2025

Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families

Felipe Maia Polo, Seamus Somerstep, Leshem Choshen, Yuekai Sun, Mikhail Yurochkin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Skills Scaling Laws (SSLaws), a novel approach that predicts large language model performance across multiple benchmarks by modeling low-dimensional latent skills influenced by computational resources.

Contribution

SSLaws leverages benchmark correlations and latent skill modeling to improve performance predictions across LLM families without extensive retraining.

Findings

01

Accurately predicts LLM performance on 12 benchmarks.

02

Provides insights into skill scaling and compute efficiency.

03

Reduces need for training multiple models per family.

Abstract

Scaling laws for large language models (LLMs) predict model performance based on parameters like size and training data. However, differences in training configurations and data processing across model families lead to significant variations in benchmark performance, making it difficult for a single scaling law to generalize across all LLMs. On the other hand, training family-specific scaling laws requires training models of varying sizes for every family. In this work, we propose Skills Scaling Laws (SSLaws, pronounced as Sloth), a novel scaling law that leverages publicly available benchmark data and assumes LLM performance is driven by low-dimensional latent skills, such as reasoning and instruction following. These latent skills are influenced by computational resources like model size and training tokens, but with varying efficiencies across model families. Sloth exploits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

felipemaiapolo/sloth
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics