The Scaling Laws of Skills in LLM Agent Systems
Charles Chen, Qiming Yu, Yuhang Gu, Zhuoye Huang, Hanjing Li, Hongyu Liu, Simin Liu, Jinhao Liu, Dengyun Peng, Jiangyi Wang, Zheng Yan, Fanqing Meng, Ethan Qin, Carl Che, Mengkang Hu

TL;DR
This paper uncovers scaling laws governing skill accumulation in large language model agent systems, revealing how library size impacts routing accuracy and downstream performance.
Contribution
It introduces two coupled laws describing skill library scaling effects and demonstrates how these laws can guide optimization to improve agent robustness and efficiency.
Findings
Routing accuracy decays logarithmically with library size.
Correct execution can improve downstream decisions by about 4×.
Law-guided optimization significantly enhances routing and execution performance.
Abstract
As agent systems scale, skills accumulate into large reusable libraries, yet their scaling laws remain poorly understood. Across 15 frontier LLMs, 1,141 real-world skills, and over 3M routing or execution decisions, we identify two coupled laws. Routing law: single-step routing accuracy decays logarithmically with library size ( for all models), with errors progressing from local skill competition to cross-family drift and capture by overly general "black-hole skills". Execution law: before state realization, joint routing is approximately multiplicative, whereas correct execution can improve difficult downstream decisions by about . A single parameter, the routing logarithmic decay slope , couples the two laws: routing-side fits predict execution-side rescue across models, showing that the same library property controls both pre-execution collapse and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
