From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?
Binyan Xu, Dong Fang, Haitao Li, Kehuan Zhang

TL;DR
This paper introduces Metric Freedom, a predictor of when skill distillation from multi-agent systems is beneficial, and proposes AdaSkill, a framework that adapts distillation based on metric properties, improving efficiency.
Contribution
It presents Metric Freedom as the first a priori predictor of skill utility and develops AdaSkill, a two-stage adaptive distillation framework guided by this metric.
Findings
Metric Freedom strongly predicts skill utility (r=-0.85, p<0.0001).
Skill utility varies significantly with evaluation metric rigidity.
AdaSkill matches or exceeds MAS performance while reducing cost and latency.
Abstract
Multi-agent systems (MAS) tackle complex tasks by distributing expertise, though this often comes at the cost of heavy coordination overhead, context fragmentation, and brittle phase ordering. Distilling a MAS into a single-agent skill can bypass these costs, but this conversion lacks a principled answer for when and what to distill. Instead, the empirical outcome is surprisingly inconsistent: skill lift ranges from a 28% improvement to a 2% degradation across metrics of the exact same task. In this work, we reveal that skill utility is governed not by the task, but by the evaluation metric. We introduce Metric Freedom (F), the first a priori predictor of skill utility. F measures the topological rigidity of a metric's scoring landscape by quantifying how output diversity couples with score variance via a Mantel test. Guided by F, we propose AdaSkill, a two-stage adaptive distillation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
