Ace-Skill: Bootstrapping Multimodal Agents with Prioritized and Clustered Evolution

Feng Xiong; Zengbin Wang; Yong Wang; Xuecai Hu; Jinghan He; Liang Lin; Yuan Liu; Xiangxiang Chu

arXiv:2605.08887·cs.AI·May 12, 2026

Ace-Skill: Bootstrapping Multimodal Agents with Prioritized and Clustered Evolution

Feng Xiong, Zengbin Wang, Yong Wang, Xuecai Hu, Jinghan He, Liang Lin, Yuan Liu, Xiangxiang Chu

PDF

1 Repo

TL;DR

Ace-Skill introduces a co-evolutionary framework that enhances self-evolving multimodal agents by optimizing rollout sampling and knowledge organization, leading to significant performance improvements and knowledge transfer capabilities.

Contribution

It presents a novel joint optimization approach combining prioritized sampling and semantic clustering to improve self-evolution in multimodal agents.

Findings

01

Achieved +35.46% in Avg@4 accuracy across benchmarks.

02

Enabled a 35B MoE model to outperform proprietary counterparts.

03

Transferred knowledge effectively to smaller models in zero-shot settings.

Abstract

Self-evolving agents present a promising path toward continual adaptation by distilling task interactions into reusable knowledge artifacts. In practice, this paradigm remains hindered by two coupled bottlenecks: data inefficiency, where costly rollout effort is disproportionately spent on low-value samples rather than informative ones, and knowledge interference, where heterogeneous knowledge stored in shared repositories leads to noisy retrieval and task-misaligned guidance. Together, these issues form a self-reinforcing failure loop in which uninformative rollouts yield noisy knowledge, which in turn degrades subsequent rollouts. In this work, we introduce Ace-Skill, a co-evolutionary framework that jointly optimizes rollout allocation and knowledge organization for self-evolving multimodal agents. Specifically, Ace-Skill combines aprioritized sampler with lazy-decay proficiency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AMAP-ML/Ace-Skill
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.