Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning

Zheng Wu; Xingyu Lou; Xinbei Ma; Yansi Li; Weiwen Liu; Weinan Zhang; Jun Wang; Zhuosheng Zhang

arXiv:2601.03641·cs.CL·April 14, 2026

Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning

Zheng Wu, Xingyu Lou, Xinbei Ma, Yansi Li, Weiwen Liu, Weinan Zhang, Jun Wang, Zhuosheng Zhang

PDF

1 Repo

TL;DR

Agent-Dice introduces a geometric consensus-based parameter fusion method to improve continual learning in LLM-based agents by effectively disentangling shared and conflicting knowledge updates.

Contribution

It proposes a novel two-stage knowledge disentanglement framework using geometric consensus filtering and curvature-based importance weighting.

Findings

01

Achieves superior continual learning performance with minimal computational overhead.

02

Effectively prunes conflicting gradients to prevent catastrophic forgetting.

03

Provides theoretical analysis validating the fusion scheme.

Abstract

Large Language Model (LLM)-based agents significantly extend the utility of LLMs by interacting with dynamic environments. However, enabling agents to continually learn new tasks without catastrophic forgetting remains a critical challenge, known as the stability-plasticity dilemma. In this work, we argue that this dilemma fundamentally arises from the failure to explicitly distinguish between common knowledge shared across tasks and conflicting knowledge introduced by task-specific interference. To address this, we propose Agent-Dice, a parameter fusion framework based on directional consensus evaluation. Concretely, Agent-Dice disentangles knowledge updates through a two-stage process: geometric consensus filtering to prune conflicting gradients, and curvature-based importance weighting to amplify shared semantics. We provide a rigorous theoretical analysis that establishes the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Wuzheng02/Agent-Dice
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.