TL;DR
Agent-Dice introduces a geometric consensus-based parameter fusion method to improve continual learning in LLM-based agents by effectively disentangling shared and conflicting knowledge updates.
Contribution
It proposes a novel two-stage knowledge disentanglement framework using geometric consensus filtering and curvature-based importance weighting.
Findings
Achieves superior continual learning performance with minimal computational overhead.
Effectively prunes conflicting gradients to prevent catastrophic forgetting.
Provides theoretical analysis validating the fusion scheme.
Abstract
Large Language Model (LLM)-based agents significantly extend the utility of LLMs by interacting with dynamic environments. However, enabling agents to continually learn new tasks without catastrophic forgetting remains a critical challenge, known as the stability-plasticity dilemma. In this work, we argue that this dilemma fundamentally arises from the failure to explicitly distinguish between common knowledge shared across tasks and conflicting knowledge introduced by task-specific interference. To address this, we propose Agent-Dice, a parameter fusion framework based on directional consensus evaluation. Concretely, Agent-Dice disentangles knowledge updates through a two-stage process: geometric consensus filtering to prune conflicting gradients, and curvature-based importance weighting to amplify shared semantics. We provide a rigorous theoretical analysis that establishes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
