Agentic Unlearning: When LLM Agent Meets Machine Unlearning
Bin Wang, Fan Wang, Pingping Wang, Jinyu Cong, Yang Yu, Yilong Yin, Zhongyi Han, and Benzheng Wei

TL;DR
This paper introduces agentic unlearning, a unified framework for removing specific information from both model parameters and persistent memory in AI agents, addressing backflow issues and ensuring comprehensive unlearning.
Contribution
It proposes Synchronized Backflow Unlearning (SBU), a novel method that jointly unlearns from parameters and memory pathways using a dual-update protocol.
Findings
Reduces private information traces in models and memory
Maintains data utility with limited degradation
Effective on medical QA benchmarks
Abstract
In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory backflow, where retrieval reactivates parametric remnants or memory artifacts reintroduce sensitive content, and (ii) the absence of a unified strategy that covers both parameter and memory pathways. We present Synchronized Backflow Unlearning (SBU), a framework that unlearns jointly across parameter and memory pathways. The memory pathway performs dependency closure-based unlearning that prunes isolated entities while logically invalidating shared artifacts. The parameter pathway employs stochastic reference alignment to guide model outputs toward a high-entropy prior. These pathways are integrated via a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare · Topic Modeling
