Agentic Unlearning: When LLM Agent Meets Machine Unlearning

Bin Wang; Fan Wang; Pingping Wang; Jinyu Cong; Yang Yu; Yilong Yin; Zhongyi Han; and Benzheng Wei

arXiv:2602.17692·cs.LG·March 3, 2026

Agentic Unlearning: When LLM Agent Meets Machine Unlearning

Bin Wang, Fan Wang, Pingping Wang, Jinyu Cong, Yang Yu, Yilong Yin, Zhongyi Han, and Benzheng Wei

PDF

Open Access

TL;DR

This paper introduces agentic unlearning, a unified framework for removing specific information from both model parameters and persistent memory in AI agents, addressing backflow issues and ensuring comprehensive unlearning.

Contribution

It proposes Synchronized Backflow Unlearning (SBU), a novel method that jointly unlearns from parameters and memory pathways using a dual-update protocol.

Findings

01

Reduces private information traces in models and memory

02

Maintains data utility with limited degradation

03

Effective on medical QA benchmarks

Abstract

In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory backflow, where retrieval reactivates parametric remnants or memory artifacts reintroduce sensitive content, and (ii) the absence of a unified strategy that covers both parameter and memory pathways. We present Synchronized Backflow Unlearning (SBU), a framework that unlearns jointly across parameter and memory pathways. The memory pathway performs dependency closure-based unlearning that prunes isolated entities while logically invalidating shared artifacts. The parameter pathway employs stochastic reference alignment to guide model outputs toward a high-entropy prior. These pathways are integrated via a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare · Topic Modeling