Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models

Yuefeng Peng; Parnian Afshar; Megan Ganji; Thomas Butler; Amir Houmansadr; Mingxian Wang; and Dezhi Hong

arXiv:2510.17620·cs.CL·October 21, 2025

Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models

Yuefeng Peng, Parnian Afshar, Megan Ganji, Thomas Butler, Amir Houmansadr, Mingxian Wang, and Dezhi Hong

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a context-aware unlearning method for large language models that effectively removes specific knowledge while preserving the model's ability to use that knowledge when reintroduced in prompts.

Contribution

It proposes a novel unlearning approach with a plug-in term to maintain contextual utility, addressing a key usability gap in existing methods.

Findings

01

Restores contextual utility to near original levels

02

Maintains effective forgetting of target knowledge

03

Preserves overall model utility

Abstract

Large language models may encode sensitive information or outdated knowledge that needs to be removed, to ensure responsible and compliant model responses. Unlearning has emerged as an efficient alternative to full retraining, aiming to remove specific knowledge while preserving overall model utility. Existing evaluations of unlearning methods focus on (1) the extent of forgetting of the target knowledge (forget set) and (2) maintaining performance on the retain set (i.e., utility). However, these evaluations overlook an important usability aspect: users may still want the model to leverage the removed information if it is re-introduced in the prompt. In a systematic evaluation of six state-of-the-art unlearning methods, we find that they consistently impair such contextual utility. To address this, we augment unlearning objectives with a plug-in term that preserves the model's ability…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 3

Strengths

- The proposed context-aware objective is modular, requires minimal changes, and can be plugged into various unlearning methods. - The approach yields substantial improvements (LLM-Judge ≈ +0.9) in contextual QA performance without harming forgetting or utility. - The authors conduct extensive experiments across multiple methods, and forget ratios, supported by both quantitative and qualitative analyses.

Weaknesses

- For handling outdated knowledge, knowledge editing is generally more appropriate than unlearning. Moreover, realistic cases where a model must “re-use” forgotten information are rare. - The problem is more accurately described as studying how unlearning affects in-context learning ability, rather than as a practical need to recover forgotten knowledge. - Experiments focus mainly on two small- to mid-sized instruction-tuned models (Gemma-2B-IT and Qwen3-8B) and synthetic benchmarks (TOFU), limi

Reviewer 02Rating 4Confidence 5

Strengths

The paper presents an interesting problem with a simple yet effective solution. Experimental results indicate that adding the context term noticeably improves model performance when ground truth is paired with forget samples in the prompt.

Weaknesses

My primary concern is the practical value of an unlearning approach that explicitly retain the knowledge and allow its recovery via prompt-based context. Since unlearning demand is driven by critical concerns such as privacy and compliance requirements, retaining such information, even conditionally, may still violate regulations or enable easier extraction by attackers. Should introducing context be more of a potential attack (i.e., vulnerability of existing unlearned model) than unlearning obj

Reviewer 03Rating 4Confidence 5

Strengths

The paper presents a clear problem formulation and introduces an insightful new evaluation axis, showing that standard unlearning methods can suppress the model’s ability to utilize externally supplied facts. The proposed fix is simple, practical, and easy to integrate, requiring minimal additional hyperparameters while yielding tangible empirical gains.

Weaknesses

- The TOFU dataset uses fictitious entities designed to be independent, but real data typically exhibit strong interconnections among entities and attributes. The authors should also evaluate existing unlearning methods on datasets like PISTOL (which explicitly models data interconnectivity) and on real-world pretraining data to test whether ContextQA performance remains suppressed when partial contextual links to the forget set persist. - The current ContextQA setup appears to append ground-tr

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications