# Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Guidance with Proxy Constraint

**Authors:** Zhihao Liu, Jian Lou, Yuke Hu, Xiaochen Li, Yitian Chen, Tailun Chen, Zhizhen Qin, Kui Ren, Zhan Qin

arXiv: 2508.20443 · 2026-01-13

## TL;DR

This paper introduces EGUP, a novel unlearning framework for large language models that uses entanglement guidance and proxy constraints to effectively forget specific data while preserving utility.

## Contribution

EGUP is a new framework that adaptively guides unlearning in LLMs using entanglement and proxy constraints, reducing over-unlearning and utility loss.

## Key findings

- EGUP improves the unlearning-utility trade-off on TOFU and MUSE benchmarks.
- EGUP achieves near-retraining performance with better scalability and robustness.
- EGUP is compatible with existing gradient-based unlearning methods.

## Abstract

Large language models (LLMs) are trained on massive datasets that may include private or copyrighted content. Due to growing privacy and ownership concerns, data owners may request the removal of their data from trained models. Machine unlearning provides a practical solution by removing the influence of specific data without full retraining. However, most existing methods still suffer from over-unlearning due to the lack of a principled mechanism to regulate the forgetting boundary, leading to unnecessary utility degradation and heightened privacy and robustness risks. In this work, we propose EGUP (Entanglement-Guided Unlearning with Proxy Constraint), a novel framework that leverages entanglement and proxy constraint to guide the unlearning process while mitigating over-unlearning. Within each iteration, EGUP employs inter-sample entanglement to adaptively reweight the unlearning strength, assigning greater unlearning efforts to forget samples that are semantically closer to retained knowledge. Across iterations, EGUP leverages intra-sample entanglement to track the representation shift of each forget sample and dynamically adjust its unlearning effort. In addition, we incorporate a proxy constraint that approximates the model's expected outputs after unlearning, forming a reference boundary that softly regularizes the unlearning process. EGUP is compatible with existing gradient-based objectives and serves as a plug-and-play enhancement. We evaluate EGUP on the TOFU and MUSE benchmarks, demonstrating consistent improvements in the unlearning-utility trade-off across multiple LLMs. Moreover, EGUP achieves performance close to the retrained model while remaining scalable and robust.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20443/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20443/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/2508.20443/full.md

---
Source: https://tomesphere.com/paper/2508.20443