Multi-Objective Large Language Model Unlearning

Zibin Pan; Shuwen Zhang; Yuesheng Zheng; Chi Li; Yuheng Cheng; Junhua; Zhao

arXiv:2412.20412·cs.CL·January 7, 2025

Multi-Objective Large Language Model Unlearning

Zibin Pan, Shuwen Zhang, Yuesheng Zheng, Chi Li, Yuheng Cheng, Junhua, Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces MOLLM, a multi-objective optimization approach for large language model unlearning that effectively removes specific data influence while maintaining overall model utility, addressing gradient explosion and forgetting issues.

Contribution

We propose MOLLM, a novel multi-objective unlearning algorithm for LLMs that improves unlearning effectiveness and utility preservation over existing gradient ascent methods.

Findings

01

MOLLM outperforms state-of-the-art GA-based methods in unlearning effectiveness.

02

MOLLM better preserves model utility after unlearning.

03

The approach effectively mitigates gradient explosion and catastrophic forgetting.

Abstract

Machine unlearning in the domain of large language models (LLMs) has attracted great attention recently, which aims to effectively eliminate undesirable behaviors from LLMs without full retraining from scratch. In this paper, we explore the Gradient Ascent (GA) approach in LLM unlearning, which is a proactive way to decrease the prediction probability of the model on the target data in order to remove their influence. We analyze two challenges that render the process impractical: gradient explosion and catastrophic forgetting. To address these issues, we propose Multi-Objective Large Language Model Unlearning (MOLLM) algorithm. We first formulate LLM unlearning as a multi-objective optimization problem, in which the cross-entropy loss is modified to the unlearning version to overcome the gradient explosion issue. A common descent update direction is then calculated, which enables the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zibinpan/mollm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment

MethodsSoftmax · Attention Is All You Need