Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph   Traversal and Redundancy Removal

Weipeng Jiang; Juan Zhai; Shiqing Ma; Ziyan Lei; Xiaofei Xie; Yige; Wang; Chao Shen

arXiv:2502.18810·cs.AI·February 27, 2025

Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal

Weipeng Jiang, Juan Zhai, Shiqing Ma, Ziyan Lei, Xiaofei Xie, Yige, Wang, Chao Shen

PDF

Open Access

TL;DR

This paper introduces HANKER, an automated framework that generates comprehensive audit datasets for LLM unlearning by leveraging knowledge graphs, significantly improving detection of knowledge memorization and addressing redundancy issues.

Contribution

HANKER systematically creates large, fine-grained audit datasets for LLM unlearning evaluation, overcoming limitations of existing benchmarks and revealing the impact of knowledge redundancy.

Findings

01

Generated over 69,000 and 111,000 audit cases for News and Books datasets.

02

Redundancy inflates unlearning effectiveness metrics like ROUGE and Entailment Scores.

03

Systematic deduplication is essential for accurate unlearning assessment.

Abstract

In recent years, Large Language Models (LLMs) have faced increasing demands to selectively remove sensitive information, protect privacy, and comply with copyright regulations through unlearning, by Machine Unlearning. While evaluating unlearning effectiveness is crucial, existing benchmarks are limited in scale and comprehensiveness, typically containing only a few hundred test cases. We identify two critical challenges in generating holistic audit datasets: ensuring audit adequacy and handling knowledge redundancy between forget and retain dataset. To address these challenges, we propose HANKER, an automated framework for holistic audit dataset generation leveraging knowledge graphs to achieve fine-grained coverage and eliminate redundant knowledge. Applying HANKER to the popular MUSE benchmark, we successfully generated over 69,000 and 111,000 audit cases for the News and Books…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Big Data and Digital Economy