When Forgetting Builds Reliability: LLM Unlearning for Reliable Hardware Code Generation

Yiwen Liang; Qiufeng Li; Shikai Wang; Weidong Cao

arXiv:2512.05341·cs.LG·December 8, 2025

When Forgetting Builds Reliability: LLM Unlearning for Reliable Hardware Code Generation

Yiwen Liang, Qiufeng Li, Shikai Wang, Weidong Cao

PDF

Open Access

TL;DR

This paper introduces a novel unlearning framework for large language models in hardware code generation, enabling effective removal of problematic knowledge while maintaining code quality and reliability.

Contribution

The paper presents a syntax-preserving, floor-aware unlearning method tailored for LLMs in hardware design, improving reliability and safety without sacrificing performance.

Findings

01

Supports forget sets up to 3x larger

02

Requires only a single training epoch for unlearning

03

Preserves syntactic correctness and functional integrity

Abstract

Large Language Models (LLMs) have shown strong potential in accelerating digital hardware design through automated code generation. Yet, ensuring their reliability remains a critical challenge, as existing LLMs trained on massive heterogeneous datasets often exhibit problematic memorization of proprietary intellectual property (IP), contaminated benchmarks, and unsafe coding patterns. To mitigate these risks, we propose a novel unlearning framework tailored for LLM-based hardware code generation. Our method combines (i) a syntax-preserving unlearning strategy that safeguards the structural integrity of hardware code during forgetting, and (ii) a fine-grained floor-aware selective loss that enables precise and efficient removal of problematic knowledge. This integration achieves effective unlearning without degrading LLM code generation capabilities. Extensive experiments show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbedded Systems Design Techniques · Natural Language Processing Techniques · Machine Learning in Materials Science