Parameter-Efficient Token Embedding Editing for Clinical Class-Level Unlearning
Iyad Ait Hou, Shrenik Borad, Harsh Sharma, Pooja Srinivasan, Rebecca Hwa, Aya Zirikly

TL;DR
This paper presents STEU, a parameter-efficient method for class-level unlearning in clinical language models, which selectively updates token embeddings to forget specific information while preserving overall model utility.
Contribution
STEU introduces a novel, sparse embedding editing approach that unlearns targeted classes without retraining the entire model, maintaining performance with minimal parameter updates.
Findings
Achieves near-complete forgetting of target classes.
Maintains high task performance after minimal parameter modification.
Operates efficiently by updating only 0.19% of parameters.
Abstract
Machine unlearning is increasingly important for clinical language models, where privacy regulations and institutional policies may require removing sensitive information from deployed systems without retraining from scratch. In practice, deletion requests must balance effective forgetting of targeted information with preservation of model utility and minimal parameter modification. We introduce Sparse Token Embedding Unlearning (STEU), a parameter-efficient method for behavioral class-level unlearning that updates only PMI-selected token embeddings together with a small classifier head while keeping all encoder layers frozen. Across experiments on MIMIC-IV, MIMIC-III, and eICU using BioClinicalBERT, BERT-base, and DistilBERT, STEU consistently suppresses the target class while largely preserving retained task performance. In the primary MIMIC-IV setting, STEU achieves near-complete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare and Education · Adversarial Robustness in Machine Learning
