Not Every Subject Should Stay: Machine Unlearning for Noisy Engagement Recognition

Alexander Vedernikov

arXiv:2605.04713·cs.CV·May 7, 2026

Not Every Subject Should Stay: Machine Unlearning for Noisy Engagement Recognition

Alexander Vedernikov

PDF

TL;DR

This paper introduces a method for post-hoc subject-level machine unlearning in engagement recognition models, effectively removing problematic subjects with significantly less retraining cost.

Contribution

It proposes a lightweight approximate unlearning approach for removing entire subjects from trained models, improving efficiency over full retraining.

Findings

01

Unlearning recovers 89.3% and 92.5% of oracle gains on two datasets.

02

Effectiveness peaks at intermediate forget-set sizes.

03

Unlearning reduces retraining costs to roughly one quarter.

Abstract

Engagement recognition datasets are typically subject-indexed and often contain noisy, subjective supervision, making post-hoc dataset revision a practical problem. Existing noisy-label and data-cleaning methods largely operate at the sample level before or during training, but do not directly address a different question: once a model has already been trained, can the influence of an entire problematic subject be removed without full retraining? We study this setting through subject-level machine unlearning as a post-hoc sanitization mechanism for engagement recognition. Starting from a baseline trained on all subjects, we rank candidate harmful subjects using a model-dependent proxy, apply a lightweight approximate unlearning update, and compare the result against an oracle model retrained from scratch on the retained subjects only. We instantiate this protocol on DAiSEE and EngageNet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.