Online Forgetting Process for Linear Regression Models

Yuantong Li; Chi-hua Wang; Guang Cheng

arXiv:2012.01668·stat.ML·May 30, 2024

Online Forgetting Process for Linear Regression Models

Yuantong Li, Chi-hua Wang, Guang Cheng

PDF

Open Access

TL;DR

This paper addresses the challenge of online data deletion in linear regression models, proposing algorithms that maintain statistical efficiency under memory constraints and data removal, with theoretical guarantees and empirical validation.

Contribution

It introduces the FIFD-OLS and FIFD-Adaptive Ridge algorithms for online forgetting with theoretical regret bounds and improved performance over fixed regularization methods.

Findings

01

FIFD-OLS exhibits catastrophic rank swinging due to data deletion.

02

FIFD-Adaptive Ridge effectively offsets deletion uncertainty.

03

The proposed methods outperform fixed regularization ridge regression in experiments.

Abstract

Motivated by the EU's "Right To Be Forgotten" regulation, we initiate a study of statistical data deletion problems where users' data are accessible only for a limited period of time. This setting is formulated as an online supervised learning task with \textit{constant memory limit}. We propose a deletion-aware algorithm \texttt{FIFD-OLS} for the low dimensional case, and witness a catastrophic rank swinging phenomenon due to the data deletion operation, which leads to statistical inefficiency. As a remedy, we propose the \texttt{FIFD-Adaptive Ridge} algorithm with a novel online regularization scheme, that effectively offsets the uncertainty from deletion. In theory, we provide the cumulative regret upper bound for both online forgetting algorithms. In the experiment, we showed \texttt{FIFD-Adaptive Ridge} outperforms the ridge regression algorithm with fixed regularization level, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques