How to sketch a learning algorithm

Sam Gunn

arXiv:2604.07328·cs.LG·April 21, 2026

How to sketch a learning algorithm

Sam Gunn

PDF

1 Repo

TL;DR

This paper introduces a data deletion scheme for deep learning models that predicts the impact of removing training data with high accuracy and efficiency, based on a new stability assumption.

Contribution

It presents a novel, efficient data deletion method for deep learning that relies on a stability assumption, supported by experiments with microgpt.

Findings

01

The scheme predicts model behavior after data removal with vanishing error and low failure probability.

02

Precomputation and prediction are only logarithmically slower than standard training and inference.

03

Stability assumption is compatible with powerful AI models, demonstrated on microgpt.

Abstract

How does the choice of training data influence an AI model? This broad question is of central importance to interpretability, privacy, and basic science. At its technical core is the data deletion problem: after a reasonable amount of precomputation, quickly predict how the model would behave in a given situation if a given subset of training data had been excluded from the learning algorithm. We present a data deletion scheme capable of predicting model outputs with vanishing error $ε$ and failure probability $δ$ in the deep learning setting. Our precomputation and prediction algorithms are only $\tilde{O} (lo g (1/ δ) / ε^{2})$ factors slower than regular training and inference, respectively. The storage requirements are those of $\tilde{O} (lo g (1/ δ) / ε^{2})$ models. Our proof is based on an assumption that we call stability. In contrast to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SamSpo1/microgpt-sketch
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.