Rescaled Influence Functions: Accurate Data Attribution in High Dimension
Ittai Rubinstein, Samuel B. Hopkins

TL;DR
This paper introduces rescaled influence functions (RIF), an improved method for data attribution in high-dimensional models that significantly outperforms traditional influence functions in accuracy and robustness against data poisoning.
Contribution
The paper proposes RIF as a drop-in replacement for influence functions, providing more accurate data attribution in high-dimensional settings with minimal computational cost.
Findings
RIF outperforms IF in real-world data attribution tasks.
RIF provides more accurate predictions of data influence.
RIF detects data poisoning attacks that fool IF.
Abstract
How does the training data affect a model's behavior? This is the question we seek to answer with data attribution. The leading practical approaches to data attribution are based on influence functions (IF). IFs utilize a first-order Taylor approximation to efficiently predict the effect of removing a set of samples from the training set without retraining the model, and are used in a wide variety of machine learning applications. However, especially in the high-dimensional regime (# params # samples), they are often imprecise and tend to underestimate the effect of sample removals, even for simple models such as logistic regression. We present rescaled influence functions (RIF), a new tool for data attribution which can be used as a drop-in replacement for influence functions, with little computational overhead but significant improvement in accuracy. We compare IF and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data
