A Versatile Influence Function for Data Attribution with Non-Decomposable Loss
Junwei Deng, Weijing Tang, Jiaqi W. Ma

TL;DR
This paper introduces the Versatile Influence Function (VIF), a new method that extends influence functions to non-decomposable losses in machine learning, enabling efficient data attribution for complex models beyond traditional decomposable loss functions.
Contribution
The paper proposes VIF, a general influence function approach that applies to any non-decomposable loss, leveraging auto-differentiation to avoid case-specific derivations, and demonstrates its effectiveness across diverse applications.
Findings
VIF closely matches brute-force leave-one-out results
VIF is up to 1000 times faster than brute-force methods
VIF works effectively on Cox regression, node embedding, and ranking tasks
Abstract
Influence function, a technique rooted in robust statistics, has been adapted in modern machine learning for a novel application: data attribution -- quantifying how individual training data points affect a model's predictions. However, the common derivation of influence functions in the data attribution literature is limited to loss functions that can be decomposed into a sum of individual data point losses, with the most prominent examples known as M-estimators. This restricts the application of influence functions to more complex learning objectives, which we refer to as non-decomposable losses, such as contrastive or ranking losses, where a unit loss term depends on multiple data points and cannot be decomposed further. In this work, we bridge this gap by revisiting the general formulation of influence function from robust statistics, which extends beyond M-estimators. Based on this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Cryptography and Data Security
