A Versatile Influence Function for Data Attribution with   Non-Decomposable Loss

Junwei Deng; Weijing Tang; Jiaqi W. Ma

arXiv:2412.01335·cs.LG·December 3, 2024

A Versatile Influence Function for Data Attribution with Non-Decomposable Loss

Junwei Deng, Weijing Tang, Jiaqi W. Ma

PDF

Open Access 1 Video

TL;DR

This paper introduces the Versatile Influence Function (VIF), a new method that extends influence functions to non-decomposable losses in machine learning, enabling efficient data attribution for complex models beyond traditional decomposable loss functions.

Contribution

The paper proposes VIF, a general influence function approach that applies to any non-decomposable loss, leveraging auto-differentiation to avoid case-specific derivations, and demonstrates its effectiveness across diverse applications.

Findings

01

VIF closely matches brute-force leave-one-out results

02

VIF is up to 1000 times faster than brute-force methods

03

VIF works effectively on Cox regression, node embedding, and ranking tasks

Abstract

Influence function, a technique rooted in robust statistics, has been adapted in modern machine learning for a novel application: data attribution -- quantifying how individual training data points affect a model's predictions. However, the common derivation of influence functions in the data attribution literature is limited to loss functions that can be decomposed into a sum of individual data point losses, with the most prominent examples known as M-estimators. This restricts the application of influence functions to more complex learning objectives, which we refer to as non-decomposable losses, such as contrastive or ranking losses, where a unit loss term depends on multiple data points and cannot be decomposed further. In this work, we bridge this gap by revisiting the general formulation of influence function from robust statistics, which extends beyond M-estimators. Based on this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Versatile Influence Function for Data Attribution with Non-Decomposable Loss· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Cryptography and Data Security