Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

Shihao Li; Jiachen Li; and Dongmei Chen

arXiv:2512.09103·cs.LG·December 11, 2025

Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

Shihao Li, Jiachen Li, and Dongmei Chen

PDF

Open Access

TL;DR

This paper introduces a robust framework for data attribution that accounts for geometric perturbations, providing the first non-vacuous certified bounds for neural network attribution and improving stability through a novel Wasserstein metric.

Contribution

It develops a unified framework extending from convex models to deep networks, introducing the Natural Wasserstein metric to eliminate spectral amplification and stabilize attribution estimates.

Findings

01

Natural W-TRAK certifies 68.7 ext{ extbackslash}% of ranking pairs on CIFAR-10, outperforming Euclidean baselines.

02

Spectral amplification inflates Lipschitz bounds by over 10,000 times in deep networks.

03

Self-Influence effectively detects label noise with 0.970 AUROC.

Abstract

Data attribution methods identify which training examples are responsible for a model's predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex models to deep networks. For convex settings, we derive Wasserstein-Robust Influence Functions (W-RIF) with provable coverage guarantees. For deep networks, we demonstrate that Euclidean certification is rendered vacuous by spectral amplification -- a mechanism where the inherent ill-conditioning of deep representations inflates Lipschitz bounds by over $10, 000 \times$ . This explains why standard TRAK scores, while accurate point estimates, are geometrically fragile: naive Euclidean robustness analysis yields 0\% certification. Our key contribution is the Natural Wasserstein metric, which measures perturbations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis