Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks
Shihao Li, Jiachen Li, and Dongmei Chen

TL;DR
This paper introduces a robust framework for data attribution that accounts for geometric perturbations, providing the first non-vacuous certified bounds for neural network attribution and improving stability through a novel Wasserstein metric.
Contribution
It develops a unified framework extending from convex models to deep networks, introducing the Natural Wasserstein metric to eliminate spectral amplification and stabilize attribution estimates.
Findings
Natural W-TRAK certifies 68.7 ext{ extbackslash}% of ranking pairs on CIFAR-10, outperforming Euclidean baselines.
Spectral amplification inflates Lipschitz bounds by over 10,000 times in deep networks.
Self-Influence effectively detects label noise with 0.970 AUROC.
Abstract
Data attribution methods identify which training examples are responsible for a model's predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex models to deep networks. For convex settings, we derive Wasserstein-Robust Influence Functions (W-RIF) with provable coverage guarantees. For deep networks, we demonstrate that Euclidean certification is rendered vacuous by spectral amplification -- a mechanism where the inherent ill-conditioning of deep representations inflates Lipschitz bounds by over . This explains why standard TRAK scores, while accurate point estimates, are geometrically fragile: naive Euclidean robustness analysis yields 0\% certification. Our key contribution is the Natural Wasserstein metric, which measures perturbations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
