LoRIF: Low-Rank Influence Functions for Scalable Training Data Attribution

Shuangqi Li; Hieu Le; Jingyi Xu; Mathieu Salzmann

arXiv:2601.21929·cs.LG·May 15, 2026

LoRIF: Low-Rank Influence Functions for Scalable Training Data Attribution

Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann

PDF

TL;DR

LoRIF introduces a low-rank approach to scalable training data attribution, significantly reducing storage and computation costs while maintaining high attribution quality on large models.

Contribution

It proposes a low-rank influence function method that overcomes scalability bottlenecks in gradient-based training data attribution for large models.

Findings

01

LoRIF reduces storage by up to 20 times compared to previous methods.

02

It achieves up to 20x speedup in query time.

03

LoRIF maintains or improves attribution quality on models with up to 70B parameters.

Abstract

Training data attribution (TDA) identifies which training examples most influenced a model's prediction. Influence function methods are a theoretically grounded family of TDA methods and exploit gradients. To overcome the scalability challenge arising from gradient computation, the most popular strategy is random projection (e.g., TRAK, LoGRA). However, this still faces two bottlenecks when scaling to large training sets and high-quality attribution: \emph{(i)} storing and loading projected per-example gradients for all $N$ training examples, where query latency is dominated by I/O; and \emph{(ii)} forming the $D \times D$ inverse Hessian approximation, which costs $O (D^{2})$ memory. Both bottlenecks scale with the projection dimension $D$ , yet increasing $D$ is necessary for attribution quality -- creating a quality--scalability tradeoff. We introduce \textbf{LoRIF}…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.