LoRIF: Low-Rank Influence Functions for Scalable Training Data Attribution
Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann

TL;DR
LoRIF introduces a low-rank approach to scalable training data attribution, significantly reducing storage and computation costs while maintaining high attribution quality on large models.
Contribution
It proposes a low-rank influence function method that overcomes scalability bottlenecks in gradient-based training data attribution for large models.
Findings
LoRIF reduces storage by up to 20 times compared to previous methods.
It achieves up to 20x speedup in query time.
LoRIF maintains or improves attribution quality on models with up to 70B parameters.
Abstract
Training data attribution (TDA) identifies which training examples most influenced a model's prediction. Influence function methods are a theoretically grounded family of TDA methods and exploit gradients. To overcome the scalability challenge arising from gradient computation, the most popular strategy is random projection (e.g., TRAK, LoGRA). However, this still faces two bottlenecks when scaling to large training sets and high-quality attribution: \emph{(i)} storing and loading projected per-example gradients for all training examples, where query latency is dominated by I/O; and \emph{(ii)} forming the inverse Hessian approximation, which costs memory. Both bottlenecks scale with the projection dimension , yet increasing is necessary for attribution quality -- creating a quality--scalability tradeoff. We introduce \textbf{LoRIF}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
