GRAFT: Gradient-Aware Fast MaxVol Technique for Dynamic Data Sampling

Ashish Jha; Anh huy Phan; Razan Dibo; Valentin Leplat

arXiv:2508.13653·cs.LG·August 25, 2025

GRAFT: Gradient-Aware Fast MaxVol Technique for Dynamic Data Sampling

Ashish Jha, Anh huy Phan, Razan Dibo, Valentin Leplat

PDF

TL;DR

GRAFT is a scalable, in-training data subset selection method that reduces computational costs and emissions in neural network training by dynamically selecting diverse, representative data samples using low-rank subspace techniques.

Contribution

GRAFT introduces a novel gradient-aware, low-rank subspace sampling method that improves training efficiency and reduces environmental impact without sacrificing accuracy.

Findings

01

GRAFT matches or exceeds baseline accuracy.

02

Reduces training time and energy consumption.

03

Lowers CO2 emissions during training.

Abstract

Training modern neural networks on large datasets is computationally and environmentally costly. We introduce GRAFT, a scalable in-training subset selection method that (i) extracts a low-rank feature representation for each batch, (ii) applies a Fast MaxVol sampler to select a small, diverse subset that spans the batch's dominant subspace, and (iii) dynamically adjusts the subset size using a gradient-approximation criterion. By operating in low-rank subspaces and training on carefully chosen examples instead of full batches, GRAFT preserves the training trajectory while reducing wall-clock time, energy consumption, and $CO_{2}$ emissions. Across multiple benchmarks, GRAFT matches or exceeds recent selection baselines in both accuracy and efficiency, providing a favorable trade-off between accuracy, efficiency, and emissions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.