MILO: Model-Agnostic Subset Selection Framework for Efficient Model   Training and Tuning

Krishnateja Killamsetty; Alexandre V. Evfimievski; Tejaswini Pedapati,; Kiran Kate; Lucian Popa; Rishabh Iyer

arXiv:2301.13287·cs.LG·June 21, 2023

MILO: Model-Agnostic Subset Selection Framework for Efficient Model Training and Tuning

Krishnateja Killamsetty, Alexandre V. Evfimievski, Tejaswini Pedapati,, Kiran Kate, Lucian Popa, Rishabh Iyer

PDF

Open Access

TL;DR

MILO is a model-agnostic subset selection framework that accelerates deep network training and hyperparameter tuning by pre-selecting representative data subsets, achieving significant speedups without sacrificing performance.

Contribution

MILO introduces a pre-processing subset selection method that is model-agnostic and enables faster training and tuning by decoupling subset selection from model training.

Findings

01

MILO trains models 3x to 10x faster.

02

MILO tunes hyperparameters 20x to 75x faster.

03

MILO maintains comparable model performance.

Abstract

Training deep networks and tuning hyperparameters on large datasets is computationally intensive. One of the primary research directions for efficient training is to reduce training costs by selecting well-generalizable subsets of training data. Compared to simple adaptive random subset selection baselines, existing intelligent subset selection approaches are not competitive due to the time-consuming subset selection step, which involves computing model-dependent gradients and feature embeddings and applies greedy maximization of submodular objectives. Our key insight is that removing the reliance on downstream model parameters enables subset selection as a pre-processing step and enables one to train multiple models at no additional cost. In this work, we propose MILO, a model-agnostic subset selection framework that decouples the subset selection from model training while enabling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning