Learning to Generate Synthetic Training Data using Gradient Matching and   Implicit Differentiation

Dmitry Medvedev; Alexander D'yakonov

arXiv:2203.08559·cs.LG·March 17, 2022

Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Dmitry Medvedev, Alexander D'yakonov

PDF

1 Repo

TL;DR

This paper introduces new data distillation techniques using gradient matching and implicit differentiation, which improve training efficiency and model performance with less data in deep learning tasks.

Contribution

The paper proposes novel data distillation methods based on generative teaching networks, gradient matching, and the Implicit Function Theorem, enhancing efficiency and effectiveness.

Findings

01

Methods are computationally more efficient than previous approaches.

02

Distilled data improves model performance on MNIST.

03

Techniques enable training with less data without sacrificing accuracy.

Abstract

Using huge training datasets can be costly and inconvenient. This article explores various data distillation techniques that can reduce the amount of data required to successfully train deep networks. Inspired by recent ideas, we suggest new data distillation techniques based on generative teaching networks, gradient matching, and the Implicit Function Theorem. Experiments with the MNIST image classification problem show that the new methods are computationally more efficient than previous ones and allow to increase the performance of models trained on distilled data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dm-medvedev/efficientdistillation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.