A Data-Centric Optimization Framework for Machine Learning

Oliver Rausch; Tal Ben-Nun; Nikoli Dryden; Andrei Ivanov; Shigang Li,; Torsten Hoefler

arXiv:2110.10802·cs.LG·August 31, 2022

A Data-Centric Optimization Framework for Machine Learning

Oliver Rausch, Tal Ben-Nun, Nikoli Dryden, Andrei Ivanov, Shigang Li,, Torsten Hoefler

PDF

Open Access 1 Repo

TL;DR

This paper introduces a flexible, data-centric optimization framework for deep learning that enhances training efficiency across diverse models by minimizing data movement, with interactive tools for customization and demonstrated speedups.

Contribution

It presents a novel, extensible pipeline for optimizing arbitrary neural networks through data movement reduction, applicable to various models and integrated with user-friendly interfaces.

Findings

01

Achieved competitive performance or speedups on ten different networks.

02

Discovered new optimization opportunities in EfficientNet.

03

Provided an interactive, extensible optimization pipeline for deep learning.

Abstract

Rapid progress in deep learning is leading to a diverse set of quickly changing models, with a dramatically growing demand for compute. However, as frameworks specialize performance optimization to patterns in popular networks, they implicitly constrain novel and diverse models that drive progress in research. We empower deep learning researchers by defining a flexible and user-customizable pipeline for optimizing training of arbitrary deep neural networks, based on data movement minimization. The pipeline begins with standard networks in PyTorch or ONNX and transforms computation through progressive lowering. We define four levels of general-purpose transformations, from local intra-operator optimizations to global data movement reduction. These operate on a data-centric graph intermediate representation that expresses computation and data movement at all levels of abstraction,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spcl/daceml
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and Data Classification

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · 1x1 Convolution · Batch Normalization · Squeeze-and-Excitation Block · Pointwise Convolution · Convolution · Sigmoid Activation · Depthwise Separable Convolution · RMSProp