FLOPs as a Direct Optimization Objective for Learning Sparse Neural   Networks

Raphael Tang; Ashutosh Adhikari; Jimmy Lin

arXiv:1811.03060·cs.LG·November 26, 2018·21 cites

FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks

Raphael Tang, Ashutosh Adhikari, Jimmy Lin

PDF

Open Access

TL;DR

This paper introduces a method to directly optimize neural networks for a specified number of FLOPs, enabling resource-efficient models tailored to different hardware constraints during training.

Contribution

It extends existing sparsity techniques by incorporating FLOPs as a direct optimization objective, allowing targeted model compression based on FLOPs constraints.

Findings

01

Successfully trains models to meet specific FLOPs targets

02

Demonstrates resource-efficient neural networks for image classification

03

Adapts to different system constraints like GPU and mobile devices

Abstract

There exists a plethora of techniques for inducing structured sparsity in parametric models during the optimization process, with the final goal of resource-efficient inference. However, few methods target a specific number of floating-point operations (FLOPs) as part of the optimization objective, despite many reporting FLOPs as part of the results. Furthermore, a one-size-fits-all approach ignores realistic system constraints, which differ significantly between, say, a GPU and a mobile phone -- FLOPs on the former incur less latency than on the latter; thus, it is important for practitioners to be able to specify a target number of FLOPs during model compression. In this work, we extend a state-of-the-art technique to directly incorporate FLOPs as part of the optimization objective and show that, given a desired FLOPs requirement, different neural networks can be successfully trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and Algorithms