Fast and Accurate Model Scaling

Piotr Doll\'ar; Mannat Singh; Ross Girshick

arXiv:2103.06877·cs.CV·March 12, 2021

Fast and Accurate Model Scaling

Piotr Doll\'ar, Mannat Singh, Ross Girshick

PDF

5 Repos 10 Models

TL;DR

This paper analyzes different strategies for scaling convolutional neural networks, revealing that a simple compound approach focusing on width scaling offers better efficiency and similar accuracy compared to traditional methods.

Contribution

The paper introduces a fast compound scaling method that primarily scales width, resulting in more efficient models with lower activation growth and comparable accuracy.

Findings

01

Scaling strategies impact model parameters and runtime differently.

02

Many scaling methods achieve similar accuracy with different resource costs.

03

The proposed method achieves near square-root growth in activations, improving efficiency.

Abstract

In this work we analyze strategies for convolutional neural network scaling; that is, the process of scaling a base convolutional network to endow it with greater computational complexity and consequently representational power. Example scaling strategies may include increasing model width, depth, resolution, etc. While various scaling strategies exist, their tradeoffs are not fully understood. Existing analysis typically focuses on the interplay of accuracy and flops (floating point operations). Yet, as we demonstrate, various scaling strategies affect model parameters, activations, and consequently actual runtime quite differently. In our experiments we show the surprising result that numerous scaling strategies yield networks with similar accuracy but with widely varying properties. This leads us to propose a simple fast compound scaling strategy that encourages primarily scaling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.