GroSS: Group-Size Series Decomposition for Grouped Architecture Search

Henry Howard-Jenkins; Yiwen Li; Victor A. Prisacariu

arXiv:1912.00673·cs.LG·July 17, 2020

GroSS: Group-Size Series Decomposition for Grouped Architecture Search

Henry Howard-Jenkins, Yiwen Li, Victor A. Prisacariu

PDF

1 Repo

TL;DR

GroSS introduces a differentiable tensor factorization method that enables simultaneous training of various grouped convolution configurations within neural networks, streamlining architecture search.

Contribution

It is the first method to allow concurrent training of different group sizes and combinations in grouped convolutions, facilitating efficient architecture search.

Findings

01

Enables training of multiple group configurations simultaneously

02

Improves efficiency of grouped convolution architecture search

03

Demonstrates effectiveness on multiple datasets and networks

Abstract

We present a novel approach which is able to explore the configuration of grouped convolutions within neural networks. Group-size Series (GroSS) decomposition is a mathematical formulation of tensor factorisation into a series of approximations of increasing rank terms. GroSS allows for dynamic and differentiable selection of factorisation rank, which is analogous to a grouped convolution. Therefore, to the best of our knowledge, GroSS is the first method to enable simultaneous training of differing numbers of groups within a single layer, as well as all possible combinations between layers. In doing so, GroSS is able to train an entire grouped convolution architecture search-space concurrently. We demonstrate this through architecture searches with performance objectives on multiple datasets and networks. GroSS enables more effective and efficient search for grouped convolutional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ActiveVisionLab/GroSS
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods1x1 Convolution · Grouped Convolution · Convolution