Efficient parametrization of multi-domain deep neural networks

Sylvestre-Alvise Rebuffi; Hakan Bilen; Andrea Vedaldi

arXiv:1803.10082·cs.CV·March 28, 2018

Efficient parametrization of multi-domain deep neural networks

Sylvestre-Alvise Rebuffi, Hakan Bilen, Andrea Vedaldi

PDF

3 Repos

TL;DR

This paper introduces universal parametric families of neural networks that adapt to multiple domains with minimal parameter changes, outperforming traditional fine-tuning in transfer learning.

Contribution

It proposes a new approach to multi-domain neural networks using universal parametrizations that require few parameters to adapt, improving transfer learning performance.

Findings

01

Universal parametrizations outperform traditional fine-tuning.

02

Small parameter changes suffice for effective adaptation.

03

Certain designs yield higher compression and performance.

Abstract

A practical limitation of deep neural networks is their high degree of specialization to a single task and visual domain. Recently, inspired by the successes of transfer learning, several authors have proposed to learn instead universal, fixed feature extractors that, used as the first stage of any deep network, work well for several tasks and domains simultaneously. Nevertheless, such universal features are still somewhat inferior to specialized networks. To overcome this limitation, in this paper we propose to consider instead universal parametric families of neural networks, which still contain specialized problem-specific models, but differing only by a small number of parameters. We study different designs for such parametrizations, including series and parallel residual adapters, joint adapter compression, and parameter allocations, and empirically identify the ones that yield…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.