One ticket to win them all: generalizing lottery ticket initializations   across datasets and optimizers

Ari S. Morcos; Haonan Yu; Michela Paganini; and Yuandong Tian

arXiv:1906.02773·stat.ML·October 29, 2019·107 cites

One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers

Ari S. Morcos, Haonan Yu, Michela Paganini, and Yuandong Tian

PDF

Open Access 2 Repos

TL;DR

This paper investigates whether lottery ticket initializations can be transferred across datasets and optimizers, finding that larger datasets produce more generalizable tickets that perform well across various configurations.

Contribution

The study demonstrates that winning ticket initializations can generalize across datasets and optimizers, especially when generated from larger datasets, indicating broader applicability.

Findings

01

Winning tickets transfer well across datasets in the natural images domain.

02

Larger datasets produce more transferable winning tickets.

03

Winning tickets also generalize across different optimizers.

Abstract

The success of lottery ticket initializations (Frankle and Carbin, 2019) suggests that small, sparsified networks can be trained so long as the network is initialized appropriately. Unfortunately, finding these "winning ticket" initializations is computationally expensive. One potential solution is to reuse the same winning tickets across a variety of datasets and optimizers. However, the generality of winning ticket initializations remains unclear. Here, we attempt to answer this question by generating winning tickets for one training configuration (optimizer and dataset) and evaluating their performance on another configuration. Perhaps surprisingly, we found that, within the natural images domain, winning ticket initializations generalized across a variety of datasets, including Fashion MNIST, SVHN, CIFAR-10/100, ImageNet, and Places365, often achieving performance close to that of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Artificial Intelligence in Games · Human Pose and Action Recognition