Playing the lottery with rewards and multiple languages: lottery tickets   in RL and NLP

Haonan Yu; Sergey Edunov; Yuandong Tian; and Ari S. Morcos

arXiv:1906.02768·stat.ML·February 27, 2020·78 cites

Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP

Haonan Yu, Sergey Edunov, Yuandong Tian, and Ari S. Morcos

PDF

Open Access

TL;DR

This paper demonstrates that the lottery ticket hypothesis applies broadly across NLP and RL, showing that smaller, pruned models initialized with winning tickets can perform comparably to larger models, indicating a general phenomenon in deep learning.

Contribution

The study extends the lottery ticket hypothesis to NLP and RL, showing winning tickets exist in these domains and enable significant model size reduction without performance loss.

Findings

01

Winning tickets outperform random initializations at high pruning rates

02

Transformers can be pruned to one-third size with similar performance

03

Lottery ticket phenomenon applies beyond supervised image classification

Abstract

The lottery ticket hypothesis proposes that over-parameterization of deep neural networks (DNNs) aids training by increasing the probability of a "lucky" sub-network initialization being present rather than by helping the optimization process (Frankle & Carbin, 2019). Intriguingly, this phenomenon suggests that initialization strategies for DNNs can be improved substantially, but the lottery ticket hypothesis has only previously been tested in the context of supervised learning for natural image tasks. Here, we evaluate whether "winning ticket" initializations exist in two different domains: natural language processing (NLP) and reinforcement learning (RL).For NLP, we examined both recurrent LSTM models and large-scale Transformer models (Vaswani et al., 2017). For RL, we analyzed a number of discrete-action space tasks, including both classic control and pixel control. Consistent with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Artificial Intelligence in Games

MethodsPruning · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Sigmoid Activation · Tanh Activation · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing