Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT   2019 Efficiency Task

Kenton Murray; Brian DuSell; David Chiang

arXiv:1910.07134·cs.CL·October 17, 2019

Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT 2019 Efficiency Task

Kenton Murray, Brian DuSell, David Chiang

PDF

TL;DR

This paper explores auto-sizing techniques applied to Transformer models to significantly reduce parameters with minimal performance loss, demonstrating over 25% parameter reduction and only a 1.1 BLEU decrease.

Contribution

The paper introduces an auto-sizing approach tailored for Transformers, achieving substantial parameter reduction while maintaining competitive performance.

Findings

01

Over 25% parameter reduction achieved

02

Only 1.1 BLEU decrease observed

03

Auto-sizing effectively reduces model size

Abstract

This paper describes the Notre Dame Natural Language Processing Group's (NDNLP) submission to the WNGT 2019 shared task (Hayashi et al., 2019). We investigated the impact of auto-sizing (Murray and Chiang, 2015; Murray et al., 2019) to the Transformer network (Vaswani et al., 2017) with the goal of substantially reducing the number of parameters in the model. Our method was able to eliminate more than 25% of the model's parameters while suffering a decrease of only 1.1 BLEU.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax