Neutron: An Implementation of the Transformer Translation Model and its   Variants

Hongfei Xu; Qiuhui Liu

arXiv:1903.07402·cs.CL·March 23, 2020·21 cites

Neutron: An Implementation of the Transformer Translation Model and its Variants

Hongfei Xu, Qiuhui Liu

PDF

Open Access 2 Repos

TL;DR

This paper presents Neutron, a highly optimized and modular implementation of the Transformer translation model and its variants, facilitating research and industrial applications with improved performance and readability.

Contribution

It introduces Neutron, an implementation that is easy to modify, optimized, and includes recent Transformer variants, enhancing usability and research flexibility.

Findings

01

Comparable performance to existing Transformer models

02

Easy to modify and extend with recent variants

03

Optimized for better parallelization and readability

Abstract

The Transformer translation model is easier to parallelize and provides better performance compared to recurrent seq2seq models, which makes it popular among industry and research community. We implement the Neutron in this work, including the Transformer model and its several variants from most recent researches. It is highly optimized, easy to modify and provides comparable performance with interesting features while keeping readability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Sigmoid Activation · Tanh Activation · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia?