Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu, Qiuhui Liu

TL;DR
This paper presents Neutron, a highly optimized and modular implementation of the Transformer translation model and its variants, facilitating research and industrial applications with improved performance and readability.
Contribution
It introduces Neutron, an implementation that is easy to modify, optimized, and includes recent Transformer variants, enhancing usability and research flexibility.
Findings
Comparable performance to existing Transformer models
Easy to modify and extend with recent variants
Optimized for better parallelization and readability
Abstract
The Transformer translation model is easier to parallelize and provides better performance compared to recurrent seq2seq models, which makes it popular among industry and research community. We implement the Neutron in this work, including the Transformer model and its several variants from most recent researches. It is highly optimized, easy to modify and provides comparable performance with interesting features while keeping readability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Sigmoid Activation · Tanh Activation · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia?
