# NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation   Systems

**Authors:** Ozan Caglayan, Mercedes Garc\'ia-Mart\'inez, Adrien Bardet, Walid, Aransa, Fethi Bougares, Lo\"ic Barrault

arXiv: 1706.00457 · 2018-11-20

## TL;DR

nmtpy is a versatile Python toolkit designed for neural machine translation and sequence-to-sequence models, simplifying architecture development and enabling top-ranked results in major translation tasks.

## Contribution

It introduces a flexible, decoupled framework for neural translation models, reducing boilerplate and facilitating rapid development and experimentation.

## Key findings

- Used in top-ranked WMT submissions in 2016 and 2017.
- Simplifies architecture addition and experimentation.
- Proven effectiveness in competitive translation tasks.

## Abstract

In this paper, we present nmtpy, a flexible Python toolkit based on Theano for training Neural Machine Translation and other neural sequence-to-sequence architectures. nmtpy decouples the specification of a network from the training and inference utilities to simplify the addition of a new architecture and reduce the amount of boilerplate code to be written. nmtpy has been used for LIUM's top-ranked submissions to WMT Multimodal Machine Translation and News Translation tasks in 2016 and 2017.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.00457/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1706.00457/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1706.00457/full.md

---
Source: https://tomesphere.com/paper/1706.00457