YANMTT: Yet Another Neural Machine Translation Toolkit

Raj Dabre; Eiichiro Sumita

arXiv:2108.11126·cs.CL·August 26, 2021·5 cites

YANMTT: Yet Another Neural Machine Translation Toolkit

Raj Dabre, Eiichiro Sumita

PDF

Open Access

TL;DR

YANMTT is an open-source, user-friendly toolkit built on Transformers that simplifies pre-training, transfer learning, and fine-tuning of neural machine translation models, with additional features like multi-source translation and model compression.

Contribution

It introduces a lightweight, flexible toolkit for NMT that facilitates pre-training, transfer learning, and advanced functionalities with minimal code, addressing limitations of existing toolkits.

Findings

01

Enables easy pre-training of large-scale NMT models

02

Allows selective transfer of pre-trained parameters

03

Supports advanced features like multi-source NMT and model distillation

Abstract

In this paper we present our open-source neural machine translation (NMT) toolkit called "Yet Another Neural Machine Translation Toolkit" abbreviated as YANMTT which is built on top of the Transformers library. Despite the growing importance of sequence to sequence pre-training there surprisingly few, if not none, well established toolkits that allow users to easily do pre-training. Toolkits such as Fairseq which do allow pre-training, have very large codebases and thus they are not beginner friendly. With regards to transfer learning via fine-tuning most toolkits do not explicitly allow the user to have control over what parts of the pre-trained models can be transferred. YANMTT aims to address these issues via the minimum amount of code to pre-train large scale NMT models, selectively transfer pre-trained parameters and fine-tune them, perform translation as well as extract…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications