Refining the state-of-the-art in Machine Translation, optimizing NMT for the JA <-> EN language pair by leveraging personal domain expertise
Matthew Bieda

TL;DR
This paper details the development and optimization of a Neural Machine Translation system for Japanese-English translation using Transformer architecture, focusing on corpus processing, hyperparameter tuning, and evaluation metrics.
Contribution
It introduces a systematic approach to optimize NMT performance for En/Ja by combining technical tuning with linguistic expertise.
Findings
Achieved improved BLEU scores for En/Ja translation
Demonstrated the impact of corpus preprocessing on translation quality
Provided subjective linguistic evaluation alongside standard metrics
Abstract
Documenting the construction of an NMT (Neural Machine Translation) system for En/Ja based on the Transformer architecture leveraging the OpenNMT framework. A systematic exploration of corpora pre-processing, hyperparameter tuning and model architecture is carried out to obtain optimal performance. The system is evaluated using standard auto-evaluation metrics such as BLEU, and my subjective opinion as a Japanese linguist.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Layer Normalization · Adam · Label Smoothing · Dropout · Absolute Position Encodings · Position-Wise Feed-Forward Layer
