An Effective Method using Phrase Mechanism in Neural Machine Translation
Phuong Minh Nguyen, Le Minh Nguyen

TL;DR
This paper introduces PhraseTransformer, a phrase mechanism integrated into Transformer models, significantly enhancing neural machine translation quality for Vietnamese-Chinese with notable BLEU score improvements.
Contribution
The paper presents a novel phrase mechanism, PhraseTransformer, that improves Transformer-based NMT systems specifically for Vietnamese-Chinese translation tasks.
Findings
Achieved BLEU scores of 35.3 for Vietnamese to Chinese
Achieved BLEU scores of 33.2 for Chinese to Vietnamese
Demonstrated effectiveness of phrase mechanism in NMT
Abstract
Machine Translation is one of the essential tasks in Natural Language Processing (NLP), which has massive applications in real life as well as contributing to other tasks in the NLP research community. Recently, Transformer -based methods have attracted numerous researchers in this domain and achieved state-of-the-art results in most of the pair languages. In this paper, we report an effective method using a phrase mechanism, PhraseTransformer, to improve the strong baseline model Transformer in constructing a Neural Machine Translation (NMT) system for parallel corpora Vietnamese-Chinese. Our experiments on the MT dataset of the VLSP 2022 competition achieved the BLEU score of 35.3 on Vietnamese to Chinese and 33.2 BLEU scores on Chinese to Vietnamese data. Our code is available at https://github.com/phuongnm94/PhraseTransformer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning in Bioinformatics
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Softmax · Dense Connections
