Improving Character-level Japanese-Chinese Neural Machine Translation with Radicals as an Additional Input Feature
Jinyi Zhang, Tadahiro Matsumoto

TL;DR
This paper demonstrates that incorporating Chinese character radicals as additional input features significantly enhances character-level Japanese-Chinese neural machine translation, achieving state-of-the-art BLEU scores on the WAT2016 dataset.
Contribution
The study introduces the novel use of radicals as input features in character-level NMT, leading to notable improvements in translation quality.
Findings
Achieved a BLEU score of 40.61, surpassing previous systems.
Radical features improved BLEU by up to 1.5 points.
Enhanced translation performance on the WAT2016 Japanese-Chinese dataset.
Abstract
In recent years, Neural Machine Translation (NMT) has been proven to get impressive results. While some additional linguistic features of input words improve word-level NMT, any additional character features have not been used to improve character-level NMT so far. In this paper, we show that the radicals of Chinese characters (or kanji), as a character feature information, can be easily provide further improvements in the character-level NMT. In experiments on WAT2016 Japanese-Chinese scientific paper excerpt corpus (ASPEC-JP), we find that the proposed method improves the translation quality according to two aspects: perplexity and BLEU. The character-level NMT with the radical input feature's model got a state-of-the-art result of 40.61 BLEU points in the test set, which is an improvement of about 8.6 BLEU points over the best system on the WAT2016 Japanese-to-Chinese translation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques
