HanoiT: Enhancing Context-aware Translation via Selective Context

Jian Yang; Yuwei Yin; Shuming Ma; Liqun Yang; Hongcheng Guo; Haoyang; Huang; Dongdong Zhang; Yutao Zeng; Zhoujun Li; Furu Wei

arXiv:2301.06825·cs.CL·April 20, 2023

HanoiT: Enhancing Context-aware Translation via Selective Context

Jian Yang, Yuwei Yin, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang, Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei

PDF

Open Access

TL;DR

HanoiT introduces a layer-wise selection mechanism in neural machine translation to filter relevant context, significantly improving translation quality across multiple benchmarks by reducing noise from irrelevant words.

Contribution

The paper presents a novel end-to-end encoder-decoder model with a layer-wise selection mechanism for better utilization of document context in translation.

Findings

01

Model outperforms previous methods on four benchmarks

02

Selective context filtering reduces noise and improves translation accuracy

03

Layer-wise refinement enhances context relevance understanding

Abstract

Context-aware neural machine translation aims to use the document-level context to improve translation quality. However, not all words in the context are helpful. The irrelevant or trivial words may bring some noise and distract the model from learning the relationship between the current sentence and the auxiliary context. To mitigate this problem, we propose a novel end-to-end encoder-decoder model with a layer-wise selection mechanism to sift and refine the long document context. To verify the effectiveness of our method, extensive experiments and extra quantitative analysis are conducted on four document-level machine translation benchmarks. The experimental results demonstrate that our model significantly outperforms previous models on all datasets via the soft selection mechanism.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications