Mention Attention for Pronoun Translation
Gongbo Tang, Christian Hardmeier

TL;DR
This paper introduces a mention attention module in neural machine translation to improve pronoun translation by focusing on source mentions and incorporating target context, leading to better performance on pronoun translation tasks.
Contribution
The paper proposes a novel mention attention mechanism and mention classifiers to enhance pronoun translation in neural machine translation models.
Findings
Outperforms baseline Transformer in BLEU and APT scores
Improves pronoun translation accuracy without harming overall translation quality
Demonstrates effectiveness on WMT17 English-German translation task
Abstract
Most pronouns are referring expressions, computers need to resolve what do the pronouns refer to, and there are divergences on pronoun usage across languages. Thus, dealing with these divergences and translating pronouns is a challenge in machine translation. Mentions are referring candidates of pronouns and have closer relations with pronouns compared to general tokens. We assume that extracting additional mention features can help pronoun translation. Therefore, we introduce an additional mention attention module in the decoder to pay extra attention to source mentions but not non-mention tokens. Our mention attention module not only extracts features from source mentions, but also considers target-side context which benefits pronoun translation. In addition, we also introduce two mention classifiers to train models to recognize mentions, whose outputs guide the mention attention. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Dropout · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Softmax · Attention Is All You Need
