Modeling Target-side Inflection in Placeholder Translation
Ryokan Ri, Toshiaki Nakazawa, Yoshimasa Tsuruoka

TL;DR
This paper introduces a novel placeholder translation method that inflects user-specified terms according to context, improving grammatical correctness in output sentences, especially in Japanese-to-English scientific translation.
Contribution
It extends sequence-to-sequence models with a character-level decoder to inflect specified terms based on output context, addressing a key limitation of previous placeholder translation systems.
Findings
Model successfully inflects specified terms in Japanese-English translation
Outperforms comparable models in incorporating correct term forms
Improves grammatical correctness of translated sentences
Abstract
Placeholder translation systems enable the users to specify how a specific phrase is translated in the output sentence. The system is trained to output special placeholder tokens, and the user-specified term is injected into the output through the context-free replacement of the placeholder token. However, this approach could result in ungrammatical sentences because it is often the case that the specified term needs to be inflected according to the context of the output, which is unknown before the translation. To address this problem, we propose a novel method of placeholder translation that can inflect specified terms according to the grammatical construction of the output sentence. We extend the sequence-to-sequence architecture with a character-level decoder that takes the lemma of a user-specified term and the words generated from the word-level decoder to output the correct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
