Towards Making the Most of Dialogue Characteristics for Neural Chat Translation
Yunlong Liang, Chulun Zhou, Fandong Meng, Jinan Xu, Yufeng Chen,, Jinsong Su, Jie Zhou

TL;DR
This paper enhances neural chat translation by modeling dialogue characteristics such as coherence and speaker traits through auxiliary tasks, resulting in more coherent and speaker-relevant translations across multiple language pairs.
Contribution
It introduces four auxiliary tasks to incorporate dialogue features into neural chat translation models, improving translation quality by capturing dialogue-specific traits.
Findings
Improved translation coherence and speaker relevance.
Effective across English-German and English-Chinese language pairs.
Outperforms baseline models in comprehensive experiments.
Abstract
Neural Chat Translation (NCT) aims to translate conversational text between speakers of different languages. Despite the promising performance of sentence-level and context-aware neural machine translation models, there still remain limitations in current NCT models because the inherent dialogue characteristics of chat, such as dialogue coherence and speaker personality, are neglected. In this paper, we propose to promote the chat translation by introducing the modeling of dialogue characteristics into the NCT model. To this end, we design four auxiliary tasks including monolingual response generation, cross-lingual response generation, next utterance discrimination, and speaker identification. Together with the main chat translation task, we optimize the NCT model through the training objectives of all these tasks. By this means, the NCT model can be enhanced by capturing the inherent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
