Naver Labs Europe's Systems for the Document-Level Generation and   Translation Task at WNGT 2019

Fahimeh Saleh; Alexandre B\'erard; Ioan Calapodescu; Laurent Besacier

arXiv:1910.14539·cs.CL·November 1, 2019

Naver Labs Europe's Systems for the Document-Level Generation and Translation Task at WNGT 2019

Fahimeh Saleh, Alexandre B\'erard, Ioan Calapodescu, Laurent Besacier

PDF

TL;DR

This paper presents a transfer learning approach leveraging large-scale document-level data to improve neural models for long text generation and document-level translation, achieving state-of-the-art results at WNGT 2019.

Contribution

It introduces a transfer learning method that fine-tunes document-based MT models for NLG and MT with metadata, outperforming previous methods without data selection or planning.

Findings

01

Outperforms previous state-of-the-art on Rotowire NLG task

02

Ranks first in all tracks at WNGT 2019

03

Demonstrates effectiveness of transfer learning across tasks

Abstract

Recently, neural models led to significant improvements in both machine translation (MT) and natural language generation tasks (NLG). However, generation of long descriptive summaries conditioned on structured data remains an open challenge. Likewise, MT that goes beyond sentence-level context is still an open issue (e.g., document-level MT or MT with metadata). To address these challenges, we propose to leverage data from both tasks and do transfer learning between MT, NLG, and MT with source-side metadata (MT+NLG). First, we train document-based MT systems with large amounts of parallel data. Then, we adapt these models to pure NLG and MT+NLG tasks by fine-tuning with smaller amounts of domain-specific data. This end-to-end NLG approach, without data selection and planning, outperforms the previous state of the art on the Rotowire NLG task. We participated to the "Document Generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.