Learning to Select Bi-Aspect Information for Document-Scale Text Content   Manipulation

Xiaocheng Feng; Yawei Sun; Bing Qin; Heng Gong; Yibo Sun; Wei Bi,; Xiaojiang Liu; Ting Liu

arXiv:2002.10210·cs.CL·February 25, 2020·1 cites

Learning to Select Bi-Aspect Information for Document-Scale Text Content Manipulation

Xiaocheng Feng, Yawei Sun, Bing Qin, Heng Gong, Yibo Sun, Wei Bi,, Xiaojiang Liu, Ting Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new task of document-scale text content manipulation that preserves style while altering content, using an unsupervised neural model with interactive attention and back-translation, achieving state-of-the-art results.

Contribution

It proposes a novel unsupervised approach for document-scale content manipulation with an interactive attention mechanism and pseudo-training data generation.

Findings

01

Outperforms competitive methods in content transfer and style preservation

02

Achieves new state-of-the-art on sentence-level dataset

03

Demonstrates effectiveness of back-translation in unsupervised setting

Abstract

In this paper, we focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer and aims to preserve text styles while altering the content. In detail, the input is a set of structured records and a reference text for describing another recordset. The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference. The task is unsupervised due to lack of parallel data, and is challenging to select suitable records and style words from bi-aspect inputs respectively and generate a high-fidelity long document. To tackle those problems, we first build a dataset based on a basketball game report corpus as our testbed, and present an unsupervised neural model with interactive attention mechanism, which is used for learning the semantic relationship between records and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

syw1996/SCIR-TG-Data2text-Bi-Aspect
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Video Analysis and Summarization