Learning to Select Bi-Aspect Information for Document-Scale Text Content Manipulation
Xiaocheng Feng, Yawei Sun, Bing Qin, Heng Gong, Yibo Sun, Wei Bi,, Xiaojiang Liu, Ting Liu

TL;DR
This paper introduces a new task of document-scale text content manipulation that preserves style while altering content, using an unsupervised neural model with interactive attention and back-translation, achieving state-of-the-art results.
Contribution
It proposes a novel unsupervised approach for document-scale content manipulation with an interactive attention mechanism and pseudo-training data generation.
Findings
Outperforms competitive methods in content transfer and style preservation
Achieves new state-of-the-art on sentence-level dataset
Demonstrates effectiveness of back-translation in unsupervised setting
Abstract
In this paper, we focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer and aims to preserve text styles while altering the content. In detail, the input is a set of structured records and a reference text for describing another recordset. The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference. The task is unsupervised due to lack of parallel data, and is challenging to select suitable records and style words from bi-aspect inputs respectively and generate a high-fidelity long document. To tackle those problems, we first build a dataset based on a basketball game report corpus as our testbed, and present an unsupervised neural model with interactive attention mechanism, which is used for learning the semantic relationship between records and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Video Analysis and Summarization
