Extract, Denoise and Enforce: Evaluating and Improving Concept   Preservation for Text-to-Text Generation

Yuning Mao; Wenchang Ma; Deren Lei; Jiawei Han; Xiang Ren

arXiv:2104.08724·cs.CL·September 6, 2021

Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation

Yuning Mao, Wenchang Ma, Deren Lei, Jiawei Han, Xiang Ren

PDF

Open Access 2 Repos

TL;DR

This paper investigates whether current seq2seq models effectively preserve important input concepts in text-to-text generation and proposes a framework to improve concept preservation by explicitly guiding the generation process.

Contribution

It introduces a simple framework to automatically extract, denoise, and enforce key input concepts as lexical constraints, enhancing concept preservation in generation tasks.

Findings

01

The proposed method improves concept coverage and human ratings.

02

It performs comparably or better than unconstrained models on automatic metrics.

03

Explicit concept guidance benefits text-to-text generation quality.

Abstract

Prior studies on text-to-text generation typically assume that the model could figure out what to attend to in the input and what to include in the output via seq2seq learning, with only the parallel training data and no additional guidance. However, it remains unclear whether current models can preserve important concepts in the source input, as seq2seq learning does not have explicit focus on the concepts and commonly used evaluation metrics also treat concepts equally important as other tokens. In this paper, we present a systematic analysis that studies whether current seq2seq models, especially pre-trained language models, are good enough for preserving important input concepts and to what extent explicitly guiding generation with the concepts as lexical constraints is beneficial. We answer the above questions by conducting extensive analytical experiments on four representative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence