Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation
Yuning Mao, Wenchang Ma, Deren Lei, Jiawei Han, Xiang Ren

TL;DR
This paper investigates whether current seq2seq models effectively preserve important input concepts in text-to-text generation and proposes a framework to improve concept preservation by explicitly guiding the generation process.
Contribution
It introduces a simple framework to automatically extract, denoise, and enforce key input concepts as lexical constraints, enhancing concept preservation in generation tasks.
Findings
The proposed method improves concept coverage and human ratings.
It performs comparably or better than unconstrained models on automatic metrics.
Explicit concept guidance benefits text-to-text generation quality.
Abstract
Prior studies on text-to-text generation typically assume that the model could figure out what to attend to in the input and what to include in the output via seq2seq learning, with only the parallel training data and no additional guidance. However, it remains unclear whether current models can preserve important concepts in the source input, as seq2seq learning does not have explicit focus on the concepts and commonly used evaluation metrics also treat concepts equally important as other tokens. In this paper, we present a systematic analysis that studies whether current seq2seq models, especially pre-trained language models, are good enough for preserving important input concepts and to what extent explicitly guiding generation with the concepts as lexical constraints is beneficial. We answer the above questions by conducting extensive analytical experiments on four representative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence
