Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries
Xiaofei Sun, Zijun Sun, Yuxian Meng, Jiwei Li, Chun Fan

TL;DR
This paper introduces SOE, a hierarchical long-text generation system that uses summarization and outlining to improve coherence, capture discourse dependencies, and enable more efficient generation of long texts.
Contribution
The paper proposes a novel hierarchical framework for long-text generation that leverages unsupervised segment summaries to guide coherent and high-level content planning.
Findings
SOE produces higher quality long texts.
The system converges faster than baseline models.
It effectively captures high-level discourse dependencies.
Abstract
The difficulty of generating coherent long texts lies in the fact that existing models overwhelmingly focus on predicting local words, and cannot make high level plans on what to generate or capture the high-level discourse dependencies between chunks of texts. Inspired by human writing processes, where a list of bullet points or a catalog is first outlined, and then each bullet point is expanded to form the whole article, we propose {\it SOE}, a pipelined system that involves of summarizing, outlining and elaborating for long text generation: the model first outlines the summaries for different segments of long texts, and then elaborates on each bullet point to generate the corresponding segment. To avoid the labor-intensive process of summary soliciting, we propose the {\it reconstruction} strategy, which extracts segment summaries in an unsupervised manner by selecting its most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
