EDU-level Extractive Summarization with Varying Summary Lengths
Yuping Wu, Ching-Hsun Tseng, Jiayu Shang, Shengzhong Mao, Goran, Nenadic, Xiao-Jun Zeng

TL;DR
This paper introduces an EDU-level extractive summarization model with varying summary lengths, demonstrating improved ROUGE scores and better summary quality compared to fixed-length sentence extraction methods.
Contribution
It proposes a novel EDU-based extractive summarization approach with a learning algorithm for variable summary lengths, justified by analysis showing higher evaluation scores than sentence-based methods.
Findings
EDU-based summaries outperform sentence-based summaries in ROUGE scores.
The proposed model maintains grammaticality and readability.
Varying summary lengths adapt better to document content diversity.
Abstract
Extractive models usually formulate text summarization as extracting fixed top- salient sentences from the document as a summary. Few works exploited extracting finer-grained Elementary Discourse Unit (EDU) with little analysis and justification for the extractive unit selection. Further, the selection strategy of the fixed top- salient sentences fits the summarization need poorly, as the number of salient sentences in different documents varies and therefore a common or best does not exist in reality. To fill these gaps, this paper first conducts the comparison analysis of oracle summaries based on EDUs and sentences, which provides evidence from both theoretical and experimental perspectives to justify and quantify that EDUs make summaries with higher automatic evaluation scores than sentences. Then, considering this merit of EDUs, this paper further proposes an EDU-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
