Attend to the beginning: A study on using bidirectional attention for extractive summarization
Ahmed Magooda, Cezary Marcjan

TL;DR
This paper introduces a bidirectional attention mechanism focusing on the beginning of documents to enhance extractive summarization, achieving state-of-the-art results on forum discussions and improving performance on generic texts.
Contribution
It proposes a novel approach that leverages attention to the start of texts, demonstrating significant improvements in summarization quality for both forum data and generic texts.
Findings
Bidirectional attention to initial comments boosts ROUGE scores.
Achieved new state-of-the-art ROUGE scores on forum discussion datasets.
Attending to introductory sentences improves summarization across different text types.
Abstract
Forum discussion data differ in both structure and properties from generic form of textual data such as news. Henceforth, summarization techniques should, in turn, make use of such differences, and craft models that can benefit from the structural nature of discussion data. In this work, we propose attending to the beginning of a document, to improve the performance of extractive summarization models when applied to forum discussion data. Evaluations demonstrated that with the help of bidirectional attention mechanism, attending to the beginning of a document (initial comment/post) in a discussion thread, can introduce a consistent boost in ROUGE scores, as well as introducing a new State Of The Art (SOTA) ROUGE scores on the forum discussions dataset. Additionally, we explored whether this hypothesis is extendable to other generic forms of textual data. We make use of the tendency of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
