Towards Abstractive Timeline Summarisation using Preference-based Reinforcement Learning
Yuxuan Ye, Edwin Simpson

TL;DR
This paper presents a preference-based reinforcement learning approach to improve abstractive timeline summarisation, outperforming extractive methods on benchmark datasets and aligning well with human preferences.
Contribution
It introduces a novel offline reinforcement learning method that fine-tunes pretrained abstractive summarizers for timeline summarisation using preference data.
Findings
Outperforms extractive methods on two benchmark datasets
Participants prefer summaries generated by the proposed method
Requires only a small number of preferences for training
Abstract
This paper introduces a novel pipeline for summarising timelines of events reported by multiple news sources. Transformer-based models for abstractive summarisation generate coherent and concise summaries of long documents but can fail to outperform established extractive methods on specialised tasks such as timeline summarisation (TLS). While extractive summaries are more faithful to their sources, they may be less readable and contain redundant or unnecessary information. This paper proposes a preference-based reinforcement learning (PBRL) method for adapting pretrained abstractive summarisers to TLS, which can overcome the drawbacks of extractive timeline summaries. We define a compound reward function that learns from keywords of interest and pairwise preference labels, which we use to fine-tune a pretrained abstractive summariser via offline reinforcement learning. We carry out…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsALIGN
