Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning
Demian Gholipour Ghalandari, Chris Hokamp, Georgiana Ifrim

TL;DR
This paper introduces a reinforcement learning-based method for unsupervised sentence compression that fine-tunes transformers, achieving better efficiency and performance without requiring ground-truth data.
Contribution
It proposes a novel reinforcement learning approach for unsupervised sentence compression by fine-tuning transformers as binary sequence labelers, improving efficiency and effectiveness.
Findings
Outperforms existing unsupervised models in quality
More efficient inference compared to guided search methods
Effective binary sequence labeling with transformer models
Abstract
Sentence compression reduces the length of text by removing non-essential content while preserving important facts and grammaticality. Unsupervised objective driven methods for sentence compression can be used to create customized models without the need for ground-truth training data, while allowing flexibility in the objective function(s) that are used for learning and inference. Recent unsupervised sentence compression approaches use custom objectives to guide discrete search; however, guided search is expensive at inference time. In this work, we explore the use of reinforcement learning to train effective sentence compression models that are also fast when generating predictions. In particular, we cast the task as binary sequence labelling and fine-tune a pre-trained transformer using a simple policy gradient approach. Our approach outperforms other unsupervised models while also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
