Efficient Unsupervised Sentence Compression by Fine-tuning Transformers   with Reinforcement Learning

Demian Gholipour Ghalandari; Chris Hokamp; Georgiana Ifrim

arXiv:2205.08221·cs.CL·May 18, 2022

Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning

Demian Gholipour Ghalandari, Chris Hokamp, Georgiana Ifrim

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning-based method for unsupervised sentence compression that fine-tunes transformers, achieving better efficiency and performance without requiring ground-truth data.

Contribution

It proposes a novel reinforcement learning approach for unsupervised sentence compression by fine-tuning transformers as binary sequence labelers, improving efficiency and effectiveness.

Findings

01

Outperforms existing unsupervised models in quality

02

More efficient inference compared to guided search methods

03

Effective binary sequence labeling with transformer models

Abstract

Sentence compression reduces the length of text by removing non-essential content while preserving important facts and grammaticality. Unsupervised objective driven methods for sentence compression can be used to create customized models without the need for ground-truth training data, while allowing flexibility in the objective function(s) that are used for learning and inference. Recent unsupervised sentence compression approaches use custom objectives to guide discrete search; however, guided search is expensive at inference time. In this work, we explore the use of reinforcement learning to train effective sentence compression models that are also fast when generating predictions. In particular, we cast the task as binary sequence labelling and fine-tune a pre-trained transformer using a simple policy gradient approach. Our approach outperforms other unsupervised models while also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

complementizer/rl-sentence-compression
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications