SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for   Unsupervised Abstractive Sentence Compression

Christos Baziotis; Ion Androutsopoulos; Ioannis Konstas; Alexandros; Potamianos

arXiv:1904.03651·cs.CL·June 11, 2019·22 cites

SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression

Christos Baziotis, Ion Androutsopoulos, Ioannis Konstas, Alexandros, Potamianos

PDF

Open Access 1 Repo

TL;DR

The paper introduces SEQ^3, an unsupervised sequence-to-sequence autoencoder that compresses sentences by learning discrete latent representations, enabling effective sentence compression without parallel data.

Contribution

It proposes a novel differentiable autoencoder with discrete latent variables for unsupervised sentence compression, combining length constraints and pretrained language models.

Findings

01

Achieves promising results on benchmark datasets

02

Operates without parallel text-summary pairs

03

Uses continuous relaxations for gradient-based training

Abstract

Neural sequence-to-sequence models are currently the dominant approach in several natural language processing tasks, but require large parallel corpora. We present a sequence-to-sequence-to-sequence autoencoder (SEQ^3), consisting of two chained encoder-decoder pairs, with words used as a sequence of discrete latent variables. We apply the proposed model to unsupervised abstractive sentence compression, where the first and last sequences are the input and reconstructed sentences, respectively, while the middle sequence is the compressed sentence. Constraining the length of the latent word sequences forces the model to distill important information from the input. A pretrained language model, acting as a prior over the latent sequences, encourages the compressed sentences to be human-readable. Continuous relaxations enable us to sample from categorical distributions, allowing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cbaziotis/seq3
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsSolana Customer Service Number +1-833-534-1729