TL;DR
This paper introduces a novel pairwise discriminator approach to generate sentence embeddings that improve paraphrase generation and sentiment analysis, outperforming existing methods on standard datasets.
Contribution
The paper presents a new method combining a pairwise discriminator with an encoder-decoder model to produce semantically meaningful sentence embeddings.
Findings
Outperforms state-of-the-art on paraphrase generation
Achieves better sentiment analysis accuracy
Embeddings are semantically close and statistically significant
Abstract
In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple method in the context of solving the paraphrase generation task. If we use a sequential encoder-decoder model for generating paraphrase, we would like the generated paraphrase to be semantically close to the original sentence. One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far. This is ensured by using a sequential pair-wise discriminator that shares weights with the encoder that is trained with a suitable loss function. Our loss function penalizes paraphrase sentence embedding distances from being too large. This loss is used in combination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
