Unsupervised Evaluation Metrics and Learning Criteria for Non-Parallel   Textual Transfer

Richard Yuanzhe Pang; Kevin Gimpel

arXiv:1810.11878·cs.CL·October 1, 2019

Unsupervised Evaluation Metrics and Learning Criteria for Non-Parallel Textual Transfer

Richard Yuanzhe Pang, Kevin Gimpel

PDF

TL;DR

This paper introduces new unsupervised evaluation metrics and training strategies for non-parallel textual transfer, improving the assessment of semantic preservation, fluency, and transfer accuracy.

Contribution

It proposes a comprehensive set of metrics and loss functions for better evaluation and training of non-parallel text transfer models, addressing current limitations.

Findings

01

Metrics correlate well with human judgments

02

Significant improvements over baseline methods

03

Effective in both sentiment and literature datasets

Abstract

We consider the problem of automatically generating textual paraphrases with modified attributes or properties, focusing on the setting without parallel data (Hu et al., 2017; Shen et al., 2017). This setting poses challenges for evaluation. We show that the metric of post-transfer classification accuracy is insufficient on its own, and propose additional metrics based on semantic preservation and fluency as well as a way to combine them into a single overall score. We contribute new loss functions and training strategies to address the different metrics. Semantic preservation is addressed by adding a cyclic consistency loss and a loss based on paraphrase pairs, while fluency is improved by integrating losses based on style-specific language models. We experiment with a Yelp sentiment dataset and a new literature dataset that we propose, using multiple models that extend prior work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings