Evaluating prose style transfer with the Bible

Keith Carlson; Allen Riddell; Daniel Rockmore

arXiv:1711.04731·cs.CL·September 1, 2021

Evaluating prose style transfer with the Bible

Keith Carlson, Allen Riddell, Daniel Rockmore

PDF

1 Repo

TL;DR

This paper introduces a high-quality parallel corpus derived from Bible versions for evaluating prose style transfer systems, along with baseline results to facilitate future research in the area.

Contribution

It provides a standardized, publicly available Bible-based corpus for style transfer evaluation and baseline model results using BLEU and PINC metrics.

Findings

01

The corpus is highly parallel and well-aligned due to chapter and verse numbering.

02

Baseline models achieve measurable BLEU and PINC scores.

03

The corpus is useful for style transfer and potentially other NLP tasks.

Abstract

In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split, into training, development and testing data, of the public domain versions in our corpus. This corpus is highly parallel since many Bible versions are included. Sentences are aligned due to the presence of chapter and verse numbers within all versions of the text. In addition to the corpus, we present the results, as measured by the BLEU and PINC metrics, of several models trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

keithecarlson/StyleTransferBibleData
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.