# Reference-less Quality Estimation of Text Simplification Systems

**Authors:** Louis Martin (FAIR, ALMAnaCH), Samuel Humeau (FAIR), Pierre-Emmanuel, Mazar\'e (FAIR), Antoine Bordes (FAIR), \'Eric Villemonte de La Clergerie, (ALMAnaCH), Beno\^it Sagot (ALMAnaCH)

arXiv: 1901.10746 · 2019-01-31

## TL;DR

This paper explores reference-less methods for evaluating text simplification quality, focusing on grammaticality, meaning preservation, and simplicity, and compares their effectiveness against human judgments.

## Contribution

It introduces and compares multiple reference-less evaluation approaches for text simplification, highlighting the effectiveness of n-gram metrics and length-based metrics for different quality dimensions.

## Key findings

- BLEU and METEOR correlate well with human judgments of grammaticality and meaning preservation.
- Length-based metrics are most effective for evaluating simplicity.
- Reference-less evaluation approaches can effectively assess text simplification quality without reference data.

## Abstract

The evaluation of text simplification (TS) systems remains an open challenge. As the task has common points with machine translation (MT), TS is often evaluated using MT metrics such as BLEU. However, such metrics require high quality reference data, which is rarely available for TS. TS has the advantage over MT of being a monolingual task, which allows for direct comparisons to be made between the simplified text and its original version. In this paper, we compare multiple approaches to reference-less quality estimation of sentence-level text simplification systems, based on the dataset used for the QATS 2016 shared task. We distinguish three different dimensions: gram-maticality, meaning preservation and simplicity. We show that n-gram-based MT metrics such as BLEU and METEOR correlate the most with human judgment of grammaticality and meaning preservation, whereas simplicity is best evaluated by basic length-based metrics.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.10746/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1901.10746/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1901.10746/full.md

---
Source: https://tomesphere.com/paper/1901.10746