Sentence-level quality estimation by predicting HTER as a   multi-component metric

Eleftherios Avramidis

arXiv:1707.06167·cs.CL·July 20, 2017

Sentence-level quality estimation by predicting HTER as a multi-component metric

Eleftherios Avramidis

PDF

TL;DR

This paper proposes a machine learning approach that predicts individual post-editing operations to estimate sentence-level HTER scores, improving accuracy without extensive feature engineering.

Contribution

It introduces a multi-output neural model that jointly predicts editing operations, enabling better HTER estimation and correction of invalid predictions.

Findings

01

Multi-layer perceptron with 4 outputs improves HTER prediction accuracy.

02

Joint prediction of editing operations enhances estimation robustness.

03

Model allows correction of invalid HTER predictions.

Abstract

This submission investigates alternative machine learning models for predicting the HTER score on the sentence level. Instead of directly predicting the HTER score, we suggest a model that jointly predicts the amount of the 4 distinct post-editing operations, which are then used to calculate the HTER score. This also gives the possibility to correct invalid (e.g. negative) predicted values prior to the calculation of the HTER score. Without any feature exploration, a multi-layer perceptron with 4 outputs yields small but significant improvements over the baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.