Evaluating the Morphosyntactic Well-formedness of Generated Texts

Adithya Pratapa; Antonios Anastasopoulos; Shruti Rijhwani; Aditi; Chaudhary; David R. Mortensen; Graham Neubig; Yulia Tsvetkov

arXiv:2103.16590·cs.CL·September 13, 2021

Evaluating the Morphosyntactic Well-formedness of Generated Texts

Adithya Pratapa, Antonios Anastasopoulos, Shruti Rijhwani, Aditi, Chaudhary, David R. Mortensen, Graham Neubig, Yulia Tsvetkov

PDF

Open Access 1 Repo

TL;DR

This paper introduces L'AMBRE, a new metric for assessing the morphosyntactic correctness of generated texts using dependency parsing and language rules, improving evaluation in multilingual NLP tasks.

Contribution

It presents a novel metric based on dependency parse analysis, a method to automatically extract morphosyntactic rules, and a robust parser training approach for noisy generated texts.

Findings

01

L'AMBRE effectively evaluates morphosyntactic well-formedness.

02

The metric performs well on machine translation tasks.

03

Robust parsers improve analysis of noisy outputs.

Abstract

Text generation systems are ubiquitous in natural language processing applications. However, evaluation of these systems remains a challenge, especially in multilingual settings. In this paper, we propose L'AMBRE -- a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. We present a way to automatically extract various rules governing morphosyntax directly from dependency treebanks. To tackle the noisy outputs from text generation systems, we propose a simple methodology to train robust parsers. We show the effectiveness of our metric on the task of machine translation through a diachronic study of systems translating into morphologically-rich languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adithya7/lambre
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research