# CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection   in 52 Languages

**Authors:** Ryan Cotterell, Christo Kirov, John Sylak-Glassman, G\'eraldine, Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sandra K\"ubler,, David Yarowsky, Jason Eisner, Mans Hulden

arXiv: 1706.09031 · 2017-07-06

## TL;DR

The paper reports on the 2017 shared task on morphological reinflection across 52 languages, demonstrating that neural models can perform well even with limited data, but different approaches predict different forms, indicating room for improvement.

## Contribution

It introduces a large-scale shared task for morphological reinflection in diverse languages and evaluates neural models' effectiveness under various resource conditions.

## Key findings

- Neural models perform well with small datasets using appropriate biases.
- Different data augmentation methods lead to different correct predictions.
- High performance achievable with limited labeled data.

## Abstract

The CoNLL-SIGMORPHON 2017 shared task on supervised morphological generation required systems to be trained and tested in each of 52 typologically diverse languages. In sub-task 1, submitted systems were asked to predict a specific inflected form of a given lemma. In sub-task 2, systems were given a lemma and some of its specific inflected forms, and asked to complete the inflectional paradigm by predicting all of the remaining inflected forms. Both sub-tasks included high, medium, and low-resource conditions. Sub-task 1 received 24 system submissions, while sub-task 2 received 3 system submissions. Following the success of neural sequence-to-sequence models in the SIGMORPHON 2016 shared task, all but one of the submissions included a neural component. The results show that high performance can be achieved with small training datasets, so long as models have appropriate inductive bias or make use of additional unlabeled data or synthetic data. However, different biasing and data augmentation resulted in disjoint sets of inflected forms being predicted correctly, suggesting that there is room for future improvement.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.09031/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1706.09031/full.md

## References

71 references — full list in the complete paper: https://tomesphere.com/paper/1706.09031/full.md

---
Source: https://tomesphere.com/paper/1706.09031