Neural Transductive Learning and Beyond: Morphological Generation in the   Minimal-Resource Setting

Katharina Kann; Hinrich Sch\"utze

arXiv:1809.08733·cs.CL·May 10, 2019

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

Katharina Kann, Hinrich Sch\"utze

PDF

Open Access

TL;DR

This paper introduces two novel methods, paradigm transduction and SHIP, to improve morphological generation in low-resource settings, significantly outperforming previous models on a 52-language benchmark.

Contribution

The paper presents paradigm transduction and SHIP, new approaches that enhance neural morphological generation with minimal training data, addressing limitations of existing models.

Findings

01

Outperforms previous state-of-the-art by up to 9.71% accuracy on a 52-language benchmark.

02

Paradigm transduction exploits test-time input subsets to improve generalization.

03

SHIP effectively identifies reliable source forms in low-resource scenarios.

Abstract

Neural state-of-the-art sequence-to-sequence (seq2seq) models often do not perform well for small training sets. We address paradigm completion, the morphological task of, given a partial paradigm, generating all missing forms. We propose two new methods for the minimal-resource setting: (i) Paradigm transduction: Since we assume only few paradigms available for training, neural seq2seq models are able to capture relationships between paradigm cells, but are tied to the idiosyncracies of the training set. Paradigm transduction mitigates this problem by exploiting the input subset of inflected forms at test time. (ii) Source selection with high precision (SHIP): Multi-source models which learn to automatically select one or multiple sources to predict a target inflection do not perform well in the minimal-resource setting. SHIP is an alternative to identify a reliable source if training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence