Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting
Katharina Kann, Hinrich Sch\"utze

TL;DR
This paper introduces two novel methods, paradigm transduction and SHIP, to improve morphological generation in low-resource settings, significantly outperforming previous models on a 52-language benchmark.
Contribution
The paper presents paradigm transduction and SHIP, new approaches that enhance neural morphological generation with minimal training data, addressing limitations of existing models.
Findings
Outperforms previous state-of-the-art by up to 9.71% accuracy on a 52-language benchmark.
Paradigm transduction exploits test-time input subsets to improve generalization.
SHIP effectively identifies reliable source forms in low-resource scenarios.
Abstract
Neural state-of-the-art sequence-to-sequence (seq2seq) models often do not perform well for small training sets. We address paradigm completion, the morphological task of, given a partial paradigm, generating all missing forms. We propose two new methods for the minimal-resource setting: (i) Paradigm transduction: Since we assume only few paradigms available for training, neural seq2seq models are able to capture relationships between paradigm cells, but are tied to the idiosyncracies of the training set. Paradigm transduction mitigates this problem by exploiting the input subset of inflected forms at test time. (ii) Source selection with high precision (SHIP): Multi-source models which learn to automatically select one or multiple sources to predict a target inflection do not perform well in the minimal-resource setting. SHIP is an alternative to identify a reliable source if training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
