Unsupervised Inflection Generation Using Neural Language Modeling
Octavia-Maria Sulea, Steve Young

TL;DR
This paper demonstrates that neural language models can unsupervisedly generate noun inflections in morphologically rich languages without pre-training, effectively addressing morphological complexity in NLP tasks.
Contribution
It introduces a neural language modeling approach for unsupervised noun inflection generation in multiple morphologically rich languages, showing effectiveness without large-scale pre-training.
Findings
Neural models can generate full noun inflection tables successfully.
Pre-training on large corpora can reduce inflection generation performance.
The approach works for Romanian, German, and Finnish.
Abstract
The use of Deep Neural Network architectures for Language Modeling has recently seen a tremendous increase in interest in the field of NLP with the advent of transfer learning and the shift in focus from rule-based and predictive models (supervised learning) to generative or unsupervised models to solve the long-standing problems in NLP like Information Extraction or Question Answering. While this shift has worked greatly for languages lacking in inflectional morphology, such as English, challenges still arise when trying to build similar systems for morphologically-rich languages, since their individual words shift forms in context more often. In this paper we investigate the extent to which these new unsupervised or generative techniques can serve to alleviate the type-token ratio disparity in morphologically rich languages. We apply an off-the-shelf neural language modeling library…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
