Joint Semantic Synthesis and Morphological Analysis of the Derived Word
Ryan Cotterell, Hinrich Sch\"utze

TL;DR
This paper introduces a probabilistic model that jointly analyzes word structure into morphemes and derives their combined semantic meaning, improving segmentation accuracy and semantic composition in English and German data.
Contribution
It presents a novel joint model for morphological segmentation and semantic synthesis, enhancing both tasks and exploring neural composition methods.
Findings
Joint modeling improves segmentation accuracy by 3-5%.
Recurrent neural networks outperform additive models in semantic composition.
The model's representations relate to linguistic notions of morphological productivity.
Abstract
Much like sentences are composed of words, words themselves are composed of smaller units. For example, the English word questionably can be analyzed as question+able+ly. However, this structural decomposition of the word does not directly give us a semantic representation of the word's meaning. Since morphology obeys the principle of compositionality, the semantics of the word can be systematically derived from the meaning of its parts. In this work, we propose a novel probabilistic model of word formation that captures both the analysis of a word w into its constituents segments and the synthesis of the meaning of w from the meanings of those segments. Our model jointly learns to segment words into morphemes and compose distributional semantic vectors of those morphemes. We experiment with the model on English CELEX data and German DerivBase (Zeller et al., 2013) data. We show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
