An Unsupervised Method for Uncovering Morphological Chains

Karthik Narasimhan; Regina Barzilay; Tommi Jaakkola

arXiv:1503.02335·cs.CL·March 10, 2015·32 cites

An Unsupervised Method for Uncovering Morphological Chains

Karthik Narasimhan, Regina Barzilay, Tommi Jaakkola

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised approach combining orthographic and semantic information to analyze morphological chains, outperforming existing systems across multiple languages.

Contribution

It presents a novel log-linear model for morphological analysis that integrates semantic and orthographic features to identify morphological chains without supervision.

Findings

01

Outperforms five state-of-the-art systems on Arabic, English, and Turkish.

02

Effectively models parent-child relations in morphological chains.

03

Utilizes contrastive estimation for feasible training.

Abstract

Most state-of-the-art systems today produce morphological analysis based only on orthographic patterns. In contrast, we propose a model for unsupervised morphological analysis that integrates orthographic and semantic views of words. We model word formation in terms of morphological chains, from base words to the observed words, breaking the chains into parent-child relations. We use log-linear models with morpheme and word-level features to predict possible parents, including their modifications, for each word. The limited set of candidate parents for each word render contrastive estimation feasible. Our model consistently matches or outperforms five state-of-the-art systems on Arabic, English and Turkish.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

karthikncode/MorphoChain
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications