Semantic Structure and Interpretability of Word Embeddings

Lutfi Kerem Senel; Ihsan Utlu; Veysel Yucesoy; Aykut Koc; Tolga Cukur

arXiv:1711.00331·cs.CL·July 20, 2018

Semantic Structure and Interpretability of Word Embeddings

Lutfi Kerem Senel, Ihsan Utlu, Veysel Yucesoy, Aykut Koc, Tolga Cukur

PDF

2 Repos

TL;DR

This paper introduces a statistical approach to uncover and quantify the latent semantic structure in dense word embeddings, addressing interpretability challenges by analyzing a new dataset and proposing an alternative evaluation method.

Contribution

It presents a novel statistical method for revealing semantic structures in word embeddings and introduces SEMCAT, a new dataset for semantic grouping, along with a practical interpretability measure.

Findings

01

Semantic structures are heterogeneously distributed across embedding dimensions.

02

The proposed method effectively uncovers meaningful semantic groupings.

03

The interpretability measure correlates well with human judgment.

Abstract

Dense word embeddings, which encode semantic meanings of words to low dimensional vector spaces have become very popular in natural language processing (NLP) research due to their state-of-the-art performances in many NLP tasks. Word embeddings are substantially successful in capturing semantic relations among words, so a meaningful semantic structure must be present in the respective vector spaces. However, in many cases, this semantic structure is broadly and heterogeneously distributed across the embedding dimensions, which makes interpretation a big challenge. In this study, we propose a statistical method to uncover the latent semantic structure in the dense word embeddings. To perform our analysis we introduce a new dataset (SEMCAT) that contains more than 6500 words semantically grouped under 110 categories. We further propose a method to quantify the interpretability of the word…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability