A Cluster-based Approach for Improving Isotropy in Contextual Embedding   Space

Sara Rajaee; Mohammad Taher Pilehvar

arXiv:2106.01183·cs.CL·June 3, 2021

A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space

Sara Rajaee, Mohammad Taher Pilehvar

PDF

1 Repo

TL;DR

This paper introduces a cluster-based method to improve the isotropy of contextual word embeddings by removing dominant directions within clusters, enhancing their semantic task performance.

Contribution

It proposes a novel local, cluster-based approach to address representation degeneration in CWRs, contrasting with prior global methods.

Findings

01

Removing dominant directions improves embedding isotropy.

02

Cluster-based analysis reveals structural and tense information.

03

Method enhances performance on multiple semantic tasks.

Abstract

The representation degeneration problem in Contextual Word Representations (CWRs) hurts the expressiveness of the embedding space by forming an anisotropic cone where even unrelated words have excessively positive correlations. Existing techniques for tackling this issue require a learning process to re-train models with additional objectives and mostly employ a global assessment to study isotropy. Our quantitative analysis over isotropy shows that a local assessment could be more accurate due to the clustered structure of CWRs. Based on this observation, we propose a local cluster-based method to address the degeneration issue in contextual embedding spaces. We show that in clusters including punctuations and stop words, local dominant directions encode structural information, removing which can improve CWRs performance on semantic tasks. Moreover, we find that tense information in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sara-Rajaee/clusterbased_isotropy_enhancement
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.