Discovering Universal Geometry in Embeddings with ICA

Hiroaki Yamagiwa; Momose Oyama; Hidetoshi Shimodaira

arXiv:2305.13175·cs.CL·November 3, 2023·1 cites

Discovering Universal Geometry in Embeddings with ICA

Hiroaki Yamagiwa, Momose Oyama, Hidetoshi Shimodaira

PDF

Open Access 1 Repo

TL;DR

This paper uses ICA to reveal a universal semantic structure in embeddings, showing that intrinsic axes are consistent across languages, algorithms, and modalities, thereby deepening understanding of embedding representations.

Contribution

The study introduces a novel ICA-based method to uncover a universal semantic geometry in embeddings, highlighting consistent intrinsic axes across diverse data.

Findings

01

Semantic axes are consistent across languages and modalities.

02

Embeddings can be decomposed into interpretable intrinsic axes.

03

Universal geometric patterns are identified in embedding spaces.

Abstract

This study utilizes Independent Component Analysis (ICA) to unveil a consistent semantic structure within embeddings of words or images. Our approach extracts independent semantic components from the embeddings of a pre-trained model by leveraging anisotropic information that remains after the whitening process in Principal Component Analysis (PCA). We demonstrate that each embedding can be expressed as a composition of a few intrinsic interpretable axes and that these semantic axes remain consistent across different languages, algorithms, and modalities. The discovery of a universal semantic structure in the geometric patterns of embeddings enhances our understanding of the representations in embeddings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shimo-lab/universal-geometry-with-ica
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Neural Networks and Applications · Fractal and DNA sequence analysis