Residual Semantic Decomposition of Word Embeddings

Seungmin Jin

arXiv:2605.17482·cs.CL·May 19, 2026

Residual Semantic Decomposition of Word Embeddings

Seungmin Jin

PDF

TL;DR

This paper presents Residual Semantic Decomposition (RSD), a neural method for decomposing word embeddings to analyze semantic axes and residual information, balancing reconstruction and relational structure.

Contribution

RSD introduces a recursive binary decomposition approach for word embeddings, enabling local semantic axis extraction and residual analysis for ambiguous words.

Findings

01

RSD separates context anchors from controls in diagnostics.

02

Residual neighborhoods serve as qualitative diagnostics, not benchmarks.

03

Ambiguous words are not uniformly high-entropy boundary points.

Abstract

We introduce Residual Semantic Decomposition (RSD), a neural additive decomposition of word embeddings that balances embedding reconstruction with relational structure preservation. RSD supports recursive binary decomposition: each $K = 2$ fit extracts a local semantic axis, while residuals expose information not absorbed by that axis. In manually specified paired-context diagnostics over ambiguous words, RSD separates supplied context anchors above shuffled-label controls, but entropy diagnostics show that ambiguous targets are not uniformly high-entropy boundary points in static GloVe. We therefore treat residual neighborhoods as qualitative diagnostics rather than benchmark sense predictions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.