Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder

Yingji Zhang; Danilo S. Carvalho; Andr\'e Freitas

arXiv:2506.20083·cs.CL·April 16, 2026

Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder

Yingji Zhang, Danilo S. Carvalho, Andr\'e Freitas

PDF

TL;DR

This survey explores how autoencoder-based models can unify symbolic and distributional semantics by analyzing their latent space geometries to improve interpretability and semantic understanding.

Contribution

It introduces a novel perspective on latent space geometry in autoencoders for semantic representation learning, bridging symbolic and distributional semantics.

Findings

01

Compared VAE, VQVAE, and SAE in their latent geometries.

02

Highlighted how different autoencoders influence semantic interpretability.

03

Provided insights into enhancing semantic models with autoencoder architectures.

Abstract

Integrating compositional and symbolic properties into current distributional semantic spaces can enhance the interpretability, controllability, compositionality, and generalisation capabilities of Transformer-based auto-regressive language models (LMs). In this survey, we offer a novel perspective on latent space geometry through the lens of compositional semantics, a direction we refer to as \textit{semantic representation learning}. This direction enables a bridge between symbolic and distributional semantics, helping to mitigate the gap between them. We review and compare three mainstream autoencoder architectures-Variational AutoEncoder (VAE), Vector Quantised VAE (VQVAE), and Sparse AutoEncoder (SAE)-and examine the distinctive latent geometries they induce in relation to semantic structure and interpretability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.