Formal Semantic Control over Language Models

Yingji Zhang

arXiv:2602.00638·cs.CL·February 3, 2026

Formal Semantic Control over Language Models

Yingji Zhang

PDF

Open Access 1 Video

TL;DR

This thesis develops methods to improve the interpretability and controllability of language models' semantic representations by shaping their latent space within a VAE framework, enabling targeted manipulation at sentence and reasoning levels.

Contribution

It introduces novel theoretical frameworks and practical techniques for disentangling and controlling semantic features in language models' latent spaces within a VAE architecture.

Findings

01

Enhanced interpretability of semantic representations

02

Improved controllability of language model outputs

03

Effective manipulation of inference behaviors in NLI tasks

Abstract

This thesis advances semantic representation learning to render language representations or models more semantically and geometrically interpretable, and to enable localised, quasi-symbolic, compositional control through deliberate shaping of their latent space geometry. We pursue this goal within a VAE framework, exploring two complementary research directions: (i) Sentence-level learning and control: disentangling and manipulating specific semantic features in the latent space to guide sentence generation, with explanatory text serving as the testbed; and (ii) Reasoning-level learning and control: isolating and steering inference behaviours in the latent space to control NLI. In this direction, we focus on Explanatory NLI tasks, in which two premises (explanations) are provided to infer a conclusion. The overarching objective is to move toward language models whose internal semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Formal Semantic Control over Language Models· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Topic Modeling