LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces

Olexander Mazurets; Olexander Barmak; Leonid Bedratyuk; Iurii Krak

arXiv:2604.06086·cs.CL·April 8, 2026

LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces

Olexander Mazurets, Olexander Barmak, Leonid Bedratyuk, Iurii Krak

PDF

TL;DR

LAG-XAI introduces a geometric affine framework for interpretable paraphrasing in Transformer models, enabling explicit analysis of semantic transformations with high accuracy and practical applications like hallucination detection.

Contribution

It proposes a novel Lie-inspired affine geometric model that interprets paraphrasing as continuous transformations in embedding space, enhancing interpretability of Transformer latent semantics.

Findings

01

Achieves an AUC of 0.7713 on Twitter paraphrasing task.

02

Captures 80% of the non-linear baseline's classification capacity.

03

Detects 95.3% of factual distortions in hallucination detection.

Abstract

Modern Transformer-based language models achieve strong performance in natural language processing tasks, yet their latent semantic spaces remain largely uninterpretable black boxes. This paper introduces LAG-XAI (Lie Affine Geometry for Explainable AI), a novel geometric framework that models paraphrasing not as discrete word substitutions, but as a structured affine transformation within the embedding space. By conceptualizing paraphrasing as a continuous geometric flow on a semantic manifold, we propose a computationally efficient mean-field approximation, inspired by local Lie group actions. This allows us to decompose paraphrase transitions into geometrically interpretable components: rotation, deformation, and translation. Experiments on the noisy PIT-2015 Twitter corpus, encoded with Sentence-BERT, reveal a "linear transparency" phenomenon. The proposed affine operator achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.