LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces
Olexander Mazurets, Olexander Barmak, Leonid Bedratyuk, Iurii Krak

TL;DR
LAG-XAI introduces a geometric affine framework for interpretable paraphrasing in Transformer models, enabling explicit analysis of semantic transformations with high accuracy and practical applications like hallucination detection.
Contribution
It proposes a novel Lie-inspired affine geometric model that interprets paraphrasing as continuous transformations in embedding space, enhancing interpretability of Transformer latent semantics.
Findings
Achieves an AUC of 0.7713 on Twitter paraphrasing task.
Captures 80% of the non-linear baseline's classification capacity.
Detects 95.3% of factual distortions in hallucination detection.
Abstract
Modern Transformer-based language models achieve strong performance in natural language processing tasks, yet their latent semantic spaces remain largely uninterpretable black boxes. This paper introduces LAG-XAI (Lie Affine Geometry for Explainable AI), a novel geometric framework that models paraphrasing not as discrete word substitutions, but as a structured affine transformation within the embedding space. By conceptualizing paraphrasing as a continuous geometric flow on a semantic manifold, we propose a computationally efficient mean-field approximation, inspired by local Lie group actions. This allows us to decompose paraphrase transitions into geometrically interpretable components: rotation, deformation, and translation. Experiments on the noisy PIT-2015 Twitter corpus, encoded with Sentence-BERT, reveal a "linear transparency" phenomenon. The proposed affine operator achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
