Gyan: An Explainable Neuro-Symbolic Language Model
Venkat Srinivasan, Vishaal Jatav, Anushka Chandrababu, Geetika Sharma

TL;DR
Gyan is an explainable, non-transformer language model that captures complete compositional context, achieves state-of-the-art results, and enhances trust and transparency in AI applications.
Contribution
Introducing Gyan, a novel non-transformer architecture that improves interpretability, captures full compositional context, and outperforms existing models on multiple datasets.
Findings
Gyan achieves SOTA on 3 standard datasets.
Gyan outperforms on two proprietary datasets.
The architecture enhances interpretability and trustworthiness.
Abstract
Transformer based pre-trained large language models have become ubiquitous. There is increasing evidence to suggest that even with large scale pre-training, these models do not capture complete compositional context and certainly not, the full human analogous context. Besides, by the very nature of the architecture, these models hallucinate, are difficult to maintain, are not easily interpretable and require enormous compute resources for training and inference. Here, we describe Gyan, an explainable language model based on a novel non-transformer architecture, without any of these limitations. Gyan achieves SOTA performance on 3 widely cited data sets and superior performance on two proprietary data sets. The novel architecture decouples the language model from knowledge acquisition and representation. The model draws on rhetorical structure theory, semantic role theory and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
