Enhancing Interpretable Clauses Semantically using Pretrained Word Representation
Rohan Kumar Yadav, Lei Jiao, Ole-Christoffer Granmo, and Morten, Goodwin

TL;DR
This paper introduces a novel method to incorporate pre-trained word embeddings into Tsetlin Machines, significantly improving their NLP performance and interpretability to match deep neural networks.
Contribution
It proposes a new approach to use pre-trained word representations in Tsetlin Machines, enhancing both accuracy and interpretability in NLP tasks.
Findings
Accuracy surpasses previous BOW-based TM models.
Performance reaches levels comparable to DNNs.
Semantic input features improve interpretability.
Abstract
Tsetlin Machine (TM) is an interpretable pattern recognition algorithm based on propositional logic, which has demonstrated competitive performance in many Natural Language Processing (NLP) tasks, including sentiment analysis, text classification, and Word Sense Disambiguation. To obtain human-level interpretability, legacy TM employs Boolean input features such as bag-of-words (BOW). However, the BOW representation makes it difficult to use any pre-trained information, for instance, word2vec and GloVe word representations. This restriction has constrained the performance of TM compared to deep neural networks (DNNs) in NLP. To reduce the performance gap, in this paper, we propose a novel way of using pre-trained word representations for TM. The approach significantly enhances the performance and interpretability of TM. We achieve this by extracting semantically related words from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
MethodsGloVe Embeddings
