# Neural Networks as Explicit Word-Based Rules

**Authors:** Jind\v{r}ich Libovick\'y

arXiv: 1907.04613 · 2019-07-11

## TL;DR

This paper interprets convolutional neural networks for NLP as explicit word-based rules, enabling understanding and recovery of model performance through rule extraction.

## Contribution

It introduces a method to interpret CNN weights in NLP as explicit rules, bridging the gap between neural models and human-readable linguistic patterns.

## Key findings

- CNNs can be effectively interpreted as word-based rules.
- The extracted rules can recover the original model's performance.
- This approach enhances interpretability of NLP neural models.

## Abstract

Filters of convolutional networks used in computer vision are often visualized as image patches that maximize the response of the filter. We use the same approach to interpret weight matrices in simple architectures for natural language processing tasks. We interpret a convolutional network for sentiment classification as word-based rules. Using the rule, we recover the performance of the original model.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.04613/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1907.04613/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/1907.04613/full.md

---
Source: https://tomesphere.com/paper/1907.04613