Explainable Natural Language Processing with Matrix Product States
Jirawat Tangpanitanon, Chanatip Mangkang, Pradeep Bhadola, Yuichiro, Minato, Dimitris G. Angelakis, Thiparat Chotibut

TL;DR
This paper analyzes RNNs in NLP using quantum physics tools, revealing that single-layer recurrent arithmetic circuits efficiently perform sentiment analysis with limited information propagation capacity, challenging common beliefs.
Contribution
It introduces a novel analysis of RNNs via matrix product states and entanglement entropy, providing insights into their expressiveness and explainability in NLP tasks.
Findings
Single-layer RACs achieve ~99% accuracy in sentiment analysis.
EE saturation limits model complexity without affecting accuracy.
High expressiveness arises from interplay between information propagation and embeddings.
Abstract
Despite empirical successes of recurrent neural networks (RNNs) in natural language processing (NLP), theoretical understanding of RNNs is still limited due to intrinsically complex non-linear computations. We systematically analyze RNNs' behaviors in a ubiquitous NLP task, the sentiment analysis of movie reviews, via the mapping between a class of RNNs called recurrent arithmetic circuits (RACs) and a matrix product state (MPS). Using the von-Neumann entanglement entropy (EE) as a proxy for information propagation, we show that single-layer RACs possess a maximum information propagation capacity, reflected by the saturation of the EE. Enlarging the bond dimension beyond the EE saturation threshold does not increase model prediction accuracies, so a minimal model that best estimates the data statistics can be inferred. Although the saturated EE is smaller than the maximum EE allowed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Quantum many-body systems · Neural Networks and Applications
