Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
Mariya Toneva, Leila Wehbe

TL;DR
This paper introduces a novel brain-based interpretation method for NLP models, revealing differences in their representations and improving BERT's language understanding by aligning it more closely with brain activity.
Contribution
It proposes a brain-inspired interpretation approach for NLP models and demonstrates how aligning models with brain data enhances their language comprehension capabilities.
Findings
Differences in context-related representations across NLP models.
Interaction effects between layer depth, context length, and attention type in transformers.
Enhanced BERT with brain-alignment outperforms the original in syntactic NLP tasks.
Abstract
Neural networks models for NLP are typically implemented without the explicit encoding of language rules and yet they are able to break one performance record after another. This has generated a lot of research interest in interpreting the representations learned by these networks. We propose here a novel interpretation approach that relies on the only processing system we have that does understand language: the human brain. We use brain imaging recordings of subjects reading complex natural text to interpret word and sequence embeddings from 4 recent NLP models - ELMo, USE, BERT and Transformer-XL. We study how their representations differ across layer depth, context length, and attention type. Our results reveal differences in the context-related representations across these models. Further, in the transformer models, we find an interaction between layer depth and context length, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Neurobiology of Language and Bilingualism · Natural Language Processing Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Cosine Annealing · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Variational Dropout · Bidirectional LSTM · Adaptive Input Representations
