# Relating Simple Sentence Representations in Deep Neural Networks and the   Brain

**Authors:** Sharmistha Jat, Hao Tang, Partha Talukdar, Tom Mitchell

arXiv: 1906.11861 · 2019-07-01

## TL;DR

This study explores the relationship between deep neural network sentence representations and brain activity, showing BERT's activations best correlate with MEG data and that synthetic brain data can enhance decoding tasks.

## Contribution

It demonstrates the correspondence between deep model layers and brain regions during sentence processing and introduces a method to generate synthetic brain data to improve decoding accuracy.

## Key findings

- BERT's activations best correlate with MEG brain data.
- Deep network representations can generate synthetic brain data.
- Synthetic data improves stimuli decoding accuracy.

## Abstract

What is the relationship between sentence representations learned by deep recurrent models against those encoded by the brain? Is there any correspondence between hidden layers of these recurrent models and brain regions when processing sentences? Can these deep models be used to synthesize brain data which can then be utilized in other extrinsic tasks? We investigate these questions using sentences with simple syntax and semantics (e.g., The bone was eaten by the dog.). We consider multiple neural network architectures, including recently proposed ELMo and BERT. We use magnetoencephalography (MEG) brain recording data collected from human subjects when they were reading these simple sentences.   Overall, we find that BERT's activations correlate the best with MEG brain data. We also find that the deep network representation can be used to generate brain data from new sentences to augment existing brain data.   To the best of our knowledge, this is the first work showing that the MEG brain recording when reading a word in a sentence can be used to distinguish earlier words in the sentence. Our exploration is also the first to use deep neural network representations to generate synthetic brain data and to show that it helps in improving subsequent stimuli decoding task accuracy.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.11861/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1906.11861/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1906.11861/full.md

---
Source: https://tomesphere.com/paper/1906.11861