On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
Sanaz Saki Norouzi, Mohammad Masjedi, Pascal Hitzler

TL;DR
This paper demonstrates that Large Language Models like Llama 3 process grammatical categories using specific neurons, similar to how the human brain handles language, revealing a neural subspace dedicated to part-of-speech recognition.
Contribution
It identifies key neurons in LLMs associated with grammatical categories and shows their activation patterns can predict part-of-speech tags, paralleling neuroscientific findings.
Findings
Key neurons correlate with part-of-speech prediction
Activation patterns reliably classify parts of speech
Evidence of a grammatical subspace in LLMs
Abstract
Artificial Neural Networks, the building blocks of AI, were inspired by the human brain's network of neurons. Over the years, these networks have evolved to replicate the complex capabilities of the brain, allowing them to handle tasks such as image and language processing. In the realm of Large Language Models, there has been a keen interest in making the language learning process more akin to that of humans. While neuroscientific research has shown that different grammatical categories are processed by different neurons in the brain, we show that LLMs operate in a similar way. Utilizing Llama 3, we identify the most important neurons associated with the prediction of words belonging to different part-of-speech tags. Using the achieved knowledge, we train a classifier on a dataset, which shows that the activation patterns of these key neurons can reliably predict part-of-speech tags on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Action Observation and Synchronization · Face Recognition and Perception
