What Are Large Language Models Mapping to in the Brain? A Case Against   Over-Reliance on Brain Scores

Ebrahim Feghhi; Nima Hadidi; Bryan Song; Idan A. Blank; Jonathan C.; Kao

arXiv:2406.01538·cs.CL·June 24, 2024·6 cites

What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain Scores

Ebrahim Feghhi, Nima Hadidi, Bryan Song, Idan A. Blank, Jonathan C., Kao

PDF

Open Access 1 Repo

TL;DR

This paper critically examines the use of brain scores to compare large language models with human neural activity, revealing that many high scores are driven by simple features rather than true linguistic processing similarities.

Contribution

The study demonstrates that high brain scores of LLMs are largely explained by trivial features like sentence length and position, challenging assumptions about their brain-like computations.

Findings

01

Trivial features like sentence length explain high brain scores.

02

Untrained LLMs' brain scores are driven by simple features, not architecture.

03

Proper data splitting reduces inflated brain score estimates.

Abstract

Given the remarkable capabilities of large language models (LLMs), there has been a growing interest in evaluating their similarity to the human brain. One approach towards quantifying this similarity is by measuring how well a model predicts neural signals, also called "brain score". Internal representations from LLMs achieve state-of-the-art brain scores, leading to speculation that they share computational principles with human language processing. This inference is only valid if the subset of neural activity predicted by LLMs reflects core elements of language processing. Here, we question this assumption by analyzing three neural datasets used in an impactful study on LLM-to-brain mappings, with a particular focus on an fMRI dataset where participants read short passages. We first find that when using shuffled train-test splits, as done in previous studies with these datasets, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ebrahimfeghhi/beyond-brainscore
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsFocus