Do Large Language Models Mirror Cognitive Language Processing?

Yuqi Ren; Renren Jin; Tongxuan Zhang; Deyi Xiong

arXiv:2402.18023·cs.AI·January 16, 2025·3 cites

Do Large Language Models Mirror Cognitive Language Processing?

Yuqi Ren, Renren Jin, Tongxuan Zhang, Deyi Xiong

PDF

Open Access

TL;DR

This study evaluates how well large language models' text representations align with human brain signals during language processing, revealing factors that influence their cognitive similarity.

Contribution

It introduces a comprehensive analysis of LLM-brain alignment using RSA and examines the effects of training strategies and prompts on this alignment.

Findings

01

Pre-training data size and model scaling improve LLM-brain similarity.

02

Alignment training significantly enhances cognitive alignment.

03

Explicit prompts increase consistency with brain signals.

Abstract

Large Language Models (LLMs) have demonstrated remarkable abilities in text comprehension and logical reasoning, indicating that the text representations learned by LLMs can facilitate their language processing capabilities. In neuroscience, brain cognitive processing signals are typically utilized to study human language processing. Therefore, it is natural to ask how well the text embeddings from LLMs align with the brain cognitive processing signals, and how training strategies affect the LLM-brain alignment? In this paper, we employ Representational Similarity Analysis (RSA) to measure the alignment between 23 mainstream LLMs and fMRI signals of the brain to evaluate how effectively LLMs simulate cognitive language processing. We empirically investigate the impact of various factors (e.g., pre-training data size, model scaling, alignment training, and prompts) on such LLM-brain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsALIGN