Contextual Feature Extraction Hierarchies Converge in Large Language   Models and the Brain

Gavin Mischler; Yinghao Aaron Li; Stephan Bickel; Ashesh D. Mehta and; Nima Mesgarani

arXiv:2401.17671·cs.CL·December 10, 2024·5 cites

Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta and, Nima Mesgarani

PDF

Open Access

TL;DR

This study investigates how high-performing large language models increasingly resemble human brain language processing, highlighting hierarchical feature extraction, contextual information, and convergence among models.

Contribution

It reveals the factors driving convergence between LLMs and brain processing, including hierarchical feature pathways and the role of context, advancing understanding of AI-human parallels.

Findings

01

Higher performance models predict neural responses better.

02

Converging hierarchical processing pathways among models.

03

Context enhances both model performance and brain similarity.

Abstract

Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs, remain elusive. Here, we examined a diverse selection of high-performance LLMs with similar parameter sizes to investigate the factors contributing to their alignment with the brain's language processing mechanisms. We find that as LLMs achieve higher performance on benchmark tasks, they not only become more brain-like as measured by higher performance when predicting neural responses from LLM embeddings, but also their hierarchical feature extraction pathways map more closely onto the brain's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsALIGN