The potential -- and the pitfalls -- of using pre-trained language   models as cognitive science theories

Raj Sanjay Shah; Sashank Varma

arXiv:2501.12651·cs.CL·January 23, 2025·2 cites

The potential -- and the pitfalls -- of using pre-trained language models as cognitive science theories

Raj Sanjay Shah, Sashank Varma

PDF

Open Access

TL;DR

This paper critically examines the use of pre-trained language models as models of human cognition and development, highlighting challenges, lessons learned, and criteria for their credible application in cognitive science.

Contribution

It provides a comprehensive review of the assumptions, challenges, and lessons learned in using PLMs as cognitive and developmental models, proposing criteria for their credible use.

Findings

01

Identifies key challenges in using PLMs as cognitive models.

02

Highlights pitfalls and limitations of current approaches.

03

Proposes criteria for credible application of PLMs in cognitive science.

Abstract

Many studies have evaluated the cognitive alignment of Pre-trained Language Models (PLMs), i.e., their correspondence to adult performance across a range of cognitive domains. Recently, the focus has expanded to the developmental alignment of these models: identifying phases during training where improvements in model performance track improvements in children's thinking over development. However, there are many challenges to the use of PLMs as cognitive science theories, including different architectures, different training data modalities and scales, and limited model interpretability. In this paper, we distill lessons learned from treating PLMs, not as engineering artifacts but as cognitive science and developmental science models. We review assumptions used by researchers to map measures of PLM performance to measures of human performance. We identify potential pitfalls of this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsFocus