Large Language Models as Neurolinguistic Subjects: Discrepancy between Performance and Competence

Linyang He; Ercong Nie; Helmut Schmid; Hinrich Sch\"utze; Nima Mesgarani; Jonathan Brennan

arXiv:2411.07533·cs.CL·July 15, 2025

Large Language Models as Neurolinguistic Subjects: Discrepancy between Performance and Competence

Linyang He, Ercong Nie, Helmut Schmid, Hinrich Sch\"utze, Nima Mesgarani, Jonathan Brennan

PDF

Open Access

TL;DR

This paper explores the discrepancy between LLMs' linguistic performance and underlying competence by introducing a neurolinguistic assessment approach, revealing that models excel more in form than in meaning, with implications for evaluating language understanding.

Contribution

It introduces a neurolinguistic evaluation method combining minimal pair and diagnostic probing, and provides new multilingual datasets to better assess LLMs' linguistic competence.

Findings

01

LLMs show higher competence in form than in meaning.

02

Psycholinguistic and neurolinguistic assessments reveal performance-competence discrepancy.

03

Instruction tuning improves performance but not underlying competence.

Abstract

This study investigates the linguistic understanding of Large Language Models (LLMs) regarding signifier (form) and signified (meaning) by distinguishing two LLM assessment paradigms: psycholinguistic and neurolinguistic. Traditional psycholinguistic evaluations often reflect statistical rules that may not accurately represent LLMs' true linguistic competence. We introduce a neurolinguistic approach, utilizing a novel method that combines minimal pair and diagnostic probing to analyze activation patterns across model layers. This method allows for a detailed examination of how LLMs represent form and meaning, and whether these representations are consistent across languages. We found: (1) Psycholinguistic and neurolinguistic methods reveal that language performance and competence are distinct; (2) Direct probability measurement may not accurately assess linguistic competence; (3)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling