Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs

Supantho Rakshit; Adele Goldberg

arXiv:2507.22286·cs.CL·September 10, 2025

Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs

Supantho Rakshit, Adele Goldberg

PDF

TL;DR

This paper demonstrates that Large Language Models encode graded, meaning-infused representations of language constructions, with their internal geometric separability reflecting human-like preference strengths for different sentence structures.

Contribution

It provides empirical evidence that LLMs learn rich, graded, and meaning-infused representations of constructions, supporting the constructionist view of language.

Findings

01

Representation separability varies systematically with preference strength.

02

Prototypical exemplars are more geometrically distinct in activation space.

03

Results support geometric measures as tools for analyzing LLM representations.

Abstract

The usage-based constructionist (UCx) approach to language posits that language comprises a network of learned form-meaning pairings (constructions) whose use is largely determined by their meanings or functions, requiring them to be graded and probabilistic. This study investigates whether the internal representations in Large Language Models (LLMs) reflect the proposed function-infused gradience. We analyze representations of the English Double Object (DO) and Prepositional Object (PO) constructions in Pythia- $1.4$ B, using a dataset of $5000$ sentence pairs systematically varied by human-rated preference strength for DO or PO. Geometric analyses show that the separability between the two constructions' representations, as measured by energy distance or Jensen-Shannon divergence, is systematically modulated by gradient preference strength, which depends on lexical and functional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.