# Analysis of argument structure constructions in a deep recurrent language model

**Authors:** Pegah Ramezani, Achim Schilling, Patrick Krauss

PMC · DOI: 10.3389/fncom.2025.1474860 · 2025-06-16

## TL;DR

This paper investigates how a recurrent neural network processes different types of sentence structures, showing that even simple models can form abstract language representations.

## Contribution

The study demonstrates that a brain-constrained LSTM model can form construction-level representations of ASCs through prediction-based learning.

## Key findings

- Distinct clusters for four ASCs were observed across all hidden layers of the LSTM.
- The strongest separation of ASC clusters was found in the final hidden layer.
- The results align with prior findings in large language models, showing similar abstract representations.

## Abstract

Understanding how language and linguistic constructions are processed in the brain is a fundamental question in cognitive computational neuroscience. This study builds directly on our previous work analyzing Argument Structure Constructions (ASCs) in the BERT language model, extending the investigation to a simpler, brain-constrained architecture: a recurrent neural language model. Specifically, we explore the representation and processing of four ASCs–transitive, ditransitive, caused-motion, and resultative–in a Long Short-Term Memory (LSTM) network. We trained the LSTM on a custom GPT-4-generated dataset of 2,000 syntactically balanced sentences. We then analyzed the internal hidden layer activations using Multidimensional Scaling (MDS) and t-Distributed Stochastic Neighbor Embedding (t-SNE) to visualize sentence representations. The Generalized Discrimination Value (GDV) was calculated to quantify cluster separation. Our results show distinct clusters for the four ASCs across all hidden layers, with the strongest separation observed in the final layer. These findings are consistent with our earlier study based on a large language model and demonstrate that even relatively simple RNNs can form abstract, construction-level representations. This supports the hypothesis that hierarchical linguistic structure can emerge through prediction-based learning. In future work, we plan to compare these model-derived representations with neuroimaging data from continuous speech perception, further bridging computational and biological perspectives on language processing.

## Full-text entities

- **Diseases:** PK (MESH:C564858), LSTM (MESH:D000088562), ASC (MESH:D065309)
- **Chemicals:** Cm (MESH:D003476), Cl (MESH:D002713)
- **Species:** Felis catus (cat, species) [taxon 9685], Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12206872/full.md

---
Source: https://tomesphere.com/paper/PMC12206872