Modeling the human lexicon under temperature variations: linguistic factors, diversity and typicality in LLM word associations

Maria Andueza Rodriguez; Marie Candito; Richard Huyghe

arXiv:2603.18171·cs.CL·March 20, 2026

Modeling the human lexicon under temperature variations: linguistic factors, diversity and typicality in LLM word associations

Maria Andueza Rodriguez, Marie Candito, Richard Huyghe

PDF

Open Access

TL;DR

This study compares human and large language model word associations to understand how models replicate human lexical patterns, focusing on factors like frequency, concreteness, variability, and typicality across different model sizes and temperature settings.

Contribution

It provides a detailed analysis of how LLMs mimic human lexical behavior, highlighting the effects of model size and temperature on response variability and typicality.

Findings

01

Larger models produce more typical responses with less variability.

02

Temperature increases response variability but decreases typicality.

03

Models replicate human trends for frequency and concreteness.

Abstract

Large language models (LLMs) achieve impressive results in terms of fluency in text generation, yet the nature of their linguistic knowledge - in particular the human-likeness of their internal lexicon - remains uncertain. This study compares human and LLM-generated word associations to evaluate how accurately models capture human lexical patterns. Using English cue-response pairs from the SWOW dataset and newly generated associations from three LLMs (Mistral-7B, Llama-3.1-8B, and Qwen-2.5-32B) across multiple temperature settings, we examine (i) the influence of lexical factors such as word frequency and concreteness on cue-response pairs, and (ii) the variability and typicality of LLM responses compared to human responses. Results show that all models mirror human trends for frequency and concreteness but differ in response variability and typicality. Larger models such as Qwen tend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Language and cultural evolution