TL;DR
This paper explores how large language models demonstrate human-like temporal cognition, including establishing a subjective temporal reference point and logarithmic perception of time, through detailed neuronal and representational analyses.
Contribution
It uncovers the mechanisms behind temporal cognition in LLMs, revealing hierarchical temporal representations and the influence of training data structure, offering a new experientialist perspective.
Findings
LLMs spontaneously establish a subjective temporal reference point.
LLMs follow the Weber-Fechner law in temporal perception.
Hierarchical construction of temporal representations in LLM layers.
Abstract
As Large Language Models (LLMs) continue to advance, they exhibit certain cognitive patterns similar to those of humans that are not directly specified in training data. This study investigates this phenomenon by focusing on temporal cognition in LLMs. Leveraging the similarity judgment task, we find that larger models spontaneously establish a subjective temporal reference point and adhere to the Weber-Fechner law, whereby the perceived distance logarithmically compresses as years recede from this reference point. To uncover the mechanisms behind this behavior, we conducted multiple analyses across neuronal, representational, and informational levels. We first identify a set of temporal-preferential neurons and find that this group exhibits minimal activation at the subjective reference point and implements a logarithmic coding scheme convergently found in biological systems. Probing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
