Can Large Language Models generalize analogy solving like children can?
Claire E. Stevenson, Alexandra Pafford, Han L. J. van der Maas, Melanie Mitchell

TL;DR
This paper investigates whether large language models can generalize analogy solving across different domains like humans, finding that unlike children and adults, LLMs struggle with robust transfer to new domains.
Contribution
The study provides empirical evidence that LLMs do not yet match human ability in analogical transfer across domains, highlighting a key limitation in current models.
Findings
Children and adults generalize analogies to new domains easily.
LLMs fail to transfer analogy solving to unfamiliar domains.
Humans outperform LLMs in robust analogical transfer tasks.
Abstract
In people, the ability to solve analogies such as "body : feet :: table : ?" emerges in childhood, and appears to transfer easily to other domains, such as the visual domain "( : ) :: < : ?". Recent research shows that large language models (LLMs) can solve various forms of analogies. However, can LLMs generalize analogy solving to new domains like people can? To investigate this, we had children, adults, and LLMs solve a series of letter-string analogies (e.g., a b : a c :: j k : ?) in the Latin alphabet, in a near transfer domain (Greek alphabet), and a far transfer domain (list of symbols). Children and adults easily generalized their knowledge to unfamiliar domains, whereas LLMs did not. This key difference between human and AI performance is evidence that these LLMs still struggle with robust human-like analogical transfer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
