Feature Resemblance: Towards a Theoretical Understanding of Analogical Reasoning in Transformers
Ruichen Xu, Wenjing Yan, Ying-Jun Angela Zhang

TL;DR
This paper provides a theoretical framework for understanding how transformers perform analogical reasoning by encoding entities with similar features into aligned representations, supported by experiments on large models.
Contribution
It offers a formal analysis of the emergence of analogical reasoning in transformers, highlighting the importance of training curriculum and representational geometry.
Findings
Joint training on similarity and attribution enables analogical reasoning.
Sequential training requires learning similarity before attributes.
Two-hop reasoning reduces to explicit analogical structures in data.
Abstract
Understanding reasoning in large language models is complicated by evaluations that conflate multiple reasoning types. We isolate analogical reasoning (inferring shared properties between entities based on known similarities) and analyze its emergence in transformers. We theoretically prove three key results: (1) Joint training on similarity and attribution premises enables analogical reasoning through aligned representations; (2) Sequential training succeeds only when similarity structure is learned before specific attributes, revealing a necessary curriculum; (3) Two-hop reasoning () reduces to analogical reasoning with identity bridges (), which must appear explicitly in training data. These results reveal a unified mechanism: transformers encode entities with similar properties into similar representations, enabling property transfer through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Child and Animal Learning Development · Language and cultural evolution
