Scientific and Creative Analogies in Pretrained Language Models
Tamara Czinczoll, Helen Yannakoudakis, Pushkar Mishra, Ekaterina, Shutova

TL;DR
This paper introduces the SCAN dataset to evaluate the analogy reasoning abilities of large pretrained language models across dissimilar domains, revealing their current limitations in understanding complex analogies.
Contribution
The paper presents a new analogy dataset, SCAN, designed to test models on more realistic and complex analogical reasoning tasks across diverse domains.
Findings
State-of-the-art LMs perform poorly on SCAN analogies.
Existing datasets may overestimate models' analogy capabilities.
Challenges in analogy understanding remain significant for current models.
Abstract
This paper examines the encoding of analogy in large-scale pretrained language models, such as BERT and GPT-2. Existing analogy datasets typically focus on a limited set of analogical relations, with a high similarity of the two domains between which the analogy holds. As a more realistic setup, we introduce the Scientific and Creative Analogy dataset (SCAN), a novel analogy dataset containing systematic mappings of multiple attributes and relational structures across dissimilar domains. Using this dataset, we test the analogical reasoning capabilities of several widely-used pretrained language models (LMs). We find that state-of-the-art LMs achieve low performance on these complex analogy tasks, highlighting the challenges still posed by analogy understanding.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Test · Byte Pair Encoding · Discriminative Fine-Tuning · Cosine Annealing · Linear Layer · Weight Decay · Residual Connection · Dense Connections
