Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Bhavya Bhavya, Jinjun Xiong, Chengxiang Zhai

TL;DR
This paper explores how to effectively prompt InstructGPT to generate meaningful analogies and explanations, analyzing prompt design sensitivity and comparing model sizes, with the largest model reaching human-level performance in analogy generation.
Contribution
It systematically studies prompt design for analogy generation with InstructGPT and evaluates the impact of model size and prompt variations on output quality.
Findings
Effective prompts are precise imperative statements.
Model sensitivity varies with prompt phrasing and spelling errors.
Largest InstructGPT model achieves human-level analogy quality.
Abstract
We propose a novel application of prompting Pre-trained Language Models (PLMs) to generate analogies and study how to design effective prompts for two task settings: generating a source concept analogous to a given target concept (aka Analogous Concept Generation or ACG), and generating an explanation of the similarity between a given pair of target concept and source concept (aka Analogous Explanation Generation or AEG). We found that it is feasible to prompt InstructGPT to generate meaningful analogies and the best prompts tend to be precise imperative statements especially with a low temperature setting. We also systematically analyzed the sensitivity of the InstructGPT model to prompt design, temperature, and injected spelling errors, and found that the model is particularly sensitive to certain variations (e.g., questions vs. imperative statements). Further, we conducted human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
