Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning
Cole Gawin, Yidan Sun, Mayank Kejriwal

TL;DR
This paper evaluates large language models' ability to perform abstract common-sense reasoning using ConceptNet, revealing significant gaps compared to humans but also potential improvements through prompt engineering.
Contribution
It introduces two prompting methods for assessing common-sense reasoning in LLMs and systematically analyzes their performance and biases using ConceptNet.
Findings
Models perform well in relation ranking but poorly in single-relation prediction.
Few-shot prompting improves accuracy when selecting from limited relations.
Prompt engineering and selective retrieval can enhance reasoning performance.
Abstract
Large language models (LLMs) have achieved remarkable performance in generating human-like text and solving reasoning tasks of moderate complexity, such as question-answering and mathematical problem-solving. However, their capabilities in tasks requiring deeper cognitive skills, such as common-sense understanding and abstract reasoning, remain under-explored. In this paper, we systematically evaluate abstract common-sense reasoning in LLMs using the ConceptNet knowledge graph. We propose two prompting approaches: instruct prompting, where models predict plausible semantic relationships based on provided definitions, and few-shot prompting, where models identify relations using examples as guidance. Our experiments with the gpt-4o-mini model show that in instruct prompting, consistent performance is obtained when ranking multiple relations but with substantial decline when the model is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Logic, Reasoning, and Knowledge · AI-based Problem Solving and Planning
