Navigating Semantic Relations: Challenges for Language Models in   Abstract Common-Sense Reasoning

Cole Gawin; Yidan Sun; Mayank Kejriwal

arXiv:2502.14086·cs.CL·February 21, 2025

Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning

Cole Gawin, Yidan Sun, Mayank Kejriwal

PDF

Open Access

TL;DR

This paper evaluates large language models' ability to perform abstract common-sense reasoning using ConceptNet, revealing significant gaps compared to humans but also potential improvements through prompt engineering.

Contribution

It introduces two prompting methods for assessing common-sense reasoning in LLMs and systematically analyzes their performance and biases using ConceptNet.

Findings

01

Models perform well in relation ranking but poorly in single-relation prediction.

02

Few-shot prompting improves accuracy when selecting from limited relations.

03

Prompt engineering and selective retrieval can enhance reasoning performance.

Abstract

Large language models (LLMs) have achieved remarkable performance in generating human-like text and solving reasoning tasks of moderate complexity, such as question-answering and mathematical problem-solving. However, their capabilities in tasks requiring deeper cognitive skills, such as common-sense understanding and abstract reasoning, remain under-explored. In this paper, we systematically evaluate abstract common-sense reasoning in LLMs using the ConceptNet knowledge graph. We propose two prompting approaches: instruct prompting, where models predict plausible semantic relationships based on provided definitions, and few-shot prompting, where models identify relations using examples as guidance. Our experiments with the gpt-4o-mini model show that in instruct prompting, consistent performance is obtained when ranking multiple relations but with substantial decline when the model is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Logic, Reasoning, and Knowledge · AI-based Problem Solving and Planning