Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

Jaione Bengoetxea; Itziar Gonzalez-Dios; Rodrigo Agerri

arXiv:2602.14812·cs.CL·April 14, 2026

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

Jaione Bengoetxea, Itziar Gonzalez-Dios, Rodrigo Agerri

PDF

1 Datasets

TL;DR

This study introduces BasPhyCo, a novel dataset for physical commonsense reasoning in Basque, evaluating multilingual LLMs and revealing limited capabilities in low-resource dialectal variants.

Contribution

It presents the first non-QA physical commonsense reasoning dataset for Basque and assesses LLM performance on hierarchical reasoning tasks in a low-resource language.

Findings

01

LLMs show limited physical commonsense understanding in Basque.

02

Dialectal variants pose additional challenges for LLM reasoning.

03

Performance drops are notable in verifiability tasks for low-resource languages.

Abstract

Physical commonsense reasoning represents a fundamental capability of human intelligence, enabling individuals to understand their environment, predict future events, and navigate physical spaces. Recent years have witnessed growing interest in reasoning tasks within Natural Language Processing (NLP). However, no prior research has examined the performance of Large Language Models (LLMs) on non-question-answering (non-QA) physical commonsense reasoning tasks in low-resource languages such as Basque. Taking the Italian GITA as a starting point, this paper addresses this gap by presenting BasPhyCo, the first non-QA physical commonsense reasoning dataset for Basque, available in both standard and dialectal variants. We evaluate model performance across three hierarchical levels of commonsense understanding: (1) distinguishing between plausible and implausible narratives (accuracy), (2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

HiTZ/BasPhyCo
dataset· 47 dl
47 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.