# MirathQA: A dataset for evaluating large language models on Hanbali Islamic inheritance reasoning tasks

**Authors:** Ameera Almasoud, Sharefah Al-Ghamdi, Reem Alqifari, Noof Alfear, Hend Al-Khalifa

PMC · DOI: 10.1016/j.dib.2026.112589 · Data in Brief · 2026-02-13

## TL;DR

MirathQA is a new dataset for testing how well Arabic language AI models handle complex Islamic inheritance law reasoning based on Hanbali jurisprudence.

## Contribution

The paper introduces MirathQA, the first structured Arabic dataset for evaluating Hanbali Islamic inheritance reasoning in large language models.

## Key findings

- MirathQA contains 1394 questions derived from 242 real-life inheritance cases with verified solutions.
- The dataset includes annotations for heirs, shares, blocked heirs, and reasoning skills like proportional reduction.
- It is split into training, validation, and test sets to ensure reproducibility and prevent leakage.

## Abstract

Islamic inheritance (Muwārīth/مواريث) refers to the distribution of a deceased person's estate among qualified heirs in accordance with Sharia laws derived from the Qur’an and Sunnah. This dataset focuses specifically on the Hanbali school of jurisprudence (المذهب الحنبلي), one of the four major Sunni schools, which has distinct rulings on residuary heirs (taʿsīb/التعصيب) and blocking mechanisms (al-ḥajb/الحجب). Despite recent advances, Arabic large language models (LLMs) often struggle with tasks requiring exact arithmetic, multi-step reasoning, and strict adherence to domain-specific rules. These challenges are particularly evident in Islamic inheritance law (ʿilm al-farāʾiḍ/علم الفرائض), where case resolution demands sequential rule application, identification of heirs and blocked heirs (ḥijb/حجب), handling of proportional reduction (ʿawl/عول) and return (radd/رد), and accounting for juristic differences. The lack of well-structured Arabic datasets further restricts systematic evaluation in this area. To address this gap, we introduce the MirathQA, comprising over 1394 questions derived from 242 real-life inheritance cases collected from different Arabic sources. Each case includes a textual description, verified solution, heirs and their corresponding shares, and annotations for blocked or modified heirs. Additional multiple-choice and true/false questions assess reasoning skills such as exclusion and proportional reduction. The dataset is divided into training (70%), validation (15%), and test (15%) splits at the case level to prevent leakage and ensure reproducibility. It is also provided in both CSV and Excel formats. This resource establishes a benchmark for evaluating Arabic LLM reasoning, while also serving education, interdisciplinary research, and legal AI applications.

## Full-text entities

- **Chemicals:** MCQ (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12950483/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12950483/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/PMC12950483/full.md

---
Source: https://tomesphere.com/paper/PMC12950483