IslamicLegalBench: Evaluating LLMs Knowledge and Reasoning of Islamic Law Across 1,200 Years of Islamic Pluralist Legal Traditions

Ezieddin Elmahjub; Junaid Qadir; Abdullah Mushtaq; Rafay Naeem; Ibrahim Ghaznavi; Waleed Iqbal

arXiv:2602.21226·cs.CL·February 26, 2026

IslamicLegalBench: Evaluating LLMs Knowledge and Reasoning of Islamic Law Across 1,200 Years of Islamic Pluralist Legal Traditions

Ezieddin Elmahjub, Junaid Qadir, Abdullah Mushtaq, Rafay Naeem, Ibrahim Ghaznavi, Waleed Iqbal

PDF

Open Access

TL;DR

IslamicLegalBench systematically evaluates LLMs' ability to reason about Islamic law, revealing significant knowledge gaps, hallucinations, and risks in current models across diverse jurisprudential tasks.

Contribution

First benchmark to assess LLMs' understanding and reasoning of Islamic law across multiple schools and complex tasks, exposing critical limitations.

Findings

01

Best model achieves 68% correctness with 21% hallucination

02

Few-shot prompting yields minimal improvements

03

High error rates in moderate-complexity tasks

Abstract

As millions of Muslims turn to LLMs like GPT, Claude, and DeepSeek for religious guidance, a critical question arises: Can these AI systems reliably reason about Islamic law? We introduce IslamicLegalBench, the first benchmark evaluating LLMs across seven schools of Islamic jurisprudence, with 718 instances covering 13 tasks of varying complexity. Evaluation of nine state-of-the-art models reveals major limitations: the best model achieves only 68% correctness with 21% hallucination, while several models fall below 35% correctness and exceed 55% hallucination. Few-shot prompting provides minimal gains, improving only 2 of 9 models by >1%. Moderate-complexity tasks requiring exact knowledge show the highest errors, whereas high-complexity tasks display apparent competence through semantic reasoning. False premise detection indicates risky sycophancy, with 6 of 9 models accepting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Law · Ethics and Social Impacts of AI