Few-Shot Prompting for Extractive Quranic QA with Instruction-Tuned LLMs

Mohamed Basem; Islam Oshallah; Ali Hamdi; Ammar Mohammed

arXiv:2508.06103·cs.CL·August 11, 2025

Few-Shot Prompting for Extractive Quranic QA with Instruction-Tuned LLMs

Mohamed Basem, Islam Oshallah, Ali Hamdi, Ammar Mohammed

PDF

Open Access

TL;DR

This paper demonstrates that few-shot prompting with instruction-tuned large language models significantly improves extractive Quranic question answering, especially in low-resource, semantically complex contexts.

Contribution

It introduces a specialized Arabic prompt framework and a post-processing system, advancing extractive QA performance with instruction-tuned LLMs for Quranic texts.

Findings

01

Large language models outperform traditional fine-tuned models.

02

The best configuration achieves a pAP10 score of 0.637.

03

Prompt-based instruction tuning is effective for low-resource, semantically rich QA tasks.

Abstract

This paper presents two effective approaches for Extractive Question Answering (QA) on the Quran. It addresses challenges related to complex language, unique terminology, and deep meaning in the text. The second uses few-shot prompting with instruction-tuned large language models such as Gemini and DeepSeek. A specialized Arabic prompt framework is developed for span extraction. A strong post-processing system integrates subword alignment, overlap suppression, and semantic filtering. This improves precision and reduces hallucinations. Evaluations show that large language models with Arabic instructions outperform traditional fine-tuned models. The best configuration achieves a pAP10 score of 0.637. The results confirm that prompt-based instruction tuning is effective for low-resource, semantically rich QA tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Handwritten Text Recognition Techniques