Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Muhammad Muneeb; David B. Ascher; Ahsan Baidar Bakht

arXiv:2512.00323·cs.CL·December 2, 2025

Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Muhammad Muneeb, David B. Ascher, Ahsan Baidar Bakht

PDF

Open Access

TL;DR

This study benchmarks 47 context-based question answering models across eight datasets, identifying top performers and analyzing factors affecting accuracy and efficiency, with implications for practical deployment.

Contribution

It provides a comprehensive comparison of 47 CBQA models on diverse datasets without additional fine-tuning, highlighting the best models and influencing factors.

Findings

01

Electra large discriminator model achieved 43% accuracy overall.

02

Model performance decreases with longer answers and more complex contexts.

03

Genetic algorithms can enhance accuracy by combining model responses.

Abstract

Context-based question answering (CBQA) models provide more accurate and relevant answers by considering the contextual information. They effectively extract specific information given a context, making them functional in various applications involving user support, information retrieval, and educational platforms. In this manuscript, we benchmarked the performance of 47 CBQA models from Hugging Face on eight different datasets. This study aims to identify the best-performing model across diverse datasets without additional fine-tuning. It is valuable for practical applications where the need to retrain models for specific datasets is minimized, streamlining the implementation of these models in various contexts. The best-performing models were trained on the SQuAD v2 or SQuAD v1 datasets. The best-performing model was ahotrod/electra_large_discriminator_squad2_512, which yielded 43\%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications