Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb, David B. Ascher, Ahsan Baidar Bakht

TL;DR
This study benchmarks 47 context-based question answering models across eight datasets, identifying top performers and analyzing factors affecting accuracy and efficiency, with implications for practical deployment.
Contribution
It provides a comprehensive comparison of 47 CBQA models on diverse datasets without additional fine-tuning, highlighting the best models and influencing factors.
Findings
Electra large discriminator model achieved 43% accuracy overall.
Model performance decreases with longer answers and more complex contexts.
Genetic algorithms can enhance accuracy by combining model responses.
Abstract
Context-based question answering (CBQA) models provide more accurate and relevant answers by considering the contextual information. They effectively extract specific information given a context, making them functional in various applications involving user support, information retrieval, and educational platforms. In this manuscript, we benchmarked the performance of 47 CBQA models from Hugging Face on eight different datasets. This study aims to identify the best-performing model across diverse datasets without additional fine-tuning. It is valuable for practical applications where the need to retrain models for specific datasets is minimized, streamlining the implementation of these models in various contexts. The best-performing models were trained on the SQuAD v2 or SQuAD v1 datasets. The best-performing model was ahotrod/electra_large_discriminator_squad2_512, which yielded 43\%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications
