LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ
Marc-Antoine Allard, Matin Ansaripour, Maria Yuffa, Paul Teiletche

TL;DR
LLaMa-SciQ is an educational chatbot designed to help students solve science MCQs, using fine-tuned LLaMa-8B with retrieval augmentation and quantization, achieving high accuracy and efficiency in mathematical reasoning tasks.
Contribution
The paper introduces LLaMa-SciQ, a fine-tuned and aligned LLaMa-based chatbot with retrieval augmentation and quantization, tailored for STEM MCQ assistance, and evaluates its performance.
Findings
LLaMa-SciQ achieved 74.5% accuracy on GSM8k dataset.
Quantization caused only a 5% performance loss.
Retrieval-Augmented Generation did not improve accuracy.
Abstract
Large Language Models (LLMs) often struggle with tasks requiring mathematical reasoning, particularly multiple-choice questions (MCQs). To address this issue, we developed LLaMa-SciQ, an educational chatbot designed to assist college students in solving and understanding MCQs in STEM fields. We begin by fine-tuning and aligning the models to human preferences. After comparing the performance of Mistral-7B and LLaMa-8B, we selected the latter as the base model due to its higher evaluation accuracy. To further enhance accuracy, we implement Retrieval-Augmented Generation (RAG) and apply quantization to compress the model, reducing inference time and increasing accessibility for students. For mathematical reasoning, LLaMa-SciQ achieved 74.5% accuracy on the GSM8k dataset and 30% on the MATH dataset. However, RAG does not improve performance and even reduces it, likely due to retriever…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Dense Connections · Multi-Head Attention · Linear Warmup With Linear Decay · Weight Decay · Adam · WordPiece
