LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ

Marc-Antoine Allard; Matin Ansaripour; Maria Yuffa; Paul Teiletche

arXiv:2409.16779·cs.AI·September 26, 2024

LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ

Marc-Antoine Allard, Matin Ansaripour, Maria Yuffa, Paul Teiletche

PDF

Open Access

TL;DR

LLaMa-SciQ is an educational chatbot designed to help students solve science MCQs, using fine-tuned LLaMa-8B with retrieval augmentation and quantization, achieving high accuracy and efficiency in mathematical reasoning tasks.

Contribution

The paper introduces LLaMa-SciQ, a fine-tuned and aligned LLaMa-based chatbot with retrieval augmentation and quantization, tailored for STEM MCQ assistance, and evaluates its performance.

Findings

01

LLaMa-SciQ achieved 74.5% accuracy on GSM8k dataset.

02

Quantization caused only a 5% performance loss.

03

Retrieval-Augmented Generation did not improve accuracy.

Abstract

Large Language Models (LLMs) often struggle with tasks requiring mathematical reasoning, particularly multiple-choice questions (MCQs). To address this issue, we developed LLaMa-SciQ, an educational chatbot designed to assist college students in solving and understanding MCQs in STEM fields. We begin by fine-tuning and aligning the models to human preferences. After comparing the performance of Mistral-7B and LLaMa-8B, we selected the latter as the base model due to its higher evaluation accuracy. To further enhance accuracy, we implement Retrieval-Augmented Generation (RAG) and apply quantization to compress the model, reducing inference time and increasing accessibility for students. For mathematical reasoning, LLaMa-SciQ achieved 74.5% accuracy on the GSM8k dataset and 30% on the MATH dataset. However, RAG does not improve performance and even reduces it, likely due to retriever…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Dense Connections · Multi-Head Attention · Linear Warmup With Linear Decay · Weight Decay · Adam · WordPiece