Enhancing classroom teaching with LLMs and RAG
Elizabeth A Mullins, Adrian Portillo, Kristalys Ruiz-Rohena, Aritran, Piplai

TL;DR
This paper explores using Retrieval-Augmented Generation (RAG) with course materials to improve K-12 education, evaluating Reddit as a data source for cybersecurity information and analyzing chunk size effects on answer accuracy.
Contribution
The study demonstrates how RAG pipelines can assess data sources for educational content and highlights Reddit's limitations for cybersecurity questions.
Findings
Reddit is ineffective for cybersecurity question answering with RAG.
Optimal chunk size for context was identified but did not improve accuracy.
RAG methodology can evaluate data source suitability for educational purposes.
Abstract
Large Language Models have become a valuable source of information for our daily inquiries. However, after training, its data source quickly becomes out-of-date, making RAG a useful tool for providing even more recent or pertinent data. In this work, we investigate how RAG pipelines, with the course materials serving as a data source, might help students in K-12 education. The initial research utilizes Reddit as a data source for up-to-date cybersecurity information. Chunk size is evaluated to determine the optimal amount of context needed to generate accurate answers. After running the experiment for different chunk sizes, answer correctness was evaluated using RAGAs with average answer correctness not exceeding 50 percent for any chunk size. This suggests that Reddit is not a good source to mine for data for questions about cybersecurity threats. The methodology was successful in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Softmax · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Layer Normalization · Linear Warmup With Linear Decay · WordPiece · Adam
