Large Language Models for Education: ChemTAsk -- An Open-Source Paradigm for Automated Q&A in the Graduate Classroom
Ryann M. Perez, Marie Shimogawa, Yanan Chang, Hoang Anh T. Phan, Jason, G. Marmorstein, Evan S. K. Yanagawa, and E. James Petersson

TL;DR
ChemTAsk is an open-source pipeline combining large language models with retrieval-augmented generation to provide accurate, context-specific educational assistance in graduate biological chemistry courses, performing comparably to human TAs.
Contribution
This work introduces ChemTAsk, a novel open-source system integrating LLMs with RAG for educational Q&A, demonstrating its effectiveness in a real classroom setting and benchmarking different models.
Findings
ChemTAsk performs on par with human TAs in understanding and accuracy.
OpenAI models show higher tolerance to prompt deviations and better self-assessment.
Students perceive ChemTAsk as helpful, correct, and faster than TAs.
Abstract
Large language models (LLMs) show promise for aiding graduate level education, but are limited by their training data and potential confabulations. We developed ChemTAsk, an open-source pipeline that combines LLMs with retrieval-augmented generation (RAG) to provide accurate, context-specific assistance. ChemTAsk utilizes course materials, including lecture transcripts and primary publications, to generate accurate responses to student queries. Over nine weeks in an advanced biological chemistry course at the University of Pennsylvania, students could opt in to use ChemTAsk for assistance in any assignment or to understand class material. Comparative analysis showed ChemTAsk performed on par with human teaching assistants (TAs) in understanding student queries and providing accurate information, particularly excelling in creative problem-solving tasks. In contrast, TAs were more precise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems
