Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Beatriz Borges, Negar Foroutan, Deniz Bayazit, Anna Sotnikova,, Syrielle Montariol, Tanya Nazaretzky, Mohammadreza Banaei, Alireza, Sakhaeirad, Philippe Servant, Seyed Parsa Neshaei, Jibril Frej, Angelika, Romanou, Gail Weiss, Sepideh Mamooler, Zeming Chen, Simin Fan, Silin Gao,

TL;DR
This paper assesses the vulnerability of higher education assessments to AI assistants by measuring GPT-3.5 and GPT-4's ability to answer university-level STEM questions, revealing significant risks to current evaluation methods.
Contribution
It provides a novel dataset of assessment questions and evaluates AI performance, highlighting the need to revise assessment strategies in higher education due to AI capabilities.
Findings
GPT-4 answers 65.8% of questions correctly
AI can pass core assessments in various degree programs
Assessment design must be revised to address AI capabilities
Abstract
AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by student use of generative AI. We investigate the potential scale of this vulnerability by measuring the degree to which AI assistants can complete assessment questions in standard university-level STEM courses. Specifically, we compile a novel dataset of textual assessment questions from 50 courses at EPFL and evaluate whether two AI assistants, GPT-3.5 and GPT-4 can adequately answer these questions. We use eight prompting strategies to produce responses and find that GPT-4 answers an average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Adam · Layer Normalization · Weight Decay · Position-Wise Feed-Forward Layer · Dense Connections · Attention Dropout
