ChatGPT Participates in a Computer Science Exam
Sebastian Bordt, Ulrike von Luxburg

TL;DR
This study evaluates ChatGPT's performance on a university computer science exam, finding it narrowly passes and highlighting its capabilities and limitations in understanding complex academic tasks.
Contribution
First systematic assessment of ChatGPT's exam performance in computer science, comparing GPT-3.5 and GPT-4, and analyzing its potential and limitations.
Findings
ChatGPT narrowly passes the exam with 20.5/40 points.
GPT-4 outperforms GPT-3.5 by 17%, reaching average student performance.
Performance is based on exam questions similar to training data, not true understanding.
Abstract
We asked ChatGPT to participate in an undergraduate computer science exam on ''Algorithms and Data Structures''. The program was evaluated on the entire exam as posed to the students. We hand-copied its answers onto an exam sheet, which was subsequently graded in a blind setup alongside those of 200 participating students. We find that ChatGPT narrowly passed the exam, obtaining 20.5 out of 40 points. This impressive performance indicates that ChatGPT can indeed succeed in challenging tasks like university exams. At the same time, the questions in our exam are structurally similar to those of other exams, solved homework problems, and teaching materials that can be found online and might have been part of ChatGPT's training data. Therefore, it would be inadequate to conclude from this experiment that ChatGPT has any understanding of computer science. We also assess the improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Online Learning and Analytics · Explainable Artificial Intelligence (XAI)
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Linear Layer · Attention Dropout · Weight Decay · Adam · Softmax · Attention Is All You Need · Residual Connection · Byte Pair Encoding · Dropout
