Superstudent intelligence in thermodynamics
Rebecca Loubet, Pascal Zittlau, Marco Hoffmann, Luisa Vollmer, Sophie Fellenz, Heike Leitte, Fabian Jirasek, Johannes Lenhard, Hans Hasse

TL;DR
OpenAI's large language model o3 outperformed university students in a challenging thermodynamics exam, demonstrating machine intelligence in complex problem-solving traditionally seen as a human intellectual feat.
Contribution
This paper reports that a large language model can solve advanced thermodynamics problems better than students, highlighting a significant advancement in AI's reasoning capabilities.
Findings
Model o3 solved all exam problems correctly in zero-shot mode.
Model o3's score exceeded the best human student scores in over 10,000 exams.
This marks a turning point in AI's ability to perform complex scientific reasoning.
Abstract
In this short note, we report and analyze a striking event: OpenAI's large language model o3 has outwitted all students in a university exam on thermodynamics. The thermodynamics exam is a difficult hurdle for most students, where they must show that they have mastered the fundamentals of this important topic. Consequently, the failure rates are very high, A-grades are rare - and they are considered proof of the students' exceptional intellectual abilities. This is because pattern learning does not help in the exam. The problems can only be solved by knowledgeably and creatively combining principles of thermodynamics. We have given our latest thermodynamics exam not only to the students but also to OpenAI's most powerful reasoning model, o3, and have assessed the answers of o3 exactly the same way as those of the students. In zero-shot mode, the model o3 solved all problems correctly,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Computing and Networks · AI-based Problem Solving and Planning · Machine Learning in Materials Science
