Perfect score on IPhO 2025 theory by Gemini agent

Yichen Huang

arXiv:2603.03352·physics.ed-ph·March 5, 2026

Perfect score on IPhO 2025 theory by Gemini agent

Yichen Huang

PDF

Open Access

TL;DR

This paper presents a simple AI agent based on Gemini 3.1 Pro Preview that achieved perfect scores on IPhO 2025 theory problems, demonstrating significant progress in physics problem-solving capabilities.

Contribution

The authors develop and test a new AI agent that attains perfect scores on IPhO 2025 problems, surpassing previous AI performance and approaching human-level reasoning.

Findings

01

AI agent achieved perfect scores five times in testing.

02

Performance suggests potential for advanced physics reasoning by AI.

03

Data contamination risk due to model release timing.

Abstract

The International Physics Olympiad (IPhO) is the world's most prestigious and renowned physics competition for pre-university students. IPhO problems require complex reasoning based on deep understanding of physical principles in a standard general physics curriculum. On IPhO 2025 theory problems, while gold medal performance by AI models was reported previously, it falls behind the best human contestant. Here we build a simple agent with Gemini 3.1 Pro Preview. We run it five times and it achieved a perfect score every time. However, data contamination could occur because Gemini 3.1 Pro Preview was released after the competition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Experimental and Theoretical Physics Studies · Science Education and Pedagogy