Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?
Lve Meng (University of Science, Technology of China, Zhongguancun Academy), Weilong Zhao (Universit\'e Paris Cit\'e), Yanzhi Zhang (Zhongguancun Academy), Haoxiang Guan (Zhongguancun Academy), Jiyan He (Zhongguancun Academy)

TL;DR
This paper demonstrates that lightweight AI pipelines using advanced language models can generate and verify research-level mathematical proofs, opening new possibilities for automated research assistance in mathematics.
Contribution
It introduces a streamlined AI pipeline with next-generation models for solving complex research problems, validated on novel datasets and made accessible through open-source tools.
Findings
Successfully generated candidate proofs for all test problems.
Verified solutions for key problem sets and submitted results publicly.
Open-sourced code and user interface for broader adoption.
Abstract
Large language models (LLMs) have recently achieved remarkable success in generating rigorous mathematical proofs, with "AI for Math" emerging as a vibrant field of research (Ju et al., 2026). While these models have mastered competition-level benchmarks like the International Mathematical Olympiad (Huang et al., 2025; Duan et al., 2025) and show promise in research applications through auto-formalization (Wang et al., 2025), their deployment via lightweight, natural-language pipelines for research problems remains underexplored. In this work, we demonstrate that next-generation models (e.g., Gemini 3 Pro, GPT-5.2 Pro), when integrated into a streamlined automated pipeline optimized for citation-based verification, can solve sophisticated research-grade problems. We evaluate our pipeline on two novel datasets: (1) the ICCM (2025) problem sets (comparable to the S.-T. Yau College Student…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing · Machine Learning in Materials Science · Polynomial and algebraic computation
