The AI Research Assistant: Promise, Peril, and a Proof of Concept
Tan Bui-Thanh

TL;DR
This paper presents a detailed case study demonstrating how human-AI collaboration can advance mathematical research, highlighting AI's strengths and limitations in a real-world discovery process.
Contribution
It provides empirical evidence of AI's capabilities and challenges in mathematical research through a comprehensive case study involving theorem discovery and proof verification.
Findings
AI excelled at algebraic manipulation and literature synthesis.
Human oversight was essential for verification and strategic guidance.
The workflow revealed patterns and failure modes in human-AI collaboration.
Abstract
Can artificial intelligence truly contribute to creative mathematical research, or does it merely automate routine calculations while introducing risks of error? We provide empirical evidence through a detailed case study: the discovery of novel error representations and bounds for Hermite quadrature rules via systematic human-AI collaboration. Working with multiple AI assistants, we extended results beyond what manual work achieved, formulating and proving several theorems with AI assistance. The collaboration revealed both remarkable capabilities and critical limitations. AI excelled at algebraic manipulation, systematic proof exploration, literature synthesis, and LaTeX preparation. However, every step required rigorous human verification, mathematical intuition for problem formulation, and strategic direction. We document the complete research workflow with unusual transparency,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
