Can Transformers Do Enumerative Geometry?
Baran Hashemi, Roderic G. Corominas, Alessandro Giacchetto

TL;DR
This paper introduces a Transformer-based method with a novel activation function and uncertainty quantification techniques to compute and analyze enumerative geometric intersection numbers, revealing emergent mathematical structures.
Contribution
It presents a new approach combining Transformers, a custom activation, and conformal prediction to model recursive geometric functions and uncover internal representations of asymptotic behaviors.
Findings
Transformer models can accurately compute intersection numbers across a wide range.
The Dynamic Range Activator enhances modeling of recursive and heteroscedastic patterns.
Transformers implicitly learn Virasoro constraints and asymptotic properties of intersection numbers.
Abstract
How can Transformers model and learn enumerative geometry? What is a robust procedure for using Transformers in abductive knowledge discovery within a mathematician-machine collaboration? In this work, we introduce a Transformer-based approach to computational enumerative geometry, specifically targeting the computation of -class intersection numbers on the moduli space of curves. By reformulating the problem as a continuous optimization task, we compute intersection numbers across a wide value range from to . To capture the recursive nature inherent in these intersection numbers, we propose the Dynamic Range Activator (DRA), a new activation function that enhances the Transformer's ability to model recursive patterns and handle severe heteroscedasticity. Given precision requirements for computing the intersections, we quantify the uncertainty of the…
Peer Reviews
Decision·ICLR 2025 Poster
* The new DRA functions, motivated by the evidence presented in the paper, are a significant contribution that may interest machine learning scientists. * Training a DynamicFormer to predict $\psi$-class intersection numbers, which then allows one to investigate a system's deeper geometry, is a significant, novel contribution that will interest mathematicians investigating enumerative geometry. * The use of Conformal Prediction to estimate uncertainty provides a concrete measure of confidence in
**Notice: These weaknesses have been adressed during the discussion phase, and apply only to the initial version of the manuscript. However, these will remain unedited for posterity.** ### Section 2 * The equations in this section use $\hbar$ without defining it in the text. It may be worth explicitly calling it the reduced Planck constant in the text. * The last paragraph mentions excluding the tensor $C$ due to a decreased impact on the computed $\psi$-class intersection numbers, observed dur
+ The idea of using transformers to do enumerative geometry is new. + Meanwhile, the authors proposed a new activation function, DRA, which found to be useful to improve the prediction performance. + The authors compared DRA with other popular activations functions in Figure 1 and Table 2. + Experiments show some evidence of transformers can learn to predict the $\psi$-class intersection numbers. + Meanwhile, the authors also presented a discussion on how transformers being able
I am not an expert in "enumerative geometry". However, I think the paper lacks many important clarifications and discussions. + The paper lacked discussion of the reasons/motivations of using transformers. At the moment, the paper seemed only a combination of a popular neural network architecture and a new mathematical problem. + The "Related Work" section is quite weak at the moment: the authors spent only one paragraph to discuss related works and then summarized their contributions. + F
The care and clarity of the writing, the fact that some extensive research has been done, the general trust in the results that this paper inspires.
What do we learn about machine learning or enumerative geometry? We seem to learn something that could be expected, a particular case of a general phenomenon.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Image Processing Techniques
MethodsDynamic Range Activator · Causal inference
