TokaMark: A Comprehensive Benchmark for MAST Tokamak Plasma Models
C\'ecile Rousseau, Samuel Jackson, Rodrigo H. Ordonez-Hurtado, Nicola C. Amorisco, Tobia Boschi, George K. Holt, Andrea Loreti, Eszter Sz\'ekely, Alexander Whittle, Adriano Agnello, Stanislas Pamela, Alessandra Pascale, Robert Akers, Juan Bernabe Moreno, Sue Thorne

TL;DR
TokaMark is a comprehensive, open-source benchmark dataset and framework designed to evaluate AI models on real MAST tokamak plasma data, promoting reproducibility and progress in fusion energy research.
Contribution
It introduces a standardized, multi-task benchmark with curated data and tools for consistent evaluation of AI models in fusion plasma modeling.
Findings
Provides a unified access to multi-modal fusion data
Includes 14 diverse tasks covering various plasma physics mechanisms
Establishes baseline models for comparison
Abstract
Development and operation of commercially viable fusion energy reactors such as tokamaks require accurate predictions of plasma dynamics from sparse, noisy, and incomplete sensors readings. The complexity of the underlying physics and the heterogeneity of experimental data pose formidable challenges for conventional numerical methods, while simultaneously highlight the promise of modern data-native AI approaches. A major obstacle in realizing this potential is, however, the lack of curated, openly available datasets and standardized benchmarks. Existing fusion datasets are scarce, fragmented across institutions, facility-specific, and inconsistently annotated, which limits reproducibility and prevents a fair and scalable comparison of AI approaches. In this paper, we introduce TokaMark, a structured benchmark to evaluate AI models on real experimental data collected from the Mega Ampere…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMagnetic confinement fusion research · Cold Fusion and Nuclear Reactions · Scientific Computing and Data Management
