TokaMind: A Multi-Modal Transformer Foundation Model for Tokamak Plasma Dynamics

Tobia Boschi; Andrea Loreti; Nicola C. Amorisco; Rodrigo H. Ordonez-Hurtado; C\'ecile Rousseau; George K. Holt; Eszter Sz\'ekely; Alexander Whittle; Samuel Jackson; Adriano Agnello; Stanislas Pamela; Alessandra Pascale; Robert Akers; Juan Bernabe Moreno; Vassil Alexandrov; and Mykhaylo Zayats

arXiv:2602.15084·physics.plasm-ph·February 18, 2026

TokaMind: A Multi-Modal Transformer Foundation Model for Tokamak Plasma Dynamics

Tobia Boschi, Andrea Loreti, Nicola C. Amorisco, Rodrigo H. Ordonez-Hurtado, C\'ecile Rousseau, George K. Holt, Eszter Sz\'ekely, Alexander Whittle, Samuel Jackson, Adriano Agnello, Stanislas Pamela, Alessandra Pascale, Robert Akers, Juan Bernabe Moreno, Vassil Alexandrov

PDF

Open Access

TL;DR

TokaMind is an open-source multi-modal transformer model for fusion plasma modeling that effectively integrates heterogeneous diagnostics, outperforming benchmarks and enabling efficient task adaptation in tokamak research.

Contribution

The paper introduces TokaMind, a novel multi-modal transformer framework for tokamak plasma data, with new multi-modal embedding techniques and superior performance on the MAST benchmark.

Findings

01

Fine-tuned TokaMind outperforms baseline on most tasks.

02

Lightweight fine-tuning can surpass training from scratch.

03

Multi-modal pretraining benefits tokamak plasma modeling.

Abstract

We present TokaMind, an open-source foundation model framework for fusion plasma modeling, based on a Multi-Modal Transformer (MMT) and trained on heterogeneous tokamak diagnostics from the publicly available MAST dataset. TokaMind supports multiple data modalities (time-series, 2D profiles, and videos) with different sampling rates, robust missing-signal handling, and efficient task adaptation via selectively loading and freezing four model components. To represent multi-modal signals, we use a training-free Discrete Cosine Transform embedding (DCT3D) and provide a clean interface for alternative embeddings (e.g., Variational Autoencoders - VAEs). We evaluate TokaMind on the recently introduced MAST benchmark TokaMark, comparing training and embedding strategies. Our results show that fine-tuned TokaMind outperforms the benchmark baseline on all but one task, and that, for several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMagnetic confinement fusion research · Generative Adversarial Networks and Image Synthesis · Time Series Analysis and Forecasting