Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science
Levi Lingsch, Georgios Kissas, Johannes Jakubik, Siddhartha Mishra

TL;DR
This paper introduces Phaedra, a novel discrete tokenization method tailored for scientific images, enhancing the fidelity of physical and spectral properties in PDE-related data for improved deep learning applications.
Contribution
Phaedra is a new tokenization approach inspired by classical methods, specifically designed to better capture physical and spectral properties in scientific images, outperforming existing tokenizers.
Findings
Phaedra improves reconstruction accuracy across PDE datasets.
It demonstrates strong out-of-distribution generalization.
It effectively handles real-world Earth observation and weather data.
Abstract
Tokens are discrete representations that allow modern deep learning to scale by transforming high-dimensional data into sequences that can be efficiently learned, generated, and generalized to new tasks. These have become foundational for image and video generation and, more recently, physical simulation. As existing tokenizers are designed for the explicit requirements of realistic visual perception of images, it is necessary to ask whether these approaches are optimal for scientific images, which exhibit a large dynamic range and require token embeddings to retain physical and spectral properties. In this work, we investigate the accuracy of a suite of image tokenizers across a range of metrics designed to measure the fidelity of PDE properties in both physical and spectral space. Based on the observation that these struggle to capture both fine details and precise magnitudes, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Advanced Vision and Imaging
