Zatom-1: Towards a Multimodal Foundation Model for 3D Molecules and Materials

Alex Morehead; Miruna Cretu; Antonia Panescu; Rishabh Anand; Maurice Weiler; Tynan Perez; Samuel Blau; Steven Farrell; Wahid Bhimji; Anubhav Jain; Hrushikesh Sahasrabuddhe; Pietro Lio; Tommi Jaakkola; Rafael Gomez-Bombarelli; Rex Ying; N. Benjamin Erichson; and Michael W. Mahoney

arXiv:2602.22251·cs.LG·May 14, 2026

Zatom-1: Towards a Multimodal Foundation Model for 3D Molecules and Materials

Alex Morehead, Miruna Cretu, Antonia Panescu, Rishabh Anand, Maurice Weiler, Tynan Perez, Samuel Blau, Steven Farrell, Wahid Bhimji, Anubhav Jain, Hrushikesh Sahasrabuddhe, Pietro Lio, Tommi Jaakkola, Rafael Gomez-Bombarelli, Rex Ying, N. Benjamin Erichson, and Michael W. Mahoney

PDF

1 Repo 1 Models

TL;DR

Zatom-1 is a versatile multimodal Transformer model that unifies generative and predictive tasks for 3D molecules and materials, enabling scalable pretraining and transfer learning across domains.

Contribution

It introduces a simplified Transformer architecture trained with a multimodal flow matching objective for cross-domain 3D molecular and material modeling.

Findings

01

Outperforms specialized baselines on generative and predictive benchmarks.

02

Enables fast, stable sampling with scalable pretraining.

03

Improves molecular property prediction via transfer from material modeling.

Abstract

General-purpose 3D modeling in chemistry encompasses molecules and materials, requiring both generative and predictive capabilities. However, most existing AI approaches are optimized for a single domain (molecules or materials) and a single task (generation or prediction), which limits representation sharing and transfer. We introduce Zatom-1, a cross-domain, general-purpose model architecture that unifies generative and predictive learning of 3D molecules and materials. Zatom-1 is a deliberately simplified Transformer trained with a multimodal flow matching objective that jointly models discrete atom types and continuous 3D geometries. This approach supports scalable pretraining with predictable gains as model capacity increases, while enabling fast and stable sampling. We use cross-domain generative pretraining as a universal initialization for downstream multi-task prediction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Zatom-AI/zatom
github

Models

🤗
Zatom-AI/Zatom-1
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.