The Open Catalyst 2025 (OC25) Dataset and Models for Solid-Liquid Interfaces
Sushree Jagriti Sahoo, Mikael Maraschin, Daniel S. Levine, Zachary Ulissi, C. Lawrence Zitnick, Joel B Varley, Joseph A. Gauthier, Nitish Govindarajan, Muhammed Shuaibi

TL;DR
The paper introduces the OC25 dataset, the largest and most diverse collection of DFT calculations for solid-liquid interfaces, enabling improved machine learning models for catalysis research.
Contribution
It provides the first large-scale, diverse solid-liquid interface dataset and baseline models, advancing simulation capabilities and catalyst discovery.
Findings
State-of-the-art models achieve low energy and force errors.
The dataset covers 88 elements and various solvents.
Model performance improves with DFT force quality.
Abstract
Catalysis at solid-liquid interfaces plays a central role in the advancement of energy storage and sustainable chemical production technologies. By enabling accurate, long-time scale simulations, machine learning (ML) models have the potential to accelerate the discovery of (electro)catalysts. While prior Open Catalyst datasets (OC20 and OC22) have advanced the field by providing large-scale density functional theory (DFT) data of adsorbates on surfaces at solid-gas interfaces, they do not capture the critical role of solvent and electrolyte effects at solid-liquid interfaces. To bridge this gap, we introduce the Open Catalyst 2025 (OC25) dataset, consisting of 7,801,261 calculations across 1,511,270 unique explicit solvent environments. OC25 constitutes the largest and most diverse solid-liquid interface dataset that is currently available and provides configurational and elemental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCatalytic Processes in Materials Science
