From Regression to Classification: Exploring the Benefits of Categorical Representations of Energy in MLIPs
Ahmad Ali

TL;DR
This paper introduces a categorical, multi-class classification approach for Machine Learning Interatomic Potentials (MLIPs) that matches regression accuracy while providing a way to quantify model uncertainty through distribution entropy.
Contribution
It proposes a novel classification-based formulation for MLIPs that predicts energy and force distributions, enabling uncertainty quantification alongside accurate predictions.
Findings
Categorical approach achieves comparable accuracy to regression models.
Enables quantification of epistemic uncertainty via distribution entropy.
Provides richer supervision through multiple target distributions.
Abstract
Density Functional Theory (DFT) is a widely used computational method for estimating the energy and behavior of molecules. Machine Learning Interatomic Potentials (MLIPs) are models trained to approximate DFT-level energies and forces at dramatically lower computational cost. Many modern MLIPs rely on a scalar regression formulation; given information about a molecule, they predict a single energy value and corresponding forces while minimizing absolute error with DFT's calculations. In this work, we explore a multi-class classification formulation that predicts a categorical distribution over energy/force values, providing richer supervision through multiple targets. Most importantly, this approach offers a principled way to quantify model uncertainty. In particular, our method predicts a histogram of the energy/force distribution, converts scalar targets into histograms, and trains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
