Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs
David Graber, Victor Armegioiu, Rebecca Buller, Siddhartha Mishra

TL;DR
This paper introduces a probabilistic out-of-distribution detection framework for complex 3D graph data using a diffusion model that learns a density of the training distribution in an unsupervised manner, enabling reliable OOD detection and uncertainty estimation.
Contribution
The authors develop a unified continuous diffusion model over both 3D coordinates and categorical features, providing a principled likelihood-based OOD detection method for irregular graphs.
Findings
Likelihoods effectively identify OOD protein complexes.
Trajectory statistics enhance OOD detection sensitivity.
Likelihood correlates with prediction errors, enabling reliability estimates.
Abstract
Predictive machine learning models generally excel on in-distribution data, but their performance degrades on out-of-distribution (OOD) inputs. Reliable deployment therefore requires robust OOD detection, yet this is particularly challenging for irregular 3D graphs that combine continuous geometry with categorical identities and are unordered by construction. Here, we present a probabilistic OOD detection framework for complex 3D graph data built on a diffusion model that learns a density of the training distribution in a fully unsupervised manner. A key ingredient we introduce is a unified continuous diffusion over both 3D coordinates and discrete features: categorical identities are embedded in a continuous space and trained with cross-entropy, while the corresponding diffusion score is obtained analytically via posterior-mean interpolation from predicted class probabilities. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Bioinformatics and Genomic Networks · Protein Structure and Dynamics
