CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design
Zijun Gao, Mutian He, Shijia Sun, Hanqun Cao, Jingjie Zhang, Zihao Luo, Xiaorui Wang, Xiaojun Yao, Chang-Yu Hsieh, Chunbin Gu, Pheng Ann Heng

TL;DR
CONFIDE introduces a new evaluation framework combining energetic and topological assessments, significantly improving the reliability of protein structure predictions and related biomolecular modeling tasks.
Contribution
The paper presents CODE, an unsupervised metric for topological frustration, integrated with pLDDT into CONFIDE, a unified framework that enhances structure prediction evaluation.
Findings
CODE correlates with folding rates (0.82) better than pLDDT (0.33)
CONFIDE improves correlation with RMSD to 0.73 from 0.42
Outperforms existing metrics across diverse biomolecular systems
Abstract
Reliable evaluation of protein structure predictions remains challenging, as metrics like pLDDT capture energetic stability but often miss subtle errors such as atomic clashes or conformational traps reflecting topological frustration within the protein folding energy landscape. We present CODE (Chain of Diffusion Embeddings), a self evaluating metric empirically found to quantify topological frustration directly from the latent diffusion embeddings of the AlphaFold3 series of structure predictors in a fully unsupervised manner. Integrating this with pLDDT, we propose CONFIDE, a unified evaluation framework that combines energetic and topological perspectives to improve the reliability of AlphaFold3 and related models. CODE strongly correlates with protein folding rates driven by topological frustration, achieving a correlation of 0.82 compared to pLDDT's 0.33 (a relative improvement of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Computational Drug Discovery Methods · Machine Learning in Bioinformatics
