Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment
Yuda Bi, Huaiwen Zhang, Jingnan Sun, Vince D Calhoun

TL;DR
The paper introduces the Spectral Coherence Index (SCI), a model-free metric derived from spectral analysis, to assess the quality of protein structural ensembles from NMR data, effectively distinguishing experimental data from noise.
Contribution
This work presents SCI as a novel, rotation-invariant, spectral summary metric that improves protein ensemble quality assessment, validated across large, heterogeneous datasets.
Findings
SCI achieves high discrimination accuracy with AUC-ROC > 0.97.
SCI remains robust across different validation schemes and datasets.
Combining SCI with other features enhances ensemble quality classification.
Abstract
Protein structural ensembles from NMR spectroscopy capture biologically important conformational heterogeneity, but it remains difficult to determine whether observed variation reflects coordinated motion or noise-like artifacts. We evaluate the Spectral Coherence Index (SCI), a model-free, rotation-invariant summary derived from the participation-ratio effective rank of the inter-model pairwise distance-variance matrix. Under grouped primary analysis of a Main110 cohort of 110 NMR ensembles (30--403 residues; 10--30 models per entry), SCI separated experimental ensembles from matched synthetic incoherent controls with AUC-ROC and Cliff's . Relative to an internal 27-protein pilot, discrimination softened modestly, showing that pilot-era thresholds do not transfer perfectly to a larger, more heterogeneous cohort: the primary operating point …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
