Decoy Selection for Protein Structure Prediction Via Extreme Gradient Boosting and Ranking
Nasrin Akhter, Gopinath Chennupati, Hristo Djidjev, Amarda, Shehu

TL;DR
This paper introduces ML-Select, a machine learning framework that improves decoy selection in protein structure prediction by leveraging energy landscape features, outperforming existing methods across diverse test cases.
Contribution
The paper presents a novel ML-based decoy selection method that generalizes well across varied datasets and handles low-quality decoy sets more effectively than traditional approaches.
Findings
ML-Select outperforms clustering and energy ranking methods.
It maintains high performance across diverse test cases.
Shows effectiveness even with predominantly low-quality decoys.
Abstract
Identifying one or more biologically-active/native decoys from millions of non-native decoys is one of the major challenges in computational structural biology. The extreme lack of balance in positive and negative samples (native and non-native decoys) in a decoy set makes the problem even more complicated. Consensus methods show varied success in handling the challenge of decoy selection despite some issues associated with clustering large decoy sets and decoy sets that do not show much structural similarity. Recent investigations into energy landscape-based decoy selection approaches show promises. However, lack of generalization over varied test cases remains a bottleneck for these methods. We propose a novel decoy selection method, ML-Select, a machine learning framework that exploits the energy landscape associated with the structure space probed through a template-free decoy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
