Automatic Feature Selection in Markov State Models Using Genetic Algorithm
Qihua Chen, Jiangyan Feng, Shriyaa Mittal, Diwakar Shukla

TL;DR
This paper presents an automated method using genetic algorithms to optimize feature selection in Markov State Models, improving the accuracy of biomolecular dynamics predictions from molecular simulations.
Contribution
It introduces a novel GA-based approach for automatic feature selection in MSMs, addressing a key challenge in modeling complex biomolecular systems.
Findings
Successfully applied to four proteins with 28-80 residues
Demonstrates improved MSM accuracy and robustness
Potential for broad application to diverse proteins
Abstract
Markov State Models (MSMs) are a powerful framework to reproduce the long-time conformational dynamics of biomolecules using a set of short Molecular Dynamics (MD) simulations. However, precise kinetics predictions of MSMs heavily rely on the features selected to describe the system. Despite the importance of feature selection for large system, determining an optimal set of features remains a difficult unsolved problem. Here, we introduce an automatic approach to optimize feature selection based on genetic algorithms (GA), which adaptively evolves the most fitted solution according to natural selection laws. The power of the GA-based method is illustrated on long atomistic folding simulations of four proteins, varying in length from 28 to 80 residues. Due to the diversity of tested proteins, we expect that our method will be extensible to other proteins and drive MSM building to a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Enzyme Structure and Function · RNA and protein synthesis mechanisms
