MCP: a Multi-Component learning machine to Predict protein secondary structure
Leila Khalatbari, Mohammad Reza Kangavari, Saeid Hosseini, Hongzhi, Yin, Ngai-Man Cheung

TL;DR
This paper introduces a multi-component machine learning framework that combines multiple classifiers and a novel dissimilarity measure to improve the accuracy of protein secondary structure prediction, addressing challenges like data noise and high dimensionality.
Contribution
The paper presents a novel multi-component prediction framework that integrates support vector machines and fuzzy nearest neighbor classifiers with a compound dissimilarity measure for enhanced accuracy.
Findings
Achieves higher accuracy than existing methods
Effectively handles sequence-structure relation complexity
Demonstrates robustness against data noise and class imbalance
Abstract
The Gene or DNA sequence in every cell does not control genetic properties on its own; Rather, this is done through translation of DNA into protein and subsequent formation of a certain 3D structure. The biological function of a protein is tightly connected to its specific 3D structure. Prediction of the protein secondary structure is a crucial intermediate step towards elucidating its 3D structure and function. Traditional experimental methods for prediction of protein structure are expensive and time-consuming. Therefore, various machine learning approaches have been proposed to predict the protein secondary structure. Nevertheless, the average accuracy of the suggested solutions has hardly reached beyond 80%. The possible underlying reasons are the ambiguous sequence-structure relation, noise in input protein data, class imbalance, and the high dimensionality of the encoding schemes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
