Protein Function Prediction Based on Kernel Logistic Regression with 2-order Graphic Neighbor Information
Jingwei Liu

TL;DR
This paper introduces a 2-order graphic neighbor feature extraction method for protein function prediction, demonstrating significant accuracy improvements using kernel logistic regression models, especially RBF KLR, on protein interaction data.
Contribution
It extends existing 1-order neighbor features to 2-order, incorporating chi-square feature combination, and evaluates multiple kernel logistic regression models for enhanced prediction accuracy.
Findings
2-order neighbor features significantly improve prediction accuracy.
RBF KLR achieves up to 99.05% accuracy with chi-square feature combination.
The method outperforms traditional 1-order approaches on protein-protein interaction data.
Abstract
To enhance the accuracy of protein-protein interaction function prediction, a 2-order graphic neighbor information feature extraction method based on undirected simple graph is proposed in this paper, which extends the 1-order graphic neighbor featureextraction method. And the chi-square test statistical method is also involved in feature combination. To demonstrate the effectiveness of our 2-order graphic neighbor feature, four logistic regression models (logistic regression (abbrev. LR), diffusion kernel logistic regression (abbrev. DKLR), polynomial kernel logistic regression (abbrev. PKLR), and radial basis function (RBF) based kernel logistic regression (abbrev. RBF KLR)) are investigated on the two feature sets. The experimental results of protein function prediction of Yeast Proteome Database (YPD) using the the protein-protein interaction data of Munich Information Center for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification · Machine Learning in Bioinformatics
