Predicting the S. cerevisiae Gene Expression Score by a Machine Learning Classifier
Piotr H. Pawłowski, Piotr Zielenkiewicz

TL;DR
This paper uses machine learning to predict gene expression scores in yeast based on various attributes.
Contribution
A novel random forest model is developed to identify key attributes influencing gene expression scores in Saccharomyces cerevisiae.
Findings
The random forest model achieved 84.1% accuracy in classifying gene expression scores.
Key attributes include experimental conditions and genetic, physical, statistical, and logistic features.
The model distinguishes low, moderate, and high expression score classes effectively.
Abstract
The topic of this work is gene expression and its score according to various factors analyzed globally using machine learning techniques. The expression score (ES) of genes characterizes their activity and, thus, their importance for cellular processes. This may depend on many different factors (attributes). To find the most important classifier, a machine learning classifier (random forest) was selected, trained, and optimized on the Waikato Environment for Knowledge Analysis WEKA platform, resulting in the most accurate attribute-dependent prediction of the ES of Saccharomyces cerevisiae genes. In this way, data from the Saccharomyces Genome Database (SGD), presenting ES values corresponding to a wide spectrum of attributes, were used, revised, classified, and balanced, and the significance of the considered attributes was evaluated. In this way, the novel random forest model…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFungal and yeast genetics research · Bioinformatics and Genomic Networks · Gene expression and cancer classification
