Automated Statistical and Machine Learning Platform for Biological Research
Luke Rimmo Lego, Samantha Gauthier, Denver Jn. Baptiste

TL;DR
This paper introduces an integrated platform combining statistical methods and machine learning, specifically Random Forests, to streamline biological data analysis and improve workflow efficiency for researchers.
Contribution
The platform uniquely unifies statistical and machine learning tools with automated features, making advanced analysis accessible without extensive programming skills.
Findings
Effective classification accuracy across diverse datasets
Automated hyperparameter optimization improves model performance
Unified interface enhances workflow efficiency
Abstract
Research increasingly relies on computational methods to analyze experimental data and predict molecular properties. Current approaches often require researchers to use a variety of tools for statistical analysis and machine learning, creating workflow inefficiencies. We present an integrated platform that combines classical statistical methods with Random Forest classification for comprehensive data analysis that can be used in the biological sciences. The platform implements automated hyperparameter optimization, feature importance analysis, and a suite of statistical tests including t tests, ANOVA, and Pearson correlation analysis. Our methodology addresses the gap between traditional statistical software, modern machine learning frameworks and biology, by providing a unified interface accessible to researchers without extensive programming experience. The system achieves this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Machine Learning in Materials Science · Gene expression and cancer classification
