MyESL: Sparse learning in molecular evolution and phylogenetic analysis
Maxwell Sanderford, Sudip Sharma, Glen Stecher, Jun Liu, Jieping Ye,, Sudhir Kumar

TL;DR
MyESL is an open-source software tool that applies sparse learning techniques, specifically LASSO, to analyze genome-scale data for evolutionary and phylogenetic studies, enabling efficient modeling on personal computers.
Contribution
This paper introduces MyESL, a novel software that efficiently implements sparse-group LASSO for large-scale genomic data analysis in evolutionary biology.
Findings
MyESL can process genome-scale datasets quickly on personal computers.
It outperforms existing software in handling large, complex genomic data.
The tool provides user-friendly outputs for functional and evolutionary genomics.
Abstract
Evolutionary sparse learning (ESL) uses a supervised machine learning approach, Least Absolute Shrinkage and Selection Operator (LASSO), to build models explaining the relationship between a hypothesis and the variation across genomic features (e.g., sites) in sequence alignments. ESL employs sparsity between and within the groups of genomic features (e.g., genomic loci) by using sparse-group LASSO. Although some software packages are available for performing sparse group LASSO, we found them less well-suited for processing and analyzing genome-scale data containing millions of features, such as bases. MyESL software fills the need for open-source software for conducting ESL analyses with facilities to pre-process the input hypotheses and large alignments, make LASSO flexible and computationally efficient, and post-process the output model to produce different metrics useful in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Machine Learning in Bioinformatics · Biomedical Text Mining and Ontologies
