Attention-based Multi-task Learning for Base Editor Outcome Prediction
Amina Mollaysa, Ahmed Allam, Michael Krauthammer

TL;DR
This paper introduces an attention-based multi-task machine learning model that predicts base editing outcomes, aiming to improve efficiency and accuracy in genome editing applications.
Contribution
The study presents a novel two-stage attention-based model with multi-task learning for predicting multiple base editor outcomes simultaneously.
Findings
Strong correlation with experimental results across datasets
Effective multi-task learning for different base editor variants
Potential to accelerate base editing design process
Abstract
Human genetic diseases often arise from point mutations, emphasizing the critical need for precise genome editing techniques. Among these, base editing stands out as it allows targeted alterations at the single nucleotide level. However, its clinical application is hindered by low editing efficiency and unintended mutations, necessitating extensive trial-and-error experimentation in the laboratory. To speed up this process, we present an attention-based two-stage machine learning model that learns to predict the likelihood of all possible editing outcomes for a given genomic target sequence. We further propose a multi-task learning schema to jointly learn multiple base editors (i.e. variants) at once. Our model's predictions consistently demonstrated a strong correlation with the actual experimental results on multiple datasets and base editor variants. These results provide further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCRISPR and Genetic Engineering · RNA and protein synthesis mechanisms · Evolution and Genetic Dynamics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection
