Attention-based Multi-task Learning for Base Editor Outcome Prediction

Amina Mollaysa; Ahmed Allam; Michael Krauthammer

arXiv:2311.07636·q-bio.GN·November 16, 2023·1 cites

Attention-based Multi-task Learning for Base Editor Outcome Prediction

Amina Mollaysa, Ahmed Allam, Michael Krauthammer

PDF

Open Access

TL;DR

This paper introduces an attention-based multi-task machine learning model that predicts base editing outcomes, aiming to improve efficiency and accuracy in genome editing applications.

Contribution

The study presents a novel two-stage attention-based model with multi-task learning for predicting multiple base editor outcomes simultaneously.

Findings

01

Strong correlation with experimental results across datasets

02

Effective multi-task learning for different base editor variants

03

Potential to accelerate base editing design process

Abstract

Human genetic diseases often arise from point mutations, emphasizing the critical need for precise genome editing techniques. Among these, base editing stands out as it allows targeted alterations at the single nucleotide level. However, its clinical application is hindered by low editing efficiency and unintended mutations, necessitating extensive trial-and-error experimentation in the laboratory. To speed up this process, we present an attention-based two-stage machine learning model that learns to predict the likelihood of all possible editing outcomes for a given genomic target sequence. We further propose a multi-task learning schema to jointly learn multiple base editors (i.e. variants) at once. Our model's predictions consistently demonstrated a strong correlation with the actual experimental results on multiple datasets and base editor variants. These results provide further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCRISPR and Genetic Engineering · RNA and protein synthesis mechanisms · Evolution and Genetic Dynamics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection