Diversifying Design of Nucleic Acid Aptamers Using Unsupervised Machine Learning
Siba Moussa, Michael Kilgour, Clara Jans, Alex Hernandez-Garcia,, Miroslava Cuperlovic-Culf, Yoshua Bengio, and Lena Simine

TL;DR
This paper introduces an unsupervised machine learning approach using the Potts model to systematically generate diverse nucleic acid aptamer sequences with desired structural features, enhancing the design process.
Contribution
It presents a novel application of the Potts model for controlled diversification of aptamer sequences based on spectral energy range manipulation.
Findings
Successfully generated diverse aptamer sequences with specified motifs.
Demonstrated control over sequence diversity and similarity.
Applied method to RNA and DNA aptamers with effective results.
Abstract
Inverse design of short single-stranded RNA and DNA sequences (aptamers) is the task of finding sequences that satisfy a set of desired criteria. Relevant criteria may be, for example, the presence of specific folding motifs, binding to molecular ligands, sensing properties, etc. Most practical approaches to aptamer design identify a small set of promising candidate sequences using high-throughput experiments (e.g. SELEX), and then optimize performance by introducing only minor modifications to the empirically found candidates. Sequences that possess the desired properties but differ drastically in chemical composition will add diversity to the search space and facilitate the discovery of useful nucleic acid aptamers. Systematic diversification protocols are needed. Here we propose to use an unsupervised machine learning model known as the Potts model to discover new, useful sequences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Advanced biosensing and bioanalysis techniques · RNA Interference and Gene Delivery
