DNA Hash Pooling and its Applications
Dennis Shasha (Courant Institute, New York University), Martyn Amos, (Computing, Mathematics, Manchester Metropolitan University)

TL;DR
This paper introduces DNA hash pooling, a novel filtering technique inspired by computer science hashing, to efficiently compare DNA populations, reducing sequencing needs and applicable to ecological and genomic studies.
Contribution
The paper presents a new DNA comparison method using filtering criteria and restriction enzymes, offering a cost-effective alternative to sequencing for ecological and genomic analysis.
Findings
Method reduces sequencing by a factor of 10 or more
In silico results show high sensitivity to experimental error
Applicable as a pre-sequencing filtering step
Abstract
In this paper we describe a new technique for the comparison of populations of DNA strands. Comparison is vital to the study of ecological systems, at both the micro and macro scales. Existing methods make use of DNA sequencing and cloning, which can prove costly and time consuming, even with current sequencing techniques. Our overall objective is to address questions such as: (i) (Genome detection) Is a known genome sequence present, at least in part, in an environmental sample? (ii) (Sequence query) Is a specific fragment sequence present in a sample? (iii) (Similarity discovery) How similar in terms of sequence content are two unsequenced samples? We propose a method involving multiple filtering criteria that result in "pools" of DNA of high or very high purity. Because our method is similar in spirit to hashing in computer science, we call it DNA hash pooling. To illustrate this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · DNA and Nucleic Acid Chemistry · Advanced biosensing and bioanalysis techniques
