Analysis of Compression Techniques for DNA Sequence Data
Shakeela Bibi, Javed Iqbal, Adnan Iftekhar, Mir Hassan

TL;DR
This paper reviews various DNA sequence compression techniques, analyzing their effectiveness in reducing data size while preserving information, highlighting the importance of efficient compression for biological data management.
Contribution
It provides a comprehensive analysis of existing DNA and protein sequence compression algorithms, comparing their performance and identifying challenges in the field.
Findings
Efficient compression techniques significantly reduce DNA data size.
Traditional methods are less suitable for biological sequence compression.
Compression algorithms help in understanding DNA characteristics and improve storage and transmission.
Abstract
Biological data mainly comprises of Deoxyribonucleic acid (DNA) and protein sequences. These are the biomolecules which are present in all cells of human beings. Due to the self-replicating property of DNA, it is a key constitute of genetic material that exist in all breathingcreatures. This biomolecule (DNA) comprehends the genetic material obligatory for the operational and expansion of all personified lives. To save DNA data of single person we require 10CD-ROMs.Moreover, this size is increasing constantly, and more and more sequences are adding in the public databases. This abundant increase in the sequence data arise challenges in the precise information extraction from this data. Since many data analyzing and visualization tools do not support processing of this huge amount of data. To reduce the size of DNA and protein sequence, many scientists introduced various types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
