Genbit Compress Tool(GBC): A Java-Based Tool to Compress DNA Sequences and Compute Compression Ratio(bits/base) of Genomes
P.Raja Rajeswari (1) Allam Apparo (2), V.K. Kumar (3) ((1) Acharya, Nagarjuna University, India, (2) Jawaharlal Nehru Technological University,, India, (3) S.V.H. College Of Engineering, India)

TL;DR
The paper introduces GenBit Compress, a Java-based tool that achieves superior compression ratios for entire genomes by encoding exact repeats and assigning binary bits to DNA fragments, outperforming existing algorithms in both compression efficiency and speed.
Contribution
It presents a novel DNA compression algorithm that encodes exact repeats and assigns binary bits to DNA fragments, achieving the best compression ratios and faster performance for large genomes.
Findings
Achieves the best compression ratio for large DNA sequences.
Significantly improves running time over previous DNA compressors.
Introduces a unique binary bit assignment for DNA fragments.
Abstract
We present a Compression Tool, "GenBit Compress", for genetic sequences based on our new proposed "GenBit Compress Algorithm". Our Tool achieves the best compression ratios for Entire Genome (DNA sequences) . Significantly better compression results show that GenBit compress algorithm is the best among the remaining Genome compression algorithms for non-repetitive DNA sequences in Genomes. The standard Compression algorithms such as gzip or compress cannot compress DNA sequences but only expand them in size. In this paper we consider the problem of DNA compression. It is well known that one of the main features of DNA Sequences is that they contain substrings which are duplicated except for a few random Mutations. For this reason most DNA compressors work by searching and encoding approximate repeats. We depart from this strategy by searching and encoding only exact repeats. our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
