Mespotine-RLE-basic v0.9 - An overhead-reduced and improved   Run-Length-Encoding Method

Meo Mespotine

arXiv:1501.05542·cs.DS·January 23, 2015·2 cites

Mespotine-RLE-basic v0.9 - An overhead-reduced and improved Run-Length-Encoding Method

Meo Mespotine

PDF

Open Access

TL;DR

This paper introduces Mespotine-RLE-basic v0.9, an improved run-length encoding method that reduces overhead by selectively encoding only compressible characters, significantly enhancing data compression efficiency.

Contribution

The paper presents a novel RLE modification that uses a bit-list to identify compressible characters, minimizing unnecessary data storage and overhead.

Findings

01

Overhead is limited to the size of the bit-list, at most 32 bytes.

02

The method effectively compresses data with high redundancy.

03

It avoids data size increase in worst-case scenarios.

Abstract

Run Length Encoding(RLE) is one of the oldest algorithms for data-compression available, a method used for compression of large data into smaller and therefore more compact data. It compresses by looking at the data for repetitions of the same character in a row and storing the amount(called run) and the respective character(called run_value) as target-data. Unfortunately it only compresses within strict and special cases. Outside of these cases, it increases the data-size, even doubles the size in worst cases compared to the original, unprocessed data. In this paper, we will discuss modifications to RLE, with which we will only store the run for characters, that are actually compressible, getting rid of a lot of useless data like the runs of the characters, that are uncompressible in the first place. This will be achieved by storing the character first and the run second. Additionally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Advanced Biosensing Techniques and Applications · DNA and Biological Computing