Mespotine-RLE-basic v0.9 - An overhead-reduced and improved Run-Length-Encoding Method
Meo Mespotine

TL;DR
This paper introduces Mespotine-RLE-basic v0.9, an improved run-length encoding method that reduces overhead by selectively encoding only compressible characters, significantly enhancing data compression efficiency.
Contribution
The paper presents a novel RLE modification that uses a bit-list to identify compressible characters, minimizing unnecessary data storage and overhead.
Findings
Overhead is limited to the size of the bit-list, at most 32 bytes.
The method effectively compresses data with high redundancy.
It avoids data size increase in worst-case scenarios.
Abstract
Run Length Encoding(RLE) is one of the oldest algorithms for data-compression available, a method used for compression of large data into smaller and therefore more compact data. It compresses by looking at the data for repetitions of the same character in a row and storing the amount(called run) and the respective character(called run_value) as target-data. Unfortunately it only compresses within strict and special cases. Outside of these cases, it increases the data-size, even doubles the size in worst cases compared to the original, unprocessed data. In this paper, we will discuss modifications to RLE, with which we will only store the run for characters, that are actually compressible, getting rid of a lot of useless data like the runs of the characters, that are uncompressible in the first place. This will be achieved by storing the character first and the run second. Additionally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Advanced Biosensing Techniques and Applications · DNA and Biological Computing
