SSE Lossless Compression Method for the Text of the Insignificance of the Lines Order
Juncai Xu, Weidong Zhang, Qingwen Ren, Xin Xie, and Zhengyu Yang

TL;DR
This paper introduces a new lossless compression method tailored for texts where line order is irrelevant, such as word lists, achieving better compression by pre-processing with SSE before applying traditional methods.
Contribution
The paper proposes a novel pre-processing technique called SSE that enhances compression efficiency for order-insensitive texts, outperforming traditional methods.
Findings
Improved compression ratios for order-insensitive texts
SSE pre-processing enhances traditional lossless compression
Method outperforms existing approaches on tested datasets
Abstract
There is a special type of text which the order of the rows makes no difference (e.g., a word list). To compress these special texts, the traditional lossless compression method is not the ideal choice. A new method that can achieve better compression results for this type of texts is proposed. The texts are pre-processed by a method named SSE and are then compressed through the traditional lossless compression method. Comparison shows that an improved compression result is achieved.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · Cellular Automata and Applications
