CONCISE: Compressed 'n' Composable Integer Set
Alessandro Colantonio, Roberto Di Pietro

TL;DR
CONCISE is a new bitmap compression scheme that significantly reduces memory usage and improves performance over WAH, enabling efficient set operations for large integer sets.
Contribution
We introduce CONCISE, a novel bitmap compression method that outperforms WAH in memory efficiency and maintains or improves computation speed.
Findings
Reduces memory usage by up to 50% compared to WAH
Maintains similar or better computation times
Effective for manipulating large sets of integers
Abstract
Bit arrays, or bitmaps, are used to significantly speed up set operations in several areas, such as data warehousing, information retrieval, and data mining, to cite a few. However, bitmaps usually use a large storage space, thus requiring compression. Nevertheless, there is a space-time tradeoff among compression schemes. The Word Aligned Hybrid (WAH) bitmap compression trades some space to allow for bitwise operations without first decompressing bitmaps. WAH has been recognized as the most efficient scheme in terms of computation time. In this paper we present CONCISE (Compressed 'n' Composable Integer Set), a new scheme that enjoys significatively better performances than those of WAH. In particular, when compared to WAH, our algorithm is able to reduce the required memory up to 50%, by having similar or better performance in terms of computation time. Further, we show that CONCISE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Storage Technologies · Network Packet Processing and Optimization
