TL;DR
Stream VByte is a new byte-oriented integer compression method that significantly outperforms existing techniques like varint-G8IU by leveraging SIMD instructions, achieving record-breaking decoding speeds for search engine data processing.
Contribution
It introduces Stream VByte, a novel compression technique that separates control and data streams, enabling faster SIMD-based decoding surpassing previous methods.
Findings
Decodes over 4 billion integers per second on a 3.4GHz processor.
Decoding speed can be twice as fast as varint-G8IU.
Establishes new speed records for byte-oriented integer compression.
Abstract
Arrays of integers are often compressed in search engines. Though there are many ways to compress integers, we are interested in the popular byte-oriented integer compression techniques (e.g., VByte or Google's Varint-GB). They are appealing due to their simplicity and engineering convenience. Amazon's varint-G8IU is one of the fastest byte-oriented compression technique published so far. It makes judicious use of the powerful single-instruction-multiple-data (SIMD) instructions available in commodity processors. To surpass varint-G8IU, we present Stream VByte, a novel byte-oriented compression technique that separates the control stream from the encoded data. Like varint-G8IU, Stream VByte is well suited for SIMD instructions. We show that Stream VByte decoding can be up to twice as fast as varint-G8IU decoding over real data sets. In this sense, Stream VByte establishes new speed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
