Context Cascade Compression: Exploring the Upper Limits of Text Compression
Fanfan Liu, Haibo Qiu

TL;DR
This paper introduces Context Cascade Compression (C3), a novel method using cascaded LLMs to achieve high-ratio text compression with high decoding accuracy, surpassing optical compression methods and indicating upper bounds for future research.
Contribution
The paper proposes a new cascade-based text compression method with high compression ratios and demonstrates its superior performance over optical compression techniques.
Findings
Achieves 98% decoding accuracy at 20x compression ratio
Maintains around 93% accuracy at 40x compression ratio
Outperforms DeepSeek-OCR in context compression tasks
Abstract
Million-level token inputs in long-context tasks pose significant computational and memory challenges for Large Language Models (LLMs). Recently, DeepSeek-OCR conducted research into the feasibility of Contexts Optical Compression and achieved preliminary results. Inspired by this, we introduce Context Cascade Compression C3 to explore the upper limits of text compression. Our method cascades two LLMs of different sizes to handle the compression and decoding tasks. Specifically, a small LLM, acting as the first stage, performs text compression by condensing a long context into a set of latent tokens (e.g., 32 or 64 in length), achieving a high ratio of text tokens to latent tokens. A large LLM, as the second stage, then executes the decoding task on this compressed context. Experiments show that at a 20x compression ratio (where the number of text tokens is 20 times the number of latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques
