Compression for Better: A General and Stable Lossless Compression Framework
Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangming Liu, Wenguang Chen

TL;DR
This paper introduces LLC, a theoretical framework for stable, lossless model compression that defines error boundaries to ensure no performance loss, and demonstrates its effectiveness with quantization and decomposition techniques.
Contribution
The paper presents LLC, a novel theoretical framework for lossless model compression that systematically defines error boundaries and guides compression techniques.
Findings
LLC effectively achieves lossless compression across various models.
Quantization reformulated as a grouped knapsack problem improves efficiency.
LLC determines layer-wise rank automatically for low-rank decomposition.
Abstract
This work focus on how to stabilize and lossless model compression, aiming to reduce model complexity and enhance efficiency without sacrificing performance due to compression errors. A key challenge is effectively leveraging compression errors and defining the boundaries for lossless compression to minimize model loss. i.e., compression for better. Currently, there is no systematic approach to determining this error boundary or understanding its specific impact on model performance. We propose a general \textbf{L}oss\textbf{L}ess \textbf{C}ompression theoretical framework (\textbf{LLC}), which further delineates the compression neighborhood and higher-order analysis boundaries through the total differential, thereby specifying the error range within which a model can be compressed without loss. To verify the effectiveness of LLC, we apply various compression techniques, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Parallel Computing and Optimization Techniques · Advanced Data Compression Techniques
MethodsFocus
