S2CFormer: Revisiting the RD-Latency Trade-off in Transformer-based Learned Image Compression
Yunuo Chen, Qian Li, Bing He, Donghui Feng, Ronghua Wu, Qi Wang, Li, Song, Guo Lu, Wenjun Zhang

TL;DR
This paper introduces S2CFormer, a new architecture for learned image compression that improves rate-distortion performance while significantly reducing decoding latency by focusing on efficient channel aggregation instead of complex spatial operations.
Contribution
The paper proposes the S2CFormer paradigm, emphasizing channel aggregation over spatial complexity, and introduces variants that achieve state-of-the-art results with faster decoding speeds.
Findings
S2C-Conv and S2C-Attention outperform previous methods in R-D performance.
S2C-Hybrid achieves a better performance-latency trade-off.
Models set new benchmarks on multiple datasets.
Abstract
Transformer-based Learned Image Compression (LIC) suffers from a suboptimal trade-off between decoding latency and rate-distortion (R-D) performance. Moreover, the critical role of the FeedForward Network (FFN)-based channel aggregation module has been largely overlooked. Our research reveals that efficient channel aggregation-rather than complex and time-consuming spatial operations-is the key to achieving competitive LIC models. Based on this insight, we initiate the ``S2CFormer'' paradigm, a general architecture that simplifies spatial operations and enhances channel operations to overcome the previous trade-off. We present two instances of the S2CFormer: S2C-Conv, and S2C-Attention. Both models demonstrate state-of-the-art (SOTA) R-D performance and significantly faster decoding speed. Furthermore, we introduce S2C-Hybrid, an enhanced variant that maximizes the strengths of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Image Processing Techniques and Applications · Medical Image Segmentation Techniques
MethodsDense Connections · Feedforward Network · Focus
