TransCL: Transformer Makes Strong and Flexible Compressive Learning
Chong Mou, Jian Zhang

TL;DR
TransCL introduces a transformer-based framework for compressive learning on large-scale images, enabling flexible CS ratios and achieving state-of-the-art results in high-resolution vision tasks.
Contribution
It proposes a novel transformer-based compressive learning framework with learnable block-based sensing and arbitrary CS ratios for large-scale images.
Findings
Achieves state-of-the-art performance in image classification and segmentation.
Performs well even at very low CS ratios like 1%.
Demonstrates robustness and flexibility across various tasks and data scales.
Abstract
Compressive learning (CL) is an emerging framework that integrates signal acquisition via compressed sensing (CS) and machine learning for inference tasks directly on a small number of measurements. It can be a promising alternative to classical image-domain methods and enjoys great advantages in memory saving and computational efficiency. However, previous attempts on CL are not only limited to a fixed CS ratio, which lacks flexibility, but also limited to MNIST/CIFAR-like datasets and do not scale to complex real-world high-resolution (HR) data or vision tasks. In this paper, a novel transformer-based compressive learning framework on large-scale images with arbitrary CS ratios, dubbed TransCL, is proposed. Specifically, TransCL first utilizes the strategy of learnable block-based compressed sensing and proposes a flexible linear projection strategy to enable CL to be performed on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Indoor and Outdoor Localization Technologies · Domain Adaptation and Few-Shot Learning
