Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning
Chuang Niu, Wenjun Xia, Hongming Shan, Ge Wang

TL;DR
This paper introduces IMSVD, a novel self-supervised learning method that discretizes latent variables to improve image representation learning, achieving better accuracy, efficiency, and explainability over existing methods.
Contribution
The paper presents IMSVD, a new SSL approach that softly discretizes latent variables and uses an information-theoretic objective, outperforming prior contrastive and non-contrastive methods.
Findings
IMSVD improves downstream task accuracy.
IMSVD enhances learning efficiency.
Features offer variable-level explainability.
Abstract
Self-supervised learning (SSL) has emerged as a crucial technique in image processing, encoding, and understanding, especially for developing today's vision foundation models that utilize large-scale datasets without annotations to enhance various downstream tasks. This study introduces a novel SSL approach, Information-Maximized Soft Variable Discretization (IMSVD), for image representation learning. Specifically, IMSVD softly discretizes each variable in the latent space, enabling the estimation of their probability distributions over training batches and allowing the learning process to be directly guided by information measures. Motivated by the MultiView assumption, we propose an information-theoretic objective function to learn transform-invariant, non-travail, and redundancy-minimized representation features. We then derive a joint-cross entropy loss function for self-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Image Retrieval and Classification Techniques · Image Processing Techniques and Applications
