Contrastive Learning for Image Complexity Representation
Shipeng Liu, Liang Zhao, Dengfeng Chen, Zhanping Song

TL;DR
This paper introduces CLIC, a contrastive learning framework using MoCo v2 and RCM for representing image complexity, achieving performance comparable to supervised methods without manual annotations.
Contribution
The paper proposes a novel contrastive learning approach with RCM for image complexity representation, reducing annotation costs and improving task performance.
Findings
CLIC performs comparably to supervised methods.
RCM increases data diversity without extra data.
CLIC enhances computer vision task performance.
Abstract
Quantifying and evaluating image complexity can be instrumental in enhancing the performance of various computer vision tasks. Supervised learning can effectively learn image complexity features from well-annotated datasets. However, creating such datasets requires expensive manual annotation costs. The models may learn human subjective biases from it. In this work, we introduce the MoCo v2 framework. We utilize contrastive learning to represent image complexity, named CLIC (Contrastive Learning for Image Complexity). We find that there are complexity differences between different local regions of an image, and propose Random Crop and Mix (RCM), which can produce positive samples consisting of multi-scale local crops. RCM can also expand the train set and increase data diversity without introducing additional data. We conduct extensive experiments with CLIC, comparing it with both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Industrial Vision Systems and Defect Detection · Image Processing Techniques and Applications
MethodsSparse Evolutionary Training · Dense Connections · Batch Normalization · Momentum Contrast · Feedforward Network · InfoNCE · Contrastive Learning · Random Gaussian Blur · MoCo v2
