Weakly-supervised Contrastive Learning for Unsupervised Object Discovery
Yunqiu Lv, Jing Zhang, Nick Barnes, Yuchao Dai

TL;DR
This paper introduces a novel weakly-supervised contrastive learning approach to improve unsupervised object discovery by enhancing semantic feature extraction and using PCA for object localization, outperforming existing methods.
Contribution
It proposes a semantic-guided self-supervised learning model based on WCL and PCA for effective unsupervised object discovery, addressing limitations of previous generative and clustering methods.
Findings
Effective object localization using PCA and eigenvalues.
Enhanced semantic feature extraction via WCL and fine-tuning DINO.
Superior performance on benchmark datasets.
Abstract
Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation. This task is promising due to its ability to discover objects in a generic manner. We roughly categorise existing techniques into two main directions, namely the generative solutions based on image resynthesis, and the clustering methods based on self-supervised models. We have observed that the former heavily relies on the quality of image reconstruction, while the latter shows limitations in effectively modeling semantic correlations. To directly target at object discovery, we focus on the latter approach and propose a novel solution by incorporating weakly-supervised contrastive learning (WCL) to enhance semantic information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Linear Layer · Dense Connections · Residual Connection · Vision Transformer · Contrastive Learning · Focus
