VISER: Visual Self-Regularization
Hamid Izadinia, Pierre Garrigues

TL;DR
VISER introduces a semi-supervised learning method that leverages large unlabeled image datasets through visual self-regularization, significantly improving visual recognition tasks like object categorization and localization.
Contribution
The paper proposes Visual Self-Regularization (VISER), a novel semi-supervised approach that uses unlabeled images as local perturbations to enhance visual model robustness.
Findings
Improved object categorization accuracy on MS COCO and Visual Genome.
Effective retrieval of similar unlabeled images using cosine similarity in semantic space.
Scalable retrieval method using distributed approximate nearest neighbor algorithm.
Abstract
In this work, we propose the use of large set of unlabeled images as a source of regularization data for learning robust visual representation. Given a visual model trained by a labeled dataset in a supervised fashion, we augment our training samples by incorporating large number of unlabeled data and train a semi-supervised model. We demonstrate that our proposed learning approach leverages an abundance of unlabeled images and boosts the visual recognition performance which alleviates the need to rely on large labeled datasets for learning robust representation. To increment the number of image instances needed to learn robust visual models in our approach, each labeled image propagates its label to its nearest unlabeled image instances. These retrieved unlabeled images serve as local perturbations of each labeled image to perform Visual Self-Regularization (VISER). To retrieve such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
