Self-tuned Visual Subclass Learning with Shared Samples An Incremental Approach
Hossein Azizpour, Stefan Carlsson

TL;DR
This paper introduces an incremental, unified approach to discovering visual subclasses within semantic classes, improving recognition by modeling diverse appearances without predefining cluster numbers.
Contribution
It proposes a novel optimization technique that jointly performs clustering and classification, eliminating the need for prior cluster number or similarity measures.
Findings
Visual subclasses follow a long tail distribution.
State-of-the-art detectors struggle with tail classes.
Removing tail classes improves detection performance.
Abstract
Computer vision tasks are traditionally defined and evaluated using semantic categories. However, it is known to the field that semantic classes do not necessarily correspond to a unique visual class (e.g. inside and outside of a car). Furthermore, many of the feasible learning techniques at hand cannot model a visual class which appears consistent to the human eye. These problems have motivated the use of 1) Unsupervised or supervised clustering as a preprocessing step to identify the visual subclasses to be used in a mixture-of-experts learning regime. 2) Felzenszwalb et al. part model and other works model mixture assignment with latent variables which is optimized during learning 3) Highly non-linear classifiers which are inherently capable of modelling multi-modal input space but are inefficient at the test time. In this work, we promote an incremental view over the recognition of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning
