TL;DR
This paper introduces a novel one-shot learning method that uses a dual auto-encoder to map multi-level visual features to a semantic space, interpolate among concepts, and generate augmented features for improved recognition with limited samples.
Contribution
It proposes a dual TriNet auto-encoder that directly synthesizes image features from semantic concepts, enhancing one-shot learning by leveraging multi-level visual features and semantic interpolation.
Findings
Significantly improves one-shot learning accuracy.
Effective use of multi-level features and semantic interpolation.
Outperforms existing methods on benchmark datasets.
Abstract
The ability to quickly recognize and learn new visual concepts from limited samples enables humans to swiftly adapt to new environments. This ability is enabled by semantic associations of novel concepts with those that have already been learned and stored in memory. Computers can start to ascertain similar abilities by utilizing a semantic concept space. A concept space is a high-dimensional semantic space in which similar abstract concepts appear close and dissimilar ones far apart. In this paper, we propose a novel approach to one-shot learning that builds on this idea. Our approach learns to map a novel sample instance to a concept, relates that concept to the existing ones in the concept space and generates new instances, by interpolating among the concepts, to help learning. Instead of synthesizing new image instance, we propose to directly synthesize instance features by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
