ProtoGAN: Towards Few Shot Learning for Action Recognition
Sai Kumar Dwivedi, Vikram Gupta, Rahul Mitra, Shuaib Ahmed, Arjun Jain

TL;DR
ProtoGAN introduces a generative framework that synthesizes training examples for novel action categories in few-shot and generalized few-shot learning, significantly improving recognition accuracy.
Contribution
The paper proposes ProtoGAN, a novel generative approach using class prototypes to synthesize data for unseen classes, advancing few-shot and generalized few-shot action recognition.
Findings
Outperforms state-of-the-art in FSL on three datasets.
First to report results for G-FSL in action recognition.
Provides a new benchmark for future G-FSL research.
Abstract
Few-shot learning (FSL) for action recognition is a challenging task of recognizing novel action categories which are represented by few instances in the training data. In a more generalized FSL setting (G-FSL), both seen as well as novel action categories need to be recognized. Conventional classifiers suffer due to inadequate data in FSL setting and inherent bias towards seen action categories in G-FSL setting. In this paper, we address this problem by proposing a novel ProtoGAN framework which synthesizes additional examples for novel categories by conditioning a conditional generative adversarial network with class prototype vectors. These class prototype vectors are learnt using a Class Prototype Transfer Network (CPTN) from examples of seen categories. Our synthesized examples for a novel class are semantically similar to real examples belonging to that class and is used to train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
