Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation
Harshala Gammulle, Tharindu Fernando, Simon Denman, Sridha Sridharan,, Clinton Fookes

TL;DR
This paper introduces a novel conditional GAN architecture that leverages multi-modal data and scene context to improve continuous fine-grained human action segmentation, outperforming existing methods on multiple datasets.
Contribution
The paper presents a dual-GAN framework with context extraction that effectively integrates multiple data modalities for enhanced action segmentation.
Findings
Outperforms state-of-the-art on 50 Salads, MERL Shopping, Georgia Tech datasets
Utilizes multi-modal data including RGB, depth, and optical flow
Demonstrates the importance of context and auxiliary information in segmentation
Abstract
We propose a novel conditional GAN (cGAN) model for continuous fine-grained human action segmentation, that utilises multi-modal data and learned scene context information. The proposed approach utilises two GANs: termed Action GAN and Auxiliary GAN, where the Action GAN is trained to operate over the current RGB frame while the Auxiliary GAN utilises supplementary information such as depth or optical flow. The goal of both GANs is to generate similar `action codes', a vector representation of the current action. To facilitate this process a context extractor that incorporates data and recent outputs from both modes is used to extract context information to aid recognition. The result is a recurrent GAN architecture which learns a task specific loss function from multiple feature modalities. Extensive evaluations on variants of the proposed model to show the importance of utilising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis
MethodsModel-Agnostic Meta-Learning · Meta Reward Learning · Convolution · Dogecoin Customer Service Number +1-833-534-1729
