Coupled Generative Adversarial Network for Continuous Fine-grained   Action Segmentation

Harshala Gammulle; Tharindu Fernando; Simon Denman; Sridha Sridharan,; Clinton Fookes

arXiv:1909.09283·cs.CV·September 23, 2019·1 cites

Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation

Harshala Gammulle, Tharindu Fernando, Simon Denman, Sridha Sridharan,, Clinton Fookes

PDF

Open Access

TL;DR

This paper introduces a novel conditional GAN architecture that leverages multi-modal data and scene context to improve continuous fine-grained human action segmentation, outperforming existing methods on multiple datasets.

Contribution

The paper presents a dual-GAN framework with context extraction that effectively integrates multiple data modalities for enhanced action segmentation.

Findings

01

Outperforms state-of-the-art on 50 Salads, MERL Shopping, Georgia Tech datasets

02

Utilizes multi-modal data including RGB, depth, and optical flow

03

Demonstrates the importance of context and auxiliary information in segmentation

Abstract

We propose a novel conditional GAN (cGAN) model for continuous fine-grained human action segmentation, that utilises multi-modal data and learned scene context information. The proposed approach utilises two GANs: termed Action GAN and Auxiliary GAN, where the Action GAN is trained to operate over the current RGB frame while the Auxiliary GAN utilises supplementary information such as depth or optical flow. The goal of both GANs is to generate similar `action codes', a vector representation of the current action. To facilitate this process a context extractor that incorporates data and recent outputs from both modes is used to extract context information to aid recognition. The result is a recurrent GAN architecture which learns a task specific loss function from multiple feature modalities. Extensive evaluations on variants of the proposed model to show the importance of utilising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis

MethodsModel-Agnostic Meta-Learning · Meta Reward Learning · Convolution · Dogecoin Customer Service Number +1-833-534-1729