DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns
Ali Diba, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc Van Gool

TL;DR
DeepCAMP introduces a novel CNN that mines discriminative mid-level patches for fine-grained human action and attribute recognition, achieving state-of-the-art results without requiring part or pose annotations.
Contribution
It presents a new CNN architecture that iteratively learns and clusters patches, effectively capturing subtle features for action and attribute classification.
Findings
State-of-the-art accuracy on PASCAL VOC 2012 Action dataset
Effective attribute recognition on Berkeley Attributes of People dataset
No need for part or pose annotations
Abstract
The recognition of human actions and the determination of human attributes are two tasks that call for fine-grained classification. Indeed, often rather small and inconspicuous objects and features have to be detected to tell their classes apart. In order to deal with this challenge, we propose a novel convolutional neural network that mines mid-level image patches that are sufficiently dedicated to resolve the corresponding subtleties. In particular, we train a newly de- signed CNN (DeepPattern) that learns discriminative patch groups. There are two innovative aspects to this. On the one hand we pay attention to contextual information in an origi- nal fashion. On the other hand, we let an iteration of feature learning and patch clustering purify the set of dedicated patches that we use. We validate our method for action clas- sification on two challenging datasets: PASCAL VOC 2012…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
