Spatial and Semantic Consistency Regularizations for Pedestrian   Attribute Recognition

Jian Jia; Xiaotang Chen; Kaiqi Huang

arXiv:2109.05686·cs.CV·September 14, 2021·1 cites

Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition

Jian Jia, Xiaotang Chen, Kaiqi Huang

PDF

Open Access

TL;DR

This paper introduces a Spatial and Semantic Consistency (SSC) framework for pedestrian attribute recognition that leverages inter-image relations and human priors to improve accuracy without increasing model complexity.

Contribution

The paper proposes a novel SSC framework with regularizations for spatial and semantic consistency, enhancing attribute recognition by exploiting inter-image relations and prior knowledge.

Findings

01

Outperforms state-of-the-art methods on PA100K, RAP, and PETA datasets.

02

Improves attribute localization and semantic feature extraction.

03

Achieves better accuracy without increasing model parameters.

Abstract

While recent studies on pedestrian attribute recognition have shown remarkable progress in leveraging complicated networks and attention mechanisms, most of them neglect the inter-image relations and an important prior: spatial consistency and semantic consistency of attributes under surveillance scenarios. The spatial locations of the same attribute should be consistent between different pedestrian images, \eg, the ``hat" attribute and the ``boots" attribute are always located at the top and bottom of the picture respectively. In addition, the inherent semantic feature of the ``hat" attribute should be consistent, whether it is a baseball cap, beret, or helmet. To fully exploit inter-image relations and aggregate human prior in the model learning process, we construct a Spatial and Semantic Consistency (SSC) framework that consists of two complementary regularizations to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning