A CNN Cascade for Landmark Guided Semantic Part Segmentation

Aaron Jackson; Michel Valstar; Georgios Tzimiropoulos

arXiv:1609.09642·cs.CV·October 3, 2016

A CNN Cascade for Landmark Guided Semantic Part Segmentation

Aaron Jackson, Michel Valstar, Georgios Tzimiropoulos

PDF

Open Access

TL;DR

This paper introduces a CNN cascade that leverages pose-specific landmarks to improve semantic part segmentation, demonstrating significant performance gains in facial segmentation tasks.

Contribution

It is the first to explore the interplay between pose estimation and semantic segmentation using a CNN cascade architecture.

Findings

01

Large performance improvement on face datasets

02

Effective use of landmarks to guide segmentation

03

First integration of pose estimation with segmentation in CNNs

Abstract

This paper proposes a CNN cascade for semantic part segmentation guided by pose-specific information encoded in terms of a set of landmarks (or keypoints). There is large amount of prior work on each of these tasks separately, yet, to the best of our knowledge, this is the first time in literature that the interplay between pose estimation and semantic part segmentation is investigated. To address this limitation of prior work, in this paper, we propose a CNN cascade of tasks that firstly performs landmark localisation and then uses this information as input for guiding semantic part segmentation. We applied our architecture to the problem of facial part segmentation and report large performance improvement over the standard unguided network on the most challenging face datasets. Testing code and models will be published online at http://cs.nott.ac.uk/~psxasj/.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Biometric Identification and Security · Face and Expression Recognition