Face Mask Extraction in Video Sequence

Yujiang Wang; Bingnan Luo; Jie Shen; Maja Pantic

arXiv:1807.09207·cs.CV·March 2, 2021

Face Mask Extraction in Video Sequence

Yujiang Wang, Bingnan Luo, Jie Shen, Maja Pantic

PDF

TL;DR

This paper presents an end-to-end deep learning model combining ConvLSTM and FCN for detailed face mask segmentation in videos, improving accuracy by nearly 17% over baseline models on a challenging dataset.

Contribution

The introduction of a ConvLSTM-FCN model with a novel segmentation loss function for improved face mask extraction in video sequences.

Findings

01

Achieved 16.99% relative improvement in mean IoU over baseline.

02

Effectively models temporal correlations in video for better segmentation.

03

Specialized models for face, eyes, and mouth regions enhance accuracy.

Abstract

Inspired by the recent development of deep network-based methods in semantic image segmentation, we introduce an end-to-end trainable model for face mask extraction in video sequence. Comparing to landmark-based sparse face shape representation, our method can produce the segmentation masks of individual facial components, which can better reflect their detailed shape variations. By integrating Convolutional LSTM (ConvLSTM) algorithm with Fully Convolutional Networks (FCN), our new ConvLSTM-FCN model works on a per-sequence basis and takes advantage of the temporal correlation in video clips. In addition, we also propose a novel loss function, called Segmentation Loss, to directly optimise the Intersection over Union (IoU) performances. In practice, to further increase segmentation accuracy, one primary model and two additional models were trained to focus on the face, eyes, and mouth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory