EFormer: Enhanced Transformer towards Semantic-Contour Features of   Foreground for Portraits Matting

Zitao Wang; Qiguang Miao; Peipei Zhao; Yue Xi

arXiv:2308.12831·cs.CV·December 4, 2023

EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting

Zitao Wang, Qiguang Miao, Peipei Zhao, Yue Xi

PDF

Open Access

TL;DR

EFormer enhances transformer models for portrait matting by effectively capturing both semantic and high-frequency contour features, leading to more accurate alpha matte predictions with detailed contours.

Contribution

The paper introduces a cross-attention based approach to improve contour detail modeling in transformer-based portrait matting methods.

Findings

01

EFormer outperforms previous methods on VideoMatte240K and AIM datasets.

02

The cross-attention module effectively guides focus to high-frequency contour regions.

03

The combined semantic and contour features improve matte accuracy.

Abstract

The portrait matting task aims to extract an alpha matte with complete semantics and finely-detailed contours. In comparison to CNN-based approaches, transformers with self-attention module have a better capacity to capture long-range dependencies and low-frequency semantic information of a portrait. However, the recent research shows that self-attention mechanism struggles with modeling high-frequency contour information and capturing fine contour details, which can lead to bias while predicting the portrait's contours. To deal with this issue, we propose EFormer to enhance the model's attention towards both of the low-frequency semantic and high-frequency contour features. For the high-frequency contours, our research demonstrates that cross-attention module between different resolutions can guide our model to allocate attention appropriately to these contour regions. Supported on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsColor Science and Applications · Image Enhancement Techniques · Visual Attention and Saliency Detection

MethodsSoftmax · Concatenated Skip Connection