EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang, Qiguang Miao, Peipei Zhao, Yue Xi

TL;DR
EFormer enhances transformer models for portrait matting by effectively capturing both semantic and high-frequency contour features, leading to more accurate alpha matte predictions with detailed contours.
Contribution
The paper introduces a cross-attention based approach to improve contour detail modeling in transformer-based portrait matting methods.
Findings
EFormer outperforms previous methods on VideoMatte240K and AIM datasets.
The cross-attention module effectively guides focus to high-frequency contour regions.
The combined semantic and contour features improve matte accuracy.
Abstract
The portrait matting task aims to extract an alpha matte with complete semantics and finely-detailed contours. In comparison to CNN-based approaches, transformers with self-attention module have a better capacity to capture long-range dependencies and low-frequency semantic information of a portrait. However, the recent research shows that self-attention mechanism struggles with modeling high-frequency contour information and capturing fine contour details, which can lead to bias while predicting the portrait's contours. To deal with this issue, we propose EFormer to enhance the model's attention towards both of the low-frequency semantic and high-frequency contour features. For the high-frequency contours, our research demonstrates that cross-attention module between different resolutions can guide our model to allocate attention appropriately to these contour regions. Supported on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColor Science and Applications · Image Enhancement Techniques · Visual Attention and Saliency Detection
MethodsSoftmax · Concatenated Skip Connection
