CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment   and Reconstruction

Yilin Liu; Xuezhou Guo; Xinqi Wang; Fangzhou Du

arXiv:2405.19659·cs.CV·May 31, 2024

CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction

Yilin Liu, Xuezhou Guo, Xinqi Wang, Fangzhou Du

PDF

Open Access

TL;DR

CSANet is a novel end-to-end 3D face alignment and reconstruction network that leverages advanced attention mechanisms and specialized loss functions to improve accuracy and training stability.

Contribution

It introduces a new architecture combining channel spatial attention with a robust training strategy for 3D face modeling.

Findings

01

Outperforms baseline models quantitatively

02

Achieves superior qualitative reconstruction results

03

Demonstrates stable training and convergence

Abstract

Our project proposes an end-to-end 3D face alignment and reconstruction network. The backbone of our model is built by Bottle-Neck structure via Depth-wise Separable Convolution. We integrate Coordinate Attention mechanism and Spatial Group-wise Enhancement to extract more representative features. For more stable training process and better convergence, we jointly use Wing loss and the Weighted Parameter Distance Cost to learn parameters for 3D Morphable model and 3D vertices. Our proposed model outperforms all baseline models both quantitatively and qualitatively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Medical Imaging and Analysis

MethodsCoordinate attention · Convolution