Local Multi-Head Channel Self-Attention for Facial Expression   Recognition

Roberto Pecoraro; Valerio Basile; Viviana Bono; Sara Gallo

arXiv:2111.07224·cs.CV·November 19, 2021

Local Multi-Head Channel Self-Attention for Facial Expression Recognition

Roberto Pecoraro, Valerio Basile, Viviana Bono, Sara Gallo

PDF

Open Access 1 Repo

TL;DR

This paper introduces LHC, a novel local multi-head channel self-attention module for CNNs, achieving state-of-the-art facial expression recognition results with lower complexity.

Contribution

The paper proposes a new channel-wise local self-attention module, LHC, specifically designed for computer vision, improving facial expression recognition performance.

Findings

01

Achieved new state-of-the-art on FER2013 dataset.

02

Lower computational complexity compared to previous methods.

03

Effective integration of LHC into CNNs enhances recognition accuracy.

Abstract

Since the Transformer architecture was introduced in 2017 there has been many attempts to bring the self-attention paradigm in the field of computer vision. In this paper we propose a novel self-attention module that can be easily integrated in virtually every convolutional neural network and that is specifically designed for computer vision, the LHC: Local (multi) Head Channel (self-attention). LHC is based on two main ideas: first, we think that in computer vision the best way to leverage the self-attention paradigm is the channel-wise application instead of the more explored spatial attention and that convolution will not be replaced by attention modules like recurrent networks were in NLP; second, a local approach has the potential to better overcome the limitations of convolution than global attention. With LHC-Net we managed to achieve a new state of the art in the famous FER2013…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bodhis4ttva/lhc_net
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Anomaly Detection Techniques and Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Adam · Dense Connections · Layer Normalization