Achieving 3D Attention via Triplet Squeeze and Excitation Block
Maan Alhazmi, Abdulrahman Altahhan

TL;DR
This paper introduces TripSE, a novel attention mechanism combining Triplet attention with Squeeze-and-Excitation, integrated into CNN architectures to enhance image classification and facial expression recognition performance.
Contribution
The paper proposes TripSE, a new attention module that improves CNN models like ResNet, DenseNet, and ConvNeXt, demonstrating significant performance gains across multiple datasets.
Findings
TripSE boosts CNN performance, especially in ConvNeXt.
ConvNeXt with TripSE achieves 78.27% accuracy on FER2013.
The proposed models outperform existing methods on several benchmarks.
Abstract
The emergence of ConvNeXt and its variants has reaffirmed the conceptual and structural suitability of CNN-based models for vision tasks, re-establishing them as key players in image classification in general, and in facial expression recognition (FER) in particular. In this paper, we propose a new set of models that build on these advancements by incorporating a new set of attention mechanisms that combines Triplet attention with Squeeze-and-Excitation (TripSE) in four different variants. We demonstrate the effectiveness of these variants by applying them to the ResNet18, DenseNet and ConvNext architectures to validate their versatility and impact. Our study shows that incorporating a TripSE block in these CNN models boosts their performances, particularly for the ConvNeXt architecture, indicating its utility. We evaluate the proposed mechanisms and associated models across four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face Recognition and Perception
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Attention Is All You Need · Concatenated Skip Connection · Global Average Pooling · Convolution · Batch Normalization · 1x1 Convolution · Kaiming Initialization · Dense Connections · Dense Block
