Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing
Le-Anh Tran, Dong-Chul Park

TL;DR
This paper introduces DPTE-Net, a lightweight transformer-based model for realistic image dehazing that reduces computational complexity through pooling mechanisms and uses distillation and GAN training to enhance performance.
Contribution
The paper presents a novel, efficient transformer encoder with pooling, combined with distillation and GAN training, for resource-efficient image dehazing.
Findings
Achieves competitive dehazing results on benchmark datasets.
Maintains low computational complexity suitable for resource-constrained devices.
Outperforms some state-of-the-art methods in efficiency while preserving quality.
Abstract
This paper proposes a lightweight neural network designed for realistic image dehazing, utilizing a Distilled Pooling Transformer Encoder, named DPTE-Net. Recently, while vision transformers (ViTs) have achieved great success in various vision tasks, their self-attention (SA) module's complexity scales quadratically with image resolution, hindering their applicability on resource-constrained devices. To overcome this, the proposed DPTE-Net substitutes traditional SA modules with efficient pooling mechanisms, significantly reducing computational demands while preserving ViTs' learning capabilities. To further enhance semantic feature learning, a distillation-based training process is implemented which transfers rich knowledge from a larger teacher network to DPTE-Net. Additionally, DPTE-Net is trained within a generative adversarial network (GAN) framework, leveraging the strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Brain Tumor Detection and Classification · Image and Signal Denoising Methods
MethodsAttention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Softmax
