Efficient Remote Sensing Segmentation With Generative Adversarial Transformer
Luyi Qiu, Dayu Yu, Xiaofeng Zhang, Chenxiao Zhang

TL;DR
This paper introduces GATrans, an efficient generative adversarial transformer model for high-precision semantic segmentation on embedded devices, combining global transformers and structural similarity losses for improved performance.
Contribution
The paper presents GATrans, a lightweight transformer-based segmentation framework that maintains high accuracy while significantly reducing model size and computational complexity.
Findings
Achieved 90.17% F1 score on Vaihingen dataset
Attained 91.92% overall accuracy
Demonstrated efficiency suitable for embedded devices
Abstract
Most deep learning methods that achieve high segmentation accuracy require deep network architectures that are too heavy and complex to run on embedded devices with limited storage and memory space. To address this issue, this paper proposes an efficient Generative Adversarial Transfomer (GATrans) for achieving high-precision semantic segmentation while maintaining an extremely efficient size. The framework utilizes a Global Transformer Network (GTNet) as the generator, efficiently extracting multi-level features through residual connections. GTNet employs global transformer blocks with progressively linear computational complexity to reassign global features based on a learnable similarity function. To focus on object-level and pixel-level information, the GATrans optimizes the objective function by combining structural similarity losses. We validate the effectiveness of our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Label Smoothing · Absolute Position Encodings · Adam · Residual Connection · Focus · Layer Normalization
