TNet: Terrace Convolutional Decoder Network for Remote Sensing Image Semantic Segmentation
Chengqian Dai, Yonghong Guo, Hongzhao Xiang, Yigui Luo

TL;DR
TNet introduces a convolutional decoder that progressively fuses multi-resolution features for improved remote sensing image segmentation, emphasizing global-local integration with simple operations.
Contribution
It proposes a novel convolutional decoder architecture that effectively combines multi-resolution features without complex modules, enhancing segmentation performance.
Findings
Achieves 85.35% mIoU on ISPRS Vaihingen dataset.
Attains 87.05% mIoU on ISPRS Potsdam dataset.
Maintains high computational efficiency.
Abstract
In remote sensing, most segmentation networks adopt the UNet architecture, often incorporating modules such as Transformers or Mamba to enhance global-local feature interactions within decoder stages. However, these enhancements typically focus on intra-scale relationships and neglect the global contextual dependencies across multiple resolutions. To address this limitation, we introduce the Terrace Convolutional Decoder Network (TNet), a simple yet effective architecture that leverages only convolution and addition operations to progressively integrate low-resolution features (rich in global context) into higher-resolution features (rich in local details) across decoding stages. This progressive fusion enables the model to learn spatially-aware convolutional kernels that naturally blend global and local information in a stage-wise manner. We implement TNet with a ResNet-18 encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
