AgMTR: Agent Mining Transformer for Few-shot Segmentation in Remote Sensing
Hanbo Bi, Yingchao Feng, Yongqiang Mao, Jianning Pei, Wenhui Diao,, Hongqi Wang, Xian Sun

TL;DR
This paper introduces AgMTR, a transformer-based method for few-shot segmentation in remote sensing that uses agent-level semantic correlation to improve accuracy amidst intra-class variation and cluttered backgrounds.
Contribution
The paper proposes a novel Agent Mining Transformer (AgMTR) with local-aware agents and decoders to enhance semantic clarity and leverage unlabeled data for few-shot segmentation.
Findings
Achieves state-of-the-art results on iSAID remote sensing benchmark.
Performs competitively on natural image datasets PASCAL-5i and COCO-20i.
Demonstrates robustness in complex remote sensing scenarios.
Abstract
Few-shot Segmentation (FSS) aims to segment the interested objects in the query image with just a handful of labeled samples (i.e., support images). Previous schemes would leverage the similarity between support-query pixel pairs to construct the pixel-level semantic correlation. However, in remote sensing scenarios with extreme intra-class variations and cluttered backgrounds, such pixel-level correlations may produce tremendous mismatches, resulting in semantic ambiguity between the query foreground (FG) and background (BG) pixels. To tackle this problem, we propose a novel Agent Mining Transformer (AgMTR), which adaptively mines a set of local-aware agents to construct agent-level semantic correlation. Compared with pixel-level semantics, the given agents are equipped with local-contextual information and possess a broader receptive field. At this point, different query pixels can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and Land Use · Atmospheric and Environmental Gas Dynamics · Metaheuristic Optimization Algorithms Research
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout
