SpecDETR: A transformer-based hyperspectral point object detection network
Zhaoxu Li, Wei An, Gaowei Guo, Longguang Wang, Yingqian Wang, Zaiping Lin

TL;DR
This paper introduces SpecDETR, a transformer-based network designed for hyperspectral point object detection, leveraging spatial-spectral features without pre-trained backbones, and establishes a new benchmark for evaluation.
Contribution
It proposes the first specialized hyperspectral point object detection network, SpecDETR, that directly extracts joint features using a transformer, and creates a new benchmark dataset SPOD for evaluation.
Findings
SpecDETR outperforms state-of-the-art visual detectors and HTD methods.
The proposed method effectively captures spatial-spectral joint features.
Extensive experiments validate the superiority of SpecDETR.
Abstract
Hyperspectral target detection (HTD) aims to identify specific materials based on spectral information in hyperspectral imagery and can detect extremely small-sized objects, some of which occupy a smaller than one-pixel area. However, existing HTD methods are developed based on per-pixel binary classification, neglecting the three-dimensional cube structure of hyperspectral images (HSIs) that integrates both spatial and spectral dimensions. The synergistic existence of spatial and spectral features in HSIs enable objects to simultaneously exhibit both, yet the per-pixel HTD framework limits the joint expression of these features. In this paper, we rethink HTD from the perspective of spatial-spectral synergistic representation and propose hyperspectral point object detection as an innovative task framework. We introduce SpecDETR, the first specialized network for hyperspectral…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Dropout
