HyperVision: A Channel-Adaptive Ground-Based Hyperspectral Vision Pre-trained Backbone

Guanyiman Fu; Jingtao Li; Zihang Cheng; Zhuanfeng Li; Diqi Chen; Yan Xu; Fengchao Xiong; Jianfeng Lu; Jun Zhou

arXiv:2605.17286·cs.CV·May 19, 2026

HyperVision: A Channel-Adaptive Ground-Based Hyperspectral Vision Pre-trained Backbone

Guanyiman Fu, Jingtao Li, Zihang Cheng, Zhuanfeng Li, Diqi Chen, Yan Xu, Fengchao Xiong, Jianfeng Lu, Jun Zhou

PDF

1 Repo

TL;DR

HyperVision is a novel ground-based hyperspectral pre-trained backbone that handles varying spectral configurations, label scarcity, and limited dataset diversity, achieving state-of-the-art results across multiple tasks.

Contribution

It introduces a channel-adaptive embedding, multi-source pseudo-labeling, and cross-modal knowledge distillation for hyperspectral vision, enabling universal perception.

Findings

01

Achieves up to 16.3% relative improvement in hyperspectral semantic segmentation accuracy.

02

Gains 2.1% in object tracking AUC.

03

Reduces salient object detection MAE by 35.5%.

Abstract

While hyperspectral imaging provides rich spatial-spectral information across hundreds of narrow wavelength bands for precise material identification, ground-based hyperspectral pre-trained backbones remain absent, constrained by varying spectral configurations across sensors, the scarcity and inconsistency of labels, and the limited scale and scene diversity of existing datasets. To address these challenges and enable universal perception, we propose HyperVision, the first ground-based hyperspectral pre-trained backbone. First, to handle varying spectral configurations, HyperVision adopts a channel-adaptive dynamic embedding mechanism to map heterogeneous inputs into a unified token space. Second, to address the scarcity and inconsistency of labels, we introduce a multi-source pseudo-labeling method that fuses semantic representations from both spatial structures generated by SAM2 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lronkitty/HyperVision
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.