Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model
Yimin Zhu, Lincoln Linlin Xu

TL;DR
This paper introduces Txt2HSI-LDM(VAE), a novel language-guided hyperspectral image synthesis method using a semi-supervised diffusion model and VAE to improve classification in imbalanced small sample scenarios.
Contribution
It proposes a new language-informed hyperspectral image synthesis approach combining diffusion models, VAE, and semi-supervised learning to generate diverse samples conditioned on text descriptions.
Findings
Synthetic samples improve classification accuracy.
Model captures spatial layout and geometry effectively.
Outperforms classical and state-of-the-art methods.
Abstract
Data augmentation effectively addresses the imbalanced-small sample data (ISSD) problem in hyperspectral image classification (HSIC). While most methodologies extend features in the latent space, few leverage text-driven generation to create realistic and diverse samples. Recently, text-guided diffusion models have gained significant attention due to their ability to generate highly diverse and high-quality images based on text prompts in natural image synthesis. Motivated by this, this paper proposes Txt2HSI-LDM(VAE), a novel language-informed hyperspectral image synthesis method to address the ISSD in HSIC. The proposed approach uses a denoising diffusion model, which iteratively removes Gaussian noise to generate hyperspectral samples conditioned on textual descriptions. First, to address the high-dimensionality of hyperspectral data, a universal variational autoencoder (VAE) is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification
MethodsSoftmax · Attention Is All You Need · Diffusion
