MTSIC: Multi-stage Transformer-based GAN for Spectral Infrared Image Colorization
Tingting Liu, Yuan Liu, Jinhui Tang, Liyin Yuan, Chengyu Liu, Chunlai Li, Xiubao Sui, and Qian Chen

TL;DR
This paper introduces MTSIC, a multi-stage Transformer-based GAN that leverages spectral information and self-attention mechanisms to improve the colorization of thermal infrared images, enhancing their visual quality and semantic accuracy.
Contribution
The paper presents a novel multi-stage Transformer GAN framework that integrates spectral self-attention and multi-scale wavelet blocks for superior infrared image colorization.
Findings
Outperforms traditional colorization methods in visual quality.
Effectively preserves spectral details and semantic information.
Reduces semantic ambiguity in infrared image colorization.
Abstract
Thermal infrared (TIR) images, acquired through thermal radiation imaging, are unaffected by variations in lighting conditions and atmospheric haze. However, TIR images inherently lack color and texture information, limiting downstream tasks and potentially causing visual fatigue. Existing colorization methods primarily rely on single-band images with limited spectral information and insufficient feature extraction capabilities, which often result in image distortion and semantic ambiguity. In contrast, multiband infrared imagery provides richer spectral data, facilitating the preservation of finer details and enhancing semantic accuracy. In this paper, we propose a generative adversarial network (GAN)-based framework designed to integrate spectral information to enhance the colorization of infrared images. The framework employs a multi-stage spectral self-attention Transformer network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Layer Normalization · Dropout · Absolute Position Encodings · Convolution · Dense Connections · Byte Pair Encoding · Softmax · Residual Block · Colorization
