SpecSwin3D: Generating Hyperspectral Imagery from Multispectral Data via Transformer Networks
By: Tang Sui, Songxi Yang, Qunying Huang
Potential Business Impact:
Makes blurry satellite pictures sharp and detailed.
Multispectral and hyperspectral imagery are widely used in agriculture, environmental monitoring, and urban planning due to their complementary spatial and spectral characteristics. A fundamental trade-off persists: multispectral imagery offers high spatial but limited spectral resolution, while hyperspectral imagery provides rich spectra at lower spatial resolution. Prior hyperspectral generation approaches (e.g., pan-sharpening variants, matrix factorization, CNNs) often struggle to jointly preserve spatial detail and spectral fidelity. In response, we propose SpecSwin3D, a transformer-based model that generates hyperspectral imagery from multispectral inputs while preserving both spatial and spectral quality. Specifically, SpecSwin3D takes five multispectral bands as input and reconstructs 224 hyperspectral bands at the same spatial resolution. In addition, we observe that reconstruction errors grow for hyperspectral bands spectrally distant from the input bands. To address this, we introduce a cascade training strategy that progressively expands the spectral range to stabilize learning and improve fidelity. Moreover, we design an optimized band sequence that strategically repeats and orders the five selected multispectral bands to better capture pairwise relations within a 3D shifted-window transformer framework. Quantitatively, our model achieves a PSNR of 35.82 dB, SAM of 2.40{\deg}, and SSIM of 0.96, outperforming the baseline MHF-Net by +5.6 dB in PSNR and reducing ERGAS by more than half. Beyond reconstruction, we further demonstrate the practical value of SpecSwin3D on two downstream tasks, including land use classification and burnt area segmentation.
Similar Papers
Hyperspectral Vision Transformers for Greenhouse Gas Estimations from Space
CV and Pattern Recognition
Lets satellites see pollution better, more often.
Hyperspectral Image Classification using Spectral-Spatial Mixer Network
CV and Pattern Recognition
Helps computers identify things in pictures better.
SpectralTrain: A Universal Framework for Hyperspectral Image Classification
CV and Pattern Recognition
Trains computers to see Earth from space faster.