Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator
By: Md. Mahbub Hasan Akash , Aria Tasnim Mridula , Sheekar Banerjee and more
Underwater imaging is essential for marine exploration, environmental monitoring, and infrastructure inspection. However, water causes severe image degradation through wavelength-dependent absorption and scattering, resulting in color distortion, low contrast, and haze effects. Traditional reconstruction methods and convolutional neural network-based approaches often fail to adequately address these challenges due to limited receptive fields and inability to model global dependencies. This paper presented a novel deep learning framework that integrated a Swin Transformer architecture within a generative adversarial network (GAN) for underwater image reconstruction. Our generator employed a U-Net structure with Swin Transformer blocks to capture both local features and long-range dependencies crucial for color correction across entire images. A PatchGAN discriminator provided adversarial training to ensure high-frequency detail preservation. We trained and evaluated our model on the EUVP dataset, which contains paired underwater images of varying quality. Quantitative results demonstrate stateof-the-art performance with PSNR of 24.76 dB and SSIM of 0.89, representing significant improvements over existing methods. Visual results showed effective color balance restoration, contrast improvement, and haze reduction. An ablation study confirms the superiority of our Swin Transformer designed over convolutional alternatives. The proposed method offers robust underwater image reconstruction suitable for various marine applications.
Similar Papers
A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement
CV and Pattern Recognition
Makes underwater pictures clear and colorful again.
Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images
CV and Pattern Recognition
Helps underwater robots see and map better.
Enhancing Underwater Images via Deep Learning: A Comparative Study of VGG19 and ResNet50-Based Approaches
CV and Pattern Recognition
Cleans up blurry underwater pictures for better viewing.