CATformer: Contrastive Adversarial Transformer for Image Super-Resolution
By: Qinyi Tian, Spence Cox, Laura E. Dalton
Potential Business Impact:
Makes blurry pictures sharp and clear.
Super-resolution remains a promising technique to enhance the quality of low-resolution images. This study introduces CATformer (Contrastive Adversarial Transformer), a novel neural network integrating diffusion-inspired feature refinement with adversarial and contrastive learning. CATformer employs a dual-branch architecture combining a primary diffusion-inspired transformer, which progressively refines latent representations, with an auxiliary transformer branch designed to enhance robustness to noise through learned latent contrasts. These complementary representations are fused and decoded using deep Residual-in-Residual Dense Blocks for enhanced reconstruction quality. Extensive experiments on benchmark datasets demonstrate that CATformer outperforms recent transformer-based and diffusion-inspired methods both in efficiency and visual image quality. This work bridges the performance gap among transformer-, diffusion-, and GAN-based methods, laying a foundation for practical applications of diffusion-inspired transformers in super-resolution.
Similar Papers
CTLformer: A Hybrid Denoising Model Combining Convolutional Layers and Self-Attention for Enhanced CT Image Reconstruction
Image and Video Processing
Cleans up blurry medical scans for better health checks.
CAT: A Conditional Adaptation Tailor for Efficient and Effective Instance-Specific Pansharpening on Real-World Data
CV and Pattern Recognition
Makes satellite pictures clearer, faster, and work anywhere.
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
CV and Pattern Recognition
Makes AI create much bigger, clearer pictures.