Score: 0

Frequency-Integrated Transformer for Arbitrary-Scale Super-Resolution

Published: April 26, 2025 | arXiv ID: 2504.18818v1

By: Xufei Wang , Fei Ge , Jinchen Zhu and more

Potential Business Impact:

Makes blurry pictures sharp and clear.

Business Areas:

Facial Recognition Data and Analytics, Software

Methods based on implicit neural representation have demonstrated remarkable capabilities in arbitrary-scale super-resolution (ASSR) tasks, but they neglect the potential value of the frequency domain, leading to sub-optimal performance. We proposes a novel network called Frequency-Integrated Transformer (FIT) to incorporate and utilize frequency information to enhance ASSR performance. FIT employs Frequency Incorporation Module (FIM) to introduce frequency information in a lossless manner and Frequency Utilization Self-Attention module (FUSAM) to efficiently leverage frequency information by exploiting spatial-frequency interrelationship and global nature of frequency. FIM enriches detail characterization by incorporating frequency information through a combination of Fast Fourier Transform (FFT) with real-imaginary mapping. In FUSAM, Interaction Implicit Self-Attention (IISA) achieves cross-domain information synergy by interacting spatial and frequency information in subspace, while Frequency Correlation Self-attention (FCSA) captures the global context by computing correlation in frequency. Experimental results demonstrate FIT yields superior performance compared to existing methods across multiple benchmark datasets. Visual feature map proves the superiority of FIM in enriching detail characterization. Frequency error map validates IISA productively improve the frequency fidelity. Local attribution map validates FCSA effectively captures global context.

FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion

CV and Pattern Recognition

Makes blurry night pictures clear and detailed.

12 Jun 2025 1

88%

FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution

CV and Pattern Recognition

Makes blurry pictures sharp and clear.

1 Dec 2025 0

88%

SuperF: Neural Implicit Fields for Multi-Image Super-Resolution

CV and Pattern Recognition

Makes blurry pictures sharp using many views.

9 Dec 2025 1

View PDF Login to Bookmark

Page Count

11 pages

Frequency-Integrated Transformer for Arbitrary-Scale Super-Resolution

Makes blurry pictures sharp and clear.

Technical Abstract

FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion

FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution

SuperF: Neural Implicit Fields for Multi-Image Super-Resolution