Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution
By: Xiao He , Zhijun Tu , Kun Cheng and more
Potential Business Impact:
Makes blurry pictures sharp using smart computer parts.
The demonstrated success of sparsely-gated Mixture-of-Experts (MoE) architectures, exemplified by models such as DeepSeek and Grok, has motivated researchers to investigate their adaptation to diverse domains. In real-world image super-resolution (Real-ISR), existing approaches mainly rely on fine-tuning pre-trained diffusion models through Low-Rank Adaptation (LoRA) module to reconstruct high-resolution (HR) images. However, these dense Real-ISR models are limited in their ability to adaptively capture the heterogeneous characteristics of complex real-world degraded samples or enable knowledge sharing between inputs under equivalent computational budgets. To address this, we investigate the integration of sparse MoE into Real-ISR and propose a Mixture-of-Ranks (MoR) architecture for single-step image super-resolution. We introduce a fine-grained expert partitioning strategy that treats each rank in LoRA as an independent expert. This design enables flexible knowledge recombination while isolating fixed-position ranks as shared experts to preserve common-sense features and minimize routing redundancy. Furthermore, we develop a degradation estimation module leveraging CLIP embeddings and predefined positive-negative text pairs to compute relative degradation scores, dynamically guiding expert activation. To better accommodate varying sample complexities, we incorporate zero-expert slots and propose a degradation-aware load-balancing loss, which dynamically adjusts the number of active experts based on degradation severity, ensuring optimal computational resource allocation. Comprehensive experiments validate our framework's effectiveness and state-of-the-art performance.
Similar Papers
S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning
Computation and Language
Makes smart computer programs learn faster and better.
MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts
CV and Pattern Recognition
Builds 3D shapes from pictures more accurately.
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
Machine Learning (CS)
Makes big AI models learn faster and better.