Score: 1

InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation

Published: December 25, 2025 | arXiv ID: 2512.21788v1

By: Jinqi Xiao , Qing Yan , Liming Jiang and more

Potential Business Impact:

Makes AI images follow instructions better.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Parameter-Efficient Fine-Tuning of Diffusion Transformers (DiTs) for diverse, multi-conditional tasks often suffers from task interference when using monolithic adapters like LoRA. The Mixture of Low-rank Experts (MoLE) architecture offers a modular solution, but its potential is usually limited by routing policies that operate at a token level. Such local routing can conflict with the global nature of user instructions, leading to artifacts like spatial fragmentation and semantic drift in complex image generation tasks. To address these limitations, we introduce InstructMoLE, a novel framework that employs an Instruction-Guided Mixture of Low-Rank Experts. Instead of per-token routing, InstructMoLE utilizes a global routing signal, Instruction-Guided Routing (IGR), derived from the user's comprehensive instruction. This ensures that a single, coherently chosen expert council is applied uniformly across all input tokens, preserving the global semantics and structural integrity of the generation process. To complement this, we introduce an output-space orthogonality loss, which promotes expert functional diversity and mitigates representational collapse. Extensive experiments demonstrate that InstructMoLE significantly outperforms existing LoRA adapters and MoLE variants across challenging multi-conditional generation benchmarks. Our work presents a robust and generalizable framework for instruction-driven fine-tuning of generative models, enabling superior compositional control and fidelity to user intent.

DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism

Computation and Language

Makes AI smarter and learn faster.

1 Apr 2025 1

90%

Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning

CV and Pattern Recognition

Helps AI learn new things without forgetting old ones.

13 Jun 2025 1

89%

Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution

CV and Pattern Recognition

Makes blurry pictures sharp using smart computer parts.

20 Nov 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

24 pages

InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation

Makes AI images follow instructions better.

Technical Abstract

DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism

Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning

Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution