Score: 1

MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models

Published: June 6, 2025 | arXiv ID: 2506.05928v1

By: Jie Cao , Tianwei Lin , Hongyang He and more

Potential Business Impact:

Makes AI smarter by mixing different learning parts.

Business Areas:

A/B Testing Data and Analytics

Recent studies integrate Low-Rank Adaptation (LoRA) and Mixture-of-Experts (MoE) to further enhance the performance of parameter-efficient fine-tuning (PEFT) methods in Large Language Model (LLM) applications. Existing methods employ \emph{homogeneous} MoE-LoRA architectures composed of LoRA experts with either similar or identical structures and capacities. However, these approaches often suffer from representation collapse and expert load imbalance, which negatively impact the potential of LLMs. To address these challenges, we propose a \emph{heterogeneous} \textbf{Mixture-of-Adapters (MoA)} approach. This method dynamically integrates PEFT adapter experts with diverse structures, leveraging their complementary representational capabilities to foster expert specialization, thereby enhancing the effective transfer of pre-trained knowledge to downstream tasks. MoA supports two variants: \textbf{(i)} \textit{Soft MoA} achieves fine-grained integration by performing a weighted fusion of all expert outputs; \textbf{(ii)} \textit{Sparse MoA} activates adapter experts sparsely based on their contribution, achieving this with negligible performance degradation. Experimental results demonstrate that heterogeneous MoA outperforms homogeneous MoE-LoRA methods in both performance and parameter efficiency. Our project is available at https://github.com/DCDmllm/MoA.

TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts

Machine Learning (CS)

Makes AI learn many tasks using less computer power.

29 Apr 2025 0

90%

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning

Machine Learning (CS)

Makes AI understand different kinds of information together.

26 Mar 2025 0

90%

DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism

Computation and Language

Makes AI smarter and learn faster.

1 Apr 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

16 pages

MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models

Makes AI smarter by mixing different learning parts.

Technical Abstract

TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning

DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism