Score: 0

FedSODA: Federated Fine-tuning of LLMs via Similarity Group Pruning and Orchestrated Distillation Alignment

Published: August 18, 2025 | arXiv ID: 2508.12727v1

By: Manning Zhu , Songtao Guo , Pengzhan Zhou and more

Potential Business Impact:

Lets phones learn without sending private data.

Federated fine-tuning (FFT) of large language models (LLMs) has recently emerged as a promising solution to enable domain-specific adaptation while preserving data privacy. Despite its benefits, FFT on resource-constrained clients relies on the high computational and memory demands of full-model fine-tuning, which limits the potential advancement. This paper presents FedSODA, a resource-efficient FFT framework that enables clients to adapt LLMs without accessing or storing the full model. Specifically, we first propose a similarity group pruning (SGP) module, which prunes redundant layers from the full LLM while retaining the most critical layers to preserve the model performance. Moreover, we introduce an orchestrated distillation alignment (ODA) module to reduce gradient divergence between the sub-LLM and the full LLM during FFT. Through the use of the QLoRA, clients only need to deploy quantized sub-LLMs and fine-tune lightweight adapters, significantly reducing local resource requirements. We conduct extensive experiments on three open-source LLMs across a variety of downstream tasks. The experimental results demonstrate that FedSODA reduces communication overhead by an average of 70.6%, decreases storage usage by 75.6%, and improves task accuracy by 3.1%, making it highly suitable for practical FFT applications under resource constraints.

FFT-MoE: Efficient Federated Fine-Tuning for Foundation Models via Large-scale Sparse MoE under Heterogeneous Edge

Machine Learning (CS)

Teaches AI to learn from many computers without sharing secrets.

26 Aug 2025 1

88%

OvA-LP: A Simple and Efficient Framework for Federated Learning on Non-IID Data

Machine Learning (CS)

Makes AI learn from many computers without mixing data.

7 Nov 2025 1

88%

Communication-Efficient Wireless Federated Fine-Tuning for Large-Scale AI Models

Machine Learning (CS)

Trains big computer brains with less data sent.

1 May 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

11 pages

FedSODA: Federated Fine-tuning of LLMs via Similarity Group Pruning and Orchestrated Distillation Alignment

Lets phones learn without sending private data.

Technical Abstract

FFT-MoE: Efficient Federated Fine-Tuning for Foundation Models via Large-scale Sparse MoE under Heterogeneous Edge

OvA-LP: A Simple and Efficient Framework for Federated Learning on Non-IID Data

Communication-Efficient Wireless Federated Fine-Tuning for Large-Scale AI Models