Score: 0

SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS

Published: September 25, 2025 | arXiv ID: 2509.20802v1

By: Tan Dat Nguyen , Jaehun Kim , Ji-Hoon Kim and more

Potential Business Impact:

Makes AI voices sound better and faster.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The goal of this paper is to introduce SPADE, a framework for Structured Pruning and Adaptive Distillation for Efficient Large Language Model-based text-to-speech (LLM-TTS). Recent LLM-TTS systems achieve strong controllability and zero-shot generalization, but their large parameter counts and high latency limit real-world deployment. SPADE addresses this by combining (i) a pruning step guided by a word-error-rate-based layer importance index to remove non-essential Transformer layers, with (ii) multi-level knowledge distillation to restore autoregressive coherence. On zero-shot benchmarks, SPADE preserves near-parity perceptual quality while halving Transformer depth, reducing VRAM usage by up to 20%, and achieving up to 1.7x faster real-time factor with less than 5% of the original training data. These results show that compact LLM-TTS models can maintain naturalness and speaker similarity while enabling practical real-time speech generation. Audio samples are available at https://mm.kaist.ac.kr/projects/SPADE/.

SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS

Audio and Speech Processing

Makes AI voices sound better and faster.

25 Sep 2025 0

89%

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture

Artificial Intelligence

Helps farmers know when to water crops.

10 Sep 2025 0

88%

A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)

Computation and Language

Makes smart computer programs faster and cheaper.

23 Jul 2025 0

View PDF Login to Bookmark

Page Count

5 pages

SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS

Makes AI voices sound better and faster.

Technical Abstract

SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture

A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)