Score: 0

Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models

Published: December 26, 2025 | arXiv ID: 2512.21894v1

By: Ruihao Jing , Cheng Gong , Yu Jiang and more

Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.

Category
Electrical Engineering and Systems Science:
Audio and Speech Processing