Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models
By: Ruihao Jing , Cheng Gong , Yu Jiang and more
Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.
Similar Papers
On Fairness of Task Arithmetic: The Role of Task Vectors
Machine Learning (CS)
Fixes AI to be fairer to everyone.
Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition
Audio and Speech Processing
Fixes speech recognition for new accents.
Towards stable AI systems for Evaluating Arabic Pronunciations
Computation and Language
Teaches computers to understand Arabic letter sounds.