Efficient Model Editing with Task-Localized Sparse Fine-tuning
By: Leonardo Iurada, Marco Ciccone, Tatiana Tommasi
Potential Business Impact:
Makes AI models learn new things without forgetting old ones.
Task arithmetic has emerged as a promising approach for editing models by representing task-specific knowledge as composable task vectors. However, existing methods rely on network linearization to derive task vectors, leading to computational bottlenecks during training and inference. Moreover, linearization alone does not ensure weight disentanglement, the key property that enables conflict-free composition of task vectors. To address this, we propose TaLoS which allows to build sparse task vectors with minimal interference without requiring explicit linearization and sharing information across tasks. We find that pre-trained models contain a subset of parameters with consistently low gradient sensitivity across tasks, and that sparsely updating only these parameters allows for promoting weight disentanglement during fine-tuning. Our experiments prove that TaLoS improves training and inference efficiency while outperforming current methods in task addition and negation. By enabling modular parameter editing, our approach fosters practical deployment of adaptable foundation models in real-world applications.
Similar Papers
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
Machine Learning (CS)
Teaches computers to forget or learn new things.
Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment
CV and Pattern Recognition
Lets AI learn new skills from different AI.
Variational Task Vector Composition
Machine Learning (CS)
Combines computer skills without slowing them down.