SingLoRA: Low Rank Adaptation Using a Single Matrix
By: David Bensaïd , Noam Rotstein , Roy Velich and more
Potential Business Impact:
Makes AI learn better and faster.
Low-Rank Adaptation (LoRA) has significantly advanced parameter-efficient fine-tuning of large pretrained models. LoRA augments the pre-trained weights of a model by adding the product of two smaller matrices that together form a low-rank matrix update. Recent research has shown that scale disparities between these two matrices often cause unstable training dynamics, leading to suboptimal performance. In this paper, we propose SingLoRA, which reformulates low-rank adaptation by learning the weights update as a decomposition of a single low-rank matrix multiplied by its transpose. This simple design inherently removes inter-matrix scale conflicts, ensuring stable optimization, and roughly halves the parameter count. We analyze SingLoRA within the infinite-width neural network framework, showing that it guarantees stable feature learning by construction. Extensive experiments on multiple tasks validate these benefits. In common sense reasoning, fine-tuning LLama 7B on MNLI with SingLoRA achieves 91.3% accuracy - surpassing LoRA (89.1%) and LoRA+ (90.2%) - while using only 60% of their parameter budget. In image generation, fine-tuning Stable Diffusion with SingLoRA significantly improves image fidelity on DreamBooth, achieving a DINO similarity score of 0.151, compared to scores of 0.148 and 0.143 for DoRA and LoRA, respectively.
Similar Papers
DenseLoRA: Dense Low-Rank Adaptation of Large Language Models
Computation and Language
Makes smart computer programs learn faster.
RandLoRA: Full-rank parameter-efficient fine-tuning of large models
Computation and Language
Makes smart computer programs learn better, faster.
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Machine Learning (CS)
Makes AI learn faster and better.