TensLoRA: Tensor Alternatives for Low-Rank Adaptation
By: Axel Marmoret , Reda Bensaid , Jonathan Lys and more
Potential Business Impact:
Makes AI learn better with less computer memory.
Low-Rank Adaptation (LoRA) is widely used to efficiently adapt Transformers by adding trainable low-rank matrices to attention projections. While effective, these matrices are considered independent for each attention projection (Query, Key, and Value) and each layer. Recent extensions have considered joint, tensor-based adaptations, but only in limited forms and without a systematic framework. We introduce TensLoRA, a unified framework that aggregates LoRA updates into higher-order tensors and models a broad family of tensor-based low-rank adaptations. Our formulation generalizes existing tensor-based methods and enables mode-specific compression rates, allowing parameter budgets to be tailored according to the modality and task. Experiments on vision and language benchmarks reveal that the tensor construction directly impacts performance, sometimes better than standard LoRA under similar parameter counts.
Similar Papers
TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models
Machine Learning (CS)
Makes AI learn new things with less effort.
Serial Low-rank Adaptation of Vision Transformer
CV and Pattern Recognition
Makes computer vision models use less memory.
MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning
Machine Learning (CS)
Teaches computers to learn new things faster.