Score: 0

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models

Published: September 3, 2025 | arXiv ID: 2509.03234v1

By: Yuxuan Gu , Wuyang Zhou , Giorgos Iacovides and more

Potential Business Impact:

Makes smart computer programs learn more with less data.

Business Areas:

A/B Testing Data and Analytics

Parameter-Efficient Fine-Tuning (PEFT) methods, such as Low-Rank Adaptation (LoRA), have significantly reduced the number of trainable parameters needed in fine-tuning large language models (LLMs). Subsequent developments of LoRA-style adapters have diverged into two main directions: (1) enhancing model expressivity with high-rank adapters, and (2) pushing for further parameter reduction, as exemplified by vector-based methods. However, these approaches present a trade-off, as achieving the expressivity of high-rank weight updates typically comes at the cost of sacrificing the extreme parameter efficiency offered by vector-based techniques. To address this issue, we propose a vector-based random \underline{\textbf{Te}}nsor network for high-\underline{\textbf{R}}ank \underline{\textbf{A}}daptation (TeRA), a novel PEFT method that achieves high-rank weight updates while retaining the parameter efficiency of vector-based PEFT adapters. This is achieved by parameterizing the tensorized weight update matrix as a Tucker-like tensor network (TN), in which large randomly initialized factors are frozen and shared across layers, while only small layer-specific scaling vectors, formed by entries in diagonal factor matrices, are trained. This design effectively decouples the rank of the weight update matrix from the number of trainable parameters. Comprehensive experiments demonstrate that TeRA matches or even outperforms high-rank adapters, while requiring a trainable parameter count similar to vector-based methods. Theoretical analysis and ablation studies further validate the effectiveness of our approach.

Towards Higher Effective Rank in Parameter-efficient Fine-tuning using Khatri--Rao Product

Machine Learning (CS)

Makes AI learn better without needing more power.

1 Aug 2025 2

91%

HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance

Machine Learning (CS)

Makes AI learn faster without needing more power.

3 Oct 2025 0

90%

1LoRA: Summation Compression for Very Low-Rank Adaptation

CV and Pattern Recognition

Makes big computer brains learn faster with less effort.

11 Mar 2025 2

View PDF Login to Bookmark

Country of Origin

🇬🇧 United Kingdom

Page Count

9 pages

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models

Makes smart computer programs learn more with less data.

Technical Abstract

Towards Higher Effective Rank in Parameter-efficient Fine-tuning using Khatri--Rao Product

HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance

1LoRA: Summation Compression for Very Low-Rank Adaptation