Multivariate Polynomial Codes for Efficient Matrix Chain Multiplication in Distributed Systems
By: Jesús Gómez-Vilardebò
We study the problem of computing matrix chain multiplications in a distributed computing cluster. In such systems, performance is often limited by the straggler problem, where the slowest worker dominates the overall computation latency. To resolve this issue, several coded computing strategies have been proposed, primarily focusing on the simplest case: the multiplication of two matrices. These approaches successfully alleviate the straggler effect, but they do so at the expense of higher computational complexity and increased storage needs at the workers. However, in many real-world applications, computations naturally involve long chains of matrix multiplications rather than just a single two-matrix product. Extending univariate polynomial coding to this setting has been shown to amplify the costs -- both computation and storage overheads grow significantly, limiting scalability. In this work, we propose two novel multivariate polynomial coding schemes specifically designed for matrix chain multiplication in distributed environments. Our results show that while multivariate codes introduce additional computational cost at the workers, they can dramatically reduce storage overhead compared to univariate extensions. This reveals a fundamental trade-off between computation and storage efficiency, and highlights the potential of multivariate codes as a practical solution for large-scale distributed linear algebra tasks.
Similar Papers
Channel Coding based on Skew Polynomials and Multivariate Polynomials
Information Theory
Makes computers store data safer from mistakes.
Distributed Matrix Multiplication-Friendly Algebraic Function Fields
Information Theory
Makes computer math faster for certain tasks.
Quantum Private Distributed Matrix Multiplication With Degree Tables
Information Theory
Makes private math calculations faster using quantum tricks.