Score: 2

Global Convergence of Four-Layer Matrix Factorization under Random Initialization

Published: November 13, 2025 | arXiv ID: 2511.09925v2

By: Minrui Luo , Weihang Xu , Xiang Gao and more

BigTech Affiliations: University of Washington

Potential Business Impact:

Makes deep computer learning work better.

Business Areas:

A/B Testing Data and Analytics

Gradient descent dynamics on the deep matrix factorization problem is extensively studied as a simplified theoretical model for deep neural networks. Although the convergence theory for two-layer matrix factorization is well-established, no global convergence guarantee for general deep matrix factorization under random initialization has been established to date. To address this gap, we provide a polynomial-time global convergence guarantee for randomly initialized gradient descent on four-layer matrix factorization, given certain conditions on the target matrix and a standard balanced regularization term. Our analysis employs new techniques to show saddle-avoidance properties of gradient decent dynamics, and extends previous theories to characterize the change in eigenvalues of layer weights.

Global Convergence of Four-Layer Matrix Factorization under Random Initialization

Optimization and Control

Makes deep computer learning work better.

13 Nov 2025 2

88%

Understanding Incremental Learning with Closed-form Solution to Gradient Flow on Overparamerterized Matrix Factorization

Machine Learning (CS)

Teaches computers to learn things step-by-step.

28 Aug 2025 1

87%

Global Convergence Analysis of Vanilla Gradient Descent for Asymmetric Matrix Completion

Machine Learning (CS)

Makes computers fill in missing data faster.

13 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇺🇸 China, United States

Page Count

77 pages

Global Convergence of Four-Layer Matrix Factorization under Random Initialization

Makes deep computer learning work better.

Technical Abstract

Global Convergence of Four-Layer Matrix Factorization under Random Initialization

Understanding Incremental Learning with Closed-form Solution to Gradient Flow on Overparamerterized Matrix Factorization

Global Convergence Analysis of Vanilla Gradient Descent for Asymmetric Matrix Completion