Score: 0

PCA recovery thresholds in low-rank matrix inference with sparse noise

Published: November 14, 2025 | arXiv ID: 2511.11927v1

By: Urte Adomaityte, Gabriele Sicuro, Pierpaolo Vivo

Potential Business Impact:

Finds hidden patterns in messy data.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

We study the high-dimensional inference of a rank-one signal corrupted by sparse noise. The noise is modelled as the adjacency matrix of a weighted undirected graph with finite average connectivity in the large size limit. Using the replica method from statistical physics, we analytically compute the typical value of the top eigenvalue, the top eigenvector component density, and the overlap between the signal vector and the top eigenvector. The solution is given in terms of recursive distributional equations for auxiliary probability density functions which can be efficiently solved using a population dynamics algorithm. Specialising the noise matrix to Poissonian and Random Regular degree distributions, the critical signal strength is analytically identified at which a transition happens for the recovery of the signal via the top eigenvector, thus generalising the celebrated BBP transition to the sparse noise case. In the large-connectivity limit, known results for dense noise are recovered. Analytical results are in agreement with numerical diagonalisation of large matrices.

Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares

Statistics Theory

Finds hidden connections in messy, big data.

20 Oct 2025 0

88%

Nonlinear Laplacians: Tunable principal component analysis under directional prior information

Machine Learning (Stat)

Find hidden patterns in messy data better.

18 May 2025 1

88%

Evaluating Singular Value Thresholds for DNN Weight Matrices based on Random Matrix Theory

Machine Learning (Stat)

Cleans up computer brains for better learning.

15 Dec 2025 0

View PDF Login to Bookmark

Page Count

24 pages

PCA recovery thresholds in low-rank matrix inference with sparse noise

Finds hidden patterns in messy data.

Technical Abstract

Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares

Nonlinear Laplacians: Tunable principal component analysis under directional prior information

Evaluating Singular Value Thresholds for DNN Weight Matrices based on Random Matrix Theory