Almost-Optimal Local-Search Methods for Sparse Tensor PCA
By: Max Lovig , Conor Sheehan , Konstantinos Tsirkas and more
Potential Business Impact:
Makes computers find patterns in data better.
Local-search methods are widely employed in statistical applications, yet interestingly, their theoretical foundations remain rather underexplored, compared to other classes of estimators such as low-degree polynomials and spectral methods. Of note, among the few existing results recent studies have revealed a significant "local-computational" gap in the context of a well-studied sparse tensor principal component analysis (PCA), where a broad class of local Markov chain methods exhibits a notable underperformance relative to other polynomial-time algorithms. In this work, we propose a series of local-search methods that provably "close" this gap to the best known polynomial-time procedures in multiple regimes of the model, including and going beyond the previously studied regimes in which the broad family of local Markov chain methods underperforms. Our framework includes: (1) standard greedy and randomized greedy algorithms applied to the (regularized) posterior of the model; and (2) novel random-threshold variants, in which the randomized greedy algorithm accepts a proposed transition if and only if the corresponding change in the Hamiltonian exceeds a random Gaussian threshold-rather that if and only if it is positive, as is customary. The introduction of the random thresholds enables a tight mathematical analysis of the randomized greedy algorithm's trajectory by crucially breaking the dependencies between the iterations, and could be of independent interest to the community.
Similar Papers
Computational lower bounds in latent models: clustering, sparse-clustering, biclustering
Statistics Theory
Finds limits of computer problem-solving abilities.
Accelerating Randomized Algorithms for Low-Rank Matrix Approximation
Computation
Makes computers find patterns in data faster.
Few-Round Distributed Principal Component Analysis: Closing the Statistical Efficiency Gap by Consensus
Methodology
Improves computer analysis of huge amounts of data.