Robust Randomized Low-Rank Approximation with Row-Wise Outlier Detection
By: Aidan Tiruvan
Potential Business Impact:
Cleans messy data to find true patterns.
Robust low-rank approximation under row-wise adversarial corruption can be achieved with a single pass, randomized procedure that detects and removes outlier rows by thresholding their projected norms. We propose a scalable, non-iterative algorithm that efficiently recovers the underlying low-rank structure in the presence of row-wise adversarial corruption. By first compressing the data with a Johnson Lindenstrauss projection, our approach preserves the geometry of clean rows while dramatically reducing dimensionality. Robust statistical techniques based on the median and median absolute deviation then enable precise identification and removal of outlier rows with abnormally high norms. The subsequent rank-k approximation achieves near-optimal error bounds with a one pass procedure that scales linearly with the number of observations. Empirical results confirm that combining random sketches with robust statistics yields efficient, accurate decompositions even in the presence of large fractions of corrupted rows.
Similar Papers
Efficient adaptive randomized algorithms for fixed-threshold low-rank matrix approximation
Numerical Analysis
Makes computers find important patterns in big data faster.
Clustering-based Low-Rank Matrix Approximation: An Adaptive Theoretical Analysis with Application to Data Compression
Machine Learning (CS)
Makes medical images smaller without losing detail.
Minimal Order Recovery through Rank-adaptive Identification
Systems and Control
Finds hidden patterns in messy data.