Robust Multilinear Principal Component Analysis
By: Mehdi Hirari , Fabio Centofanti , Mia Hubert and more
Potential Business Impact:
Fixes messy data so computers can understand it.
Multilinear Principal Component Analysis (MPCA) is an important tool for analyzing tensor data. It performs dimension reduction similar to PCA for multivariate data. However, standard MPCA is sensitive to outliers. It is highly influenced by observations deviating from the bulk of the data, called casewise outliers, as well as by individual outlying cells in the tensors, so-called cellwise outliers. This latter type of outlier is highly likely to occur in tensor data, as tensors typically consist of many cells. This paper introduces a novel robust MPCA method that can handle both types of outliers simultaneously, and can cope with missing values as well. This method uses a single loss function to reduce the influence of both casewise and cellwise outliers. The solution that minimizes this loss function is computed using an iteratively reweighted least squares algorithm with a robust initialization. Graphical diagnostic tools are also proposed to identify the different types of outliers that have been found by the new robust MPCA method. The performance of the method and associated graphical displays is assessed through simulations and illustrated on two real datasets.
Similar Papers
Outlier-aware Tensor Robust Principal Component Analysis with Self-guided Data Augmentation
Numerical Analysis
Cleans messy data better for clearer pictures.
Highly robust factored principal component analysis for matrix-valued outlier accommodation and explainable detection via matrix minimum covariance determinant
Methodology
Finds bad data points in complex pictures.
Principal Component Analysis When n < p: Challenges and Solutions
Methodology
Makes computer analysis better with messy, complex data.