Uncertainty-Aware PCA for Arbitrarily Distributed Data Modeled by Gaussian Mixture Models
By: Daniel Klötzl , Ozan Tastekin , David Hägele and more
Potential Business Impact:
Shows hidden patterns in messy data.
Multidimensional data is often associated with uncertainties that are not well-described by normal distributions. In this work, we describe how such distributions can be projected to a low-dimensional space using uncertainty-aware principal component analysis (UAPCA). We propose to model multidimensional distributions using Gaussian mixture models (GMMs) and derive the projection from a general formulation that allows projecting arbitrary probability density functions. The low-dimensional projections of the densities exhibit more details about the distributions and represent them more faithfully compared to UAPCA mappings. Further, we support including user-defined weights between the different distributions, which allows for varying the importance of the multidimensional distributions. We evaluate our approach by comparing the distributions in low-dimensional space obtained by our method and UAPCA to those obtained by sample-based projections.
Similar Papers
Probabilistic Geometric Principal Component Analysis with application to neural data
Machine Learning (CS)
Finds hidden patterns in brain data.
Probabilistic PCA on tensors
Statistics Theory
Finds patterns in many connected data points.
Classification EM-PCA for clustering and embedding
Machine Learning (Stat)
Makes computer groups find patterns faster and better.