Score: 0

Uncertainty-Aware PCA for Arbitrarily Distributed Data Modeled by Gaussian Mixture Models

Published: August 19, 2025 | arXiv ID: 2508.13990v1

By: Daniel Klötzl , Ozan Tastekin , David Hägele and more

Potential Business Impact:

Shows hidden patterns in messy data.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Multidimensional data is often associated with uncertainties that are not well-described by normal distributions. In this work, we describe how such distributions can be projected to a low-dimensional space using uncertainty-aware principal component analysis (UAPCA). We propose to model multidimensional distributions using Gaussian mixture models (GMMs) and derive the projection from a general formulation that allows projecting arbitrary probability density functions. The low-dimensional projections of the densities exhibit more details about the distributions and represent them more faithfully compared to UAPCA mappings. Further, we support including user-defined weights between the different distributions, which allows for varying the importance of the multidimensional distributions. We evaluate our approach by comparing the distributions in low-dimensional space obtained by our method and UAPCA to those obtained by sample-based projections.

Country of Origin
🇩🇪 Germany

Page Count
10 pages

Category
Statistics:
Machine Learning (Stat)