Score: 0

Inference on covariance structure in high-dimensional multi-view data

Published: September 2, 2025 | arXiv ID: 2509.02772v1

By: Lorenzo Mauri, David B. Dunson

Potential Business Impact:

Finds patterns in different kinds of data.

Business Areas:
A/B Testing Data and Analytics

This article focuses on covariance estimation for multi-view data. Popular approaches rely on factor-analytic decompositions that have shared and view-specific latent factors. Posterior computation is conducted via expensive and brittle Markov chain Monte Carlo (MCMC) sampling or variational approximations that underestimate uncertainty and lack theoretical guarantees. Our proposed methodology employs spectral decompositions to estimate and align latent factors that are active in at least one view. Conditionally on these factors, we choose jointly conjugate prior distributions for factor loadings and residual variances. The resulting posterior is a simple product of normal-inverse gamma distributions for each variable, bypassing MCMC and facilitating posterior computation. We prove favorable increasing-dimension asymptotic properties, including posterior contraction and central limit theorems for point estimators. We show excellent performance in simulations, including accurate uncertainty quantification, and apply the methodology to integrate four high-dimensional views from a multi-omics dataset of cancer cell samples.

Country of Origin
🇺🇸 United States

Page Count
32 pages

Category
Statistics:
Methodology