Score: 0

SO(3)-invariant PCA with application to molecular data

Published: October 21, 2025 | arXiv ID: 2510.18827v1

By: Michael Fraiman , Paulina Hoyos , Tamir Bendory and more

Potential Business Impact:

Makes 3D pictures of tiny things easier to see.

Business Areas:

Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Principal component analysis (PCA) is a fundamental technique for dimensionality reduction and denoising; however, its application to three-dimensional data with arbitrary orientations -- common in structural biology -- presents significant challenges. A naive approach requires augmenting the dataset with many rotated copies of each sample, incurring prohibitive computational costs. In this paper, we extend PCA to 3D volumetric datasets with unknown orientations by developing an efficient and principled framework for SO(3)-invariant PCA that implicitly accounts for all rotations without explicit data augmentation. By exploiting underlying algebraic structure, we demonstrate that the computation involves only the square root of the total number of covariance entries, resulting in a substantial reduction in complexity. We validate the method on real-world molecular datasets, demonstrating its effectiveness and opening up new possibilities for large-scale, high-dimensional reconstruction problems.

Highly robust factored principal component analysis for matrix-valued outlier accommodation and explainable detection via matrix minimum covariance determinant

Methodology

Finds bad data points in complex pictures.

30 Sep 2025 0

87%

Beyond Regularization: Inherently Sparse Principal Component Analysis

Methodology

Finds hidden patterns in complex information.

4 Oct 2025 0

87%

Probabilistic Geometric Principal Component Analysis with application to neural data

Machine Learning (CS)

Finds hidden patterns in brain data.

22 Sep 2025 0

View PDF Login to Bookmark

Page Count

5 pages

SO(3)-invariant PCA with application to molecular data

Makes 3D pictures of tiny things easier to see.

Technical Abstract

Highly robust factored principal component analysis for matrix-valued outlier accommodation and explainable detection via matrix minimum covariance determinant

Beyond Regularization: Inherently Sparse Principal Component Analysis

Probabilistic Geometric Principal Component Analysis with application to neural data