Cryo-EM as a Stochastic Inverse Problem
By: Diego Sanchez Espinosa, Erik H Thiede, Yunan Yang
Potential Business Impact:
Shows how tiny parts of bodies move.
Cryo-electron microscopy (Cryo-EM) enables high-resolution imaging of biomolecules, but structural heterogeneity remains a major challenge in 3D reconstruction. Traditional methods assume a discrete set of conformations, limiting their ability to recover continuous structural variability. In this work, we formulate cryo-EM reconstruction as a stochastic inverse problem (SIP) over probability measures, where the observed images are modeled as the push-forward of an unknown distribution over molecular structures via a random forward operator. We pose the reconstruction problem as the minimization of a variational discrepancy between observed and simulated image distributions, using statistical distances such as the KL divergence and the Maximum Mean Discrepancy. The resulting optimization is performed over the space of probability measures via a Wasserstein gradient flow, which we numerically solve using particles to represent and evolve conformational ensembles. We validate our approach using synthetic examples, including a realistic protein model, which demonstrates its ability to recover continuous distributions over structural states. We analyze the connection between our formulation and Maximum A Posteriori (MAP) approaches, which can be interpreted as instances of the discretize-then-optimize (DTO) framework. We further provide a consistency analysis, establishing conditions under which DTO methods, such as MAP estimation, converge to the solution of the underlying infinite-dimensional continuous problem. Beyond cryo-EM, the framework provides a general methodology for solving SIPs involving random forward operators.
Similar Papers
Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery
Quantitative Methods
Shows tiny body parts in 3D, even when broken.
Multiscale guidance of AlphaFold3 with heterogeneous cryo-EM data
Machine Learning (CS)
Helps scientists see how moving body parts work.
Two Datasets Are Better Than One: Method of Double Moments for 3-D Reconstruction in Cryo-EM
CV and Pattern Recognition
Reconstructs tiny cell parts from blurry pictures.