Score: 1

Enhancing Diffusion-Based Sampling with Molecular Collective Variables

Published: October 13, 2025 | arXiv ID: 2510.11923v1

By: Juno Nam , Bálint Máté , Artur P. Toshev and more

BigTech Affiliations: Meta Massachusetts Institute of Technology

Potential Business Impact:

Finds new molecule shapes faster than before.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Diffusion-based samplers learn to sample complex, high-dimensional distributions using energies or log densities alone, without training data. Yet, they remain impractical for molecular sampling because they are often slower than molecular dynamics and miss thermodynamically relevant modes. Inspired by enhanced sampling, we encourage exploration by introducing a sequential bias along bespoke, information-rich, low-dimensional projections of atomic coordinates known as collective variables (CVs). We introduce a repulsive potential centered on the CVs from recent samples, which pushes future samples towards novel CV regions and effectively increases the temperature in the projected space. Our resulting method improves efficiency, mode discovery, enables the estimation of free energy differences, and retains independent sampling from the approximate Boltzmann distribution via reweighting by the bias. On standard peptide conformational sampling benchmarks, the method recovers diverse conformational states and accurate free energy profiles. We are the first to demonstrate reactive sampling using a diffusion-based sampler, capturing bond breaking and formation with universal interatomic potentials at near-first-principles accuracy. The approach resolves reactive energy landscapes at a fraction of the wall-clock time of standard sampling methods, advancing diffusion-based sampling towards practical use in molecular sciences.

Country of Origin
🇺🇸 United States

Page Count
30 pages

Category
Physics:
Chemical Physics