Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics
By: Maodong Li , Jiying Zhang , Bin Feng and more
Potential Business Impact:
Predicts how drugs unstick from bodies.
Drug-protein binding and dissociation dynamics are fundamental to understanding molecular interactions in biological systems. While many tools for drug-protein interaction studies have emerged, especially artificial intelligence (AI)-based generative models, predictive tools on binding/dissociation kinetics and dynamics are still limited. We propose a novel research paradigm that combines molecular dynamics (MD) simulations, enhanced sampling, and AI generative models to address this issue. We propose an enhanced sampling strategy to efficiently implement the drug-protein dissociation process in MD simulations and estimate the free energy surface (FES). We constructed a program pipeline of MD simulations based on this sampling strategy, thus generating a dataset including 26,612 drug-protein dissociation trajectories containing about 13 million frames. We named this dissociation dynamics dataset DD-13M and used it to train a deep equivariant generative model UnbindingFlow, which can generate collision-free dissociation trajectories. The DD-13M database and UnbindingFlow model represent a significant advancement in computational structural biology, and we anticipate its broad applicability in machine learning studies of drug-protein interactions. Our ongoing efforts focus on expanding this methodology to encompass a broader spectrum of drug-protein complexes and exploring novel applications in pathway prediction.
Similar Papers
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Biomolecules
Finds new medicines by watching how proteins change.
Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation
Machine Learning (CS)
Designs new medicines that fit the body.
BioMD: All-atom Generative Model for Biomolecular Dynamics Simulation
Chemical Physics
Simulates long protein changes for new medicines.